Special Issue Paper
Configuring large-scale storage using a middleware with machine learning
Article first published online: 29 MAR 2011
Copyright © 2011 John Wiley & Sons, Ltd.
Concurrency and Computation: Practice and Experience
Volume 23, Issue 17, pages 2063–2077, 10 December 2011
How to Cite
Eyers, D. M., Routray, R., Zhang, R., Willcocks, D. and Pietzuch, P. (2011), Configuring large-scale storage using a middleware with machine learning. Concurrency Computat.: Pract. Exper., 23: 2063–2077. doi: 10.1002/cpe.1716
- Issue published online: 20 OCT 2011
- Article first published online: 29 MAR 2011
- Manuscript Accepted: 23 DEC 2010
- Manuscript Revised: 16 NOV 2010
- Manuscript Received: 31 MAY 2010
- configuration policy;
- best practices;
- machine learning
The proliferation of cloud services and other forms of service-oriented computing continues to accelerate. Alongside this development is an ever-increasing need for storage within the data centres that host these services. Management applications used by cloud providers to configure their infrastructure should ideally operate in terms of high-level policy goals, and not burden administrators with the details presented by particular instances of storage systems. One common technology used by cloud providers is the Storage Area Network (SAN). Support for seamless scalability is engineered into SAN devices. However, SAN infrastructure has a very large parameter space: their optimal deployment is a difficult challenge, and subsequent management in cloud storage continues to be difficult.
parindent = 10pt In this article, we discuss our work in SAN configuration middleware, which aims to provide users of large-scale storage infrastructure such as cloud providers with tools to assist them in their management and evolution of heterogeneous SAN environments. We propose a middleware rather than a stand-alone tool so that the middleware can be a proxy for interacting with, and informing, a central repository of SAN configurations. Storage system users can have their SAN configurations validated against a knowledge base of best practices that are contained within the central repository. Desensitized information is exported from local management applications to the repository, and the local middleware can subscribe to updates that proactively notify storage users should particular configurations be updated to be considered as sub-optimal, or unsafe. Copyright © 2011 John Wiley & Sons, Ltd.