Large complex service centers must provide many services to many users with separate service contracts, while managing their overall costs. A scalable hybrid optimization procedure is described for a minimum-cost deployment of services on nodes, taking into account processing requirements and resource contention. This is a heuristic for a problem which is in general NP-hard. It iterates between a fast linear programming (LP) subproblem, and a nonlinear performance model, both of which scale easily to thousands of services. The approach can be adapted to minimize cost subject to performance constraints, or to optimize a combined quality of service measure subject to cost constraints. It can be combined with tracked performance models to periodically re-optimize deployment for autonomic QOS management.

Allocation, Autonomic control, Cloud computing, Optimal deployment, Performance, Performance management, Service systems
6th International Conference on Autonomic Computing, ICAC'09
Department of Systems and Computer Engineering

Li, J. (Jim), Chinneck, J, Woodside, C.M, & Litoiu, M. (Marin). (2009). Fast scalable optimization to configure service systems having cost and quality of service constraints. Presented at the 6th International Conference on Autonomic Computing, ICAC'09. doi:10.1145/1555228.1555268