Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
194 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Response-Time-Optimized Distributed Cloud Resource Allocation (1601.06262v2)

Published 23 Jan 2016 in cs.NI

Abstract: A current trend in networking and cloud computing is to provide compute resources over widely dispersed places exemplified by initiatives like Network Function Virtualisation. This paves the way for a widespread service deployment and can improve service quality; a nearby server can reduce the user-perceived response times. But always using the nearest server is a bad decision if that server is already highly utilized. This paper investigates the optimal assignment of users to widespread resources -- a convex capacitated facility location problem with integrated queuing systems. We determine the response times depending on the number of used resources. This enables service providers to balance between resource costs and the corresponding service quality. We also present a linear problem reformulation showing small optimality gaps and faster solving times; this speed-up enables a swift reaction to demand changes. Finally, we compare solutions by either considering or ignoring queuing systems and discuss the response time reduction by using the more complex model. Our investigations are backed by large-scale numerical evaluations.

Citations (22)

Summary

We haven't generated a summary for this paper yet.