On Universal Scaling of Distributed Queues under Load Balancing

Abstract

This paper considers the steady-state performance of load balancing algorithms in a many-server system with distributed queues. The system has N servers, and each server maintains a local queue with buffer size b-1, i.e. a server can hold at most one job in service and b-1 jobs in the queue. Jobs in the same queue are served according to the first-in-first-out (FIFO) order. The system is operated in a heavy-traffic regime such that the workload per server is λ = 1 - N-α for 0.5≤ α<1. We identify a set of algorithms such that the steady-state queues have the following universal scaling, where universal means that it holds for any α∈[0.5,1): (i) the number of of busy servers is λ N-o(1); and (ii) the number of servers with two jobs (one in service and one in queue) is O(Nα N); and (iii) the number of servers with more than two jobs is O(1Nr(1-α)-1), where r can be any positive integer independent of N. The set of load balancing algorithms that satisfy the sufficient condition includes join-the-shortest-queue (JSQ), idle-one-first (I1F), and power-of-d-choices (Pod) with d≥ Nα2 N. We further argue that the waiting time of such an algorithm is near optimal order-wise.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…