Rectangular Matrix Multiplication in the Low-Bandwidth Model
Abstract
We study rectangular matrix multiplication in the low-bandwidth model of distributed computing. There are n computers; initially the input matrices are distributed evenly between computers, and in each communication round every computer can send and receive an O( n)-bit message. Eventually each computer must output its designated part of the product matrix. While prior work has focused primarily on square n × n multiplication under various sparsity assumptions, we study rectangular instances with no sparsity assumption. We denote by a,b,c the task of multiplying an a× b matrix by a b× c matrix in this model. We concentrate on two natural aspect ratios, n,d,n and d,n,d, for d n, and we study how the round complexity depends on n and d. When d n, both n,d,n and d,n,d approach n,n,n, which is the usual task of multiplying square matrices. If we consider multiplication over semirings, the current best upper bound in that case is O(n4/3) rounds, and there is a trivial unconditional lower bound of Ω(n). We show that for d,n,d, we can achieve the complexity of O(d4/3), which seems like a natural generalization of the upper bound O(n4/3) when d=n. However, the case of n,d,n is fundamentally different, and also exhibits a phase transition. We show that for d n, the complexity of n,d,n is Θ(d n); we have matching upper and lower bounds. However, the behavior is genuinely different in the region d n, where the upper bound is O(d2/3 n2/3).
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.