Coded Matrix Chain Multiplication

less than 1 minute read

Matrix multiplication is a fundamental building block in many machine learning models. As the input matrices may be too large to be multiplied on a single server, it is common to split input matrices into multiple submatrices and execute the multiplications on different servers. However, in a distributed infrastructure it is common to observe stragglers whose performance is lower than other servers at some time. In order to mitigate the adversarial effects of potential stragglers, various coding schemes for the distributed matrix multiplication have been recently proposed. While most existing works have only considered the simplest case where only two matrices are multiplied, we investigate a more general case in this paper where multiple matrices are multiplied, and propose a coding scheme that the result can be directly decoded in one round, instead of in multiple rounds of computation. Compared to completing the matrix chain multiplication in multiple rounds, our coding scheme can achieve significant savings of completion time by up to 90.3%.

Link to the Publication

Twitter Facebook LinkedIn

Pedro Juan Soto

Coded Matrix Chain Multiplication

You May Also Enjoy

Quantile Formulation for Optimization under a Qualitative Risk Constraint

Lightweight Projective Derivative Codes for Compressed Asynchronous Gradient Descent

Rook Coding for Batch Matrix Multiplication

Obtaining weights for Gröbner basis computation in parameter identifiability problems