AllReduce

Perform an (MPI) AllReduce over the contents of a matrix stored on different process (which overwrites every process’s input with the summation over all process’s copies).

C++ API

void AllReduce(Matrix<T> &A, mpi::Comm comm, mpi::Op op = mpi::SUM)
void AllReduce(AbstractDistMatrix<T> &A, mpi::Comm comm, mpi::Op op = mpi::SUM)

C API

TODO

Python API

TODO