Convex optimization¶

ADMM solvers¶

The following functions are inspired by the MATLAB scripts associated with the paper:

S. Boyd, N. Parikh, E. Chu, B. Peleato, and J. Eckstein, Distributed optimization and statistical learning via the Alternating Direction Method of Multipliers,

which describes a wide variety of optimization schemes using the Alternating Direction Method of Multipliers (ADMM).

BasisPursuit¶

Attempt to minimize \(\| x \|_1\) such that \(Ax=b\) using ADMM.

int BasisPursuit(const Matrix<F> &A, const Matrix<F> &b, Matrix<F> &x, Matrix<F> &z, Matrix<F> &u, Base<F> rho = 1, Base<F> alpha = 1.2, Int maxIter = 500, Base<F> absTol = 1e-6, Base<F> relTol = 1e-4, bool usePinv = true, Base<F> pinvTol = 0, bool progress = false)¶

int BasisPursuit(const DistMatrix<F> &A, const DistMatrix<F> &b, DistMatrix<F> &x, DistMatrix<F> &z, DistMatrix<F> &u, Base<F> rho = 1, Base<F> alpha = 1.2, Int maxIter = 500, Base<F> absTol = 1e-6, Base<F> relTol = 1e-4, bool usePinv = true, Base<F> pinvTol = 0, bool progress = false)¶

Implementations on GitHub

Example driver on GitHub

These functions are inspired by this basis pursuit ADMM implementation. Elemental’s implementations use parallel (dense) linear algebra and allows the user to choose between either directly computing a (thresholded-)pseudoinverse or an LQ factorization. The return value is the number of performed ADMM iterations.

Parameters	Input/Output	Explanation
A	Input	part of constraint \(Ax=b\)
b	Input	part of constraint \(Ax=b\)
x	Output	primal solution (first term)
z	Output	primal solution (second term)
u	Output	dual solution
rho	Input	augmented-Lagrangian parameter
alpha	Input	over-relaxation parameter (usually in \([1,1.8]\))
maxIter	Input	maximum number of ADMM iterations
absTol	Input	absolute convergence tolerance
relTol	Input	relative convergence tolerance
usePinv	Input	directly compute a pseudoinverse?
pinvTol	Input	the pseudoinverse inversion tolerance
progress	Input	display detailed progress information?

LinearProgram¶

Attempt to solve the linear program

\[\text{min}\, c^T x \;\;\;\text{such that }Ax=b,\; x \ge 0.\]

using ADMM.

int LinearProgram(const Matrix<Real> &A, const Matrix<Real> &b, const Matrix<Real> &c, Matrix<Real> &x, Matrix<Real> &z, Matrix<Real> &u, Real rho = 1., Real alpha = 1.2, Int maxIter = 500, Real absTol = 1e-6, Real relTol = 1e-4, bool inv = false, bool progress = true)¶

int LinearProgram(const DistMatrix<Real> &A, const DistMatrix<Real> &b, const DistMatrix<Real> &c, DistMatrix<Real> &x, DistMatrix<Real> &z, DistMatrix<F> &u, Real rho = 1., Real alpha = 1.2, Int maxIter = 500, Real absTol = 1e-6, Real relTol = 1e-4, bool inv = true, bool progress = true)¶

Implementations on GitHub

Example driver on GitHub

These functions are inspired by this ADMM linear program solver. Elemental’s implementations make use of parallel (dense) linear algebra, a custom structured factorization, and allows the user to optionally directly invert the (pivoted) Schur complement to accelerate its application. The return value is the number of performed ADMM iterations.

Parameters	Input/Output	Explanation
A	Input	part of constraints, \(Ax=b\)
b	Input	part of constraints, \(Ax=b\)
c	Input	part of the objective, \(c^T x\)
x	Output	primal solution (first term)
z	Output	primal solution (second term)
u	Output	dual solution
rho	Input	augmented-Lagrangian parameter
alpha	Input	over-relaxation parameter (usually in \([1,1.8]\))
maxIter	Input	maximum number of ADMM iterations
absTol	Input	absolute convergence tolerance
relTol	Input	relative convergence tolerance
inv	Input	directly compute Schur-complement inverse?
progress	Input	display detailed progress information?

QuadraticProgram¶

Attempt to solve the quadratic program

\[\text{min} \frac{1}{2} x^T P x + q^T x\;\;\;\text{such that }l_b \le x \le u_b\]

using ADMM.

int QuadraticProgram(const Matrix<Real> &P, const Matrix<Real> &q, Real lb, Real ub, Matrix<Real> &x, Matrix<Real> &z, Matrix<Real> &u, Real rho = 1., Real alpha = 1.2, Int maxIter = 500, Real absTol = 1e-6, Real relTol = 1e-4, bool inv = false, bool progress = true)¶

int QuadraticProgram(const DistMatrix<Real> &P, const DistMatrix<Real> &q, Real lb, Real ub, DistMatrix<Real> &x, DistMatrix<Real> &z, DistMatrix<Real> &u, Real rho = 1., Real alpha = 1.2, Int maxIter = 500, Real absTol = 1e-6, Real relTol = 1e-4, bool inv = true, bool progress = true)¶

Implementations on GitHub

Example driver on GitHub

These functions are inspired by this ADMM quadratic program solver. Elemental’s implementations make use of parallel (dense) linear algebra and allows the user to optionally directly invert the Cholesky factor to improve the parallel performance of the application of its inverse.

Parameters	Input/Output	Explanation
P	Input	SPD and part of objective, \(\frac{1}{2}x^T P x + q^T x\)
q	Input	part of objective
lb	Input	lower-bound of constraints, \(l_b \le x \le u_b\)
ub	Input	upper-bound of constraints, \(l_b \le x \le u_b\)
x	Output	primal solution (first term)
z	Output	primal solution (second term)
u	Output	dual solution
rho	Input	augmented-Lagrangian parameter
alpha	Input	over-relaxation parameter (usually in \([1,1.8]\))
maxIter	Input	maximum number of ADMM iterations
absTol	Input	absolute convergence tolerance
relTol	Input	relative convergence tolerance
inv	Input	compute inverse of Cholesky factor?
progress	Input	display detailed progress information?

SparseInvCov¶

Attempt to find a sparse inverse covariance matrix which generated the given observations by solving the program

\[\text{min} \text{trace}(S X) - \text{log}\;\text{det}\;X + \lambda \|\text{vec}(X)\|_1\]

using ADMM.

int SparseInvCov(const Matrix<Real> &D, Matrix<Real> &X, Matrix<Real> &Z, Matrix<Real> &U, Real lambda, Real rho = 1., Real alpha = 1.2, Int maxIter = 500, Real absTol = 1e-6, Real relTol = 1e-4, bool progress = true)¶

int SparseInvCov(const DistMatrix<Real> &D, DistMatrix<Real> &X, DistMatrix<Real> &Z, DistMatrix<Real> &U, Real lambda, Real rho = 1., Real alpha = 1.2, Int maxIter = 500, Real absTol = 1e-6, Real relTol = 1e-4, bool progress = true)¶

Implementations on GitHub

Example driver on GitHub

These functions are inspired by this ADMM solver. Elemental’s implementations make use of parallel (dense) linear algebra (including PMRRR for the symmetric tridiagonal eigensolver).

Parameters	Input/Output	Explanation
D	Input	Observations
X	Output	primal solution (first term)
Z	Output	primal solution (second term)
U	Output	dual solution
lambda	Input	coefficient for vector-l1 penalty
rho	Input	augmented-Lagrangian parameter
alpha	Input	over-relaxation parameter (usually in \([1,1.8]\))
maxIter	Input	maximum number of ADMM iterations
absTol	Input	absolute convergence tolerance
relTol	Input	relative convergence tolerance
progress	Input	display detailed progress information?

Utilities¶

The following utility routines are widely-used within first-order optimization methods, and some, such as Singular Value soft-Thresholding (SVT), warrant non-trivial implementations.

Clip¶

Force every entry of a matrix to lie within a given (half-)interval.

Implementations on GitHub

void LowerClip(Matrix<Real> &X, Real lowerBound = 0)¶

void LowerClip(DistMatrix<Real> &X, Real lowerBound = 0)¶: Force every entry to be at least lowerBound.

void UpperClip(Matrix<Real> &X, Real upperBound = 0)¶

void UpperClip(DistMatrix<Real> &X, Real upperBound = 0)¶: Force every entry to be at most upperBound.

void Clip(Matrix<Real> &X, Real lowerBound = 0, Real upperBound = 1)¶

void Clip(DistMatrix<Real> &X, Real lowerBound = 0, Real upperBound = 1)¶: Force every entry to lie within the interval defined by lowerBound and upperBound.

LogBarrier¶

Uses a careful calculation of the log of the determinant in order to return the log barrier of a Hermitian positive-definite matrix A, \(-\log(\mbox{det}(A))\).

Implementations on GitHub

Base<F> LogBarrier(UpperOrLower uplo, const Matrix<F> &A)¶

Base<F> LogBarrier(UpperOrLower uplo, const DistMatrix<F> &A)¶

Base<F> LogBarrier(UpperOrLower uplo, Matrix<F> &A, bool canOverwrite = false)¶

Base<F> LogBarrier(UpperOrLower uplo, DistMatrix<F> &A, bool canOverwrite = false)¶

LogDetDiv¶

The log-det divergence of a pair of \(n \times n\) Hermitian positive-definite matrices \(A\) and \(B\) is

\[D_{ld}(A,B) = \mbox{tr}(A B^{-1}) -\log(\mbox{det}(A B^{-1})) - n,\]

which can be greatly simplified using the Cholesky factors of \(A\) and \(B\). In particular, if we set \(Z = L_B^{-1} L_A\), where \(A=L_A L_A^H\) and \(B=L_B L_B^H\) are Cholesky factorizations, then

\[D_{ld}(A,B) = \| Z \|_F^2 - 2 \log(\mbox{det}(Z)) - n.\]

Implementations on GitHub

Example driver on GitHub

Base<F> LogDetDivergence(UpperOrLower uplo, const Matrix<F> &A, const Matrix<F> &B)¶

Base<F> LogDetDivergence(UpperOrLower uplo, const DistMatrix<F> &A, const DistMatrix<F> &B)¶

Singular-value soft-thresholding¶

Overwrites \(A\) with \(U S_{\tau}(\Sigma) V^H\), where \(U \Sigma V^H\) is the singular-value decomposition of \(A\) upon input and \(S_{\tau}\) performs soft-thresholding with parameter \(\tau\). The return value is the rank of the soft-thresholded matrix.

Implementations on GitHub

int SVT(Matrix<F> &A, Base<F> tau, bool relative = false)¶

int SVT(DistMatrix<F> &A, Base<F> tau, bool relative = false)¶: Runs the default SVT algorithm. In the sequential case, this is currently svt::Normal, and, in the parallel case, it is svt::Cross.

int SVT(Matrix<F> &A, Base<F> tau, int relaxedRank, bool relative = false)¶

int SVT(DistMatrix<F> &A, Base<F> tau, int relaxedRank, bool relative = false)¶: Runs a faster (for small ranks), but less accurate, algorithm given an upper bound on the rank of the soft-thresholded matrix. The current implementation preprocesses via relaxedRank steps of (Businger-Golub) column-pivoted QR via the routine svt::PivotedQR.

int SVT(DistMatrix<F, U, STAR> &A, Base<F> tau, bool relative = false)¶: Runs an SVT algorithm designed for tall-skinny matrices. The current implementation is based on TSQR factorization and is svt::TSQR.

namespace svt¶

int svt::Normal(Matrix<F> &A, Base<F> tau, bool relative = false)¶

int svt::Normal(DistMatrix<F> &A, Base<F> tau, bool relative = false)¶: Runs a standard SVD, soft-thresholds the singular values, and then reforms the matrix.

int svt::Cross(Matrix<F> &A, Base<F> tau, bool relative = false)¶

int svt::Cross(DistMatrix<F> &A, Base<F> tau, bool relative = false)¶: Forms the normal matrix, computes its Hermitian EVD, soft-thresholds the eigenvalues, and then reforms the matrix. Note that Elemental’s parallel Hermitian EVD is much faster than its parallel SVD; this is typically worth the loss of accuracy in the computed small (truncated) singular values and is therefore the default choice for parallel SVT.

int svt::PivotedQR(Matrix<F> &A, Base<F> tau, int numStepsQR, bool relative = false)¶

int svt::PivotedQR(DistMatrix<F> &A, Base<F> tau, int numStepsQR, bool relative = false)¶: Computes an approximate SVT by first approximating A as the rank-numSteps approximation produced by numSteps iterations of column-pivoted QR.

int svt::TSQR(DistMatrix<F, U, STAR> &A, Base<F> tau, bool relative = false)¶: Since the majority of the work in a tall-skinny SVT will be in the initial QR factorization, this algorithm runs a TSQR factorization and then computes the SVT of the small R factor using a single process.

Soft-thresholding¶

Overwrites each entry of \(A\) with its soft-thresholded value.

Implementations on GitHub

void SoftThreshold(Matrix<F> &A, Base<F> tau, bool relative = false)¶

void SoftThreshold(DistMatrix<F> &A, Base<F> tau, bool relative = false)¶

Convex optimization¶

ADMM solvers¶

BasisPursuit¶

LinearProgram¶

QuadraticProgram¶

SparseInvCov¶

Utilities¶

Clip¶

LogBarrier¶

LogDetDiv¶

Singular-value soft-thresholding¶

namespace svt¶

Soft-thresholding¶

Usage documentation

Developer resources