Exercises

$\newcommand{\bm}[1]{\boldsymbol{#1}} \newcommand{\textm}[1]{\textsf{#1}} \newcommand{\textnormal}[1]{\textsf{#1}} \def\T{{\mkern-2mu\raise-1mu\mathsf{T}}} \newcommand{\R}{\mathbb{R}} % real numbers \newcommand{\E}{{\rm I\kern-.2em E}} \newcommand{\w}{\bm{w}} % bold w \newcommand{\bmu}{\bm{\mu}} % bold mu \newcommand{\bSigma}{\bm{\Sigma}} % bold mu \newcommand{\bigO}{O} %\mathcal{O} \renewcommand{\d}[1]{\operatorname{d}\!{#1}}$

Exercise 11.1 (Change of variable) Show why $\bSigma\bm{x} = \bm{b}/\bm{x}$ can be equivalently solved as $\bm{C}\bm{x} = \bm{b}/\bm{x}$ , where $\bm{C}$ is the correlation matrix defined as $\bm{C} = \bm{D}^{-1/2}\bSigma\bm{D}^{-1/2}$ with $\bm{D}$ a diagonal matrix containing $\textm{diag}(\bSigma)$ along the main diagonal. Would it be possible to use instead $\bm{C} = \bm{M}^{-1/2}\bSigma\bm{M}^{-1/2}$ , where $\bm{M}$ is not necessarily a diagonal matrix?

Exercise 11.2 (Naive diagonal risk parity portfolio) If the covariance matrix is diagonal, $\bSigma = \bm{D}$ , then the system of nonlinear equations $\bSigma\bm{x} = \bm{b}/\bm{x}$ has the closed-form solution $\bm{x} = \sqrt{\bm{b}/\textm{diag}(\bm{D})}$ . Explore whether a closed-form solution can be obtained for the rank-one plus diagonal case $\bSigma = \bm{u}\bm{u}^\T + \bm{D}$ .

Exercise 11.3 (Vanilla convex risk parity portfolio) The solution to the formulation $\begin{array}{ll} \underset{\bm{x}\ge\bm{0}}{\textm{maximize}} & \bm{b}^\T\log(\bm{x})\\ \textm{subject to} & \sqrt{\bm{x}^\T\bSigma\bm{x}} \le \sigma_0 \end{array}$ is $\lambda\bSigma\bm{x} = \bm{b}/\bm{x} \times \sqrt{\bm{x}^\T\bSigma\bm{x}}.$ Can you solve for $\lambda$ and rewrite the solution in a more compact way without $\lambda$ ?

Exercise 11.4 (Newton's method) Newton’s method requires computing the direction $\bm{d} = \mathsf{H}^{-1}\nabla f$ or, equivalently, solving the system of linear equations $\mathsf{H}\,\bm{d} = \nabla f$ for $\bm{d}$ . Explore whether a more efficient solution is possible by exploiting the structure of the gradient and Hessian: $\begin{aligned} \nabla f &= \bSigma\bm{x} - \bm{b}/\bm{x},\\ \mathsf{H} &= \bSigma + \textm{Diag}(\bm{b}/\bm{x}^2). \end{aligned}$

Exercise 11.5 (MM algorithm) The MM algorithm requires the computation of the largest eigenvalue $\lambda_\textm{max}$ of matrix $\bSigma$ , which can be obtained from the eigenvalue decomposition of the matrix. A more efficient alternative is the power iteration method. Program both methods and compare their computational complexity.

Exercise 11.6 (Coordinate descent vs. SCA methods) Consider the vanilla convex formulation $\begin{array}{ll} \underset{\bm{x}\ge\bm{0}}{\textm{minimize}} & \frac{1}{2}\bm{x}^\T\bSigma\bm{x} - \bm{b}^\T\log(\bm{x}). \end{array}$ Implement the cyclical coordinate descent method and the parallel SCA method in a high-level programming language (e.g., R, Python, Julia, or MATLAB) and compare the convergence against the CPU time for these two methods. Then, re-implement these two methods in a low-level programming language (e.g., C, C++, C#, or Rust) and compare the convergence again. Comment on the difference observed.