9.3 Methods for mapping prevalence to the optimal policy

When donor groups are identical except for the number of donors ( $n_i$ ) and the prevalence of each TTI ( $p_{ik}$ ), a $k$ -dimensional function $f(\textbf{p})$ mapping the prevalence by TTI to the optimal policy can be defined as follows:

$(\textbf{m}, \textbf{a}, z)^* = f(\textbf{p}), \quad \text{where} \text{ } f(\textbf{p}) = {\arg\min}_{\textbf{m}, \textbf{a}, z} \mathbb{E}[\mathcal{C}(z, \textbf{m}, \textbf{a}, \textbf{p}) ].$

For notational convenience, we define a policy $\pi$ as a set of decision variables $\pi = (z, \textbf{m}, \textbf{a})$ . For a given policy and a single donor group, expected cost as a function of prevalence is:

$\mathbb{E}[\mathcal{C}(\textbf{p}) \mid \pi] = (1- z)d + z \Big[w + \textbf{a}^\top\boldsymbol{\phi} + \textbf{m}^\top \boldsymbol{\psi} + \big(1- s_2\big)\big(\textbf{v}_1 \circ \textbf{v}_3 \circ \textbf{p} \big)^\top \textbf{c} + gs_2(\textbf{a},\textbf{p}) \Big],$

where vectors $\textbf{v}_1$ and $\textbf{v}_3$ are functions of $\pi$ and the constant $s_2$ is a function of both $\pi$ and $\textbf{p}$ . By grouping terms that are constant in $\textbf{p}$ , this can be rewritten as

$\mathbb{E}[\mathcal{C}(\textbf{p}) \mid \pi] = \alpha(\pi) + \big(1- s_2(\textbf{a},\textbf{p})\big) \beta(\pi) \textbf{p}+ gs_2(\textbf{a},\textbf{p}) \Big],$

where $\alpha(\pi) = (1- z)d + z \Big[w + \textbf{a}^\top\boldsymbol{\phi} + \textbf{m}^\top \boldsymbol{\psi} \Big]$ and $\beta(\pi) = z (\textbf{v}_1 \circ \textbf{v}_3 \circ \textbf{c} )^\top$ .

For a policy $\pi$ , we define its ‘policy region’ as the $k$ -dimensional set of prevalence values for which $\mathbb{E}[\mathcal{C}(\textbf{p}) \mid \pi] = \min_{\pi'} \mathbb{E}[\mathcal{C}(\textbf{p}) \mid \pi']$ . For two policies $\pi_1$ and $\pi_2$ , the $k$ -dimensional ‘pairwise decision boundary’ (PDB) between their respective policy regions is the solution to $\mathbb{E}[\mathcal{C}(\textbf{p}) \mid \pi_1] = \mathbb{E}[\mathcal{C}(\textbf{p}) \mid \pi_2]$ . Policy regions can be identified through the following procedure:

Solve for the PDB for each unique pairing of policies ( $0.5T(T-1)$ pairs)
Divide the prevalence space into mutually exclusive and collectively exhaustive PBD-regions, indexed by $g$ , and identify $\Lambda_g$ , the optimal policy for each region, by evaluating any point in the region ( $\Lambda_g = {\arg\min}_{\Lambda} \mathbb{E}[\mathcal{C}(\textbf{p} \mid \Lambda) ]$ for any $\textbf{p}$ in PDB region $g$ ).
Aggregate PBDs for which $\Lambda_g = \Lambda_t$ into policy region $\Lambda_t$ .

Alternatively, decision boundaries could be approximated through grid search or other search algorithms.