Chapter 3 Single tree model
3.1 Model specification
Define the following model. Suppose we have the observation of a tree node as: yij,i=1,…,nj,j=1…,m where yij is observation i in group j. There are different numbers of observations nj in each group, so for example n1 is the number of observations in group 1, etc. There are m groups. The total number of observations is n=∑mj=1nj
Then, for each tree node, suppose we have the likelihood: yij∼N(μj,τ−1)
so each group has an overall mean μj, with an overall precision term τ.
We then have a hierarchical prior distribution:
μj∼N(μ,k1(τ−1))
where k1 will be taken as a constant for simplicity, and the hyper-parameter prior distributions are:
μ∼N(0,τμ=k2(τ−1)) τ∼Ga(α,β)
Where the values k1,k2,α,β are all fixed.
3.2 Maths
- The likelihood of each tree node will be:
L=m∏j=1nj∏i=1p(yij|μj,τ)L∝τn/2exp{−τ2m∑j=1nj∑i=1(yij−μj)2}
with prior distributions:
- μj|μ,τ,k1
p(μ1,…,μm|μ,τ)∝(τ/k1)m/2exp{−τ2k1m∑j=1(μj−μ)2}
- τ|α,β
p(τ|α,β)∝τα−1exp{−βτ}
- μ|τμ=k2(τ−1)
p(μ|k2,τ)∝(τ/k2)1/2exp{−τ2k2μ2}}
and their joint distribution as:
- p(τ,μ1,…,μm,μ|y,k1,k2,τ,α,β)
p(τ,μ1,…,μm,μ|y,k1,k2,τ,α,β)∝τα−1exp{−βτ}×(τ/k1)m/2exp{−τ2k1m∑j=1(μj−μ)2}×(τ/k2)1/2exp{−τ2k2μ2}×τn/2exp{−τ2m∑j=1nj∑i=1(yij−μj)2}