Chapter 3 Regression estimation

The relation of two random variables X and Y can be completely characterized by their joint cdf F, or equivalently, by the joint pdf f if (X,Y) is continuous, the case we will address. In the regression setting, we are interested in predicting/explaining the response Y by means of the predictor X from a sample (X1,Y1),,(Xn,Yn). The role of the variables is not symmetric: X is used to predict/explain Y.

The complete knowledge of Y when X=x is given by the conditional pdf: fY|X=x(y)=f(x,y)fX(x). While this pdf provides full knowledge about Y|X=x, it is also a challenging task to estimate it: for each x we have to estimate a curve! A simpler approach, yet still challenging, is to estimate the conditional mean (a scalar) for each x. This is the so-called regression function8

m(x):=E[Y|X=x]=ydFY|X=x(y)=yfY|X=x(y)dy.

Thus we aim to provide information about Y’s expectation, not distribution, by X.

Finally, recall that Y can expressed in terms of m by means of the location-scale model:

Y=m(X)+σ(X)ε,

where σ2(x):=Var[Y|X=x] and ε is independent from X and such that E[ε]=0 and Var[ε]=1.


  1. Recall that we assume that (X,Y) is continuous.↩︎