Chapter 3 Regression estimation
The relation of two random variables X and Y can be completely characterized by their joint cdf F, or equivalently, by the joint pdf f if (X,Y) is continuous, the case we will address. In the regression setting, we are interested in predicting/explaining the response Y by means of the predictor X from a sample (X1,Y1),…,(Xn,Yn). The role of the variables is not symmetric: X is used to predict/explain Y.
The complete knowledge of Y when X=x is given by the conditional pdf: fY|X=x(y)=f(x,y)fX(x). While this pdf provides full knowledge about Y|X=x, it is also a challenging task to estimate it: for each x we have to estimate a curve! A simpler approach, yet still challenging, is to estimate the conditional mean (a scalar) for each x. This is the so-called regression function8
m(x):=E[Y|X=x]=∫ydFY|X=x(y)=∫yfY|X=x(y)dy.
Thus we aim to provide information about Y’s expectation, not distribution, by X.
Finally, recall that Y can expressed in terms of m by means of the location-scale model:
Y=m(X)+σ(X)ε,
where σ2(x):=Var[Y|X=x] and ε is independent from X and such that E[ε]=0 and Var[ε]=1.
Recall that we assume that (X,Y) is continuous.↩︎