Chapter 8 Specifying Models
In addition to its role in limiting endogeneity, model specification plays an important role in modeling in economics. The defining characteristic of economic reasoning is marginalism so having functional forms that make marginal sense is important. In this chapter we will learn to estimate some important functional forms in economics.
We will use the following libraries.
library(tidyverse)
library(broom)
library(estimatr)
data(mtcars)
8.1 Polynomial Models
Estimating a model of the form y=β0+β1x+β2x2+β3x3+⋯+βkxk+ϵ is straightforward in R. There is no need to create new variable within the data frame since the variables can be created directly as arguments within a function, e.g., lm
.
Create a scatter diagram of miles per gallon vs horse power from the mtcars built in data set.
p <-%>%
mtcars ggplot(aes(x= hp, y = mpg)) +
geom_point() +
labs(y = "Mile per Gallon", x = "Horse Power", title = "Scatter Diagram") +
theme(plot.title = element_text(hjust = 0.5))
p
It appears as if mpg falls at a diminishing rate as horse power increases. There are a number of functional forms that will gives us that basic shape. One such form is quadratic. Let’s estimate a quadratic form of the model mpg=β0+β1hp+β2hp+ϵ
%>%
mtcars lm(mpg ~ hp + I(hp^2), .) # uses the AsIs function I() to create hp squared
Call:
lm(formula = mpg ~ hp + I(hp^2), data = .)
Coefficients:
(Intercept) hp I(hp^2)
40.409117 -0.213308 0.000421
Add the quadratic fit to the scatter diagram
+
p geom_smooth(method = "lm", formula = y ~ x + I(x^2), se = F)
Adding higher order polynomials is accomplished by adding additional AsIs functions I()
to the model.
8.2 Logarithmic
Logarithms have a wide variety of uses in Econometrics.
8.2.1 Constant Elasticity (log-log or double log)
Log-log models are of the form lny=β0+β1lnx. So β1=dlnydlnx
Recall that dlnX=dXX; that is, the change in the logarithm is a percent change in a variable. For example, ΔlnP is the % change in P.
So, β1 is the percent change in y resultant from a 1% change in x. Or, a one percent change in x will induce a β1 percent change in y.
Constant elasticity demand functions are estimated using log-log models. Let, q=αpβ where β<0 be the demand function. Recall he elasticity of demand is given by η=%Δq%Δp=dq/qdp/p=dqdppq. The elasticity of demand for our demand function is η=βαpβ−1pq=βαpβq Substitute for q=αpβ η=βαpβαpβ=β The elasticity of demand is β which is invariant. To make q=αpβ estimable with OLS take the logarithm of both sides: ln(q)=ln(αpβϵ) ln(q)=ln(α)+ln(pβ)+ln(ϵ) ln(q)=γ+βln(p)+δ is now in estimable form and can estimated by lm(log(q) ~ log(p), data = df)
Cobb-Douglas production functions are also estimated as log-log models. Let’s estimate a Cobb-Douglas model for horsepower as a function of number of cylinders and displacement in cubic inches from the mtcars data. The production function is: hp=β0cylβ1dispβ2ϵ The estimable form of the model is lnhp=α0+β1lncyl+β2lndisp+γ
%>%
mtcars lm(log(hp) ~ log(cyl) + log(disp), .)
Call:
lm(formula = log(hp) ~ log(cyl) + log(disp), data = .)
Coefficients:
(Intercept) log(cyl) log(disp)
1.851 0.817 0.299
ˆβ1 indicates that a 1% increase in the number of cylinders, ceteris paribus, increases horsepower by 0.8170%. ˆβ2 indicates that a 1% increase in the displacement, ceteris paribus increases horsepower by 0.2986%.
8.2.2 Constant Growth in a Dependent Variable (log-lin or semilog)
The log-lin model has the form lny=β0+β1x. β1=dlnydx or the percent change in y resultant from a 1 unit change in x. Or, a unit change in x will induce a 100∗β1 percent change in y.
Suppose we have a variable P that is growing at a constant rate of g per period t such that Pt=(1+g)Pt−1. By repeated substitution we get Pt=P0(1+g)t where P0 is the initial value of Y.^[Note the relationship to the time value of money. Let FV=PV(1+r)t. Suppose we’d like to calculate the average annual rate of return, r, on $PV, given a future value of $FV. After appropriate algebraic gymnastics, r=t√FVPV−1. We could calculate g without regression as g=t√PtP0−1. We could calculate g with a regression with the aid of logarithms. Let the model be Pt=P0(1+g)tϵ. Taking the logarithm of both sides yields ln(Pt)=ln(P0)+tln(1+g)+ln(ϵ) Which we can estimate as Yt=β0+β1t+ut Where β0=ln(P0), ut=ln(ϵt), and, most importantly, β1=ln(1+g). We can get our estimate of g from ˆβ1=^ln(1+g) by exponentiating both sides and solving for g. eˆβ1=e^ln(1+g) ˆg=eˆβ1−122
Suppose we’d like to know the percent change in fuel mileage resultant from adding a cylinder to the car using the mtcars data. Estimate the model lnmpg=β0+β1cyl+ϵ
%>%
mtcars lm(log(mpg) ~ cyl, .)
Call:
lm(formula = log(mpg) ~ cyl, data = .)
Coefficients:
(Intercept) cyl
3.839 -0.143
ˆβ1 indicates that adding one cylinder reduces fuel mileage by 14.25%.
8.2.3 Constant % Change in Dependent Variable (lin-log)
The lin-log model has the form y=β0+β1lnx. β1=dydlnx or the change in y from a 1 percent change in x. Or, one percent change in x will induce a β1100 unit change in y.
Suppose we’d like to know the change in the number of miles per gallon as a function of the percent change in displacement. Estimate the model mpg=β0+beta1lndisp+ϵ
%>%
mtcars lm(mpg ~ log(disp), .)
Call:
lm(formula = mpg ~ log(disp), data = .)
Coefficients:
(Intercept) log(disp)
69.21 -9.29
ˆβ1 indicates that a one percent change in displacement will decrease fuel mileage by .093 mpg.
8.3 Other Useful Functional Forms
We know that marginal analysis is the sine qua non of economic reasoning. So our economic models should be built with an eye toward what we think the marginal relationship should look like from theory. Use a sort “a poor man’s” differential equations to work from a marginal model to the functional form. For example, consider production as a function of labor holding capital constant Q=Q(ˉK,L) with K fixed at ˉK. From microeconomic theory we know that the marginal product of labor, MPL=dQdl at some point diminishes. This suggests MPL=β1+β2L+β3L2 where β1≥0,β2>0, and β3<0 as the model product of labor. The production function that will yield this marginal product is Q=β0+β1L+β2L2+β3L3. Thus the usefulness of a polynomial functional form in economics.
There are other functional forms that will yield diminishing marginal functions. For example, another functional form that will yield a diminishing marginal is a rectangular hyperbola y=β0+β1x. The marginal effect of x on y is dydx=−β1x2. A one unit change in X leads to a 1x change in y. Notice as x increases the change in y decreases, i.e., diminishing marginal.
The plot of miles per gallon vs. horsepower above suggests a negative diminishing relationship between the variables. Suppose we postulate that miles per gallon is a function of the inverse of horsepower mpg=β0+β1hp+ϵ.
%$%
mtcars lm(mpg ~ I(1/hp))
Call:
lm(formula = mpg ~ I(1/hp))
Coefficients:
(Intercept) I(1/hp)
9.43 1259.88
Add the fitted model to scatter diagram.
+
p geom_smooth(method = lm, formula = y ~ I(1/x), se = F)
We know that β1=d(lnPt)dt. Since d(lnPt)=dPtPt or the percent change in Pt, β1 is known as the instantaneous rate of growth.↩︎