 X Z 

advertisement
BINARY CHOICE MODELS: PROBIT ANALYSIS
1.00
F (Z )
0.4
p  F (Z )
0.3
0.50
0.2
0.25
Marginal effect
Cumulative effect
0.75
0.1
0.00
0
-3
-2
-1
0
1
2
Z
Z   1   2 X 2  ...   k X k
In the case of probit analysis, the sigmoid function is the cumulative standardized normal
distribution.
1
BINARY CHOICE MODELS: PROBIT ANALYSIS
1.00
1
1 2Z 2
f (Z ) 
e
2
0.3
0.50
0.2
0.25
Marginal effect
Cumulative effect
0.75
0.4
0.1
0.00
0
-3
-2
-1
0
1
2
Z
Z   1   2 X 2  ...   k X k
The maximum likelihood principle is again used to obtain estimates of the parameters.
2
BINARY CHOICE MODELS: PROBIT ANALYSIS
. probit GRAD ASVABC SM SF MALE
Iteration
Iteration
Iteration
Iteration
Iteration
0:
1:
2:
3:
4:
log
log
log
log
log
likelihood
likelihood
likelihood
likelihood
likelihood
Probit estimates
Log likelihood = -96.624926
=
=
=
=
=
-118.67769
-98.195303
-96.666096
-96.624979
-96.624926
Number of obs
LR chi2(4)
Prob > chi2
Pseudo R2
=
=
=
=
540
44.11
0.0000
0.1858
-----------------------------------------------------------------------------GRAD |
Coef.
Std. Err.
z
P>|z|
[95% Conf. Interval]
-------------+---------------------------------------------------------------ASVABC |
.0648442
.0120378
5.39
0.000
.0412505
.0884379
SM | -.0081163
.0440399
-0.18
0.854
-.094433
.0782004
SF |
.0056041
.0359557
0.16
0.876
-.0648677
.0760759
MALE |
.0630588
.1988279
0.32
0.751
-.3266368
.4527544
_cons | -1.450787
.5470608
-2.65
0.008
-2.523006
-.3785673
------------------------------------------------------------------------------
Here is the result of the probit regression using the example of graduating from high
school.
3
BINARY CHOICE MODELS: PROBIT ANALYSIS
. probit GRAD ASVABC SM SF MALE
Iteration
Iteration
Iteration
Iteration
Iteration
0:
1:
2:
3:
4:
log
log
log
log
log
likelihood
likelihood
likelihood
likelihood
likelihood
Probit estimates
Log likelihood = -96.624926
=
=
=
=
=
-118.67769
-98.195303
-96.666096
-96.624979
-96.624926
Number of obs
LR chi2(4)
Prob > chi2
Pseudo R2
=
=
=
=
540
44.11
0.0000
0.1858
-----------------------------------------------------------------------------GRAD |
Coef.
Std. Err.
z
P>|z|
[95% Conf. Interval]
-------------+---------------------------------------------------------------ASVABC |
.0648442
.0120378
5.39
0.000
.0412505
.0884379
SM | -.0081163
.0440399
-0.18
0.854
-.094433
.0782004
SF |
.0056041
.0359557
0.16
0.876
-.0648677
.0760759
MALE |
.0630588
.1988279
0.32
0.751
-.3266368
.4527544
_cons | -1.450787
.5470608
-2.65
0.008
-2.523006
-.3785673
------------------------------------------------------------------------------
As with logit analysis, the coefficients have no direct interpretation. However, we can use
them to quantify the marginal effects of the explanatory variables on the probability of
graduating from high school.
4
BINARY CHOICE MODELS: PROBIT ANALYSIS
p  F (Z )
Z   1   2 X 2  ... k X k
 1  12 Z 2 
p dp Z
  i

 f ( Z )  i  
e
X i dZ X i
 2

As with logit analysis, the marginal effect of Xi on p can be written as the product of the
marginal effect of Z on p and the marginal effect of Xi on Z.
5
BINARY CHOICE MODELS: PROBIT ANALYSIS
p  F (Z )
Z   1   2 X 2  ... k X k
1
dp
1 2Z 2
f (Z ) 

e
dZ
2
 1  12 Z 2 
p dp Z
  i

 f ( Z )  i  
e
X i dZ X i
 2

The marginal effect of Z on p is given by the standardized normal distribution. The marginal
effect of Xi on Z is given by i.
6
BINARY CHOICE MODELS: PROBIT ANALYSIS
p  F (Z )
Z   1   2 X 2  ... k X k
1
dp
1 2Z 2
f (Z ) 

e
dZ
2
 1  12 Z 2 
p dp Z
  i

 f ( Z )  i  
e
X i dZ X i
 2

As with logit analysis, the marginal effects vary with Z. A common procedure is to evaluate
them for the value of Z given by the sample means of the explanatory variables.
7
BINARY CHOICE MODELS: PROBIT ANALYSIS
. sum GRAD ASVABC SM SF MALE
Variable |
Obs
Mean
Std. Dev.
Min
Max
-------------+-------------------------------------------------------GRAD |
540
.9425926
.2328351
0
1
ASVABC |
540
51.36271
9.567646
25.45931
66.07963
SM |
540
11.57963
2.816456
0
20
SF |
540
11.83704
3.53715
0
20
MALE |
540
.5
.5004636
0
1
As with logit analysis, the marginal effects vary with Z. A common procedure is to evaluate
them for the value of Z given by the sample means of the explanatory variables.
8
BINARY CHOICE MODELS: PROBIT ANALYSIS
Probit: Marginal Effects
mean
b
product
f(Z)
f(Z)b
ASVABC
51.36
0.065
3.328
0.068
0.004
SM
11.58
–0.008
–0.094
0.068
–0.001
SF
11.84
0.006
0.066
0.068
0.000
MALE
0.50
0.063
0.032
0.068
0.004
constant
1.00
–1.451
–1.451
Total
Z   1   2 X 2  ... k X k
 1.881
1.881
In this case Z is equal to 1.881 when the X variables are equal to their sample means.
9
BINARY CHOICE MODELS: PROBIT ANALYSIS
Probit: Marginal Effects
mean
b
product
f(Z)
f(Z)b
ASVABC
51.36
0.065
3.328
0.068
0.004
SM
11.58
–0.008
–0.094
0.068
–0.001
SF
11.84
0.006
0.066
MALE
0.50
0.063
0.032
constant
1.00
–1.451
–1.451
Total
1  12 Z 2
0.000
f ( Z0.068
)
e
 0.068
2
0.068
0.004
1.881
We then calculate f(Z).
10
BINARY CHOICE MODELS: PROBIT ANALYSIS
Probit: Marginal Effects
mean
b
product
f(Z)
f(Z)b
ASVABC
51.36
0.065
3.328
0.068
0.004
SM
11.58
–0.008
–0.094
0.068
–0.001
SF
11.84
0.006
0.066
0.068
0.000
MALE
0.50
0.063
0.032
0.068
0.004
constant
1.00
–1.451
–1.451
Total
1.881
p dp Z

 f ( Z ) i
X i dZ X i
The estimated marginal effects are f(Z) multiplied by the respective coefficients. We see
that a one-point increase in ASVABC increases the probability of graduating from high
school by 0.4 percent.
11
BINARY CHOICE MODELS: PROBIT ANALYSIS
Probit: Marginal Effects
mean
b
product
f(Z)
f(Z)b
ASVABC
51.36
0.065
3.328
0.068
0.004
SM
11.58
–0.008
–0.094
0.068
–0.001
SF
11.84
0.006
0.066
0.068
0.000
MALE
0.50
0.063
0.032
0.068
0.004
constant
1.00
–1.451
–1.451
Total
1.881
p dp Z

 f ( Z ) i
X i dZ X i
Every extra year of schooling of the mother decreases the probability of graduating by 0.1
percent. Father's schooling has no discernible effect. Males have 0.4 percent higher
probability than females.
12
BINARY CHOICE MODELS: PROBIT ANALYSIS
Logit
Probit
Linear
f(Z)b
f(Z)b
b
0.004
0.004
0.007
SM
–0.001
–0.001
–0.002
SF
0.000
0.000
0.001
MALE
0.004
0.004
–0.007
ASVABC
The logit and probit results are displayed for comparison. The coefficients in the
regressions are very different because different mathematical functions are being fitted.
13
BINARY CHOICE MODELS: PROBIT ANALYSIS
Logit
Probit
Linear
f(Z)b
f(Z)b
b
0.004
0.004
0.007
SM
–0.001
–0.001
–0.002
SF
0.000
0.000
0.001
MALE
0.004
0.004
–0.007
ASVABC
Nevertheless the estimates of the marginal effects are usually similar.
14
BINARY CHOICE MODELS: PROBIT ANALYSIS
Logit
Probit
Linear
f(Z)b
f(Z)b
b
0.004
0.004
0.007
SM
–0.001
–0.001
–0.002
SF
0.000
0.000
0.001
MALE
0.004
0.004
–0.007
ASVABC
However, if the outcomes in the sample are divided between a large majority and a small
minority, they can differ.
15
BINARY CHOICE MODELS: PROBIT ANALYSIS
Logit
Probit
Linear
f(Z)b
f(Z)b
b
0.004
0.004
0.007
SM
–0.001
–0.001
–0.002
SF
0.000
0.000
0.001
MALE
0.004
0.004
–0.007
ASVABC
This is because the observations are then concentrated in a tail of the distribution.
Although the logit and probit functions share the same sigmoid outline, their tails are
somewhat different.
16
BINARY CHOICE MODELS: PROBIT ANALYSIS
Logit
Probit
Linear
f(Z)b
f(Z)b
b
0.004
0.004
0.007
SM
–0.001
–0.001
–0.002
SF
0.000
0.000
0.001
MALE
0.004
0.004
–0.007
ASVABC
This is the case here, but even so the estimates are identical to three decimal places.
According to a leading authority, Amemiya, there are no compelling grounds for preferring
logit to probit or vice versa.
17
BINARY CHOICE MODELS: PROBIT ANALYSIS
Logit
Probit
Linear
f(Z)b
f(Z)b
b
0.004
0.004
0.007
SM
–0.001
–0.001
–0.002
SF
0.000
0.000
0.001
MALE
0.004
0.004
–0.007
ASVABC
Finally, for comparison, the estimates for the corresponding regression using the linear
probability model are displayed.
18
BINARY CHOICE MODELS: PROBIT ANALYSIS
Logit
Probit
Linear
f(Z)b
f(Z)b
b
0.004
0.004
0.007
SM
–0.001
–0.001
–0.002
SF
0.000
0.000
0.001
MALE
0.004
0.004
–0.007
ASVABC
If the outcomes are evenly divided, the LPM coefficients are usually similar to those for logit
and probit. However, when one outcome dominates, as in this case, they are not very good
approximations.
19
Binary Response Models:
Interpretation II
• Probit: g(0)=.4
• Logit: g(0)=.25
• Linear probability model: g(0)=1
– To make the logit and probit slope estimates
comparable, we can multiply the probit
estimates by .4/.25=1.6.
– The logit slope estimates should be divided by
4 to make them roughly comparable to the
LPM (Linear Probability Model) estimates.
Download