Economics 105 Mark C. Foley Statistics Jan - May 2005

advertisement
Economics 105
Statistics
Mark C. Foley
Jan - May 2005
Modeling Presentations
The goal of this group project is to develop a regression model to answer one of the research
questions below. You will not be estimating the regression. This is a modeling exercise. You
should use economic theory, peer-reviewed literature, and your own intuition and knowledge to
build a well-specified econometric model of the following form
(1)
Yi = β 0 + β 1 X 1i + β 2 X 2i + ... + β K X Ki + ε i
where Y is the dependent variable, whose variation you are trying to explain with the independent
(explanatory) variables, X 1 , X 2 ,..., X K , and ε represents random error. The subscript i indexes
observations on the unit of analysis, which can be individuals, states, cities, countries, or firms
(called cross-sectional data), depending on the research question, or it could index a time period
such as months, quarters, or years if the research question is estimating a relationship over time
(called time-series data).
Note that the generic model above does not specify the full number of explanatory variables (the
X’s); it uses K to indicate the last explanatory variable. That is what you are to determine for
your chosen research question. In other words, you decide how many, and which, explanatory
variables to include.
The model is a multiple regression. Typically, there is one “variable of interest” in a research
paper. A simple regression is not adequate to capture the effect of that “variable of interest” on
the outcome since it, by virtue of having only 1 explanatory variable, does not control for other
factors which influence the dependent variable. In other words, a simple regression does not
impose the idea of ceteris paribus. A multiple regression model does. You are interested in the
effect of one variable, the “variable of interest,” on the dependent variable, holding all other
relevant factors constant. In particular, you, as the researcher, are interested in the sign,
magnitude, statistical significance, and economic significance of the estimated coefficient on the
variable of interest, but you can’t address these without actually estimating the model.
1
Requirements
The assignment is to give a 13-minute oral presentation using Powerpoint in which you present
and justify an econometric model to answer your research question. There are 8 groups and 4
topics. I’ve allocated two class periods to the presentations and discussion, so there will be 4
presentations per day, and groups with the same topic will present on the same day. The nonpresenting groups will vote on the “winner”, i.e., the group that they feel had the best
presentation and model. The four winning groups will receive a grade bonus. In addition,
groups must ask at least one question of the presenters according to the following: group 1 asks
8, 2 asks 7, 3 asks 6, 4 asks 5, 5 asks 1, 6 asks 2, 7 asks 3, and 8 asks 4. You can decide before
class who will ask a question. The quality of the questions will factor into your group’s grade.
One, two, three, or all group members may participate in the presentation. All group members
should participate in developing the content for the slides. There will be one grade for the whole
group. Please bring a printout of your slides (printed two per page, front and back) to class on
the day of the presentation, and given it to me before you start. The grade for the assignment
will be based on the clarity, content, and correctness of the presentation and the slides.
Research Questions
You will be assigned to a group and one of these research questions.
(A)
(B)
(C)
(D)
What is the effect of more police on crime?
Does smaller class size lead to better student performance?
What is the effect of
on child mortality?
What is the effect of children on a female labor market outcome?
I. Motivation (1-2 slides)
Introduce and motivate the topic by stating the specific research question and why it is
important. Incorporate “big picture” stylized facts or statistics to help place your study in
context and provide motivation for asking the research question. For example, if you are
modeling the effect of class size on student performance, it would be useful to find data on the
average class size and how it has been changing over time. Or if you were looking at the effect
of welfare on employment, for example, it would be useful for the reader to know the total
amount of money the government spends on welfare (last year, in an average year, or for some
reasonable and relevant time frame) and what percent of the budget that represents. Additional
motivation can be provided by discussing the public policy implications of having accurate
answers to the question.
II. Literature Review (2-3 slides)
Have other people studied this issue or one related to it (in particular, search the peer-reviewed
literature with broad enough key words to find appropriate studies), and what conclusions did
they reach? Be sure to cite all authors of a publication, not just the lead author. It is not
2
sufficient to read only the abstract and cite the whole article. You should have 4 – 8 peerreviewed articles in this section.
Create a table which notes the following for each paper:
1. Author(s), year of publication, and journal (see list below for abbreviations of main
journals)
2. Data used (including sample size and description of the sample group)
3. Time period – what year is outcome measured, what year are key explanatory variables
measured.
4. Definition of outcome variable
5. Methodology (OLS, GLS, 2SLS, IV, probit, logit, etc.) I realize that you only know
OLS.
6. List of independent variables included in the regressions
7. Coefficient estimate & standard error or t-stat for the variable of interest
See pages 1845 - 1849 and 1858 – 1862 and 1864 - 1868 of Haveman and Wolfe (1995), “The
Determinants of Children’s Attainments,” Journal of Economic Literature, vol. 33, pp. 1829 –
1878. There is a copy of this paper in the P: space you use for lab.
III. Model Development (about 3 slides)
Carefully develop the theory that you are testing in the context of a linear regression model as
given in equation (1). In this section, be sure to:
1. Clearly explain which variables are the independent variables and which is the dependent
variable.
2. Justify why each variable is included, and indicate its expected sign (positive, negative,
or indeterminate – two competing effects)
3. Define the units of all variables.
4. Note any potentially endogenous explanatory variables and explain why they might be
endogenous.
You will find several journal articles on your topic, each with different model specifications.
Your final theoretical model should be a combination of these, plus your own ideas. It’s okay to
follow closely a model in one of the papers (they don’t often state explicitly why each variable is
included), but you should argue why the variables included in other papers do not belong in your
study. Or perhaps you think the previous research has left out an important variable; if so, argue
for its inclusion. Do not just state that you think “variable X” belongs in the model. Tell me
why.
3
The following are among the higher quality journals in economics (typical abbreviation):
American Economic Review (AER – generally regarded as the top journal; rest of list is not in rank order)
Brookings Papers on Economic Activity (BPEA)
Brookings Papers on Economic Activity: Microeconomics
European Economic Review (EER)
Journal of Applied Econometrics (JAE)
Journal of Business and Economic Statistics (JBES)
Journal of Economic History (JEH)
Journal of Health Economics (JHE)
Journal of Human Resources (JHR)
Journal of International Economics (JIE)
Journal of Labor Economics (JoLE)
Journal of Law and Economics (JLawE)
Journal of Money, Credit, and Banking (JMCB)
Journal of Political Economy (JPE)
Journal of Sports Economics (JSportE)
Quarterly Journal of Economics (QJE)
Review of Economic Studies (REStud)
Review of Economics and Statistics (REStat)
Journal of Economic Perspectives (JEP)
Journal of Economic Literature (JEL)
4
Download