Install this application on your home screen for quick and easy access when you’re on the go.
Just tap then “Add to Home Screen”
Install this application on your home screen for quick and easy access when you’re on the go.
Just tap then “Add to Home Screen”
Member rate £492.50
Non-Member rate £985.00
Save £45 Loyalty discount applied automatically*
Save 5% on each additional course booked
*If you attended our Methods School in the last calendar year, you qualify for £45 off your course fee.
Monday 29 July – Friday 2 August
This course places an emphasis on the connections between panel data methods and causal inference, which is the primary goal of social science research.
Panel data is particularly advantageous for causal inference because it allows researchers to control for entity-specific factors (individual heterogeneity) such as geography and culture, which may not be observed and can be difficult to measure.
Panel data usually contain more degrees of freedom and more sample variability than purely cross-sectional data or time-series data, and hence improve the efficiency of the parameter estimates.
The course begins with a quick overview of causal inference and a review of the standard OLS assumptions.
It then moves on with simple panel data methods, fixed and random effect estimators, then more advanced methods such as clustered samples, panel instrumental variable methods, panel corrected standard error estimator and dynamic panel regressions, which deal with various violations of the standard OLS assumptions.
Two or three lab sessions put these methods into practice, using Stata.
The last session is a seminar in which those who want extra credits can present their research or research proposals and receive feedback from the Instructor and peers. You are therefore encouraged to bring your own data to the course.
2 credits Meet the attendance requirements and participate actively in class.
3 credits As above, plus give a presentation on your research or research proposals during the final session.
4 credits As above, plus submit a 2,500–3,000 word research paper/proposal that uses panel data analysis in the research design.
Andrew is an assistant professor at CEU's Department of International Relations. He obtained his PhD from the National University of Singapore and King’s College London.
His research interests include international political economy, research design, and quantitative methods. He teaches the Research Design and Methods in IR course series at CEU.
This course introduces various econometric models and techniques that can be applied to panel data. It also emphasises the connections between these methods and causal inference, which is the primary goal of social science research.
Day 1
We begin with an overview of causal inference in the context of observational studies. We quickly survey the idea of causality and relevant concepts such as average treatment effect (ATE). Importantly, we will draw connections between these concepts and regression analysis. We also quickly review the set of assumptions on which OLS regression hinges.
In the second half of the day, we look at simple panel data methods. Specifically, we make a distinction between ‘independently pooled across section’ data and ‘panel’ data. The latter is the focus of this course. We’ll see that panel data differs in some important respects from an independently pooled cross-section in that a panel consists of the same entities (individuals, firms, countries or whatever) across time. We then study two basic panel data methods: two-period panel data analysis and first differencing.
Day 2
We focus on slightly more advanced methods for estimating the unobserved effects in the context of panel data analysis. We introduce the fixed effects estimator, which, like first differencing, uses a transformation to remove the unobserved effect prior to estimation. As a result, any time-constant explanatory variables are removed in the process. In contrast, we introduce the random effect estimator, which looks attractive when we think the unobserved effect is uncorrelated with all the explanatory variables. For example, when we have good knowledge about the factors affecting the dependent variable and have controlled for these factors in the equation, random effect estimator can sometimes be the preferred strategy. With these foundations, we then look into a relatively new correlated random effects approach, which provides a synthesis of fixed effects and random effects methods, and has been shown to be practically very useful. In the meantime, you’ll also learn that the usefulness of these methods critically hinges on the assumptions made with respect to the error term and the relationship between the regressors and the error term.
Day 3
We begin with a lab session during which we put into practice the methods introduced in the previous two days. We carry out these analyses in Stata and learn how to interpret the results. If you have brought your own data, this is a good opportunity to carry out the analysis for your current research projects. In the second session, we return to the classroom to learn more about the research designs and methods to which panel data can be applied. We start to relax the assumptions made in the previous discussion and learn methods designed to deal with various violations of these assumptions. The first is instrumental variable (IV) method, which deals with violations to the strict exogeneity assumption. Again, I emphasise the connection between this method and causal inference.
Day 4
I introduce several more advanced panel data methods that address further violations of the standard assumptions. Since panel data contains repeated observations of the same entity over time, we may have different error variances for different panels as well as correlation of the error terms across panels and/or time. To deal with these potential challenges, we study methods such as clustering and robust estimation, panel-corrected standard error (PCSE) estimates and dynamic panel methods (Arellano-Bond estimator). For these more advanced methods, we will not look into the technical details but rely more on an intuitive understanding of the challenges these methods address.
Day 5
In the first session, we return to the lab and put these more advanced methods into practice. We learn how to carry out panel IV analysis, obtain the PCSE and Arellano-Bond estimators and calculate cluster robust standard errors. Again, this is a good opportunity for you to carry out the analyses using your own datasets and check the robustness of the results across various model specifications. The last session is a seminar. Students who want extra credits can present their research or research proposals that use panel data methods, and receive feedback from the Instructor and fellow participants.
This course is a general survey of panel data methods. It attempts to achieve a balance between theories and practical/implementation. If you are interested only in implementing these methods, you may find this course less beneficial.
The limited time available means we won’t have the luxury of going deeply into methodological niches, such as presenting all the techniques for tackling autocorrelation.
This course builds on ordinary least square (OLS) regression and extends it to data with a time-series-cross-sectional structure.
You need
If you do not have this knowledge, I strongly encourage you to take one of the following, either before or concurrently with this course:
Introduction to Inferential Statistics: What you need to know before you take regression
Multiple Regression Analysis: Estimation, Diagnostics, and Modelling
This course is designed for students who are somewhat familiar with the content covered up to Chapter 9 of Wooldridge’s 2016 textbook (see below) but do not have a good knowledge of the materials beyond that. This is an easy way to check if this is the right course for you.
Day | Topic | Details |
---|---|---|
1 | Overview of causal inference Review of OLS assumptions Simple panel data methods |
Lecture (3 hours) |
2 | Fixed effect estimator Random effect estimator Correlated random effects approach |
Lecture (3 hours) |
3 | Practical session 1 Instrumental variable method |
Lab (1.5 hours) Lecture (1.5 hours) |
4 | Clustering and robust estimation Heteroskedasticity Autocorrelation Dynamic panel models |
Lecture (3 hours) |
5 | Practical session 2 Student presentations |
Lab (1.5 hours) Seminar (1.5 hours) |
Day | Readings |
---|---|
2 |
Wooldridge, Jeffrey M Beck, Nathaniel |
1 |
Rubin, Donald B Wooldridge, Jeffrey M |
3 |
Cameron, Adrian Colin and Pravin K. Trivedi Wooldridge, Jeffrey M |
4 |
Arellano, Manuel, and Olympia Bover Beck, Nathaniel, and Jonathan N. Katz Bond, Stephen R Newey, Whitney K., and Kenneth D. West Wooldridge, Jeffrey M |
5 |
Cameron, Adrian Colin and Pravin K. Trivedi |
Stata (version 15)
None – the course will be held in a lab
Holland, Paul W
'Statistics and Causal Inference'
Journal of the American Statistical Association 81, no. 396 (1986): 945-960.
Imbens, Guido W., and Jeffrey M. Wooldridge'
Recent developments in the econometrics of program evaluation'
Journal of Eeconomic Literature 47, no. 1 (2009): 5-86
Nickell, Stephen
'Biases in dynamic models with fixed effects'
Econometrica: Journal of the Econometric Society (1981): 1417-1426
Blundell, Richard, Stephen Bond, and Frank Windmeijer
'Estimation in dynamic panel data models: improving on the performance of the standard GMM estimator'
In Nonstationary panels, panel cointegration, and dynamic panels, pp. 53-91
Emerald Group Publishing Limited, 2001
Summer School
Introduction to STATA
Introduction to Inferential Statistics: What you need to know before you take regression
Multiple Regression Analysis: Estimation, Diagnostics, and Modelling
Summer School
Advanced Topics in Applied Regression
Causal Inference in the Social Sciences II: Difference in Difference, Regression Discontinuity and Instruments