Book contents
- Frontmatter
- Contents
- Preface: Learning to Think Like a Social Scientist
- About the Contributors
- PART I MODELS AND METHODS IN THE SOCIAL SCIENCES
- PART II HISTORY
- PART III ECONOMICS
- PART IV SOCIOLOGY
- PART V POLITICAL SCIENCE
- PART VI PSYCHOLOGY
- PART VII TO TREAT OR NOT TO TREAT: CAUSAL INFERENCE IN THE SOCIAL SCIENCES
- 21 The Potential-Outcomes Model of Causation
- 22 Some Statistical Tools for Causal Inference with Observational Data
- 23 Migration and Solidarity
- References
- Index
22 - Some Statistical Tools for Causal Inference with Observational Data
Published online by Cambridge University Press: 05 June 2012
- Frontmatter
- Contents
- Preface: Learning to Think Like a Social Scientist
- About the Contributors
- PART I MODELS AND METHODS IN THE SOCIAL SCIENCES
- PART II HISTORY
- PART III ECONOMICS
- PART IV SOCIOLOGY
- PART V POLITICAL SCIENCE
- PART VI PSYCHOLOGY
- PART VII TO TREAT OR NOT TO TREAT: CAUSAL INFERENCE IN THE SOCIAL SCIENCES
- 21 The Potential-Outcomes Model of Causation
- 22 Some Statistical Tools for Causal Inference with Observational Data
- 23 Migration and Solidarity
- References
- Index
Summary
PROPENSITY-SCORE MATCHING
In order to apply the potential-outcome framework to get causal estimates that don't depend too strongly on untestable assumptions, we first need to make sure that the distributions of the treatment and control groups are balanced. This means, in other words, that we need to make sure that we are comparing apples with apples. To do so, we need to match those units that receive the treatment and those that do not receive the treatment, using a number of covariates (X). Going back to our example in Chapter 21, we need to find households that are identical in all possible, pre-treatment aspects (income, education, health, number of siblings, geographical region of origin, etc.) but that differ in their migratory experience. This procedure would create a smaller dataset with only the matched households. Once we accomplish this, we just need to estimate the average difference in means (E(Yγ − Yγ′) = E(Yγ) − E(Yγ′)) to find the impact of migration on children's emotional state. The life of an applied researcher, however, is not that easy. The introduction of a significant number of covariates (X) such as income, education, health, number of siblings, geographical region, and so on, makes it very difficult to match treated and control households. For example, if we match two households on income, then we are probably going to unmatch them on another dimension, such as number of siblings. Therefore, matching on a large number of covariates creates a high-dimensionality problem (Dehejia 2004).
- Type
- Chapter
- Information
- A Quantitative Tour of the Social Sciences , pp. 309 - 318Publisher: Cambridge University PressPrint publication year: 2009