Book contents
- Ethics in Econometrics
- Ethics in Econometrics
- Copyright page
- Contents
- Figures
- Tables
- Preface
- Acknowledgments
- Recommended Reading
- Introduction
- 1 Ethical Guidelines
- 2 Scientific Misconduct
- 3 Influential Observations
- 4 Model Selection
- 5 Estimation and Interpretation
- 6 Missing Data
- 7 Spurious Relations
- 8 Blinded by the Data
- 9 Predictability
- 10 Adjustment of Forecasts
- 11 Big Data
- 12 Algorithms
- Conclusion
- Index
11 - Big Data
Published online by Cambridge University Press: 14 November 2024
- Ethics in Econometrics
- Ethics in Econometrics
- Copyright page
- Contents
- Figures
- Tables
- Preface
- Acknowledgments
- Recommended Reading
- Introduction
- 1 Ethical Guidelines
- 2 Scientific Misconduct
- 3 Influential Observations
- 4 Model Selection
- 5 Estimation and Interpretation
- 6 Missing Data
- 7 Spurious Relations
- 8 Blinded by the Data
- 9 Predictability
- 10 Adjustment of Forecasts
- 11 Big Data
- 12 Algorithms
- Conclusion
- Index
Summary
Currently we may have access to large databases, sometimes coined as Big Data, and for those large datasets simple econometric models will not do. When you have a million people in your database, such as insurance firms or telephone providers or charities, and you have collected information on these individuals for many years, you simply cannot summarize these data using a small-sized econometric model with just a few regressors. In this chapter we address diverse options for how to handle Big Data. We kick off with a discussion about what Big Data is and why it is special. Next, we discuss a few options such as selective sampling, aggregation, nonlinear models, and variable reduction. Methods such as ridge regression, lasso, elastic net, and artificial neural networks are also addressed; these latter concepts are nowadays described as so-called machine learning methods. We see that with these methods the number of choices rapidly increases, and that reproducibility can reduce. The analysis of Big Data therefore comes at a cost of more analysis and of more choices to make and to report.
Keywords
- Type
- Chapter
- Information
- Ethics in EconometricsA Guide to Research Practice, pp. 248 - 268Publisher: Cambridge University PressPrint publication year: 2024