Sinem Hacioglu Hoke and Kerem Tuzcuoglu
We economists want to have our cake and eat it. We have far more data series at our disposal now than ever before. But using all of them in regressions would lead to wild “over-fitting” – finding random correlations in the data rather than explaining the true underlying relationships. Researchers using large data sets have historically experienced this dilemma – you can either throw away some of the information and retain clean, interpretable models; or keep most of the information but lose interpretability. This trade-off is particularly frustrating in a policy environment where understanding the identified relationships is crucial. However, in a recent working paper we show how to sidestep this trade-off by estimating a factor model with intuitive results.