Skip to Main Content
HBS Home
  • About
  • Academic Programs
  • Alumni
  • Faculty & Research
  • Baker Library
  • Giving
  • Harvard Business Review
  • Initiatives
  • News
  • Recruit
  • Map / Directions
Faculty & Research
  • Faculty
  • Research
  • Featured Topics
  • Academic Units
  • …→
  • Harvard Business School→
  • Faculty & Research→
Publications
Publications
  • August 2021
  • Article
  • Sociological Methods & Research

Multiple Imputation Using Gaussian Copulas

By: F.M. Hollenbach, I. Bojinov, S. Minhas, N.W. Metternich, M.D. Ward and A. Volfovsky
  • Format:Print
ShareBar

Abstract

Missing observations are pervasive throughout empirical research, especially in the social sciences. Despite multiple approaches to dealing adequately with missing data, many scholars still fail to address this vital issue. In this paper, we present a simple-to-use method for generating multiple imputations using a Gaussian copula. The Gaussian copula for multiple imputation (Hoff, 2007) allows scholars to attain estimation results that have good coverage and small bias. The use of copulas to model the dependence among variables will enable researchers to construct valid joint distributions of the data, even without knowledge of the actual underlying marginal distributions. Multiple imputations are then generated by drawing observations from the resulting posterior joint distribution and replacing the missing values. Using simulated and observational data from published social science research, we compare imputation via Gaussian copulas with two other widely used imputation methods: MICE and Amelia II. Our results suggest that the Gaussian copula approach has a slightly smaller bias, higher coverage rates, and narrower confidence intervals compared to the other methods. This is especially true when the variables with missing data are not normally distributed. These results, combined with theoretical guarantees and ease-of-use suggest that the approach examined provides an attractive alternative for applied researchers undertaking multiple imputations.

Keywords

Missing Data; Bayesian Statistics; Imputation; Categorical Data; Estimation

Citation

Hollenbach, F.M., I. Bojinov, S. Minhas, N.W. Metternich, M.D. Ward, and A. Volfovsky. "Multiple Imputation Using Gaussian Copulas." Special Issue on New Quantitative Approaches to Studying Social Inequality. Sociological Methods & Research 50, no. 3 (August 2021): 1259–1283. (0049124118799381.)
  • Find it at Harvard
  • Read Now

About The Author

Iavor I. Bojinov

Technology and Operations Management
→More Publications

More from the Authors

    • March 2022
    • Faculty Research

    Prediction & Machine Learning

    By: Iavor I. Bojinov, Michael Parzen and Paul J. Hamilton
    • March 2022
    • Faculty Research

    Linear Regression

    By: Iavor I. Bojinov, Michael Parzen and Paul J. Hamilton
    • March 2022
    • Faculty Research

    Statistical Inference

    By: Iavor I. Bojinov, Michael Parzen and Paul J. Hamilton
More from the Authors
  • Prediction & Machine Learning By: Iavor I. Bojinov, Michael Parzen and Paul J. Hamilton
  • Linear Regression By: Iavor I. Bojinov, Michael Parzen and Paul J. Hamilton
  • Statistical Inference By: Iavor I. Bojinov, Michael Parzen and Paul J. Hamilton
ǁ
Campus Map
Harvard Business School
Soldiers Field
Boston, MA 02163
→Map & Directions
→More Contact Information
  • Make a Gift
  • Site Map
  • Jobs
  • Harvard University
  • Trademarks
  • Policies
  • Digital Accessibility
Copyright © President & Fellows of Harvard College