Bias in regression coefficient estimates when assumptions for handling missing data are violated: a simulation study
The purpose of this simulation study is to assess the performance of multiple imputation compared to complete case analysis when assumptions of missing data mechanisms are violated.
The authors performed a stochastic simulation study to assess the performance of Complete Case (CC) analysis and Multiple Imputation (MI) with different missing data mechanisms (missing completely at random (MCAR), at random (MAR), and not at random (MNAR)). The study focused on the point estimation of regression coefficients and standard errors.
When data were MAR conditional on Y, CC analysis resulted in biased regression coefficients; they were all underestimated in our scenarios. In these scenarios, analysis after MI gave correct estimates. Yet, in case of MNAR MI yielded biased regression coefficients, while CC analysis performed well.
This work is licensed under a Creative Commons Attribution 4.0 International License.