Role of Covariates in Case Control Studies with Skewed Exposure: Evidence from Monte Carlo Simulations
DOI:
https://doi.org/10.54103/2282-0930/27302Abstract
Case-control studies, a widely used observational study design, are essential for investigating the association between exposure and outcomes. In such studies, logistic regression is commonly employed to analyse the relationship between binary outcome and exposure, accounting for covariates, confounders, and effect modifiers. However, skewed exposure distributions, where the exposure is disproportionately distributed among cases and controls, pose significant challenges.
This study aims to address these challenges by conducting a series of Monte Carlo simulation experiments to assess the impact of skewed exposure on the power of the Wald test and the bias in estimated logistic regression coefficients. The simulations focus on the role of continuous covariates in producing reliable estimates of exposure effects. The study highlights the importance of preliminary knowledge of exposure and covariate effects, as these factors play a crucial role in selecting an appropriate sample size. These simulations, which required significant computational time, highlight the robustness of the estimates with larger sample sizes and a greater number of covariates, eliminating the potential bias introduced by skewed exposure.
Downloads
References
1. Setia MS. Methodology Series Module 2: Case-control Studies. Indian Journal of Dermatology. 2016; 61(2):146–51. DOI: https://doi.org/10.4103/0019-5154.177773
2. Rosendaal FR. Bridging Case-control Studies and Randomized Trials. Current Controlled Trials in Cardiovascular Medicine. 2001; 2(3):109–10. DOI: https://doi.org/10.1186/cvm-2-3-109
3. Suárez E, Pérez CM, Rivera R, Martínez MN, Applications of Regression Models in Epidemiology. 2017. DOI: https://doi.org/10.1002/9781119212515
4. Foxman B, Marsh J, Gillespie B, Rubin N, Koopman JS, Spear S. Condom Use and First-Time Urinary Tract Infection. Epidemiology. 1997; 8(6):637–41. DOI: https://doi.org/10.1097/00001648-199711000-00008
5. Day NE, Kerridge DF. A General Maximum Likelihood Discriminant. Biometrics. 1967; 23(2):313–23. DOI: https://doi.org/10.2307/2528164
6. Albert A, Anderson JA. On the Existence of Maximum Likelihood Estimates in Logistic Regression Mod-els. Biometrika. 1984; 71(1):1–10. DOI: https://doi.org/10.1093/biomet/71.1.1
7. Anderson JA. Separate Sample Logistic Discrimination. Biometrika. 1972 Apr; 59(1):19–35. DOI: https://doi.org/10.1093/biomet/59.1.19
8. Alkhalaf A, Zumbo BD. The Impact of Predictor Variable(s) with Skewed Cell Probabilities on Wald Tests in Binary Logistic Regression. Journal of Modern Ap-plied Statistical Methods. 2017; 16(2):40–80. DOI: https://doi.org/10.22237/jmasm/1509494640
9. Agresti A. Categorical Data Analysis. Wiley Series in Probability and Statistics. 2002 Jul 3. DOI: https://doi.org/10.1002/0471249688
10. Heinze G, Schemper M. A Solution to the Problem of Separation in Logistic Regression. Statistics in Medicine. 2002; 21(16):2409–19. DOI: https://doi.org/10.1002/sim.1047
11. Heinze G, Puhr R. Bias-reduced and Separation-proof Conditional Logistic Regression with Small or Sparse Data Sets. Statistics in Medicine. 2010; 29(7-8):770–7. DOI: https://doi.org/10.1002/sim.3794
12. Mansournia MA, Geroldinger A, Greenland S, Heinze G. Separation in Logistic Regression: Causes, Consequences, and Control. American Journal of Epidemiology. 2017; 187(4):864–70. DOI: https://doi.org/10.1093/aje/kwx299
13. Bradley, J. V. Robustness? British Journal of Mathematical and Statistical Psychology.1978; 31(2):144–52. DOI: https://doi.org/10.1111/j.2044-8317.1978.tb00581.x
14. Ferguson CJ. An Effect Size Primer: a Guide for Clinicians and Researchers. Professional Psychology: Research and Practice. 2009; 40(5):532–8. DOI: https://doi.org/10.1037/a0015808
15. Kuo CL, Duan Y, Grady J. Unconditional or Conditional Logistic Regression Model for Age-Matched Case–Control Data? Frontiers in Public Health. 2018; 6(57). DOI: https://doi.org/10.3389/fpubh.2018.00057
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Yashaswini K, Anna Maria Pinto

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Accepted 2025-01-15
Published 2025-02-10


