A Comparison of Baseline and Time-Dependent Approaches in Cox Model
DOI:
https://doi.org/10.54103/2282-0930/29526Abstract
INTRODUCTION
In clinical and epidemiological research, risk factors vary throughout the observation period. Time-dependent covariates reflect evolving exposures and provide a dynamic view of the individual. In longitudinal settings, repeated observations within a subject are typically correlated and this correlation often decreases as the time interval between measurements increases.
Handling variables that change over time can be challenging in survival analysis framework, especially when their values are influenced by the outcome process itself. This interdependence may limit their application during specific phases of the analysis [1,2]. The extended Cox model provides a robust approach to incorporate such variables under a properly specified time-dependent structure [3,4] and is widely used in epidemiological settings. However, it is common practice, especially in cohort studies, to simplify time-varying covariates by relying on baseline measurements, potentially introducing bias when assessing the instantaneous risk.
OBJECTIVES
This contribution aims to assess the impact of modeling time-varying covariates as fixed at baseline within Cox model. A series of simulations was conducted to quantify the resulting information loss and identify key factors driving the discrepancy in results between baseline and time-dependent specifications.
METHODS
Simulation data were generated for 1.000 individuals, with a single time-dependent covariate drawn from a standard multivariate normal distribution. The covariance matrix was modeled with a first-order autoregressive structure, setting the coefficient to 0.3,0.6 and 0.9. The repeated measurements, defining the number of change-points, were evenly spaced over a fixed maximum follow-up, with the number of measures M set to 4, 8, or 16. Event times were generated using a Weibull distribution with shape parameter k set to 0.5,1 and 2, reflecting decreasing, constant and increasing hazard rates. The covariate effect size (log-hazard) was set to 0,0.2,0.4 or 1. Survival and censoring times were simulated using the permutational algorithm described in [5]. The expected censoring rate was approximately 50%. Baseline and time-dependent models were implemented and compared across 1000 repetitions for each scenario. For both models, the distributions of the estimated coefficients and their difference (, bias, and empirical statistical power were assessed over the simulation runs.
RESULTS
The highest discrimination between the models (median was observed in lower correlation and higher measurement frequency scenarios. Baseline models underestimated the true covariate effect, as shown in Table 1. Other scenarios based on a different number of measurements reflected these patterns.
Across all scenarios, correlation decrease between repeated measurements led to a progressive increase in the relative bias of the estimates. This pattern can be relieved especially in the increasing hazard rate (k=2) settings, meanwhile the underestimation was mitigated in decreasing hazard rate scenarios (k=0.5).
Moreover, baseline model power improved with higher correlation between measures; conversely, low correlation reduces power, especially when measurement frequency is high. This relationship holds across various effect sizes. Type I error remained controlled in all conditions.
CONCLUSIONS
Our study shows that relying on baseline covariates may lead to underestimation of the true association between variables and outcomes, particularly in scenarios with frequent measurements and low correlation between repeated values. In these settings, the negative bias grows, revealing that the baseline value fails to represent the trajectory of the covariate adequately and in some extreme configurations indicates a near-complete failure to detect the effect. A limitation of our work is using a simulation framework based on controlled assumptions, which may not fully capture the variability and complexity of real-world data. Future research will move towards a formalization of these relationships, while also examining broader settings and additional factors to strengthen these findings. Moreover, future efforts will focus on applying these insights and pursuing further explorations within environmental epidemiology framework, which is characterized by a large amount of data, as well as numerous challenges in individual exposure assessment that may interact in various ways with the time-dependent dynamics of exposure.
ACKNOWLEDGMENTS
This project was carried out with the technical support and funding of the Italian Ministry of Health – PNC PREV-A-2022-12376981 Aria outdoor e salute: un atlante integrato a supporto delle decisioni e della ricerca.
Downloads
References
[1] Kalbfleisch J.D., Prentice R.L., The statistical analysis of failure time data. John Wiley & Sons, 2002
[2] Fisher L.D., Lin D.Y., Time-dependent covariates in the Cox proportional-hazards regression model. Annu Rev Public Health, 1999; 20(1):145-57
[3] Therneau T.M., Grambsch P.M., Modeling Survival Data: Extending the Cox Model. Statistics for Biology and Health, Springer, 2000
[4] Therneau T., Crowson C., Atkinson E., Using time dependent covariates and time dependent coefficients in the Cox model. Survival Vignettes, 2017; 2(3):1-25 (Updated December 2024; accessed on April 14, 2025)
[5] Sylvestre M.P., Abrahamowicz M., Comparison of algorithms to generate event times conditional on time‐dependent covariates. Stat Med., 2008; 27(14):2618-34
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Paola Schiattarella , Annafrancesca Smimmo , Teresa Speranza , Piergiacomo Di Gennaro , Giovanni Nicolao , Mario Fordellone , Vittorio Simeon , Simona Signoriello , Massimo Stafoggia, Andrea Ranzi, Laura Bonvicini, Serena Broccoli, Nicola Caranci, Nicolás Zengarini, Paola Angelini, Paolo Giorgi Rossi, Paolo Chiodini

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Funding data
-
Ministero della Salute
Grant numbers PREV-A-2022-12376981


