Effect estimation in point-exposure studies with binary outcomes and high-dimensional covariate data
Correct specification of the inverse probability weighting (IPW) model is necessary for consistent inference from a marginal structural Cox model (MSCM). In practical applications, researchers are typically unaware of the true specification of the weight model. Nonetheless, IPWs are commonly estimated using parametric models, such as the main-effects logistic regression model. In practice, assumptions underlying such models may not hold and data-adaptive statistical learning methods may provide an alternative. Many candidate statistical learning approaches are available in the literature. However, the optimal approach for a given dataset is impossible to predict. Super learner (SL) has been proposed as a tool for selecting an optimal learner from a set of candidates using cross-validation. In this study, we evaluate the usefulness of a SL in estimating IPWin four differentMSCM simulation scenarios, in which we varied the specification of the true weight model specification (linear and/or additive). Our simulations show that, in the presence of weight model misspecification, with a rich and diverse set of candidate algorithms, SL can generally offer a better alternative to the commonly used statistical learning approaches in terms of MSE as well as the coverage probabilities of the estimated effect in an MSCM. The findings from the simulation studies guided the application of the MSCM in a multiple sclerosis cohort from British Columbia, Canada (1995–2008), to estimate the impact of beta-interferon treatment in delaying disability progression.
Pang M, Schuster T, Filion KB, Schnitzer M, Eberg M, Platt RW. Effect Estimation in Point-exposure Studies with Binary Outcomes and High-dimensional Covariate Data – a comparison of Targeted Maximum Likelihood Estimation and Inverse Probability of Treatment Weighting. Int J Biostat. 2016 Nov 1;12(2).