Targeted learning with an undersmoothed LASSO propensity score model for large-scale covariate adjustment in health-care database studies

Abstract Least absolute shrinkage and selection operator (LASSO) regression is widely used for large-scale propensity score (PS) estimation in health-care database studies. In these settings, previous work has shown that undersmoothing (overfitting) LASSO PS models can improve confounding control, b...

Full description

Saved in:
Bibliographic Details
Published in:American journal of epidemiology Vol. 193; no. 11; pp. 1632 - 1640
Main Authors: Wyss, Richard, van der Laan, Mark, Gruber, Susan, Shi, Xu, Lee, Hana, Dutcher, Sarah K, Nelson, Jennifer C, Toh, Sengwee, Russo, Massimiliano, Wang, Shirley V, Desai, Rishi J, Lin, Kueiyu Joshua
Format: Journal Article
Language:English
Published: Oxford Oxford University Press 04-11-2024
Oxford Publishing Limited (England)
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Abstract Least absolute shrinkage and selection operator (LASSO) regression is widely used for large-scale propensity score (PS) estimation in health-care database studies. In these settings, previous work has shown that undersmoothing (overfitting) LASSO PS models can improve confounding control, but it can also cause problems of nonoverlap in covariate distributions. It remains unclear how to select the degree of undersmoothing when fitting large-scale LASSO PS models to improve confounding control while avoiding issues that can result from reduced covariate overlap. Here, we used simulations to evaluate the performance of using collaborative-controlled targeted learning to data-adaptively select the degree of undersmoothing when fitting large-scale PS models within both singly and doubly robust frameworks to reduce bias in causal estimators. Simulations showed that collaborative learning can data-adaptively select the degree of undersmoothing to reduce bias in estimated treatment effects. Results further showed that when fitting undersmoothed LASSO PS models, the use of cross-fitting was important for avoiding nonoverlap in covariate distributions and reducing bias in causal estimates.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:0002-9262
1476-6256
1476-6256
DOI:10.1093/aje/kwae023