Performance characteristics of code‐based algorithms to identify urinary tract infections in large United States administrative claims databases

Background In real‐world evidence research, reliability of coding in healthcare databases dictates the accuracy of code‐based algorithms in identifying conditions such as urinary tract infection (UTI). This study evaluates the performance characteristics of code‐based algorithms to identify UTI. Met...

Full description

Saved in:

Bibliographic Details
Published in:	Pharmacoepidemiology and drug safety Vol. 31; no. 9; pp. 953 - 962
Main Authors:	Fortin, Stephen P., Swerdel, Joel, Sarnecki, Michal, Doua, Joachim, Colasurdo, Jamie, Geurtsen, Jeroen
Format:	Journal Article
Language:	English
Published:	Chichester, UK John Wiley & Sons, Inc 01-09-2022 Wiley Subscription Services, Inc
Subjects:	administrative claims databases Algorithms Codes code‐based algorithms Diagnosis Literature reviews observational research Observational studies performance characteristics phenotype PheValuator Urinalysis Urinary tract Urinary tract infections Urogenital system United States > US
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	Background In real‐world evidence research, reliability of coding in healthcare databases dictates the accuracy of code‐based algorithms in identifying conditions such as urinary tract infection (UTI). This study evaluates the performance characteristics of code‐based algorithms to identify UTI. Methods Retrospective observational study of adults contained within three large U.S. administrative claims databases on or after January 1, 2010. A targeted literature review was performed to inform the development of 10 code‐based algorithms to identify UTIs consisting of combinations of diagnosis codes, antibiotic exposure for the treatment of UTIs, and/or ordering of a urinalysis or urine culture. For each database, a probabilistic gold standard was developed using PheValuator. The performance characteristics of each code‐based algorithm were assessed compared with the probabilistic gold standard. Results A total of 2 950 641, 1 831 405, and 2 294 929 patients meeting study criteria were identified in each database. Overall, the code‐based algorithm requiring a primary UTI diagnosis code achieved the highest positive predictive values (PPV; >93.8%) but the lowest sensitivities (<12.9%). Algorithms requiring three UTI diagnosis codes achieved similar PPV (>0.899%) and improved sensitivity (<41.6%). Algorithms requiring a single UTI diagnosis code in any position achieved the highest sensitivities (>72.1%) alongside a slight reduction in PPVs (<78.3%). All‐time prevalence estimates of UTI ranged from 21.6% to 48.6%. Conclusions Based on these findings, we recommend use of algorithms requiring a single UTI diagnosis code, which achieved high sensitivity and PPV. In studies where PPV is critical, we recommend code‐based algorithms requiring three UTI diagnosis codes rather than a single primary UTI diagnosis code.
Bibliography:	Funding information Janssen Research & Development LLC ObjectType-Article-2 SourceType-Scholarly Journals-1 ObjectType-Feature-3 content type line 23 ObjectType-Review-1
ISSN:	1053-8569 1099-1557
DOI:	10.1002/pds.5492