Preprint
Finding Long-COVID: Temporal Topic Modeling of Electronic Health Records from the N3C and RECOVER Programs
medRxiv : the preprint server for health sciences
Cold Spring Harbor Laboratory
06/11/2024
DOI: 10.1101/2023.09.11.23295259
PMCID: PMC11213052
PMID: 38947087
Abstract
Post-Acute Sequelae of SARS-CoV-2 infection (PASC), also known as Long-COVID, encompasses a variety of complex and varied outcomes following COVID-19 infection that are still poorly understood. We clustered over 600 million condition diagnoses from 14 million patients available through the National COVID Cohort Collaborative (N3C), generating hundreds of highly detailed clinical phenotypes. Assessing patient clinical trajectories using these clusters allowed us to identify individual conditions and phenotypes strongly increased after acute infection. We found many conditions increased in COVID-19 patients compared to controls, and using a novel method to associate patients with clusters over time, we additionally found phenotypes specific to patient sex, age, wave of infection, and PASC diagnosis status. While many of these results reflect known PASC symptoms, the resolution provided by this unprecedented data scale suggests avenues for improved diagnostics and mechanistic understanding of this multifaceted disease.
Details
- Title: Subtitle
- Finding Long-COVID: Temporal Topic Modeling of Electronic Health Records from the N3C and RECOVER Programs
- Creators
- Shawn T O'Neil - University of Colorado Anschutz Medical CampusCharisse Madlock-Brown - University of Tennessee Health Science CenterKenneth J Wilkins - National Institute of Diabetes and Digestive and Kidney DiseasesBrenda M McGrath - OchinHannah E Davis - Patient-Led Research CollaborativeGina S Assaf - Patient-Led Research CollaborativeHannah Wei - Patient-Led Research CollaborativeParya Zareie - University of California, DavisEvan T French - Virginia Commonwealth UniversityJohanna Loomba - University of VirginiaJulie A McMurry - University of Colorado Anschutz Medical CampusAndrea Zhou - University of VirginiaChristopher G ChuteRichard A Moffitt - Emory UniversityEmily R Pfaff - University of North Carolina School of MedicineYun Jae Yoo - Emory UniversityPeter Leese - University of North Carolina School of MedicineRobert F Chew - RTI InternationalMichael Lieberman - University of PortlandMelissa A Haendel - University of Colorado Anschutz Medical Campus
- Resource Type
- Preprint
- Publication Details
- medRxiv : the preprint server for health sciences
- DOI
- 10.1101/2023.09.11.23295259
- PMID
- 38947087
- PMCID
- PMC11213052
- Publisher
- Cold Spring Harbor Laboratory; United States
- Grant note
- UL1 TR002243 / NCATS NIH HHS U54 GM115677 / NIGMS NIH HHS UL1 TR001453 / NCATS NIH HHS UL1 TR003142 / NCATS NIH HHS UL1 TR003107 / NCATS NIH HHS UL1 TR002535 / NCATS NIH HHS UL1 TR001881 / NCATS NIH HHS UL1 TR001857 / NCATS NIH HHS UL1 TR001450 / NCATS NIH HHS UL1 TR002538 / NCATS NIH HHS
- Language
- English
- Date posted
- 06/11/2024
- Academic Unit
- Nursing
- Record Identifier
- 9984721250502771
Metrics
2 Record Views