Catalogue Search | MBRL

Coding long COVID: characterizing a new disease through an ICD-10 lens

by Madlock-Brown, Charisse , Kelly, Elizabeth , Kostka, Kristin in Algorithms , Biomedicine , Clinical coding

2023

Background Naming a newly discovered disease is a difficult process; in the context of the COVID-19 pandemic and the existence of post-acute sequelae of SARS-CoV-2 infection (PASC), which includes long COVID, it has proven especially challenging. Disease definitions and assignment of a diagnosis code are often asynchronous and iterative. The clinical definition and our understanding of the underlying mechanisms of long COVID are still in flux, and the deployment of an ICD-10-CM code for long COVID in the USA took nearly 2 years after patients had begun to describe their condition. Here, we leverage the largest publicly available HIPAA-limited dataset about patients with COVID-19 in the US to examine the heterogeneity of adoption and use of U09.9, the ICD-10-CM code for “Post COVID-19 condition, unspecified.” Methods We undertook a number of analyses to characterize the N3C population with a U09.9 diagnosis code ( n = 33,782), including assessing person-level demographics and a number of area-level social determinants of health; diagnoses commonly co-occurring with U09.9, clustered using the Louvain algorithm; and quantifying medications and procedures recorded within 60 days of U09.9 diagnosis. We stratified all analyses by age group in order to discern differing patterns of care across the lifespan. Results We established the diagnoses most commonly co-occurring with U09.9 and algorithmically clustered them into four major categories: cardiopulmonary, neurological, gastrointestinal, and comorbid conditions. Importantly, we discovered that the population of patients diagnosed with U09.9 is demographically skewed toward female, White, non-Hispanic individuals, as well as individuals living in areas with low poverty and low unemployment. Our results also include a characterization of common procedures and medications associated with U09.9-coded patients. Conclusions This work offers insight into potential subtypes and current practice patterns around long COVID and speaks to the existence of disparities in the diagnosis of patients with long COVID. This latter finding in particular requires further research and urgent remediation.

Journal Article

Share this book

Add to My Shelf

Long COVID risk and pre-COVID vaccination in an EHR-based cohort study from the RECOVER program

by Redfield, Signe , Girvin, Andrew T. , Moffitt, Richard A. in 631/326/596/4130 , 692/699/255/2514 , 692/700/459/1748

2023

Long COVID, or complications arising from COVID-19 weeks after infection, has become a central concern for public health experts. The United States National Institutes of Health founded the RECOVER initiative to better understand long COVID. We used electronic health records available through the National COVID Cohort Collaborative to characterize the association between SARS-CoV-2 vaccination and long COVID diagnosis. Among patients with a COVID-19 infection between August 1, 2021 and January 31, 2022, we defined two cohorts using distinct definitions of long COVID—a clinical diagnosis ( n = 47,404) or a previously described computational phenotype ( n = 198,514)—to compare unvaccinated individuals to those with a complete vaccine series prior to infection. Evidence of long COVID was monitored through June or July of 2022, depending on patients’ data availability. We found that vaccination was consistently associated with lower odds and rates of long COVID clinical diagnosis and high-confidence computationally derived diagnosis after adjusting for sex, demographics, and medical history. The extent to which COVID-19 vaccination protects against long COVID is not well understood. Here, the authors use electronic health record data from the United States and find that, for people who received their vaccination prior to infection, vaccination was associated with lower incidence of long COVID.

Journal Article

Share this book

Add to My Shelf

Risk factors associated with post-acute sequelae of SARS-CoV-2: an N3C and NIH RECOVER study

by Sharma, Suchetha , Xie, Catherine , Madlock-Brown, Charisse in Adolescent , Adult , Aged

2023

Background More than one-third of individuals experience post-acute sequelae of SARS-CoV-2 infection (PASC, which includes long-COVID). The objective is to identify risk factors associated with PASC/long-COVID diagnosis. Methods This was a retrospective case–control study including 31 health systems in the United States from the National COVID Cohort Collaborative (N3C). 8,325 individuals with PASC (defined by the presence of the International Classification of Diseases, version 10 code U09.9 or a long-COVID clinic visit) matched to 41,625 controls within the same health system and COVID index date within ± 45 days of the corresponding case's earliest COVID index date. Measurements of risk factors included demographics, comorbidities, treatment and acute characteristics related to COVID-19. Multivariable logistic regression, random forest, and XGBoost were used to determine the associations between risk factors and PASC. Results Among 8,325 individuals with PASC, the majority were > 50 years of age (56.6%), female (62.8%), and non-Hispanic White (68.6%). In logistic regression, middle-age categories (40 to 69 years; OR ranging from 2.32 to 2.58), female sex (OR 1.4, 95% CI 1.33–1.48), hospitalization associated with COVID-19 (OR 3.8, 95% CI 3.05–4.73), long (8–30 days, OR 1.69, 95% CI 1.31–2.17) or extended hospital stay (30 + days, OR 3.38, 95% CI 2.45–4.67), receipt of mechanical ventilation (OR 1.44, 95% CI 1.18–1.74), and several comorbidities including depression (OR 1.50, 95% CI 1.40–1.60), chronic lung disease (OR 1.63, 95% CI 1.53–1.74), and obesity (OR 1.23, 95% CI 1.16–1.3) were associated with increased likelihood of PASC diagnosis or care at a long-COVID clinic. Characteristics associated with a lower likelihood of PASC diagnosis or care at a long-COVID clinic included younger age (18 to 29 years), male sex, non-Hispanic Black race, and comorbidities such as substance abuse, cardiomyopathy, psychosis, and dementia. More doctors per capita in the county of residence was associated with an increased likelihood of PASC diagnosis or care at a long-COVID clinic. Our findings were consistent in sensitivity analyses using a variety of analytic techniques and approaches to select controls. Conclusions This national study identified important risk factors for PASC diagnosis such as middle age, severe COVID-19 disease, and specific comorbidities. Further clinical and epidemiological research is needed to better understand underlying mechanisms and the potential role of vaccines and therapeutics in altering PASC course.

Journal Article

Share this book

Add to My Shelf

Insights from an N3C RECOVER EHR-based cohort study characterizing SARS-CoV-2 reinfections and Long COVID

by Patel, Saaya , Zhou, Andrea , Loomba, Johanna in 692/308/174 , 692/308/53 , Antigens

2024

Background Although the COVID-19 pandemic has persisted for over 3 years, reinfections with SARS-CoV-2 are not well understood. We aim to characterize reinfection, understand development of Long COVID after reinfection, and compare severity of reinfection with initial infection. Methods We use an electronic health record study cohort of over 3 million patients from the National COVID Cohort Collaborative as part of the NIH Researching COVID to Enhance Recovery Initiative. We calculate summary statistics, effect sizes, and Kaplan–Meier curves to better understand COVID-19 reinfections. Results Here we validate previous findings of reinfection incidence (6.9%), the occurrence of most reinfections during the Omicron epoch, and evidence of multiple reinfections. We present findings that the proportion of Long COVID diagnoses is higher following initial infection than reinfection for infections in the same epoch. We report lower albumin levels leading up to reinfection and a statistically significant association of severity between initial infection and reinfection (chi-squared value: 25,697, p -value: <0.0001) with a medium effect size (Cramer’s V : 0.20, DoF = 3). Individuals who experienced severe initial and first reinfection were older in age and at a higher mortality risk than those who had mild initial infection and reinfection. Conclusions In a large patient cohort, we find that the severity of reinfection appears to be associated with the severity of initial infection and that Long COVID diagnoses appear to occur more often following initial infection than reinfection in the same epoch. Future research may build on these findings to better understand COVID-19 reinfections. Plain language summary More than three years after the start of the COVID-19 pandemic, individuals are frequently reporting multiple COVID-19 infections. However, these reinfections remain poorly understood. Here, we investigate COVID-19 reinfections in a large electronic health record cohort of over 3 million patients. We use data summary techniques and statistical tests to characterize reinfections and their relationships with disease severity, biomarkers, and Long COVID. We find that individuals with severe initial infection are more likely to experience severe reinfection, that some protein levels are lower, leading to reinfection, and that a lower proportion of individuals are diagnosed with Long COVID following reinfection than initial infection. Our work highlights the prevalence and impact of reinfections and suggests the need for further research. Hadley et al. characterize COVID-19 re-infections utilizing electronic health record study cohort data of over 3 million patients. They find severe initial COVID-19 infection linked to severe reinfections and less frequent long COVID diagnosis after reinfection.

Journal Article

Share this book

Add to My Shelf

Identifying commonalities and differences between EHR representations of PASC and ME/CFS in the RECOVER EHR cohort

by Seltzer, Jaime , Powers, John P. , Hornig, Mady in 692/699 , 692/700/139 , Chronic fatigue syndrome

2025

Background Shared symptoms and biological abnormalities between post-acute sequelae of SARS-CoV-2 infection (PASC) and myalgic encephalomyelitis/chronic fatigue syndrome (ME/CFS) could suggest common pathophysiological bases and would support coordinated treatment efforts. Empirical studies comparing these syndromes are needed to better understand their commonalities and differences. Methods We analyzed electronic health record data from 6.5 million adult patients from the National COVID Cohort Collaborative. PASC and ME/CFS diagnostic groups were defined based on recorded diagnoses, and other recorded conditions within the two groups were used to train separate machine learning-driven computable phenotypes (CPs). The most predictive conditions for each CP were examined and compared, and the overlap of patients labeled by each CP was examined. Condition records from the diagnostic groups were also used to statistically derive condition clusters. Rates of subphenotypes based on these clusters were compared between PASC and ME/CFS groups. Results Approximately half of patients labeled by one CP are also labeled by the other. Dyspnea, fatigue, and cognitive impairment are the most-predictive conditions shared by both CPs, whereas other most-predictive conditions are specific to one CP. Recorded conditions separate into cardiopulmonary, neurological, and comorbidity clusters, with the cardiopulmonary cluster showing partial specificity for the PASC groups. Conclusions Data-driven approaches indicate substantial overlap in the condition records associated with PASC and ME/CFS diagnoses. Nevertheless, cardiopulmonary conditions are somewhat more commonly associated with PASC diagnosis, whereas other conditions, such as pain and sleep disturbances, are more associated with ME/CFS diagnosis. These findings suggest that symptom management approaches to these illnesses could overlap. Plain language summary Post-acute sequelae of SARS-CoV-2 infection (PASC; also known as Long COVID) and myalgic encephalomyelitis/chronic fatigue syndrome (ME/CFS) appear to share much in common. Understanding their similarities and differences could help to guide treatment for these complex illnesses. We analyzed data from 6.5 million adult patients from the National COVID Cohort Collaborative to evaluate patterns in their health records. We find several conditions associated with both PASC and ME/CFS diagnoses, such as difficulty breathing, fatigue, and concentration difficulties. We also find some differences. Cardiac and respiratory conditions are more typical with PASC diagnoses. Records of pain, sleep disturbances, and neuropsychiatric conditions more commonly accompany ME/CFS diagnoses. Overall, the similarities we see could support overlapping symptom management approaches across these illnesses. Powers et al. investigate commonalities and differences between electronic health record patterns associated with post-acute sequelae of SARS-CoV-2 infection and myalgic encephalomyelitis/chronic fatigue syndrome (ME/CFS). Records of dyspnea, fatigue, and cognitive impairment are common diagnoses and recommend common symptom management.

Journal Article

Share this book

Add to My Shelf

Finding Long-COVID: temporal topic modeling of electronic health records from the N3C and RECOVER programs

by Madlock-Brown, Charisse , Zhou, Andrea , Assaf, Gina S. in 631/114 , 692/699 , Biomedicine

2024

Post-Acute Sequelae of SARS-CoV-2 infection (PASC), also known as Long-COVID, encompasses a variety of complex and varied outcomes following COVID-19 infection that are still poorly understood. We clustered over 600 million condition diagnoses from 14 million patients available through the National COVID Cohort Collaborative (N3C), generating hundreds of highly detailed clinical phenotypes. Assessing patient clinical trajectories using these clusters allowed us to identify individual conditions and phenotypes strongly increased after acute infection. We found many conditions increased in COVID-19 patients compared to controls, and using a novel method to associate patients with clusters over time, we additionally found phenotypes specific to patient sex, age, wave of infection, and PASC diagnosis status. While many of these results reflect known PASC symptoms, the resolution provided by this unprecedented data scale suggests avenues for improved diagnostics and mechanistic understanding of this multifaceted disease.

Journal Article

Share this book

Add to My Shelf

Clinical Data: Sources and Types, Regulatory Constraints, Applications

by Robinson, Peter N. , Tatonetti, Nicholas , Pfaff, Emily R. in Accountability , Agreements , Asthma

2019

Background Recognizing the need to respect and protect patient privacy, numerous regulations have been established to govern the use of clinical data by researchers, including the federal Health Insurance Portability and Accountability Act of 1996 (HIPAA) and the European Union General Data Protection Regulation. Institution‐specific guidelines and governing bodies such as institutional review boards (IRBs) also address research involving patient data and other sensitive data available in electronic medical records (e.g., administrative data), in part as a result of concerns regarding the liability of healthcare providers and institutions. Clinical data types, regulatory access restrictions, and applications Clinical data type Brief description Regulatory access restrictions Applications Fully identified clinical data sets Observational patient data derived from paper‐based or electronic medical records IRB approval is required; an executed data use agreement is possibly required a Clinical interpretation and scientific inference and discovery HIPAA‐limited clinical data sets Observational patient data containing only a limited set of HIPAA‐defined PHI IRB approval is required; an executed data use agreement is possibly required a Clinical interpretation and scientific inference and discovery, but with the understanding that certain data elements have been removed from the data and/or transformed Deidentified clinical data sets Observational patient data, but with all HIPAA‐defined PHI elements removed IRB approval is not required b; IRB “Request for Determination of Human Subjects Research” is typically recommended; an executed data use agreement is possibly required Clinical interpretation and scientific inference and discovery, but with the understanding that inferences regarding time and potentially other factors cannot be made HuSH+ clinical data sets Observational patient data, fully compliant with HIPAA Safe Harbor, but unlike deidentified clinical data sets, HuSH+ clinical data sets have been altered such that (i) real patient identifiers (including geocodes) have been replaced with random patient identifiers and (ii) dates (including birth dates) have been shifted by a random number of days (maximum of ± 50 days), with all dates for a given patient shifted by the same number of days Data are derived from UNC Health Care System An executed data use agreement is required c Clinical interpretation and scientific inference and discovery, but with the understanding that any inferences based on date/time and location (geocode) cannot be made with precision, and all other inferences must consider date/time and location as potentially hidden covariates Clinical profiles Statistical profiles of disease and associated phenotypic presentation derived from observational patient data Data are derived from Johns Hopkins Medicine IRB approval is required to generate clinical profiles; no other restrictions apply Clinical interpretation and scientific inference, but with the understanding that the data represent statistical profiles Synthetic clinical data sets Realistic, but not real, observational patient data generated statistically using population distributions of observational patient data None Feasibility assessments and algorithm validation; generation of clinical profiles COHD Counts of observational clinical co‐occurrences (e.g., co‐occurrences of specific diagnoses and prescribed medications), as well as their relative frequency and observed–expected frequency ratio Data are derived from Columbia University Irving Medical Center None Clinical interpretation and scientific inference, but with the understanding that the data are restricted to co‐occurrences ICEES Patient‐level or visit‐level counts of observational patient data integrated at the patient and visit level with a variety of environmental exposures derived from multiple public data sources Data are derived from UNC Health Care System and a variety of public data sources on environmental exposures IRB approval is required to generate ICEES integrated feature tables; no other restrictions apply Clinical interpretation and scientific inference, but with the understanding that the raw data have been transformed (e.g., binned or categorized) COHD, Columbia Open Health Data; HIPAA, Health Insurance Portability and Accountability Act; HuSH+, HIPAA Safe Harbor Plus; ICEES, Integrated Clinical and Environmental Exposures Service; IRB, institutional review board; PHI, protected health information; UNC, University of North Carolina. aIndividual institutions may require a secure workspace for data access and use. bWhile HIPAA and IRB regulations do not apply, institutional approvals may be required. cHuSH+ clinical data sets were conceptualized and created by UNC as part of the National Center for Advancing Translational Sciences–funded Biomedical Data Translator program. The tables are generated using PHI (geocodes and dates), but the data are then binned or recoded and stripped of PHI. [...]the ICEES pipeline must be executed under an approved IRB protocol, but subsequent steps are not subject to IRB regulation, and ICEES is publicly accessible via an Application Programming Interface.

Journal Article

Share this book

Add to My Shelf

Ensuring a safe(r) harbor: Excising personally identifiable information from structured electronic health record data

by Kostka, Kristin , Pfaff, Emily R. , Palchuk, Matvey B. in Codes , data privacy , Data warehouses

2022

Recent findings have shown that the continued expansion of the scope and scale of data collected in electronic health records are making the protection of personally identifiable information (PII) more challenging and may inadvertently put our institutions and patients at risk if not addressed. As clinical terminologies expand to include new terms that may capture PII (e.g., Patient First Name, Patient Phone Number), institutions may start using them in clinical data capture (and in some cases, they already have). Once in use, PII-containing values associated with these terms may find their way into laboratory or observation data tables via extract-transform-load jobs intended to process structured data, putting institutions at risk of unintended disclosure. Here we aim to inform the informatics community of these findings, as well as put out a call to action for remediation by the community.

Journal Article

Share this book

Add to My Shelf

Using a Patient Portal to Increase Enrollment in a Newborn Screening Research Study: Observational Study

by Gwaltney, Angela , Duparc, Martin , Andrews, Sara M in Original Paper

2022

Many research studies fail to enroll enough research participants. Patient-facing electronic health record applications, known as patient portals, may be used to send research invitations to eligible patients. The first aim was to determine if receipt of a patient portal research recruitment invitation was associated with enrollment in a large ongoing study of newborns (Early Check). The second aim was to determine if there were differences in opening the patient portal research recruitment invitation and study enrollment by race and ethnicity, age, or rural/urban home address. We used a computable phenotype and queried the health care system's clinical data warehouse to identify women whose newborns would likely be eligible. Research recruitment invitations were sent through the women's patient portals. We conducted logistic regressions to test whether women enrolled their newborns after receipt of a patient portal invitation and whether there were differences by race and ethnicity, age, and rural/urban home address. Research recruitment invitations were sent to 4510 women not yet enrolled through their patient portals between November 22, 2019, through March 5, 2020. Among women who received a patient portal invitation, 3.6% (161/4510) enrolled their newborns within 27 days. The odds of enrolling among women who opened the invitation was nearly 9 times the odds of enrolling among women who did not open their invitation (SE 3.24, OR 8.86, 95% CI 4.33-18.13; P<.001). On average, it took 3.92 days for women to enroll their newborn in the study, with 64% (97/161) enrolling their newborn within 1 day of opening the invitation. There were disparities by race and urbanicity in enrollment in the study after receipt of a patient portal research invitation but not by age. Black women were less likely to enroll their newborns than White women (SE 0.09, OR 0.29, 95% CI 0.16-0.55; P<.001), and women in urban zip codes were more likely to enroll their newborns than women in rural zip codes (SE 0.97, OR 3.03, 95% CI 1.62-5.67; P=.001). Black women (SE 0.05, OR 0.67, 95% CI 0.57-0.78; P<.001) and Hispanic women (SE 0.07, OR 0.73, 95% CI 0.60-0.89; P=.002) were less likely to open the research invitation compared to White women. Patient portals are an effective way to recruit participants for research studies, but there are substantial racial and ethnic disparities and disparities by urban/rural status in the use of patient portals, the opening of a patient portal invitation, and enrollment in the study. ClinicalTrials.gov NCT03655223; https://clinicaltrials.gov/ct2/show/NCT03655223.

Journal Article

Share this book

Add to My Shelf

Characteristics, Outcomes, and Severity Risk Factors Associated With SARS-CoV-2 Infection Among Children in the US National COVID Cohort Collaborative

by Girvin, Andrew T. , Gersing, Ken R. , Neumann, Andrew J. in Adolescent , Age Distribution , Child

2022

Understanding of SARS-CoV-2 infection in US children has been limited by the lack of large, multicenter studies with granular data. To examine the characteristics, changes over time, outcomes, and severity risk factors of children with SARS-CoV-2 within the National COVID Cohort Collaborative (N3C). A prospective cohort study of encounters with end dates before September 24, 2021, was conducted at 56 N3C facilities throughout the US. Participants included children younger than 19 years at initial SARS-CoV-2 testing. Case incidence and severity over time, demographic and comorbidity severity risk factors, vital sign and laboratory trajectories, clinical outcomes, and acute COVID-19 vs multisystem inflammatory syndrome in children (MIS-C), and Delta vs pre-Delta variant differences for children with SARS-CoV-2. A total of 1 068 410 children were tested for SARS-CoV-2 and 167 262 test results (15.6%) were positive (82 882 [49.6%] girls; median age, 11.9 [IQR, 6.0-16.1] years). Among the 10 245 children (6.1%) who were hospitalized, 1423 (13.9%) met the criteria for severe disease: mechanical ventilation (796 [7.8%]), vasopressor-inotropic support (868 [8.5%]), extracorporeal membrane oxygenation (42 [0.4%]), or death (131 [1.3%]). Male sex (odds ratio [OR], 1.37; 95% CI, 1.21-1.56), Black/African American race (OR, 1.25; 95% CI, 1.06-1.47), obesity (OR, 1.19; 95% CI, 1.01-1.41), and several pediatric complex chronic condition (PCCC) subcategories were associated with higher severity disease. Vital signs and many laboratory test values from the day of admission were predictive of peak disease severity. Variables associated with increased odds for MIS-C vs acute COVID-19 included male sex (OR, 1.59; 95% CI, 1.33-1.90), Black/African American race (OR, 1.44; 95% CI, 1.17-1.77), younger than 12 years (OR, 1.81; 95% CI, 1.51-2.18), obesity (OR, 1.76; 95% CI, 1.40-2.22), and not having a pediatric complex chronic condition (OR, 0.72; 95% CI, 0.65-0.80). The children with MIS-C had a more inflammatory laboratory profile and severe clinical phenotype, with higher rates of invasive ventilation (117 of 707 [16.5%] vs 514 of 8241 [6.2%]; P < .001) and need for vasoactive-inotropic support (191 of 707 [27.0%] vs 426 of 8241 [5.2%]; P < .001) compared with those who had acute COVID-19. Comparing children during the Delta vs pre-Delta eras, there was no significant change in hospitalization rate (1738 [6.0%] vs 8507 [6.2%]; P = .18) and lower odds for severe disease (179 [10.3%] vs 1242 [14.6%]) (decreased by a factor of 0.67; 95% CI, 0.57-0.79; P < .001). In this cohort study of US children with SARS-CoV-2, there were observed differences in demographic characteristics, preexisting comorbidities, and initial vital sign and laboratory values between severity subgroups. Taken together, these results suggest that early identification of children likely to progress to severe disease could be achieved using readily available data elements from the day of admission. Further work is needed to translate this knowledge into improved outcomes.

Journal Article

Share this book

Add to My Shelf

Language Selector

MBRLGlobalSearch

Language Selector

Catalogue Search | MBRL

Search Results Heading

Explore the vast range of titles available.

MBRLSearchResults

MBRLHappinessMeter