Journal Article DZNE-2024-01332

http://join2-wiki.gsi.de/foswiki/pub/Main/Artwork/join2_logo100x88.png
An open-source framework for end-to-end analysis of electronic health record data.

 ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;  ;

2024
Nature America Inc. New York, NY

Nature medicine 30(11), 3369 - 3380 () [10.1038/s41591-024-03214-0]

This record in other databases:    

Please use a persistent id in citations: doi:

Abstract: With progressive digitalization of healthcare systems worldwide, large-scale collection of electronic health records (EHRs) has become commonplace. However, an extensible framework for comprehensive exploratory analysis that accounts for data heterogeneity is missing. Here we introduce ehrapy, a modular open-source Python framework designed for exploratory analysis of heterogeneous epidemiology and EHR data. ehrapy incorporates a series of analytical steps, from data extraction and quality control to the generation of low-dimensional representations. Complemented by rich statistical modules, ehrapy facilitates associating patients with disease states, differential comparison between patient clusters, survival analysis, trajectory inference, causal inference and more. Leveraging ontologies, ehrapy further enables data sharing and training EHR deep learning models, paving the way for foundational models in biomedical research. We demonstrate ehrapy's features in six distinct examples. We applied ehrapy to stratify patients affected by unspecified pneumonia into finer-grained phenotypes. Furthermore, we reveal biomarkers for significant differences in survival among these groups. Additionally, we quantify medication-class effects of pneumonia medications on length of stay. We further leveraged ehrapy to analyze cardiovascular risks across different data modalities. We reconstructed disease state trajectories in patients with severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) based on imaging data. Finally, we conducted a case study to demonstrate how ehrapy can detect and mitigate biases in EHR data. ehrapy, thus, provides a framework that we envision will standardize analysis pipelines on EHR data and serve as a cornerstone for the community.

Keyword(s): Humans (MeSH) ; Electronic Health Records (MeSH) ; COVID-19: epidemiology (MeSH) ; SARS-CoV-2 (MeSH) ; Pneumonia: epidemiology (MeSH) ; Deep Learning (MeSH)

Classification:

Contributing Institute(s):
  1. Aging and Immunity (AG Aschenbrenner)
Research Program(s):
  1. 354 - Disease Prevention and Healthy Aging (POF4-354) (POF4-354)

Appears in the scientific report 2024
Database coverage:
Medline ; Creative Commons Attribution CC BY 4.0 ; OpenAccess ; BIOSIS Previews ; Biological Abstracts ; Clarivate Analytics Master Journal List ; Current Contents - Clinical Medicine ; Current Contents - Life Sciences ; DEAL Nature ; Ebsco Academic Search ; Essential Science Indicators ; IF >= 80 ; JCR ; NationallizenzNationallizenz ; SCOPUS ; Science Citation Index Expanded ; Web of Science Core Collection
Click to display QR Code for this record

The record appears in these collections:
Institute Collections > BN DZNE > BN DZNE-AG Aschenbrenner
Document types > Articles > Journal Article
Full Text Collection
Public records
Publications Database

 Record created 2024-11-19, last modified 2024-11-26


OpenAccess:
Download fulltext PDF Download fulltext PDF (PDFA)
External link:
Download fulltextFulltext by Pubmed Central
Rate this document:

Rate this document:
1
2
3
 
(Not yet reviewed)