Cross-Representation Benchmarking in Time-Series Electronic Health Records for Clinical Outcome Prediction

2510.09159v1 cs.LG, cs.AI, cs.DB 2025-10-14

Авторы:

Tianyi Chen, Mingcheng Zhu, Zhiyao Luo, Tingting Zhu

Abstract

Electronic Health Records (EHRs) enable deep learning for clinical predictions, but the optimal method for representing patient data remains unclear due to inconsistent evaluation practices. We present the first systematic benchmark to compare EHR representation methods, including multivariate time-series, event streams, and textual event streams for LLMs. This benchmark standardises data curation and evaluation across two distinct clinical settings: the MIMIC-IV dataset for ICU tasks (mortality, phenotyping) and the EHRSHOT dataset for longitudinal care (30-day readmission, 1-year pancreatic cancer). For each paradigm, we evaluate appropriate modelling families--including Transformers, MLP, LSTMs and Retain for time-series, CLMBR and count-based models for event streams, 8-20B LLMs for textual streams--and analyse the impact of feature pruning based on data missingness. Our experiments reveal that event stream models consistently deliver the strongest performance. Pre-trained models like CLMBR are highly sample-efficient in few-shot settings, though simpler count-based models can be competitive given sufficient data. Furthermore, we find that feature selection strategies must be adapted to the clinical setting: pruning sparse features improves ICU predictions, while retaining them is critical for longitudinal tasks. Our results, enabled by a unified and reproducible pipeline, provide practical guidance for selecting EHR representations based on the clinical context and data regime.

Ссылки и действия

Читать на arXiv Скачать PDF

Дополнительные ресурсы:

Cross-Representation Benchmarking in Time-Series Electronic Health Records for Clinical Outcome Prediction

Авторы:

Abstract

Ссылки и действия

Связанные статьи

GED-Consistent Disentanglement of Aligned and Unaligned Substructures for Graph ...

How Data Quality Affects Machine Learning Models for Credit Risk Assessment

Relational Transformer: Toward Zero-Shot Foundation Models for Relational Data

Panorama: Fast-Track Nearest Neighbors

Навигация