U.S Department of Health & Human ServicesHHSNational Institutes of HealthNIHNCATSNCATSCTSA ProgramCTSA
CTSA CCOS

Coordination, Communication, and Operations Support

RegisterLog In
CTSA Groups and Meetings
Need Help
Log in to view meeting materials such as meeting slides and recordings

2025 Fall CTSA Program Annual Meeting

CTSA Banner Image

2025 Fall CTSA Program 

Annual Meeting

A foundational transformer leveraging full-night, multichannel sleep study representations estimate cardiovascular mortality risk

Hub not available

Abstract

Title: A Foundational Transformer Leveraging Full-Night, Multichannel Sleep Study Representations to Estimate Cardiovascular Mortality Risk
Authors: Benjamin Fox, Sajila Wickramaratne, Ankit Parekh, Girish Nadkarni
Affiliations: Icahn School of Medicine at Mount Sinai
Objective:
This study introduces PSG-JEPA, a foundational transformer model trained on multichannel, full-night polysomnography (PSG) data using a Joint Embedding Predictive Architecture (JEPA). The model aims to learn robust, task-agnostic sleep representations to estimate 10-year cardiovascular (CV) mortality risk.
Background:
Sleep disorders are linked to several leading causes of death, including cardiovascular disease. PSG data, when combined with advanced AI/ML techniques, can yield meaningful representations for clinical risk prediction. JEPA, a recent self-supervised learning approach, offers a promising framework for learning from complex signal data.
Methods:
The model was trained on 33,551 PSG studies (~1.9 million hours) from four large datasets: the Human Sleep Project, APPLES, MESA, and Mt. Sinai’s Sleep Data Warehouse. Each study included seven signal channels (EEG, EOG, EMG, ECG, SpO₂, thoracic and abdominal respiratory rates), resampled to 128 Hz and segmented into 3-second patches. A depthwise convolutional encoder embedded these into a 768-dimensional space. JEPA was used to predict masked target patches from context patches using a mean squared error loss.
An attentive classifier was trained on the learned representations using the SHHS and MrOS datasets (n=8,693) to estimate CV mortality risk, with time discretized into yearly bins over a 10-year horizon. The Cox proportional hazards model was used as the loss function.
Results:

PSG-JEPA achieved a mean AUROC of 0.68 and an integrated Brier score of 0.04 on the held-out test set.
Risk scores showed strong calibration and discrimination. Kaplan-Meier curves demonstrated statistically significant survival differences across risk quartiles (p < 0.0001).
Individuals with an apnea-hypopnea index (AHI) ≥ 5 had significantly lower survival probabilities than those with AHI < 5 (p < 0.01).
The model generalized well without fine-tuning, indicating its potential for broader clinical applications.

Conclusions:
PSG-JEPA is the first JEPA-based model to learn full-night, multichannel PSG representations for CV mortality risk prediction. Its ability to generate meaningful, task-agnostic features without fine-tuning highlights its utility for downstream clinical tasks. Future directions include evaluating performance on other outcomes (e.g., stroke, MI, sleep staging) and exploring sub-cohorts such as CPAP-treated patients.
Funding: K25HL151912, R01HL171813, R21HL165320, TL1TR004420

Authors

First Author

Benjamin Fox, PhD

ben.fox@icahn.mssm.edu

Contributing Authors

Sajila Wickramaratne, PhD

Ankit Parekh, PhD

Girish Nadkarni, MD, MPH

Poster

null Poster

Coordination, Communication, and Operations Support (CCOS) is funded by theNational Center for Advancing Translational Sciences, National Institutes of Health.

HomeContact UsPrivacy PolicyAccessibility Statement
Freedom of Information (FOIA)Office of Inspector General (OIG)Cookie Preference