Skip to content

colla00/IDI-Multinational-Validation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Multinational External Validation of the Intensive Documentation Index for ICU Mortality Prediction

License: MIT PhysioNet MIMIC-IV PhysioNet HiRID

Overview

This repository contains all analysis code for:

Collier AM, Shalhout SZ. Intensive Documentation Index as an All-Cause Mortality Predictor in Critically Ill Patients: A Multi-Center External Validation Study. npj Digital Medicine (under review, 2026).

The Intensive Documentation Index (IDI) is a zero-burden mortality prediction framework derived exclusively from nursing documentation timestamps — no laboratory values, no imaging, no manual scoring required.


Key Results

Cohort Dataset N (model) Mortality AUROC AUPRC Brier
MIMIC-IV (HF, USA) Derivation 26,133 15.99% 0.6491 0.2530 0.1299
HiRID (All-ICU, Switzerland) External Validation 33,897 6.08% 0.9063 0.4546 0.1168
  • Calibration slope (MIMIC-IV): Baseline 1.07, IDI-Enhanced 1.05
  • Calibration slope (HiRID): 0.98 (near-perfect)
  • Zero data-entry burden: IDI uses only passively recorded timestamps
  • AUROC vs baseline (MIMIC-IV): +0.0249 (p < 0.001, DeLong test)
  • Split: 80/20 stratified random split (seed=42); train n=20,906, test n=5,227
  • Leakage-corrected: idi_events_24h and idi_events_per_hour excluded (|r|>0.30 with ICU LOS)

Repository Structure

IDI-Multinational-Validation/
├── README.md
├── LICENSE
├── .gitignore
├── requirements.txt
├── data/
│   └── README.md              # How to access MIMIC-IV and HiRID via PhysioNet
├── src/
│   ├── cohort_selection.py    # Inclusion/exclusion criteria → cohort.csv
│   ├── idi_features.py        # 9 IDI temporal features from timestamps
│   ├── leakage_filter.py      # Remove features with |r|>0.30 with ICU LOS
│   ├── model.py               # Train baseline + IDI-enhanced models
│   ├── hirid_validation.py    # External validation on HiRID cohort
│   └── metrics.py             # AUROC, DeLong, calibration slope, AUPRC, Brier
├── results/
│   ├── figures/               # ROC curves, calibration plots, feature importance
│   └── tables/                # Performance tables (CSV)
└── notebooks/
    ├── 01_cohort_selection.ipynb
    ├── 02_feature_engineering.ipynb
    ├── 03_model_training.ipynb
    ├── 04_hirid_validation.ipynb
    └── 05_figures.ipynb

Data Access

MIMIC-IV (v2.2):

https://physionet.org/content/mimiciv/2.2/ Requires PhysioNet credentialed access + CITI training.

HiRID (v1.1.1):

https://physionet.org/content/hirid/1.1.1/ Requires PhysioNet credentialed access + data use agreement.

No patient data is included in this repository. Place downloaded data in data/raw/ (excluded by .gitignore).


Installation

git clone https://github.com/colla00/IDI-Multinational-Validation.git
cd IDI-Multinational-Validation
pip install -r requirements.txt

Usage

# 1. Build cohorts
python src/cohort_selection.py --mimic /data/raw/mimic-iv --hirid /data/raw/hirid

# 2. Extract IDI features
python src/idi_features.py --cohort data/processed/mimic_cohort.csv

# 3. Apply leakage filter (remove |r|>0.30 with ICU LOS)
python src/leakage_filter.py

# 4. Train and evaluate models (80/20 stratified random split, seed=42)
python src/model.py --split random --seed 42

# 5. HiRID external validation
python src/hirid_validation.py

# 6. Compute all metrics
python src/metrics.py

Citation

@article{collier2026npj,
  title   = {Intensive Documentation Index as an All-Cause Mortality Predictor
             in Critically Ill Patients: A Multi-Center External Validation Study},
  author  = {Collier, Alexis M. and Shalhout, Sophia Z.},
  journal = {npj Digital Medicine},
  year    = {2026},
  note    = {Under review},
  doi     = {10.5281/zenodo.18943883}
}

Competing Interests & Patents

Dr. Collier is Founder and CEO of VitaSignal LLC. This work relates to:

  • USPTO Provisional App. No. 63/976,293 — NurseRhythm IDI Engine (filed Feb 2026)
  • USPTO Provisional App. No. 63/946,187 — CDS-EQUITY (filed Dec 2025)
  • USPTO Provisional App. No. 63/932,953 — CRIS-E (filed Dec 2025)

Licensing: info@vitasignal.ai | NIH Award No. 1OT2OD032581 (Bayh-Dole applies)


License

MIT License — see LICENSE for details.

About

Multinational External Validation of the Intensive Documentation Index for ICU Mortality Prediction

Resources

License

Stars

Watchers

Forks

Packages

 
 
 

Contributors

Languages