Add MixLSTM model for temporally shifting clinical time-series (Oh et al. 2020) by amanluth03 · Pull Request #1127 · sunlabuiuc/PyHealth

amanluth03 · 2026-04-23T03:39:02Z

Contributors

Aman Luthra — NetID: aluth3 — aluth3@illinois.edu
Tanmay Mittal — NetID: tmitta3 — tmitta3@illinois.edu
Siddesh Vijayakumar — NetID: siddesh2 — siddesh2@illinois.edu

Type of Contribution

Model (Option 2)

Paper

Jeeheh Oh, Jiaxuan Wang, Shengpu Tang, Michael Sjoding, Jenna Wiens.
"Relaxed Parameter Sharing: Effectively Modeling Time-Varying Relationships in Clinical Time-Series." MLHC 2019.
https://arxiv.org/abs/1906.02898

Description

This PR implements the MixLSTM architecture from Oh et al. 2020, which addresses
temporal conditional shift in clinical time-series prediction.

Standard LSTMs share the same parameters across all time steps, which makes it
difficult to capture relationships between inputs and outcomes that change over
the course of a patient's hospital stay. MixLSTM relaxes this constraint by
maintaining K independent LSTM cells whose parameters are dynamically combined
at each time step via learned mixing coefficients constrained to the simplex.
This allows the model to smoothly transition between different temporal dynamics
without enforcing hard segment boundaries.

The implementation inherits from PyHealth's BaseModel and dynamically infers
input dimensions and sequence length from any SampleDataset passed to it, so
it can plug into existing PyHealth tasks without modification. It supports both
per-timestep regression (output shape (B, T, 1)) and classification (output
shape (B, num_classes)), auto-detected from the dataset's output schema.

File Guide

pyhealth/models/mixlstm.py — MixLSTM model implementation
pyhealth/models/__init__.py — register MixLSTM for import
tests/mixlstm_test.py — unit tests using synthetic data
examples/mimic3_synthetic_mixlstm.py — ablation study example
docs/api/models/pyhealth.models.MixLSTM.rst — API documentation
docs/api/models.rst — add MixLSTM to the models index
pyproject.toml — project dependencies

Ablation Study Summary

We ran two ablations on a synthetic non-stationary time-series regression task
(1,000 sequences per split, T=30 timesteps, input_dim=3, 90% sparse inputs,
lookback l=10, drift δ=0.05 per step). Each configuration is evaluated via
random search (20 runs × 30 epochs) over MixLSTM with K=2 experts and hidden
size sampled from {100, 150, 300, 500, 700, 900, 1100}.

1. Learning rate sweep (Adam): lr ∈ {0.0001, 0.0005, 0.001, 0.005, 0.01}

lr=0.0001 was the worst performer across all hidden sizes.
lr=0.001 (the paper's choice) was strongest for hidden sizes ≥ 500.
lr=0.005 performed best at smaller hidden sizes (100–500).
lr=0.01 was unstable and produced erratic loss curves.
Best configuration: Adam, lr=0.001, hidden=1100 → Val MSE 0.43, Test MSE 0.46.
2. Optimizer comparison (Adam vs. SGD at lr=0.001):

Optimizer	Best Val Loss	Best Test Loss
Adam	0.43	0.47
SGD	16.39	16.41

Adam dramatically outperforms SGD on this task — SGD fails to converge within
the 30-epoch budget. This is consistent with the paper's use of Adam as the
default optimizer.

Outputs: The script produces six .png visualizations: loss vs. hidden
size, predictions vs. ground truth on held-out test sequences, and heatmaps of
the synthetic task's time-varying weight distribution — for both the LR sweep
and the optimizer comparison. Full runtime is approximately 30–45 minutes on
CPU, faster on GPU.

siddesh2-sys and others added 19 commits April 13, 2026 20:55

initial implementation

e77e542

Added test and fixed minor bugs

b88a78c

Added tests, ablation, and support for regression to mixlstm

faaad49

rst file and docs added

d4f7653

updated index

f19b6fe

abalation study

634a80e

fixed abalation study

a298239

comments draft 1

e567d0a

abalation explanation done

1178691

comments

fd02644

more comments

da77126

Added type hints

7d6e082

type hints

271176d

file path fixed

00769da

do a pip install seaborn=0.13.2

dbb2160

Cleaned up comments

39efecc

added type hints

d7acb61

type hints 2

8562c1f

comments

6e83f5d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add MixLSTM model for temporally shifting clinical time-series (Oh et al. 2020)#1127

Add MixLSTM model for temporally shifting clinical time-series (Oh et al. 2020)#1127
amanluth03 wants to merge 19 commits intosunlabuiuc:masterfrom
hseddis321:master

amanluth03 commented Apr 23, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

amanluth03 commented Apr 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Contributors

Type of Contribution

Paper

Description

File Guide

Ablation Study Summary

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

amanluth03 commented Apr 23, 2026 •

edited

Loading