DILA Model Implementation by sanjanasarkar · Pull Request #1054 · sunlabuiuc/PyHealth

sanjanasarkar · 2026-04-21T02:33:03Z

Contributors

David Mendoza (dmendo24@illinois.edu)
Alex Rau (arau4@illinois.edu)
Sanjana Sarkar (ssarkar8@illinois.edu)

Type of Contribution

Model (Option 2)

Link to Original Paper

DILA: Dictionary Label Attention for Mechanistic Interpretability in High-Dimensional Multi-Label Medical Coding Prediction (https://arxiv.org/pdf/2409.10504)

High-level Description

This PR implements the Dictionary Label Attention (DILA) model for multi-label clinical classification tasks, such as ICD coding. DILA provides mechanistic interpretability by disentangling dense embeddings into sparse, human-auditable medical concepts.

File Guide

docs/api/models.rst
docs/api/models/pyhealth.models.dila.rst: API documentation for the model.
pyhealth/models/dila.py: Main model class integrating PLM, SAE, and Attention modules.
pyhealth/models/dila_sparse_autoencoder.py: SAE implementation with centering and elastic-net loss.
pyhealth/models/dila_dict_label_attention.py: Label attention logic and ICD projection initialization.
tests/core/test_dila.py: Fast unit tests using synthetic tensors.
examples/mimic3_icd_coding_dila.py: Script for hyperparameter ablation studies.
examples/dila_mimic3_evaluation.ipynb: Notebook for visualization and evaluation.

Implements the two-stage DILA (Dictionary Label Attention) pipeline: - SparseAutoencoder for learning sparse dictionary features from PLM embeddings - DictionaryLabelAttention for ICD-guided label attention mechanism - DILA BaseModel integrating both stages with pretrain_sparse_autoencoder utility - Unit and integration tests covering all three modules Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

dila model and ablation script

Add embeddings and sparse autoencoder implementation

feat: add DILA model for interpretable ICD coding

Dila eval

alexander-rau and others added 13 commits April 12, 2026 13:15

Merge remote-tracking branch 'origin/master' into david/dila-model

3fac329

completed DILA model subtask implementation and ablation study

5ba25ff

Merge pull request #3 from sanjanasarkar/david/dila-model

4f53a57

dila model and ablation script

Add embeddings and sparse autoencoder implementation

f0de163

Fixes to get test cases passing

8b131e3

revert deps changes

922c2d3

Add embeddings and sparse autoencoder implementation

725228a

Add embeddings and sparse autoencoder implementation

feat: add DILA model for interpretable ICD coding

0940d9d

feat: add DILA model for interpretable ICD coding

reformat to PEP8 with 88 char line length

21c8624

Add partial DILA eval on MIMIC-III dataset

8fc59e2

Re-add DILA notebook with state metadata

0877868

Merge pull request #4 from sanjanasarkar/DILA-eval

6e7cc31

Dila eval

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DILA Model Implementation#1054

DILA Model Implementation#1054
sanjanasarkar wants to merge 13 commits intosunlabuiuc:masterfrom
sanjanasarkar:master

sanjanasarkar commented Apr 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

sanjanasarkar commented Apr 21, 2026

Contributors

Type of Contribution

Link to Original Paper

High-level Description

File Guide

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants