(Full path) Racial Disparities and Mistrust in End-of-Life Care by aaronx2-illinois · Pull Request #1061 · sunlabuiuc/PyHealth

aaronx2-illinois · 2026-04-21T14:28:40Z

Contributor:
Aaron Xing (Email: aaronx2@illinois.edu / NetID:aaronx2)
Amy Sui Hwang (Email: ahwang22@illinois.edu / NetID:ahwang22)
Patrick Yuxuan Wu (Email: pywu4@illinois.edu / NetID:pywu4)

Contribution Type:
Full path (Dataset + Task + Model + Paper Replication)

Original Paper:
Boag et al. (2018), Racial Disparities and Mistrust in End-of-Life Care
https://proceedings.mlr.press/v85/boag18a.html

Description:
This PR adds our replication of Racial Disparities and Mistrust in End-of-Life Care to PyHealth.

We implemented the full pipeline, including the dataset, downstream tasks, model, and tests.

Main additions:

EOLMistrustDataset(BaseDataset): a dataset wrapper built on the combined MIMIC-III CSV files
EOLMistrustDownstreamMIMIC3(BaseTask): a downstream task base class with 3 binary prediction tasks:
- Left AMA
- Code Status
- In-hospital Mortality
EOLMistrustClassifier(BaseModel): a multimodal classifier using sequence pooling, linear projection, hashed-text embedding, and an MLP
Research-pipeline utilities for cohort construction, mistrust proxy scoring, acuity analysis, downstream AUC evaluation, and managed result writing

This PR supports two research paths, plus a PyHealth-native demo, all from the same entry point:

PyHealth-native demo
(BaseDataset -> set_task -> BaseModel -> Trainer.train/evaluate)
python examples/eol_mistrust_mortality_classifier.py --root EOL_Workspace/eol_mistrust_required_combined --task-demo --task-demo-train-eval
Normal (corrected) path
python examples/eol_mistrust_mortality_classifier.py --root EOL_Workspace/eol_mistrust_required_combined --repetitions 10
Paper-like (notebook-faithful) path
python examples/eol_mistrust_mortality_classifier.py --root EOL_Workspace/eol_mistrust_required_combined --paper-like-dataset-prepare --repetitions 10
Without MIMIC-III credentials
Synthetic/unit tests use generated pseudo data and do not require MIMIC-III credentials. Slow full-pipeline tests are marked with @pytest.mark.slow.

Results are written under:
EOL_Workspace/EOL_Result/EOL_(normal|Paperlike)_<timestamp>/

We also included:

an example script for running the pipeline
an ablation option (--ablation-study) to compare the two preparation modes
fast synthetic unit tests for core logic
slow full-pipeline tests marked with @pytest.mark.slow

Files to Review:

Main implementation:

pyhealth/datasets/eol_mistrust_dataset.py
pyhealth/datasets/eol_mistrust.py
pyhealth/datasets/configs/eol_mistrust.yaml
pyhealth/tasks/eol_mistrust.py
pyhealth/models/eol_mistrust_classifier.py
pyhealth/models/eol_mistrust.py

Example:

examples/eol_mistrust_mortality_classifier.py

Documentation:

docs/api/datasets/pyhealth.datasets.EOLMistrustDataset.rst
docs/api/tasks/pyhealth.tasks.eol_mistrust.rst
docs/api/models/pyhealth.models.EOLMistrustClassifier.rst
docs/api/datasets.rst
docs/api/tasks.rst
docs/api/models.rst

Tests:

tests/core/test_eol_mistrust_dataset.py
tests/core/test_eol_mistrust_task.py
tests/core/test_eol_mistrust_model.py
tests/core/test_eol_mistrust_Integration.py
tests/core/test_eol_mistrust_TrainingAndEvaluation.py
tests/core/test_eol_mistrust_module.py

Relation to paper replication:
This PR directly implements our replication of Boag et al. (2018). The paper_like path is designed to reproduce the original notebook behavior, while the Normal path provides a corrected PyHealth-native implementation after reviewing the original pipeline assumptions.

Commit Detail Add EOL mistrust workflow tasks and target helpers - Implemented task definitions for the EOL mistrust , including: - Left AMA prediction - In-hospital mortality prediction - Code status prediction - Added helper functions for data normalization, age calculation, length of stay calculation, and mapping of ethnicity and insurance. - Created a base task class for downstream predictions and specific task wrappers for each target. - Included necessary validation for input data and defined schemas for input and output.

Commit Detail Refactor code structure for improved readability and maintainability

… Logic commitDetail - Added a new function to load example modules for integration tests. - Updated test data in `test_eol_mistrust_Integration.py` for clarity and accuracy. - Enhanced insurance mapping to include fallback for unrecognized plans. - Modified cohort building tests to reflect changes in admission criteria. - Renamed tests for clarity and updated assertions to match new logic. - Improved note corpus filtering to ensure only relevant discharge summaries are included. - Adjusted noncompliance and autopsy label tests to better reflect case sensitivity and context. - Added new tests for race-based treatment analysis by acuity and ensured proper binning. - Updated model training tests to verify correct hyperparameter settings. - Enhanced module implementation tests to ensure accurate cohort and label processing.

CommitMsg: 1.Update test_eol_mistrust_module.py for new scoring logic and data handling 2.Add assertions for discharge categories and sentiment analysis 3.Add test_eol_mistrust_task.py for task module coverage 4.Test code status target building and task mapping consistency 5.Use dummy patient and event classes for test cases

…rsion 1. Added tests for EOLMistrustClassifier to confirm it extends BaseModel and accepts task-style inputs. 2. Added end-to-end classifier tests for both normal and paper-like dataset preparation. 3. Updated assertions to match new expected treatment totals and code status outputs. 4. Cleaned up dataset setup code in tests for better clarity.

- PEP8 88-char cleanup across 6 core files + example - Added synthetic-data reproducibility note in example docstring - Added 3 RST stubs + index updates for dataset/model/tasks - Added conftest.py with `--run-slow` opt-in; tagged slow tests - Refactored models/eol_mistrust.py helper for readability

Additional updates to PyHealth - EOL project - Amy Hwang

- Expand module-level docstrings in dataset, model, and task modules with Boag et al. 2018 paper citation and URL - Add missing docstrings to private helper functions and methods: _path_variants, _table_assets_exist, _discover_optional_tables, _infer_tensor_input_size, _mean_pool_sequence, _project_tensor, _embed_text_field, _require_columns, _coerce_timestamp, _normalize_token, _normalize_code_status_mode, _normalize_dataset_prepare_mode, _calculate_age_years, _calculate_los_days, _calculate_paper_like_los_days, _build_code_status_target_normal, _build_code_status_target_paper_like - Add section comments to eol_mistrust.yaml config for table groups - Enhance RST documentation with paper links and modality descriptions - Add paper reference to example script header - Add docstrings to four public wrapper functions in pyhealth/models/eol_mistrust.py that bind the generic proxy helpers to specific label columns

add paper references, missing docstrings, and YAML comments

aaronx2-illinois and others added 9 commits April 4, 2026 09:05

CommitName : checkin with testing Py file

91fbf19

Commit Detail Refactor code structure for improved readability and maintainability

Merge pull request #1 from hwangsamy1/Aaronx2Branch

251610d

Additional updates to PyHealth - EOL project - Amy Hwang

Merge pull request #3 from pattyboy227/Aaronx2Branch

a9e31c2

add paper references, missing docstrings, and YAML comments

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

(Full path) Racial Disparities and Mistrust in End-of-Life Care#1061

(Full path) Racial Disparities and Mistrust in End-of-Life Care#1061
aaronx2-illinois wants to merge 9 commits intosunlabuiuc:masterfrom
aaronx2-illinois:Racial-Disparities-and-Mistrust-in-End-of-Life-Care

aaronx2-illinois commented Apr 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

aaronx2-illinois commented Apr 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants