Skip to content

The Self-Supervision Regime and Encoder Fit for Histopathology Image Analysis#1132

Open
crovo77 wants to merge 1 commit intosunlabuiuc:masterfrom
armaan11:master
Open

The Self-Supervision Regime and Encoder Fit for Histopathology Image Analysis#1132
crovo77 wants to merge 1 commit intosunlabuiuc:masterfrom
armaan11:master

Conversation

@crovo77
Copy link
Copy Markdown

@crovo77 crovo77 commented Apr 23, 2026

Contributors: Armaan Mehta (armaanm3@illinois.edu), Stanley Su (ssu603@illinois.edu), Carson Crovo (ccrovo2@illinois.edu)

Contribution Type: New Dataset, New Task, New Model

Description: Added support for the TCGA-CRCk dataset and an associated task (groups tiles from Whole Slide Images into patient-level samples). Added model for self-attention Multiple Instance Learning which uses SimCLR encodings obtained through a replication of "The Self-Supervision Regime and Encoder Fit for Histopathology Image Analysis" (https://proceedings.mlr.press/v259/azhar25a.html).

Files to Review:

  • pyhealth/datasets/tcga_crck.py - Main dataset implementation
  • pyhealth/datasets/configs/tcga_crck.yaml
  • pyhealth/tasks/tcga_crck_msi_classification.py - Associated task
  • pyhealth/models/tcga_crck_simclr_mil.py - Associated model
  • tests/core/test_tcga_crck.py - Test Cases for dataset and task
  • tests/core/test_tcga_crck_simclr_mil.py - Test Cases for model
  • examples/tcga_crck_simclr/tcga_crck_msi_classification_tissue_aware_simclr.py - Example full pipeline
  • docs/api/datasets/pyhealth.datasets.TCGACRCkDataset.rst
  • docs/api/tasks/pyhealth.tasks.TCGACRCkMSIClassification.rst
  • docs/api/models/pyhealth.models.TissueAwareSimCLR.rst

@armaan11 armaan11 force-pushed the master branch 7 times, most recently from b44c19a to d2783cf Compare April 23, 2026 08:01
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants