An open API service indexing awesome lists of open source software.

https://github.com/georg-wolflein/pathology-foundation-models

List of pathology feature extractors and foundation models
https://github.com/georg-wolflein/pathology-foundation-models

computational-pathology deep-learning feature-extraction foundation-models pathology self-supervised-learning

Last synced: 5 months ago
JSON representation

List of pathology feature extractors and foundation models

Awesome Lists containing this project

README

          


Pathology Feature Extractors and Foundation Models


We are witnessing the emergence of many new feature extractors trained using self-supervised learning on large pathology datasets.
This repository aims to provide a comprehensive list of these models, alongside key information about them.

I aim to update this list as new models are released, but please submit a pull request / issue for any models I have missed!

## Patch-level models

| Name | Group | Weights | Released | SSL | WSIs | Tiles | Patients | Batch size | Iterations | Architecture | Parameters | Embed dim | Input size | Dataset | Links |
| ------------------------------------------------------------------------------------------------------------------------------------------------- | --------------------------------------------------------------------------------------------------------------- | ------------------ | ----------------------------------------------------------------------------------------------------- | ------------------------------------------------------------------------------- | ------------------------------- | -------- | ---------- | ---------- | ---------- | ---------------------- | ---------- | --------- | ---------- | ------------------------------------------------ | -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
| [CTransPath](https://www.sciencedirect.com/science/article/abs/pii/S1361841522002043) | Sichuan University / Tencent AI Lab | :white_check_mark: | Dec 2021[\*](https://github.com/Xiyue-Wang/TransPath/commit/4b1c67655dd38cb192567b0981b6c1e9ade59ecf) | [SRCL](https://www.sciencedirect.com/science/article/abs/pii/S1361841522002043) | 32K | 16M | | | | Swin-Transformer | 28M | 768 | 224 | TCGA, PAIP | [](https://github.com/Xiyue-Wang/TransPath) |
| [RetCCL](https://www.sciencedirect.com/science/article/abs/pii/S1361841522002043) | Sichuan University / Tencent AI Lab | :white_check_mark: | Dec 2021[\*](https://github.com/Xiyue-Wang/RetCCL/commit/e6faf0bd85c8e7e617882dd5d74e644d28eac771) | [CCL](https://www.sciencedirect.com/science/article/abs/pii/S1361841522002043) | 32K | 16M | | | | ResNet-50 | 26M | 2048 | 224 | TCGA, PAIP | [](https://github.com/Xiyue-Wang/RetCCL) |
| [REMEDIS](https://www.nature.com/articles/s41551-023-01049-7) | [Google Research](https://research.google) | :white_check_mark: | May 2022[\*](https://arxiv.org/abs/2205.09723v1) | SimCLR/BiT | 29K | 50M | 11K cases | 4096 | 1.2M | ResNet-50 | 26M | 2048 | 224 | TCGA | [](https://github.com/google-research/medical-ai-research-foundations) |
| [HIPT](https://ieeexplore.ieee.org/document/9880275) | [Mahmood Lab](https://faisal.ai) | :white_check_mark: | Jun 2022[\*](https://arxiv.org/abs/2206.02647v1) | DINOv1 | 11K | 100M | | 256 | 400K | ViT-S | 22M | 384 | 256 | TCGA | [](https://github.com/mahmoodlab/HIPT) |
| [Lunit-DINO](https://arxiv.org/abs/2212.04690) | [Lunit](https://www.lunit.io) | :white_check_mark: | Dec 2022[\*](https://arxiv.org/abs/2212.04690v1) | DINOv1 | 21K | | | | | ViT-S | 22M | 384 | 224 | TCGA | [](https://github.com/lunit-io/benchmark-ssl-pathology) |
| [Lunit-{BT,MoCoV2,SwAV}](https://arxiv.org/abs/2212.04690) | [Lunit](https://www.lunit.io) | :white_check_mark: | Dec 2022[\*](https://arxiv.org/abs/2212.04690v1) | {BT,MoCoV2,SwAV} | 21K | | | | | ResNet-50 | | 2048 | 224 | TCGA | [](https://github.com/lunit-io/benchmark-ssl-pathology) |
| [Phikon](https://www.medrxiv.org/content/10.1101/2023.07.21.23292757v2) | [Owkin](https://www.owkin.com) | :white_check_mark: | Jul 2023[\*](https://www.medrxiv.org/content/10.1101/2023.07.21.23292757v1) | iBOT | 6.1K | 43M | 5.6K | 1440 | 155K | ViT-B | 86M | 768 | 224 | TCGA | [](https://github.com/owkin/HistoSSLscaling) [](https://huggingface.co/owkin/phikon) |
| [CONCH](https://www.nature.com/articles/s41591-024-02856-4) (VL) | [Mahmood Lab](https://faisal.ai) | :white_check_mark: | Jul 2023[\*](https://arxiv.org/abs/2307.12914v1) | iBOT & vision-language pretraining | 21K | 16M | | 1024 | 80 epochs | ViT-B | 86M | 768 | 224 | proprietary | [](https://github.com/mahmoodlab/CONCH) [](https://huggingface.co/MahmoodLab/CONCH) |
| **[UNI](https://www.nature.com/articles/s41591-024-02857-3)** | [Mahmood Lab](https://faisal.ai) | :white_check_mark: | Aug 2023[\*](https://arxiv.org/abs/2308.15474v1) | DINOv2 | **100K** | 100M | | | | ViT-L | | 1024 | 224 | proprietary (Mass-100K) | [](https://github.com/mahmoodlab/UNI) [](https://huggingface.co/MahmoodLab/UNI) |
| **[Virchow](https://www.nature.com/articles/s41591-024-03141-0)** | [Paige](https://paige.ai) / [Microsoft](https://www.microsoft.com/en-us/research/lab/microsoft-health-futures) | :white_check_mark: | Sep 2023[\*](https://arxiv.org/abs/2309.07778v1) | DINOv2 | **1.5M** | | 120K | | | ViT-H | 632M | 2560 | 224 | proprietary (from MSKCC) | [](https://huggingface.co/paige-ai/Virchow) |
| **[Campanella _et al._](https://arxiv.org/abs/2310.07033)** (DINO) | [Thomas Fuchs Lab](https://www.hpims.org/labs/thomas-fuchs-lab/) | :white_check_mark: | Oct 2023[\*](https://arxiv.org/abs/2310.07033v1) | DINOv1 | **420K** | 3.3B | 77K | 1080 | 1.3K INE | ViT-S | 22M | 384 | 224 | proprietary (MSHS) | [](https://huggingface.co/MountSinaiCompPath/SP22M) ([](https://huggingface.co/MountSinaiCompPath/SP85M)) |
| **[Campanella _et al._](https://arxiv.org/abs/2310.07033)** (MAE) | [Thomas Fuchs Lab](https://www.hpims.org/labs/thomas-fuchs-lab/) | :x: | Oct 2023[\*](https://arxiv.org/abs/2310.07033v1) | MAE | **420K** | 3.3B | 77K | 1440 | 2.5K INE | ViT-L | 303M | 1024 | 224 | proprietary (MSHS) |
| [Path Foundation](https://arxiv.org/abs/2310.13259) | [Google](https://research.google) | :white_check_mark: | Oct 2023[\*](https://arxiv.org/abs/2310.13259v1) | SimCLR, MSN | 6K | 60M | | 1024 | | ViT-S | 22M | 384 | 224 | TCGA | [](https://github.com/Google-Health/imaging-research/tree/master/path-foundation) |
| [PathoDuet](https://arxiv.org/abs/2312.09894) | [Shanghai Jiao Tong University](https://life.sjtu.edu.cn/) | :white_check_mark: | Dec 2023[\*](https://arxiv.org/abs/2312.09894v1) | inspired by MoCoV3 | 11K | 13M | | 2048 | 100 epochs | ViT-B | | 4096 | 224 | TCGA | [](https://github.com/openmedlab/PathoDuet) |
| **[RudolfV](https://arxiv.org/abs/2401.04079)** | [Aignostics](https://www.aignostics.com) | :x: | Jan 2024[\*](https://arxiv.org/abs/2401.04079v1) | DINOv2 | **130K** | 750M | 36K | | | ViT-L | 300M | | 224 | proprietary (from EU & US), TCGA |
| [kaiko](https://arxiv.org/abs/2404.15217) | [kaiko.ai](https://www.kaiko.ai) | :white_check_mark: | Mar 2024[\*](https://arxiv.org/abs/2404.15217v1) | DINOv2 | 29K | 260M\*\* | | 512 | 200 INE | ViT-L | | 1024 | 224 | TCGA | [](https://github.com/kaiko-ai/towards_large_pathology_fms) |
| **[PLUTO](https://arxiv.org/abs/2405.07905)** | [PathAI](https://www.pathai.com) | :x: | May 2024[\*](https://arxiv.org/abs/2405.07905v1) | DINOv2 (+ MAE and Fourier loss) | **160K** | 200M | | | | FlexiViT-S | 22M | | 224 | proprietary (PathAI) | |
| [BEPH](https://www.biorxiv.org/content/10.1101/2024.05.16.594499) | [Shanghai Jiao Tong University](https://life.sjtu.edu.cn/) | :white_check_mark: | May 2024[\*](https://www.biorxiv.org/content/10.1101/2024.05.16.594499v1) | BEiTv2 | 12K | 12M | | 1024 | | ViT-B | 193M | 1024 | 224 | TCGA | [](https://github.com/Zhcyoung/BEPH) |
| **[Prov-GigaPath](https://www.nature.com/articles/s41586-024-07441-w)** | [Microsoft](https://www.microsoft.com/en-us/research/) / [Providence](https://www.providence.org) | :white_check_mark: | May 2024[\*](https://www.nature.com/articles/s41586-024-07441-w) | DINOv2 | **170K** | 1.4B | 30K | 384 | | ViT | | 1536 | 224 | proprietary (Providence) | [](https://github.com/prov-gigapath/prov-gigapath) [](https://huggingface.co/prov-gigapath/prov-gigapath) |
| **[Hibou-B](https://arxiv.org/abs/2406.05074)** | [HistAI](https://www.hist.ai) | :white_check_mark: | Jun 2024[\*](https://arxiv.org/abs/2406.05074v1) | DINOv2 | **1.1M** | 510M | 310K cases | 1024 | 500K | ViT-B | 86M | 768 | 224 | proprietary | [](https://github.com/HistAI/hibou) [](https://huggingface.co/histai/hibou-b) |
| **[Hibou-L](https://arxiv.org/abs/2406.05074)** | [HistAI](https://www.hist.ai) | :white_check_mark: | Jun 2024[\*](https://arxiv.org/abs/2406.05074v1) | DINOv2 | **1.1M** | 1.2B | 310K cases | 1024 | 1.2M | ViT-L | 304M | 1024 | 224 | proprietary | [](https://github.com/HistAI/hibou) [](https://huggingface.co/histai/hibou-L) |
| **[H-optimus-0](https://www.bioptimus.com/news/bioptimus-launches-h-optimus-0-the-worlds-largest-open-source-ai-foundation-model-for-pathology)** | [Bioptimus](https://www.bioptimus.com) | :white_check_mark: | Jul 2024[\*](https://github.com/bioptimus/releases/commit/f967dd8d6de387fc0926cbe29b35b3cc5abc5500) | DINOv2/iBOT | **500K** (across 4,000 clinics) | >100M | 200K | | | ViT-G with 4 registers | 1.1B | 1536 | 224 | proprietary | [](https://github.com/bioptimus/releases/tree/main/models/h-optimus/v0) [](https://huggingface.co/bioptimus/H-optimus-0) |
| [mSTAR](https://arxiv.org/abs/2407.15362) (VL) | [Smart Lab](https://hkustsmartlab.github.io) | :x: | Jul 2024[\*](https://arxiv.org/abs/2407.15362v1) | mSTAR (multimodal) | 10K | | 10K | | | ViT-L | | | 224 | TCGA | |
| **[Virchow 2](https://arxiv.org/abs/2408.00738)** | [Paige](https://paige.ai) / [Microsoft](https://www.microsoft.com/en-us/research/lab/microsoft-health-futures) | :white_check_mark: | Aug 2024[\*](https://arxiv.org/abs/2408.00738v1) | DINOv2 (+ ECT and KDE) | **3.1M** | 2B | 230K | 4096 | | ViT-H with 4 registers | 632M | 3584 | 224 | proprietary (from MSKCC and international sites) | [](https://huggingface.co/paige-ai/Virchow2) |
| **[Virchow 2G](https://arxiv.org/abs/2408.00738)** | [Paige](https://paige.ai) / [Microsoft](https://www.microsoft.com/en-us/research/lab/microsoft-health-futures) | :x: | Aug 2024[\*](https://arxiv.org/abs/2408.00738v1) | DINOv2 (+ ECT and KDE) | **3.1M** | 2B | 230K | 3072 | | ViT-G with 8 registers | 1.9B | 3584 | 224 | proprietary (from MSKCC and international sites) | |
| [Phikon-v2](https://arxiv.org/abs/2409.09173) | [Owkin](https://www.owkin.com) | :white_check_mark: | Sep 2024[\*](https://arxiv.org/abs/2409.09173v1) | DINOv2 | 58.4K | 456M | | 4096 | 250K | ViT-L | 307M | 1024 | 224 | PANCAN-XL (TCGA, CPTAC, GTEx, proprietary) | [](https://huggingface.co/owkin/phikon-v2) |
| [MUSK](https://www.nature.com/articles/s41586-024-08378-w)V (VL) | [Li Lab (Stanford)](https://med.stanford.edu/lilab.html) | :white_check_mark: | Jan 2025[\*](https://www.nature.com/articles/s41586-024-08378-w) | Unified masked modeling (MLM, MIM) + contrastive learning | 33K | 50M | 12K | 2048 | 20 epochs | BEiT3 | | | 384 | TCGA | [](https://github.com/lilab-stanford/MUSK) [](https://huggingface.co/xiangjx/musk) |
| **[RudolfV2](https://arxiv.org/abs/2501.05409)** | [Mayo](https://www.mayoclinic.org), [Charité](https://www.charite.de), [Aignostics](https://www.aignostics.com) | :x: | Jan 2025[\*](https://arxiv.org/abs/2501.05409v1) | | **1.2M** | 3.4B | 490K cases | | | ViT-H | 632M | | | | |
| **[UNI](https://www.nature.com/articles/s41591-024-02857-3)2-h** | [Mahmood Lab](https://faisal.ai) | :white_check_mark: | Jan 2025[\*](https://github.com/mahmoodlab/UNI/commit/91e141ce3d8aca1be0de83a89d0caab45af6a470) | DINOv2 | **350K** | 200M | | | | ViT-H with 8 registers | 681M | 1536 | 224 | proprietary (Mass) | [](https://github.com/mahmoodlab/UNI) [](https://huggingface.co/MahmoodLab/UNI2-h) |
| **[UNI](https://www.nature.com/articles/s41591-024-02857-3)2-g-preview** | [Mahmood Lab](https://faisal.ai) | :x: | Jan 2025[\*](https://github.com/mahmoodlab/UNI/commit/91e141ce3d8aca1be0de83a89d0caab45af6a470) | DINOv2 | **350K** | 200M | | | | ViT-G | | | | proprietary (Mass) | [](https://github.com/mahmoodlab/UNI) |

Notes:

- Models marked with VL indicate language-vision pretraining (others are vision-only)
- Models trained on >100K slides may be considered foundation models and are marked in **bold**
- \# of WSIs, tiles, and patients are reported to 2 significant figures
- INE = ImageNet epochs
- Order is chronological
- Some of these feature extractors have been evaluated in a benchmarking study for whole slide classification [here](https://arxiv.org/abs/2311.11772).
- \*\* means inferred from other numbers provided in the paper

## Slide-level / patient-level models
This table includes models that produce slide-level or patient-level embeddings without supervision.
| Name | Group | Weights | Released | SSL | WSIs | Patients | Batch size | Iterations | Architecture | Parameters | Embed dim | Patch size | Dataset | Links |
| ------------------------------------------------------------------- | -------------------------------------------------------------------------------------------------------------- | ------------------ | ---------------------------------------------------------------- | ------------------------------------- | ---------------------------- | -------- | ---------- | ---------------- | ------------------------ | ---------- | --------- | ---------- | ------------------------------------------------------------------------------ | ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
| [GigaSSL](https://arxiv.org/abs/2212.03273) | [CBIO](https://cbio.mines-paristech.fr) | :white_check_mark: | Dec 2022[\*](https://arxiv.org/abs/2212.03273v1) | SimCLR | 12K | | | 1K epochs | ResNet-18 | | 256 | 256 | TCGA | [](https://github.com/trislaz/gigassl) |
| [PRISM](https://arxiv.org/abs/2405.10254) (VL) | [Paige](https://paige.ai) / [Microsoft](https://www.microsoft.com/en-us/research/lab/microsoft-health-futures) | :white_check_mark: | May 2024[\*](https://arxiv.org/abs/2405.10254v1) | contrastive (with language) | **590K** (190K text reports) | 190K | 64 (x4) | 75K (10 epochs) | Perceiver + BioGPT | | 1280 | 224 | proprietary | [](https://huggingface.co/paige-ai/Prism) |
| [Prov-GigaPath](https://www.nature.com/articles/s41586-024-07441-w) | [Microsoft](https://www.microsoft.com/en-us/research/) / [Providence](https://www.providence.org) | :white_check_mark: | May 2024[\*](https://www.nature.com/articles/s41586-024-07441-w) | DINOv2 | **170K** | 30K | | | LongNet | 86M | 1536 | 224 | proprietary (Providence) | [](https://github.com/prov-gigapath/prov-gigapath) [](https://huggingface.co/prov-gigapath/prov-gigapath) |
| [MADELEINE](https://arxiv.org/abs/2408.02859) (VL) | [Mahmood Lab](https://faisal.ai) | :white_check_mark: | Aug 2024[\*](https://arxiv.org/abs/2408.02859v1) | contrastive (InfoNCE & OT) | 16K | 2K | 120 | 90 epochs | multi-head attention MIL | | 512 | 256 | [ACROBAT](https://acrobat.grand-challenge.org/data/), BWH Kidney (proprietary) | [](https://github.com/mahmoodlab/MADELEINE) |
| [CHIEF](https://www.nature.com/articles/s41586-024-07894-z) (VL) | [Yu Lab](https://yulab.hms.harvard.edu) | :white_check_mark: | Sep 2024[\*](https://www.nature.com/articles/s41586-024-07894-z) | | | | | | | | | | | [](https://github.com/hms-dbmi/CHIEF) |
| [COBRA](https://arxiv.org/abs/2411.13623) | [Kather Lab](https://jnkather.github.io) | :white_check_mark: | Nov 2024[\*](https://arxiv.org/abs/2411.13623v1) | COBRA (MoCo-v3 in FM embedding space) | 3K | 2.8K | 1024 | 2K epochs | Mamba-2 + ABMIL | 15M | 768 | 224 | TCGA (BRCA, CRC, LUAD, LUSC, STAD) | [](https://github.com/KatherLab/COBRA) [](https://huggingface.co/KatherLab/COBRA) |
| [TITAN](https://arxiv.org/abs/2411.19666)V (VL) | [Mahmood Lab](https://faisal.ai) | :white_check_mark: | Dec 2024[\*](https://arxiv.org/abs/2411.19666v1) | iBOT | **340K** | | 1024 | 91K (270 epochs) | ViT (smaller) | 42M | | 224 | Mass-340K (proprietary) | [](https://github.com/mahmoodlab/TITAN) [](https://huggingface.co/MahmoodLab/TITAN) |
| [THREADS](https://arxiv.org/abs/2501.16652) (WSI, RNA, DNA) | [Mahmood Lab](https://faisal.ai) | :x: | Jan 2025[\*](https://arxiv.org/abs/2501.16652v1) | | 47K | | 1200 | up to 101 epochs | ViT-L | | | 224 | MBTG-47k (MGH, BWH, TCGA, GTEx) | |