Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/kaiko-ai/eva

Evaluation framework for oncology foundation models (FMs)
https://github.com/kaiko-ai/eva

evaluation-framework foundation-models machine-learning oncology

Last synced: about 2 months ago
JSON representation

Evaluation framework for oncology foundation models (FMs)

Host: GitHub
URL: https://github.com/kaiko-ai/eva
Owner: kaiko-ai
License: apache-2.0
Created: 2024-01-16T12:05:03.000Z (12 months ago)
Default Branch: main
Last Pushed: 2024-05-23T07:35:48.000Z (7 months ago)
Last Synced: 2024-05-23T08:01:54.864Z (7 months ago)
Topics: evaluation-framework, foundation-models, machine-learning, oncology
Language: Python
Homepage: https://kaiko-ai.github.io/eva/
Size: 4.47 MB
Stars: 45
Watchers: 5
Forks: 2
Open Issues: 30
Metadata Files:
- Readme: README.md
- Contributing: docs/CONTRIBUTING.md
- License: LICENSE
- Code of conduct: docs/CODE_OF_CONDUCT.md
- Security: SECURITY.md

Awesome Lists containing this project

awesome-pathology - eva - Evaluation framework for oncology foundation models. (Software / Machine Learning)

README

        













_Oncology FM Evaluation Framework by kaiko.ai_

[![PyPI](https://img.shields.io/pypi/v/kaiko-eva.svg?logo=python)](https://pypi.python.org/pypi/kaiko-eva)

[![docs](https://img.shields.io/badge/📚_docs-latest-green)](https://kaiko-ai.github.io/eva/latest)

[![license](https://img.shields.io/badge/⚖️_License-Apache%202.0-blue.svg?labelColor=gray)](https://github.com/kaiko-ai/eva#license)


[![paper](http://img.shields.io/badge/OpenReview-MIDL_2024-B31B1B.svg)](https://openreview.net/forum?id=FNBQOPj18N&noteId=FNBQOPj18N)



  Installation •

  How To Use •

  Quick Start •

  Documentation •

  Datasets •

  Benchmarks 


  Contribute •

  Acknowledgements








_`eva`_ is an evaluation framework for oncology foundation models (FMs) by [kaiko.ai](https://kaiko.ai/).

Check out the [documentation](https://kaiko-ai.github.io/eva/) for more information.

### Highlights:

- Easy and reliable benchmark of Oncology FMs

- Supports path-level classification, slide-level classification and semantic segmentation downstream tasks

- Automatic embedding inference and evaluation of a downstream task

- Native support of popular medical [datasets](https://kaiko-ai.github.io/eva/dev/datasets/) and models

- Produce statistics over multiple evaluation fits and multiple metrics

## Installation

Simple installation from PyPI:

```sh

# to install the core version only

pip install kaiko-eva

# to install the expanded `vision` version

pip install 'kaiko-eva[vision]'

# to install everything

pip install 'kaiko-eva[all]'

```

To install the latest version of the `main` branch:

```sh

pip install "kaiko-eva[all] @ git+https://github.com/kaiko-ai/eva.git"

```

You can verify that the installation was successful by executing:

```sh

eva --version

```

## How To Use

_`eva`_ can be used directly from the terminal as a CLI tool as follows:

```sh

eva {fit,predict,predict_fit} --config url/or/path/to/the/config.yaml 

```

_`eva`_ uses [jsonargparse](https://jsonargparse.readthedocs.io/en/v4.31.0/) to

make it easily configurable by automatically generating command line interfaces (CLIs),

which allows to call *any* Python object from the command line. Moreover, the configuration structure is always in sync with the code. Thus, _`eva`_ can be used either directly from Python or as a CLI tool (recommended).

For more information, please refer to the [documentation](https://kaiko-ai.github.io/eva/dev/user-guide/tutorials/offline_vs_online/).

  Learn about Configs

The following interfaces are identical:

Python interface

Configuration file



```Python

# main.py

# execute with: `python main.py`

from torch import nn

from eva import core

from eva.vision import datasets, transforms

# initialize trainer

trainer = core.Trainer(max_steps=100)

# initialize model

model = core.HeadModule(

  backbone=nn.Flatten(),

  head=nn.Linear(150528, 4),

  criterion=nn.CrossEntropyLoss(),

)

# initialize data

data = core.DataModule(

  datasets=core.DatasetsSchema(

    train=datasets.BACH(

      root="data/bach",

      split="train",

      download=True,

      transforms=transforms.ResizeAndCrop(),

    ),

  ),

  dataloaders=core.DataloadersSchema(

    train=core.DataLoader(batch_size=32),

  ),

)

# perform fit

pipeline = core.Interface()

pipeline.fit(trainer, model=model, data=data)

```





```yaml

# main.yaml

# execute with: `eva fit --config main.yaml`

---

trainer:

  class_path: eva.Trainer

  init_args:

    max_steps: 100

model:

  class_path: eva.HeadModule

  init_args:

    backbone: torch.nn.Flatten

    head:

      class_path: torch.nn.Linear

      init_args:

        in_features: 150528

        out_features: 4

    criterion: torch.nn.CrossEntropyLoss

data:

  class_path: eva.DataModule

  init_args:

    datasets:

      train:

        class_path: eva.vision.datasets.BACH

        init_args:

          root: ./data/bach

          split: train

          download: true

          transforms: eva.vision.transforms.ResizeAndCrop

    dataloaders:

      train:

        batch_size: 32

```



The `.yaml` file defines the functionality of _`eva`_

by parsing and translating its content to Python objects directly.

Native supported configs can be found at the

[configs](https://github.com/kaiko-ai/eva/tree/main/configs) directory

of the repo, which can be both locally stored or remote.

## Quick Start

We define two types of evaluations: **online** and **offline**.

While online fit uses the backbone (FM) to perform forward passes

during the fitting process, offline fit first generates embeddings

with the backbone and then fits the model using these embeddings as

input, resulting in a faster evaluation.

Here are some examples to get you started:

- Perform a downstream offline **classification** evaluation of `DINO ViT-S/16`

on the `BACH` dataset with linear probing by first inferring the embeddings

and then performing 5 sequential fits:

  ```sh

  export DOWNLOAD_DATA=true

  eva predict_fit --config https://raw.githubusercontent.com/kaiko-ai/eva/main/configs/vision/dino_vit/offline/bach.yaml

  ```

- Perform a downstream online **segmentation** evaluation of `DINO ViT-S/16` on the

`MoNuSAC` dataset with the `ConvDecoderMS` decoder:

  ```sh

  export DOWNLOAD_DATA=true

  eva fit --config https://raw.githubusercontent.com/kaiko-ai/eva/main/configs/vision/dino_vit/online/monusac.yaml

  ```

For more examples, take a look at the [configs](https://github.com/kaiko-ai/eva/tree/main/configs)

and [tutorials](https://kaiko-ai.github.io/eva/dev/user-guide/advanced/replicate_evaluations/).

> [!NOTE]

> All the datasets that support automatic download in the repo have by default the option to automatically download set to false.

> For automatic download you have to manually set the environmental variable `DOWNLOAD_DATA=true` or in the configuration file `download=true`.

## Leaderboards

In this section you will find model benchmarks which were generated with _`eva`_.

### Table I: WSI patch-level classification tasks






| Model   | BACH  | CRC   | MHIST | PCam   | Camelyon16 | PANDA |

|---------|-------|-------|-------|--------|------------|-------|

| ViT-S/16 _(random)_	^[1] | 0.411|0.613|0.5|0.752|0.551|0.347|

| ViT-S/16 _(ImageNet)_ ^[1] | 0.675|0.936|0.827|0.861|0.751|0.676|

| DINO_(p=16) ^[2] | 0.77|0.936|0.751|0.905|0.869|0.737|

| Phikon ^[3]                | 0.715|0.942|0.766|0.925|0.879|0.784|

| UNI ^[4]                   | 0.797|0.95|0.835|0.939|0.933|0.774|

| ViT-S/16 _(kaiko.ai)_ ^[5] | 0.8|0.949|0.831|0.902|0.897|0.77|

| ViT-S/8 _(kaiko.ai)_ ^[5]  | 0.825|0.948|0.826|0.887|0.879|0.741|

| ViT-B/16 _(kaiko.ai)_	^[5] | 0.846|0.959|0.839|0.906|0.891|0.753|

| ViT-B/8 _(kaiko.ai)_ ^[5]  | 0.867|0.952|0.814|0.921|0.939|0.761|

| ViT-L/14 _(kaiko.ai)_ ^[5] | 0.862|0.935|0.822|0.907|0.941|0.769|

_Table I: Linear probing evaluation of FMs on patch-level downstream datasets.
 We report averaged balanced accuracy

over 5 runs. Results are reported on the "test" split if available and otherwise on the "validation" split._






_References_:

1. _"Emerging properties in self-supervised vision transformers”_, [arXiv](https://arxiv.org/abs/2104.14294)

2. _"Benchmarking self-supervised learning on diverse pathology datasets”_, [arXiv](https://arxiv.org/abs/2212.04690)

3. _"Scaling self-supervised learning for histopathology with masked image modeling”_, [medRxiv](https://www.medrxiv.org/content/10.1101/2023.07.21.23292757v1)

4. _"A General-Purpose Self-Supervised Model for Computational Pathology”_, [arXiv](https://arxiv.org/abs/2308.15474)

5. _"Towards Training Large-Scale Pathology Foundation Models: from TCGA to Hospital Scale”_, [arXiv](https://arxiv.org/pdf/2404.15217)

## Contributing

_`eva`_ is an open source project and welcomes contributions of all kinds. Please checkout the [developer](./docs/DEVELOPER_GUIDE.md)

and [contributing guide](./docs/CONTRIBUTING.md) for help on how to do so.

All contributors must follow the [code of conduct](./docs/CODE_OF_CONDUCT.md).

## Acknowledgements

Our codebase is built using multiple opensource contributions



[![python](https://img.shields.io/badge/-Python-blue?logo=python&logoColor=white)](https://github.com/pre-commit/pre-commit)

[![pytorch](https://img.shields.io/badge/PyTorch-ee4c2c?logo=pytorch&logoColor=white)](https://pytorch.org/get-started/locally/)

[![lightning](https://img.shields.io/badge/-⚡️_Lightning-792ee5?logo=pytorchlightning&logoColor=white)](https://pytorchlightning.ai/)


[![black](https://img.shields.io/badge/Code%20Style-Black-black.svg?labelColor=gray)](https://black.readthedocs.io/en/stable/)

[![isort](https://img.shields.io/badge/%20imports-isort-%231674b1?style=flat&labelColor=ef8336)](https://pycqa.github.io/isort/)

[![Ruff](https://img.shields.io/endpoint?url=https://raw.githubusercontent.com/astral-sh/ruff/main/assets/badge/v2.json)](https://github.com/astral-sh/ruff)

[![Checked with pyright](https://microsoft.github.io/pyright/img/pyright_badge.svg)](https://microsoft.github.io/pyright/)


[![pdm-managed](https://img.shields.io/badge/pdm-managed-blueviolet)](https://pdm-project.org)

[![Nox](https://img.shields.io/badge/%F0%9F%A6%8A-Nox-D85E00.svg)](https://github.com/wntrblm/nox)

[![Built with Material for MkDocs](https://img.shields.io/badge/Material_for_MkDocs-526CFE?logo=MaterialForMkDocs&logoColor=white)](https://squidfunk.github.io/mkdocs-material/)



## Citation

If you find this repository useful, please consider giving a star ⭐ and adding the following citation:

```bibtex

@inproceedings{kaiko.ai2024eva,

    title={eva: Evaluation framework for pathology foundation models},

    author={kaiko.ai and Ioannis Gatopoulos and Nicolas K{\"a}nzig and Roman Moser and Sebastian Ot{\'a}lora},

    booktitle={Medical Imaging with Deep Learning},

    year={2024},

    url={https://openreview.net/forum?id=FNBQOPj18N}

}

```