Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/scitator/animus
Minimalistic framework to run machine learning experiments.
https://github.com/scitator/animus
deep-learning jax keras machine-learning pytorch reinforcement-learning tensorflow
Last synced: 3 months ago
JSON representation
Minimalistic framework to run machine learning experiments.
- Host: GitHub
- URL: https://github.com/scitator/animus
- Owner: Scitator
- License: apache-2.0
- Created: 2022-01-12T05:48:47.000Z (almost 3 years ago)
- Default Branch: main
- Last Pushed: 2022-06-14T17:23:31.000Z (over 2 years ago)
- Last Synced: 2024-10-13T00:42:19.396Z (3 months ago)
- Topics: deep-learning, jax, keras, machine-learning, pytorch, reinforcement-learning, tensorflow
- Language: Python
- Homepage:
- Size: 81.1 KB
- Stars: 27
- Watchers: 2
- Forks: 0
- Open Issues: 2
-
Metadata Files:
- Readme: README.md
- Funding: .github/FUNDING.yml
- License: LICENSE
Awesome Lists containing this project
README
# Animus
> One framework to rule them all.
Animus is a "write it yourself"-based machine learning framework.
Please see `examples/` for more information.
Framework architecture is mainly inspired by [Catalyst](https://github.com/catalyst-team/catalyst).### FAQ
What is Animus?
Animus is a general-purpose for-loop-based experiment wrapper. It divides ML experiment with the straightforward logic:
```python
def run(experiment):
for epoch in experiment.epochs:
for dataset in epoch.datasets:
for batch in dataset.batches:
handle_batch(batch)
```
Each `for` encapsulated with `on_{for}_start`, `run_{for}`, and `on_{for}_end` for customisation purposes. Moreover, each `for` has its own metrics storage: `{for}_metrics` (`batch_metrics`, `dataset_metrics`, `epoch_metrics`, `experiment_metrics`).What are Animus' competitors?
Any high-level ML/DL libraries, like [Catalyst](https://github.com/catalyst-team/catalyst), [Ignite](https://github.com/pytorch/ignite), [FastAI](https://github.com/fastai/fastai), [Keras](https://github.com/keras-team/keras), etc.
Why do we need Animus if we have high-level alternatives?
Although I find high-level DL frameworks an essential step for the community and the spread of Deep Learning (I have written one by myself), they have a few weaknesses.
First of all, usually, they are heavily bounded to a single "low-level" DL framework ([Jax](https://github.com/google/jax), [PyTorch](https://github.com/pytorch/pytorch), [Tensorflow](https://github.com/tensorflow/tensorflow)). While ["low-level" frameworks become close each year](https://twitter.com/fchollet/status/1052228463300493312?s=20), high-level frameworks introduce different synthetic sugar, which makes it impossible for a fair comparison, or complementary use, of "low-level" frameworks.
Secondly, high-level frameworks introduce high-level abstractions, which:
- are built with some assumptions in mind, which could be wrong in your case,
- can cause additional bugs - even "low-level" frameworks have quite a lot of them,
- are really hard to debug/extend because of "user-friendly" interfaces and extra integrations.While these steps could seem unimportant in common cases, like supervised learning with `(features, targets)`, they became more and more important during research and heavy pipeline customization (e.g. privacy-aware multi-node distributed training with custom backpropagation).
Thirdly, many high-level frameworks try to divide ML pipeline into data, hardware, model, etc layers, making it easier for practitioners to start ML experiments and giving teams a tool to separate ML pipeline responsibility between different members. However, while it speeds up the creation of ML pipelines, it disregards that ML experiment results are heavily conditioned on the used model hyperparameters, **and data preprocessing/transformations/sampling**, **and hardware setup**.
*I found this the main reason why ML experiments fail - you have to focus on the whole data transformation pipeline simultaneously, from raw data through the training process to distributed inference, which is quite hard. And that's the reason Animus has Experiment abstraction ([Catalyst](https://github.com/catalyst-team/catalyst) analog - [IRunner](https://github.com/catalyst-team/catalyst/blob/master/catalyst/core/runner.py#L40)), which connects all parts of the experiment: hardware backend, data transformations, model train, and validation/inference logic.*What is Animus' purpose?
Highlight common "breakpoints" in ML experiments and provide a unified interface for them.
What is Animus' main application?
Research experiments, where you have to define everything on your own to get the results right.
Does Animus have any requirements?
No. That's the case - only pure Python libraries.
PyTorch and Keras could be used for extensions.Do you have plans for documentation?
No. Animus core is about 300 lines of code, so it's much easier to read than 3000 lines of documentation.
#### Demo
- [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/scitator/animus/blob/main/examples/notebooks/colab_ci_cd.ipynb) [Jax/Keras/Sklearn/Torch pipelines](./examples/notebooks/colab_ci_cd.ipynb)
- [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/scitator/animus/blob/main/examples/notebooks/XLA_jax.ipynb) [Jax XLA example](./examples/notebooks/XLA_jax.ipynb)
- [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/scitator/animus/blob/main/examples/notebooks/XLA_torch.ipynb) [Torch XLA example](./examples/notebooks/XLA_torch.ipynb)