awesome-seml

A curated list of articles that cover the software engineering best practices for building machine learning applications.
https://github.com/SE-ML/awesome-seml

Last synced: 6 days ago
JSON representation

Model Training
Deployment and Operation
Broad Overviews
Data Management
Social Aspects
Governance
Tooling
- Aim - Aim is an open source experiment tracking tool.
- Airflow - Programmatically author, schedule and monitor workflows.
- Data Version Control (DVC) - DVC is a data and ML experiments management tool.
- FairLearn - A toolkit to assess and improve the fairness of machine learning models.
- Git Large File System (LFS) - Replaces large files such as datasets with text pointers inside Git.
- HParams - A thoughtful approach to configuration management for machine learning projects.
- Kubeflow - A platform for data scientists who want to build and experiment with ML pipelines.
- Label Studio - A multi-type data labeling and annotation tool with standardized output format.
- MLFlow - Manage the ML lifecycle, including experimentation, deployment, and a central model registry.
- Neptune.ai - Experiment tracking tool bringing organization and collaboration to data science projects.
- OpenML - An inclusive movement to build an open, organized, online ecosystem for machine learning.
- PyTorch Lightning - The lightweight PyTorch wrapper for high-performance AI research. Scale your models, not the boilerplate.
- Spark Machine Learning - Spark’s ML library consisting of common learning algorithms and utilities.
- Weights & Biases - Experiment tracking, model optimization, and dataset versioning.

Categories

Model Training 98 Deployment and Operation 50 Tooling 14 Governance 9 Data Management 9 Broad Overviews 7 Social Aspects 4

Sub Categories