Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/nannyml/the-little-book-of-ml-metrics
The book every data scientist needs on their desk.
https://github.com/nannyml/the-little-book-of-ml-metrics
book classification-metrics clustering-metrics computer-vision-metrics data-science machine-learning machine-learning-evaluation machine-learning-metrics nlp-metrics python ranking-metrics regression-metrics
Last synced: 3 days ago
JSON representation
The book every data scientist needs on their desk.
- Host: GitHub
- URL: https://github.com/nannyml/the-little-book-of-ml-metrics
- Owner: NannyML
- Created: 2024-07-23T18:27:22.000Z (5 months ago)
- Default Branch: main
- Last Pushed: 2024-12-09T18:21:17.000Z (14 days ago)
- Last Synced: 2024-12-09T19:25:45.349Z (14 days ago)
- Topics: book, classification-metrics, clustering-metrics, computer-vision-metrics, data-science, machine-learning, machine-learning-evaluation, machine-learning-metrics, nlp-metrics, python, ranking-metrics, regression-metrics
- Language: Jupyter Notebook
- Homepage: https://www.nannyml.com/metrics
- Size: 27 MB
- Stars: 798
- Watchers: 12
- Forks: 69
- Open Issues: 98
-
Metadata Files:
- Readme: README.md
- Contributing: CONTRIBUTING.md
- Code of conduct: CODE_OF_CONDUCT.md
Awesome Lists containing this project
README
# The Little Book of ML Metrics
Welcome to the open source repo of [The Little Book of ML Metrics](https://www.nannyml.com/metrics). The idea of the book is to be this little handbook that sits on every data scientist's desk for quick reference, from the most well-known metric, ahem, accuracy, to the most obscure ones (looking at you, P4 metric).
> [!TIP]
> You can preview the latest pre-release version [here](https://github.com/NannyML/The-Little-Book-of-ML-Metrics/releases/download/nightly/main.pdf)![The Little Book of ML Metrics](book/figures/The_Little_Book_of_Metrics_MAPE.png)
## Why are we writing this book?
Machine learning metrics are often overlooked in traditional data science courses and university degrees. This book aims to fill that gap by offering clear, concise, and actionable explanations of the metrics that matter most. Whether you're an aspiring data scientist or an experienced professional, this book will become your go-to reference for understanding and leveraging metrics effectively.
> **Disclaimer:** The book is open-source, which means you can freely access the digital version. However, we also offer a [high-quality printed edition](https://www.nannyml.com/metrics) for purchase. Revenue from printed copies helps support further development and maintenance of the book. Reviewers, contributors, and authors receive revenue sharing through their affiliate links.
## Book Contents
The book covers a broad range of metrics from different contexts:
- **Regression**
- **Classification**
- **Clustering**
- **Ranking**
- **Computer Vision**
- **NLP**
- **GenAI**
- **Probabilistic**
- **Bias and Fairness**
- **Business**## How to Contribute
We welcome contributions from the community! As a thank-you for your contributions, each contributor will receive an affiliate link with 10% commission on sales generated through their link. Plus, your name will be included in the book. Please check our [Contributing Guidelines](CONTRIBUTING.md) for more details.
## Interested in Being a Book Reviewer?
If you're an expert in any of the topics described in the book contents section and would like to review this book, please fill out [this form](https://docs.google.com/forms/d/e/1FAIpQLSejLhxhGowCimOG_1-RLvevB8czKZsW8PM7PwPoi0_8tfGqHw/viewform). As a thank-you, reviewers will receive an affiliate link with 15% commission on sales generated through their link. Plus, your name will be included in the book.
## About the Authors
**[Santiago Viquez](https://www.linkedin.com/in/santiagoviquez/)**
ML Developer Advocate at NannyML. Santiago has over five years of professional experience in ML and data science. He holds a Bachelor’s degree in Physics and a Master’s degree in Data Science.**[Wojtek Kuberski](https://www.linkedin.com/in/wojtek-kuberski/)**
Co-founder and CTO at NannyML. Wojtek is an AI professional and entrepreneur with a Master’s degree in AI from KU Leuven. He co-founded NannyML, an OSS Python library for ML monitoring and post-deployment data science. As the CTO, he leads the research and product teams, contributing to the development of novel algorithms in model monitoring.## Project Support
This project is backed by [NannyML](https://www.nannyml.com/), the only platform for monitoring machine learning models in production that can estimate model performance metrics without ground truth.