Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/pnnl/crosscheck
https://github.com/pnnl/crosscheck
Last synced: 30 days ago
JSON representation
- Host: GitHub
- URL: https://github.com/pnnl/crosscheck
- Owner: pnnl
- License: other
- Created: 2020-01-31T16:12:50.000Z (almost 5 years ago)
- Default Branch: master
- Last Pushed: 2023-01-05T13:37:49.000Z (almost 2 years ago)
- Last Synced: 2024-10-01T14:18:26.882Z (about 1 month ago)
- Language: JavaScript
- Size: 45.4 MB
- Stars: 0
- Watchers: 4
- Forks: 1
- Open Issues: 17
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# CrossCheck:
Rapid, Reproducible, and Interpretable Model Evaluation# [Live Demo](https://pnnl.github.io/crosscheck/)
# Abstract
Evaluation beyond aggregate performance metrics, e.g. F-score, is crucial to both establish an appropriate level of trust in machine learning models and identify avenues for future model improvements.
In this paper we demonstrate `CrossCheck`, an interactive capability for rapid cross-model comparison and reproducible error analysis.We describe the tool, discuss design and implementation details, and present three NLP use cases -- named entity recognition, reading comprehension, and clickbait detection that show the benefits of using the tool for model evaluation. `CrossCheck` enables users to make informed decisions when choosing between multiple models, identify when the models are correct and for which examples, investigate whether the models are making the same mistakes as humans, evaluate models' generalizability and highlight models' limitations, strengths and weaknesses. Furthermore, `CrossCheck` is implemented as a Jupyter widget, which allows for rapid and convenient integration into existing model development workflows.
# Authors
* Dustin Arendt
* Zhuanyi Shaw
* Prasha Shrestha
* Ellyn Ayton
* Maria Glenski
* Svitlana Volkova# Usage
See the following Jupyter notebooks:
* https://github.com/pnnl/crosscheck/blob/master/notebooks/Iris%20Demo.ipynb
* https://github.com/pnnl/crosscheck/blob/master/notebooks/twenty-newsgroups/newsgroups.ipynb# Installation
```sh
pip install crosscheck-widget
```# How to cite
```
@inproceedings{arendt2021crosscheck,
title={CrossCheck:
Rapid, Reproducible, and Interpretable Model Evaluation},
author={Arendt, Dustin and Shaw, Zhuanyi and Shrestha, Prasha and Ayton, Ellyn and Glenski, Maria and Volkova, Svitlana},
booktitle={Proceedings of the Second Workshop on Data Science with Human in the Loop: Language Advances},
year={2021}
}
```