Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

https://github.com/davanstrien/hugit-cli

push ImageFolder style image datasets to the 🤗 Hub from the command line

cli datasets huggingface-datasets

Last synced: 17 May 2024

https://github.com/onesuper/HuggingFace-Datasets-Text-Quality-Analysis

Retrieves parquet files from Hugging Face, identifies and quantifies junky data, duplication, contamination, and biased content in dataset using pandas

dataset huggingface-datasets llm machine-learning nlp streamlit text-processing

Last synced: 24 Mar 2024