An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with machine-learning-dataset

A curated list of projects in awesome lists tagged with machine-learning-dataset .

https://github.com/JohannesBuchner/spoken-command-recognition

A large, free audio sample database (10M words pronounced), a test bed for voice activity detection algorithms and for single-syllable word recognition

audio audio-classification dataset machine-learning machine-learning-dataset spoken-english

Last synced: 11 Mar 2025

https://github.com/johannesbuchner/spoken-command-recognition

A large, free audio sample database (10M words pronounced), a test bed for voice activity detection algorithms and for single-syllable word recognition

audio audio-classification dataset machine-learning machine-learning-dataset spoken-english

Last synced: 23 Nov 2024

https://github.com/reddyprasade/machine-learning-problems-datasets

We currently maintain 488 data sets as a service to the machine learning community. You may view all data sets through our searchable interface. For a general overview of the Repository, please visit our About page. For information about citing data sets in publications, please read our citation policy. If you wish to donate a data set, please consult our donation policy. For any other questions, feel free to contact the Repository librarians.

machine-learning-dataset machine-learning-datasets uci uci-machine-learning

Last synced: 09 Apr 2025

https://github.com/engineeringsoftware/math-comp-corpus

Corpus of Coq code related to MathComp including several machine-readable representations

coq machine-learning-dataset mathcomp serapi

Last synced: 12 May 2025

https://github.com/krzjoa/komentarze

Korpus ręcznie sklasyfikowanych komentarzy do uczenia maszynowego (filtrowanie komentarzy obraźliwych)

corpus corpus-data dataset json-data machine-learning-dataset

Last synced: 19 Feb 2025

https://github.com/jay-johnson/network-pipeline-datasets

CSV datasets for ML/AI models from captured network traffic during ZAP scanning with web applications like Django, Flask, React, Vue and Spring - Anti-Nex training datasets

ai csv-data csv-datasets django flask flask-restful free-datasets machine-learning machine-learning-dataset machine-learning-defense network-analysis network-security owasp python3 react react-redux spring spring-boot vue vue2

Last synced: 06 Apr 2025

https://github.com/elijas/sentence-polarity-dataset-v1.0

sentence polarity dataset v1.0 (includes sentence polarity dataset README v1.0): 5331 positive and 5331 negative processed sentences / snippets. Introduced in Pang/Lee ACL 2005. Released July 2005.

dataset machine-learning-dataset polarity-dataset

Last synced: 22 Mar 2025

https://github.com/aalekhpatel07/captcha-generator

Generate captchas for ML tasks in parallel.

captcha captcha-generator machine-learning-dataset rust

Last synced: 16 Mar 2025

https://github.com/aitor-alvarez/mir-song-dataset-collection

Scripts to create Music Information Retrieval datasets from streaming services for singer identification tasks

audio-signal-processing dataset-generation deep-learning-dataset machine-learning-dataset music-information-retrieval singer-identification-tasks

Last synced: 20 Mar 2025

https://github.com/ahundt/costar_dataset

Installable python package for the costar dataset.

costar-dataset dataset machine-learning-dataset pytorch robotics tensorflow

Last synced: 15 Mar 2025

https://github.com/americast/actions-dataset

Dataset containing videos of a few actions

dataset machine-learning-dataset

Last synced: 26 Mar 2025

https://github.com/sferez/bybitmarketdata

This repository serves as a collection point for market data from Bybit. Aimed at facilitating machine learning model creation and finetuning.

ai ai-data ai-data-collection bybit bybit-websocket crypto crypto-data cryptocurrency cryptocurrency-datasets machine-learning-dataset market market-data trading trading-bot trading-strategies

Last synced: 03 Mar 2025