Projects in Awesome Lists tagged with machine-learning-dataset
A curated list of projects in awesome lists tagged with machine-learning-dataset .
https://github.com/JohannesBuchner/spoken-command-recognition
A large, free audio sample database (10M words pronounced), a test bed for voice activity detection algorithms and for single-syllable word recognition
audio audio-classification dataset machine-learning machine-learning-dataset spoken-english
Last synced: 11 Mar 2025
https://github.com/johannesbuchner/spoken-command-recognition
A large, free audio sample database (10M words pronounced), a test bed for voice activity detection algorithms and for single-syllable word recognition
audio audio-classification dataset machine-learning machine-learning-dataset spoken-english
Last synced: 23 Nov 2024
https://github.com/reddyprasade/machine-learning-problems-datasets
We currently maintain 488 data sets as a service to the machine learning community. You may view all data sets through our searchable interface. For a general overview of the Repository, please visit our About page. For information about citing data sets in publications, please read our citation policy. If you wish to donate a data set, please consult our donation policy. For any other questions, feel free to contact the Repository librarians.
machine-learning-dataset machine-learning-datasets uci uci-machine-learning
Last synced: 09 Apr 2025
https://github.com/engineeringsoftware/math-comp-corpus
Corpus of Coq code related to MathComp including several machine-readable representations
coq machine-learning-dataset mathcomp serapi
Last synced: 12 May 2025
https://github.com/krzjoa/komentarze
Korpus ręcznie sklasyfikowanych komentarzy do uczenia maszynowego (filtrowanie komentarzy obraźliwych)
corpus corpus-data dataset json-data machine-learning-dataset
Last synced: 19 Feb 2025
https://github.com/chadsr/marktplaats-scraper
Marktplaats.nl (Dutch Classifieds) Listing Scraper
chromedriver dataset-creation dataset-generation dutch-language machine-learning machine-learning-dataset marktplaats scraper selenium web-scraper web-scraping
Last synced: 13 Mar 2025
https://github.com/vtalpaert/pytorch-geometric-visual-task
Simple task for mixed image-graph data
dataset deep-learning-datasets geometric-deep-learning graph-neural-networks machine-learning-dataset pytorch pytorch-geometric
Last synced: 13 Mar 2025
https://github.com/jay-johnson/network-pipeline-datasets
CSV datasets for ML/AI models from captured network traffic during ZAP scanning with web applications like Django, Flask, React, Vue and Spring - Anti-Nex training datasets
ai csv-data csv-datasets django flask flask-restful free-datasets machine-learning machine-learning-dataset machine-learning-defense network-analysis network-security owasp python3 react react-redux spring spring-boot vue vue2
Last synced: 06 Apr 2025
https://github.com/elijas/sentence-polarity-dataset-v1.0
sentence polarity dataset v1.0 (includes sentence polarity dataset README v1.0): 5331 positive and 5331 negative processed sentences / snippets. Introduced in Pang/Lee ACL 2005. Released July 2005.
dataset machine-learning-dataset polarity-dataset
Last synced: 22 Mar 2025
https://github.com/aalekhpatel07/captcha-generator
Generate captchas for ML tasks in parallel.
captcha captcha-generator machine-learning-dataset rust
Last synced: 16 Mar 2025
https://github.com/aitor-alvarez/mir-song-dataset-collection
Scripts to create Music Information Retrieval datasets from streaming services for singer identification tasks
audio-signal-processing dataset-generation deep-learning-dataset machine-learning-dataset music-information-retrieval singer-identification-tasks
Last synced: 20 Mar 2025
https://github.com/ahundt/costar_dataset
Installable python package for the costar dataset.
costar-dataset dataset machine-learning-dataset pytorch robotics tensorflow
Last synced: 15 Mar 2025
https://github.com/americast/actions-dataset
Dataset containing videos of a few actions
dataset machine-learning-dataset
Last synced: 26 Mar 2025
https://github.com/sferez/bybitmarketdata
This repository serves as a collection point for market data from Bybit. Aimed at facilitating machine learning model creation and finetuning.
ai ai-data ai-data-collection bybit bybit-websocket crypto crypto-data cryptocurrency cryptocurrency-datasets machine-learning-dataset market market-data trading trading-bot trading-strategies
Last synced: 03 Mar 2025