Projects in Awesome Lists tagged with dataset-analysis
A curated list of projects in awesome lists tagged with dataset-analysis .
https://github.com/mozilla/tts
:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
dataset-analysis deep-learning gantts glow-tts melgan multiband-melgan python pytorch speaker-encoder speech tacotron tacotron2 tensorflow2 text-to-speech tts vocoder
Last synced: 13 May 2025
https://github.com/mozilla/TTS
:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
dataset-analysis deep-learning gantts glow-tts melgan multiband-melgan python pytorch speaker-encoder speech tacotron tacotron2 tensorflow2 text-to-speech tts vocoder
Last synced: 14 Mar 2025
https://github.com/databricks/lilac
Curate better data for LLMs
artificial-intelligence data-analysis dataset-analysis unstructured-data
Last synced: 10 Mar 2025
https://github.com/dref360/spectral-metric
Code for the CVPR 2019 paper : Spectral Metric for Dataset Complexity Assessment
dataset-analysis spectral-clustering
Last synced: 01 Sep 2025
https://github.com/RemiRigal/DatasetExplorer
A web tool for local dataset browsing and processing developped using the Flask + Angular stack.
ai angular data-processing data-science data-visualization dataset dataset-analysis docker docker-compose flask web-application
Last synced: 30 Jul 2025
https://github.com/mosesab/categorize-news-headlines-with-word-embeddings
A simple project that creates a dataset of News Headlines with Primary Category, Secondary Category, Date, Day, Month,Year, Sentiment, SentimentPolarity, Emotion and Url. All News Headlines are scraped from punch newspaper and sorted into a csv file.
dataset dataset-analysis dataset-creation dataset-generation datasets gensim-word2vec news newsapi newsheadlines newspaper nigeria nigeria-api nigerian-data punch punchcard semantic-analysis sentiment-analysis word2vec
Last synced: 06 Jun 2026
https://github.com/andrea-mosk/play-with-ml
A Machine Learning app created with Streamlit (https://www.streamlit.io/).
automated-machine-learning automated-processing dataset dataset-analysis machine-learning streamlit
Last synced: 05 May 2026
https://github.com/manwithacap/dataset-analyzation-project
A repo for Project 1 of the OSU Bootcamp. This dataset analyzation project focuses on housing, incomes, and employment in the United States.
dataset-analysis datasets ipynb-jupyter-notebook jupyter-notebook
Last synced: 06 Mar 2026
https://github.com/pncnmnp/timdb-analysis
Analyzing dataset: https://github.com/pncnmnp/TIMDB
dataset-analysis movie-recommendation
Last synced: 27 Jul 2025
https://github.com/infinitode/pyautoplot
PyAutoPlot is an open-source Python library designed to make dataset analysis much easier by generating helpful detailed plots using matplotlib. It automatically generates appropriate plots based on the dataset you feed it.
analysis automatic csv data dataset dataset-analysis generation matplotlib pandas plots plotting-in-python plotting-library python
Last synced: 16 Mar 2025
https://github.com/liuxiaotong/data-recipe
Reverse-engineering framework for AI datasets â extract annotation specs, cost models & reproducibility from samples or requirement docs.
ai-agent ai-data-pipeline annotation-spec cost-estimation dataset-analysis huggingface llm mcp python reverse-engineering training-data workflow-automation
Last synced: 08 Feb 2026
https://github.com/bursasha/pandas-numpy-matplotlib-cavies-analysis
Complete statistical analysis of cavy lifetime dataset using Python, Pandas, NumPy, Matplotlib, and SciPy to explore, visualize, and infer the impact of bacilli infection on cavy lifetimes đĻĢ
cavy dataset-analysis distribution-fitting hypothesis-testing jupyter-notebook matplotlib numpy pandas python-analysis scipy statistical-analysis statistical-methods
Last synced: 09 Feb 2026
https://github.com/melisa-karatas/machine_learning_based_prediction_on_diabetes_dataset
This projects aims to develope a diabetese prediction model using machine learning when fundamental values are given.
analysis database dataset dataset-analyse dataset-analysis diabetes machine-learning random-forest
Last synced: 18 May 2026
https://github.com/melisa-karatas/exploratory_data_analysis_on_bodyfat_dataset
In this section exploratory data analysis will be made on bodyfat dataset.
bodyfat dataset dataset-analyse dataset-analysis exploratory-data-analysis
Last synced: 25 Aug 2025
https://github.com/bursasha/mongodb-bash-docker-f1-analysis
Scalable MongoDB setup for analyzing F1 2018 season dataset using Docker, Replication, and Sharding technologies đī¸
bash-script bigdata dataset-analysis docker docker-compose f1 mongo-express mongodb nosql replication sharding
Last synced: 09 Apr 2026