An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/ledsouza/reg-linear-ii

Projeto de estudo de técnicas avançadas de modelagem em regressão linear

data-science data-visualization linear-regression pandas seaborn sklearn statsmodels vitrinedev

Last synced: 03 May 2026

https://github.com/yashsingh43/cdc-sleep-duration-health-analysis

Analysis of CDC BRFSS 2022 data exploring how sleep duration relates to mental and physical health outcomes.

beautifulsoup brfss cdc data-analysis data-visualization matplotlib pandas plotly public-health python

Last synced: 11 Jun 2026

https://github.com/allanreda/ga4-session-predictor-flask-app

Flask app that can predict future number of GA4 sessions, using the Prophet library.

data-visualization flask ga4-api matplotlib pandas prophet-library python time-series-forecasting

Last synced: 03 May 2026

https://github.com/prachipatel15/priceprediction

I've build a real estate price prediction project which predicts a house price in Banglore.

data-science data-visualization machine-learning pandas python

Last synced: 03 May 2026

https://github.com/imgios/dependensee

📊 Minimal relaxed functional dependencies set visualization webapp.

angular d3js data-visualization ionic rfd typescript visual-analytics webapp

Last synced: 03 May 2026

https://github.com/ibrahim1993-2/iraq-primary-enrollment

Data analysis of Iraq’s primary school enrollment (1960–2024). Includes SQL-style exploration, 14 insights with CSV exports, visualizations, and a polished Jupyter Notebook (PDF + HTML).

data-visualization eda education-data iraq jupyter-notebook matplotlib pandas portfolio-project python3 world-bank

Last synced: 03 May 2026

https://github.com/mohamedmetwalli5/breastcancerdiagnosis

Breast cancer diagnosis using machine learning via the XGBoost Algorithm after visualizing the data set & exploring it.

cancer data-visualization machine-learning

Last synced: 11 Jun 2026

https://github.com/pejpero/multi-class_classifier

This project demonstrates multiclass classification using Perceptron and Logistic Regression, implemented from scratch without using built-in libraries. It includes techniques like One-Versus-The-Rest and One-Versus-One for Perceptron, and Softmax for Logistic Regression, with a focus on understanding core ML concepts.

data-visualization logistic-regression machine-learning multiclass-classification one-vs-one one-vs-rest perceptron python scratch-implementation softmax

Last synced: 09 Jun 2026

https://github.com/mindlessmuse666/titanic-data-visualization

Проект по визуализации данных о пассажирах Титаника с использованием библиотек Python Matplotlib, Seaborn и Plotly.

data-analysis data-visualization matplotlib pandas plotly python seaborn titanic

Last synced: 04 May 2026

https://github.com/leosimoes/udacity-communicate-data-findings-titanic

Project 5 of the Udacity Data Analyst Nanodegree Program. Data analysis with views of the titanic dataset.

data-science data-visualization python

Last synced: 04 May 2026

https://github.com/nitrocosmstudios/temperature-sonification

Experimental sonification of temperature sensor data revealing structured, speech-like patterns coinciding with geomagnetic disturbances. Includes original dataset processing, synthetic signal attempts, and full write-up.

audio-analysis data-visualization electromagnetic-interference emf environmental-monitoring open-science raspberry-pi signal-processing sonification temperature temperature-monitoring temperature-sensor usb-devices weird-science

Last synced: 04 May 2026

https://github.com/mr-chang95/sf_data_visualization

In this personal project, I am interested in examining all of the active businesses in the San Francisco Bay Area while performing some simple data visualizations, mainly on categorical variables.

business data-analysis data-visualization jupyter-notebook pandas python san-francisco

Last synced: 04 May 2026

https://github.com/fatihilhan42/book-recommendation-system-with-python

In this project, we are making a book recommendation system that recommends similar books according to the genres or ratings that the user enters, using a large book dataset. The link of the dataset is given below. Happy reading...

books data-analysis data-science data-visualization kaggle python recommendation-engine recommendation-system

Last synced: 04 May 2026

https://github.com/hyperplasma/olympic-visualization-analysis

Multidimensional analysis and visualization of Olympic medals, economy, and happiness index.

data-analysis data-visualization matplotlib numpy pandas python wordcloud

Last synced: 04 May 2026

https://github.com/jdanielgoh/abuso-escuelas

Codigo fuente del proyecto depredadores en las aulas. Construido con Nuxt.js, three.js y d3.js

d3js data-visualization ddj threejs

Last synced: 09 Jun 2026

https://github.com/mugilan1309/csv_analyzer

📊 A simple Streamlit-based CSV Analysis & Preprocessing Tool for quick data insights.

csv-processing data-analysis data-visualization machine-learning python streamlit

Last synced: 04 May 2026

https://github.com/shashwat9kumar/trends_in_a_country_on_twitter

Finding trending topics in each country on twitter and visualizing them in a WordCloud

data data-visualization trends tweepy twitter-api wordcloud

Last synced: 13 Jun 2026

https://github.com/stephenombuya/automation_scripts

A collection of Python scripts and tools designed to automate various tasks, improve productivity, and simplify repetitive actions. Each script is well-documented and serves a specific purpose, ranging from data visualization to smart home control.

automation-with-python data-visualization productivity python3 smart-home-automation webautomation

Last synced: 13 Jun 2026

https://github.com/albertofaraujo/pbi_data_travels

Melhorar a compreensão dos dados de vendas da empresa Data Travels para identificar oportunidades de crescimento e otimizar suas estratégias de marketing.

data-visualization dax-studio power-query powerbi

Last synced: 26 Jan 2026

https://github.com/rauhanahmed/auto-data-analyzer

AutoDataAnalyzer: Automate data ingestion, analysis, and visualization with AI/ML-powered pipelines. Features natural language query processing, interactive Plotly visualizations, and seamless deployment via Docker.

ai-powered-analysis automated-pipeline cicd data-analysis data-visualization docker end-to-end-project flask generative-ai langchain llama3-1 machine-learning natural-language-processing plotly python3 pywebio

Last synced: 12 Apr 2026

https://github.com/riyouuyt/investigate-hotel-business-using-data-visualization

Explore hospitality data, visualizing customer behavior in hotel reservations and its impact on cancellations for strategic insights.

businessreporting data-visualization hotel-booking jupyter-notebook matplotlib-pyplot presentation python

Last synced: 05 May 2026

https://github.com/soupu07/ibm-employee-attrition-prediction

The aim of this project analyzes factors driving IBM employee attrition and predicts those likely to leave, helping the organization understand turnover causes and improve retention and performance.

data-science data-science-projects data-visualization machine-learning python python-data-analysis python-programming-language python-project

Last synced: 05 May 2026

https://github.com/jatinnxn/diabetes-prediction

this repository showcases a machine learning model built to predict diabetes using Diabetes dataset. The project walks through data preprocessing, model training, and evaluation, offering a Decision Tree-based solution to classify individuals as diabetic or non-diabetic based on various health metrics. It also supports real-time predictions.

data-cleaning data-preprocessing data-visualization decision-tree-classifier machine-learning

Last synced: 13 Jun 2026

https://github.com/kevinwood15/python_ml_neuralnetwork_project

I build a neural network to evaluate the CIFAR-10 dataset and achieve above 50% accuracy

cifar10 data-science data-visualization deep-learning neural-network python pytorch

Last synced: 05 May 2026

https://github.com/robmcelhinney/roman-emperors

D3.js Bar Charts of Roman Emperors' Age and length of Reign from 63 BCE to 395 CE

d3 d3js data-visualization emperor roman rome

Last synced: 06 May 2026

https://github.com/hossamAhmedSalah/Computer-Vision-

contains my training projects in this field

computer-vision data-visualization detection edge opencv

Last synced: 10 Mar 2025

https://github.com/hashkanna/youtube-wrapped-2025

🎬 Spotify Wrapped-style dashboard for YouTube watch history. Built with React + Tessl spec-driven development for EF x Tessl x Anthropic Hackathon.

analytics data-visualization hackathon react recharts tessl vite wrapped youtube

Last synced: 06 May 2026

https://github.com/sankaran-s2001/us-traffic-accidents-analysis-python-eda

Exploratory data analysis of US traffic accidents from 2016-2023, analyzing patterns by time, location, weather, and severity using Python data science libraries.

data-analysis data-science data-visualization eda matplolib numpy pandas python

Last synced: 06 May 2026

https://github.com/abhinav330/customer-behavior-analysis-linear-regression

This repository explores customer behavior data for an NYC clothing company with both a mobile app and website. They want to understand which platform drives higher sales.

data-analysis data-science data-visualization eda exploratory-data-analysis jupyter jupyter-notebook linear-regression machine-learning machine-learning-algorithms machinelearning-python numpy pandas python regression-analysis

Last synced: 06 May 2026

https://github.com/satvikpraveen/seabornmasterpro

🎨 SeabornMasterPro is a comprehensive, modular project to master Seaborn for data visualization. Includes themed utilities, advanced plotting notebooks, dashboards, time series, Streamlit app, and Docker support — perfect for learners, analysts, and open-source enthusiasts.

categorical-plots correlation-heatmap custom-theme data-visualization docker interactive-dashboard jupyter-notebook matplotlib modular-code multi-panel-layouts open-source-project pandas plot-utils project-structure python reproducible-research seaborn streamlit time-series-visualization utility-functions

Last synced: 12 Apr 2026

https://github.com/swapnil-jain/whatsapp-chat-statistics

Web Application showing common statistics, graphs and various charts about the uploaded Whatsapp chat.

data-analytics data-science data-visualization heroku-deployment streamlit whatsapp

Last synced: 06 May 2026

https://github.com/harish-siva/analyzing-personal-expenses

This project aims to simulate an expense tracker for an individual using the Faker library. The project generates realistic monthly expense data, processes and stores it in a SQL database, and creates SQL queries to derive insights into spending behavior. Streamlit app is developed to visualize these insights and showcase the results of SQL queries

data-visualization expense-tracking exploratory-data-analysis financial-analysis matplotlib pandas python seaborn sql sqlalchemy streamlit

Last synced: 06 May 2026

https://github.com/tetchen9/mapa

A map of a trip to Europe. Using d3.js, Eurostat dataset in GeoJSON.

cartography d3 d3-visualization data-visualization eurostat-data geojson maps typescript

Last synced: 06 May 2026

https://github.com/sahilmate/ebm-breast-cancer-classifier

This repository implements an Explainable Boosting Machine (EBM) model for breast cancer classification using scikit-learn and interpret. The project includes data preprocessing, model training, accuracy evaluation, and feature importance visualization.

breast-cancer-classification data-visualization explainable-boosting-machine feature-importance interpret machine-learning scikit-learn

Last synced: 06 May 2026

https://github.com/bartosz-ziolkowski/social_data

Data analysis of San Francisco's crime data set from 2003 to 2018

data-science data-visualization jupyter-notebook numpy pandas python

Last synced: 06 May 2026

https://github.com/soumya-thoutam/covid-19-impact-on-u.s.-states-and-colleges

Covid-19 analysis and impact on United States Colleges and States using SQL and Tableau.

covid-19 dashboard data-analysis data-visualization dataset sql sql-server tableau

Last synced: 04 Sep 2025

https://github.com/karlyndiary/coffee-shop-sales-analysis

Comprehensive analysis of coffee shop sales utilizing Pandas for data cleaning and exploratory data analysis (EDA), complemented by Streamlit for creating interactive data visualization dashboards.

data-analysis data-cleaning data-preprocessing data-visualization eda pandas streamlit streamlit-dashboard

Last synced: 07 May 2026

https://github.com/mindlessmuse666/train-test-splitter

Анализ данных о пассажирах Титаника и разбиение на обучающую и тестовую выборки. Практическое задание по дисциплине "Основы применения методов искусственного интеллекта в программировании".

data-analysis data-preprocessing data-visualization machine-learning pandas python scikit-learn seaborn titanic train-test-split

Last synced: 12 Apr 2026

https://github.com/prathmesh2507/global-stock-intelligence-dashboard

Interactive Global Stock Market Analytics Dashboard built using Python, YFinance, Pandas, Streamlit, and Plotly. Analyze 20+ countries and 400+ top stocks with advanced visualizations and financial insights.

dashboard data-analysis data-visualization python stock-analysis streamlit

Last synced: 15 Jun 2026

https://github.com/harshsahu23/wabtec3_brakesgui_shared

💻 App to visualize error logs and dump files generated by the braking system of locomotives 🚅

csv data-visualization graphs pandas plotly python streamlit visualization

Last synced: 07 May 2026

https://github.com/bryanhe24/data_analysis_app

A full-stack web application that allows users to upload CSV datasets, analyze the data with statistical summaries and visualizations, and interact with an AI-powered assistant for querying the dataset.

ai data data-analysis data-visualization fullstack-development javascript math python reactjs

Last synced: 07 May 2026

https://github.com/citizenlabsgr/openbudgetkw

Promoting a deeper understanding of the Kentwood, MI city budget process and city priorities.

csv data-visualization jade javascript react

Last synced: 07 May 2026

https://github.com/ayorick23/python-data-science-cheat-sheet

Guía rápida y práctica de sintaxis, comandos y funciones esenciales de Python para Ciencia de Datos. Perfecta para recordar cómo usar las librerías más comunes como NumPy, Pandas, Matplotlib y Scikit-learn en tus análisis diarios.

cheat-sheet data-analysis data-science data-visualization deep-learning jupyter-notebook machine-learning matplotlib ml numpy pandas python scikit-learn scipy seaborn statistics sympy tensorflow

Last synced: 07 Apr 2026

https://github.com/miserman/splot

An R package to ease data visualization

data-visualization r

Last synced: 22 Jan 2026

https://github.com/flowsta/ods-educacion-aporta

ODS para educación, iniciativa APORTA 2021

data data-visualization ods sdg

Last synced: 27 Jan 2026

https://github.com/pngo1997/chicago-airbnb-cta

Interactive Chicago CTA train stations geospatial map.

data-visualization geospatial html python visualization

Last synced: 15 Oct 2025

https://github.com/tjas/postgrad-ai-ddv-plotly

Jupyter Notebook to analyze the salaries of Federal District government public servants, using Python, Pandas and Plotly Express, to solve the proposed exercise in "Data Discovery and Visualization" discipline.

analysis analytics data data-analytics data-discovery data-science data-visualization graph graphs jupyter-notebook jupyter-notebooks pandas plotly plotly-express python

Last synced: 07 May 2026

https://github.com/5hraddha/ice-video-games-sales-analysis

Analysis of video game sales data of an online store Ice, which sells video games all over the world & identify patterns that determine whether a game succeeds or not.

data-visualization matplotlib numpy pandas requests-module scipy seaborn

Last synced: 14 Apr 2026

https://github.com/josephbarbierdarnal/matoolkit

matoolkit is a python package containing a toolbox for creating visually appealing graphs/annotations in matplotlib

data-analysis data-visualization matplotlib

Last synced: 31 Mar 2025

https://github.com/hamburgj/survivor-stats

Interactive visualization of Survivor US contestant statistics and season data, as well as connection path finding.

data-visualization graph interactive-visualizations react reactjs statistics survivor

Last synced: 16 Apr 2026

https://github.com/grascya/heart-disease

The objective is to ascertain the probability of an individual being susceptible to a severe heart problem based on some features.

data-visualization explainable-machine-learning exploratory-data-analysis heart-disease svm-classifier

Last synced: 16 Oct 2025

https://github.com/luzmo-official/temperature-increase

A web app displaying Global temperature rises since 1961 based on the dataset made public by FAOSTAT

climate dashboard data-visualization temperature

Last synced: 19 Jan 2026

https://github.com/claudiahw/excel-sales-dashboard

Data-driven Excel dashboard visualizing sales trends, top products, and profit breakdowns with dynamic filtering options.

dashboard data-visualization excel excel-dashboard pivot-tables

Last synced: 15 Jun 2026

https://github.com/hase3b/flask-dash-interactive-dashboard

An interactive data visualization dashboard created using Flask and Dash. This project includes comprehensive data preparation, exploratory data analysis (EDA), and dynamic visualizations with Seaborn and Plotly. Explore the multi-page Dash app with features like dropdowns and callbacks for updated plots.

callbacks dash dashboard data-analysis data-visualization dropdown eda flask interactive plotly seaborn web-app

Last synced: 19 May 2026

https://github.com/tapas-gope/adventure-works

This project analyzes sales data for AdventureWorks, focusing on revenue, customer segments, and product performance. The dashboard provides insights into top-selling products, sales by region, and customer trends across multiple years. It helps in identifying sales opportunities and optimizing marketing strategies.

adventureworks business-intelligence data-cleaning data-transformation data-visualization dax mssql powerbi sales-analysis sql-queries

Last synced: 17 Oct 2025

https://github.com/sammdu/global-warming-hurricane-typhoon

The Effect of Global Warming on Hurricane and Typhoon Occurrence

data-science data-visualization global-warming hurricane-data

Last synced: 07 May 2026

https://github.com/mahmoudnamnam/superstore-analysis

This project explores the SuperStore dataset to uncover insights into sales, profit, and customer behavior. It identifies key trends, regional variations, and product performance, using data analysis and machine learning techniques to guide business strategy and optimize performance.

clustering data-analysis data-science data-visualization geopandas jupyter-notebook machine-learning numpy pandas plotly regression seaborn sklearn

Last synced: 12 Apr 2026

https://github.com/minervarose/applysignal

Candidate-side recruitment intelligence dashboard for tracking job applications, scoring process signals, and prioritizing opportunities.

candidate-experience career-tools dashboard data-visualization decision-support job-search portfolio-project python recruitment streamlit

Last synced: 24 Jun 2026

https://github.com/abeltavares/postql

Python library and command-line interface (CLI) tool for interacting with PostgreSQL databases, providing simplified database management, query execution, and result export functionalities.

cli command-line-interface data-analysis data-engineering data-export data-management data-processing data-visualization database database-administration database-tools etl oop postgres postgresql psycopg2 python sql sqlalchemy wrapper

Last synced: 19 Jan 2026

https://github.com/yulia-momotyuk/dla-data-analysis-practice

This repository contains my homework assignments completed during the "Data Analyst in IT" course at Data Loves Academy.

analytics data-analysis data-visualization excel mysql numpy pandas postgres powerbi python seaborn sql tableau

Last synced: 14 Apr 2026

https://github.com/dzakwanalifi/stadata-x

Terminal UI untuk menjelajahi dan mengunduh data BPS Indonesia secara interaktif

bps-api cli-app data-analysis data-visualization indonesia-statistics indonesian-data open-data python statistics terminal-ui textual tui

Last synced: 20 Jan 2026

https://github.com/archanakokate/kkbox_music_recommendations

Predicting the chances of a user listening to a song repetitively after the first observable listening event.

data-visualization exploratory-data-analysis machine-learning statistical-analysis

Last synced: 11 Oct 2025

https://github.com/toodef/light-engine

Lightweight and fast 3D visualisation engine

cpp data-visualization linux python visualization windows

Last synced: 11 Feb 2026

https://github.com/mr-chang95/udacity-starbucks-challenge

Data Science Project for Udacity's Data Scientist Program. Using Python in Jupyter Notebook.

data data-science data-visualization numpy pandas sklearn

Last synced: 14 Apr 2026

https://github.com/emirhansilsupur/hotel-booking-analytics-dashboard

Interactive Power BI dashboard visualizing hotel booking metrics for two Portuguese properties (Algarve resort & Lisbon city).

dashboard data-visualization power-bi

Last synced: 27 Jan 2026

https://github.com/vinay-jose/territorial-sales-dashboard

EDA was carried out in the sales data of Atliq Technologies and a Dashboard was created in PowerBI to draw insights.

data-analysis data-visualization powerbi-desktop sql

Last synced: 11 Oct 2025

https://github.com/azaz9026/email-spam-detection

Welcome to the Email Spam Detection project! This repository provides a machine learning model for detecting spam emails using a Naive Bayes classifier and a simple web interface built with Streamlit.

data-analysis data-cleaning data-structures data-visualization deep-learning machine-learning python sql streamlit

Last synced: 14 Apr 2026

https://github.com/samjoesilvano/password_strength_prediction_using_nlp

Developed a predictive model to categorize passwords as Strong, Good, or Weak, enhancing security and reducing breach risks. The project involves cleaning and analyzing data from an SQL database, using the TF-IDF technique for transformation, and implementing a Logistic Regression model to achieve accurate classifications.

data-analysis data-classification data-cleaning data-visualization logistic-regression machine-learning natural-language-processing pandas password-security password-strength python scikit-learn sql tf-idf

Last synced: 08 May 2026

https://github.com/saifalibaig/covid-19-death-rate-analysis-using-python

Analysis of Covid-19 data along with the world happiness report to identify if there is any relationship between death rate and happiness rate of countries all over the world.

data-analysis data-visualization numpy pandas python3 sns visualization

Last synced: 03 May 2026

https://github.com/itskshitija/hr-data-analysis

The HR Data Analytics Dashboard project uses Power BI to analyze employee data, visualizing key HR metrics and KPIs to support data-driven decisions for improving workforce management, employee satisfaction, and organizational growth.

analytics data-science data-visualization dataanalysis dataanalytics hrdataanalysis powerbi-desktop powerbidashboard

Last synced: 21 Jan 2026

https://github.com/tsear/reddit-discourse-project

Mapping emotional and conceptual discourse across Reddit philosophy communities.

data-visualization emotion-detection network-analysis nlp pandas reddit-api sentiment-analysis spacy text-mining tf-idf topic-modeling

Last synced: 08 May 2026

https://github.com/kamiviolet/d3_collections

As decribed, all kinds of chart and data visualisation with D3

charts d3 data-visualization

Last synced: 29 Apr 2026

https://github.com/timjjting/data-is-beautiful

Introductory slides to data visualization

data-visualization

Last synced: 23 Jan 2026

https://github.com/stopyransky/wdvp

World Government Data Visualisation Prize - submitted work

d3 d3js data-visualization dataviz react svg

Last synced: 18 May 2026

https://github.com/allanreda/automated-k-means-clustering-engine

An interactive K-Means clustering tool built with Flask and Scikit-Learn, supporting Excel file uploads, cluster analysis, and data export, deployed on Google Cloud Run via Docker with CI/CD integration.

cicd css data-visualization deployment docker flask google-cloud-run html javascript k-means-clustering machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 19 Jan 2026

https://github.com/jessicaevelin/estudos

Repositório com atividades, exercícios e projetos realizados durante meus estudos em Ciência de Dados, baseados em cursos, livros, vídeos e conteúdos da internet.

data-science data-visualization exercises jupyter machine-learning pandas projects python study

Last synced: 08 May 2026

https://github.com/rketson/controlflux

Projeto para controle de fluxo de pessoas em um ambiente indoor (Única entrada/saída)

automacao automation computer-network data-visualization esp esp8266 redes-de-computadores visualizacao-de-dados

Last synced: 28 Jan 2026

https://github.com/skhosla8/analytics-webpage

A webpage that uses JSON data to render product details, a line chart and table.

d3 data-visualization react redux

Last synced: 14 Apr 2026

https://github.com/satvikpraveen/numpymasterpro

A hands-on, production-ready toolkit to master NumPy — from first principles to real-world applications. Includes modular Jupyter notebooks, reusable utility scripts, cheatsheets, and advanced projects like K-Means clustering from scratch.

broadcasting data-analysis data-science data-source data-visualization jupyter-notebook kmeans-clustering linear-algebra machine-learning matrix-algebra numerical-computation numpy numpy-broadcasting numpy-examples numpy-tutorial open-source python scientific-computing standardization vectorization

Last synced: 08 May 2026

https://github.com/tommcn/ontario-hs-data

A simple visualization of Ontario High School data

data-visualization r

Last synced: 25 Oct 2025

https://github.com/antrita/predicting-house-prices-using-machine-learning

[WIP!] Task: Build a simple supervised machine learning regression model using the California Housing dataset from scikit-learn, evaluate it using appropriate metrics, and deploy it with a minimal Streamlit app for predicting housing prices.

data-science data-visualization house-price-prediction linear-regression machine-learning modelprediction python sklearn

Last synced: 19 Apr 2026