An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/warazkhan/airplane-crashes-and-fatalities-since-1908-

This project analyzes airplane crash data (1908 - 2008)✈️📊 to uncover trends in aviation accidents, fatalities, and safety improvements. Using exploratory data analysis (EDA) and data visualization, we examine key factors influencing crashes, identify high-risk regions, and explore advancements in aviation safety.

data-analysis data-visualization exploratory-data-analysis

Last synced: 10 Jun 2026

https://github.com/sumit-sinha9/t20wc_2022-best-11

Finalizing best 11 players from 2022 T20 world cup using Python Web scraping, Pandas and Power BI

data-visualization pandas powerbi python webscraping

Last synced: 07 May 2026

https://github.com/mahmoudnamnam/fc-barcelona-reports

FC Barcelona Reports: An interactive web application to analyze and visualize FC Barcelona's match data. Built with Streamlit, it scrapes match data from WhoScored, stores it in MongoDB, and presents insights through interactive visualizations like pass networks, shot maps, and player statistics.

data-analysis data-visualization football-analytics mplsoccer pandas streamlit web-scraping

Last synced: 07 May 2026

https://github.com/antrikshy/personalmovieanalysis

Finds interesting patterns in an IMDb ratings export; written as a Jupyter notebook, viz using Seaborn

data-visualization imdb jupyter-notebook movie-ratings pandas python seaborn

Last synced: 07 May 2026

https://github.com/tjas/postgrad-ai-ddv-plotly

Jupyter Notebook to analyze the salaries of Federal District government public servants, using Python, Pandas and Plotly Express, to solve the proposed exercise in "Data Discovery and Visualization" discipline.

analysis analytics data data-analytics data-discovery data-science data-visualization graph graphs jupyter-notebook jupyter-notebooks pandas plotly plotly-express python

Last synced: 07 May 2026

https://github.com/leandrocollares/employment-insurance-beneficiaries

A responsive line chart that shows regular Employment Insurance beneficiaries in Canada between 2019 and 2021

d3 data-visualization svelte

Last synced: 07 May 2026

https://github.com/amarlearning/exploring-67-years-of-lego

In this project, I have explored database of every LEGO set ever built.

data-manipulation data-visualization importing-and-cleaning-data jupyter-notebook pandas python

Last synced: 07 May 2026

https://github.com/cnoret/hexa-watts

Interactive data visualization and machine learning app for energy consumption analysis and prediction in France, built with Streamlit. (Text in French)

data-visualization electricity-forecasting energy-analysis france machine-learning scikit-learn streamlit

Last synced: 07 May 2026

https://github.com/sammdu/global-warming-hurricane-typhoon

The Effect of Global Warming on Hurricane and Typhoon Occurrence

data-science data-visualization global-warming hurricane-data

Last synced: 07 May 2026

https://github.com/vyjayanthipolapragada/genai_smart_retail_recommendation

GenAI Smart Retail is a recommendation system designed for retail environments. It provides personalized product recommendations to users based on product descriptions using a content-based filtering approach. The system leverages FastAPI for backend integration, allowing users to interact with the recommendation engine via an API. This project aim

content-based-recommendation data-analysis data-science data-visualization fastapi gen-ai instacart-data jupyter-notebook open-ai python3 retail scikitlearn-machine-learning stream

Last synced: 07 May 2026

https://github.com/nishumehta/sales-analysis-project

This project aims to analyze sales performance using Excel, SQL, Python, Tableau, and Power BI. The goal is to extract insights from sales data, identify trends, and visualize key performance indicators (KPIs).

data-cleaning data-visualization eda excel matplotlib-pyplot pandas python3 tableau-dashboards

Last synced: 08 May 2026

https://github.com/ropaxyz/octobot-octopus-energy-discord-bot

A Discord bot for Octopus Energy users to track and visualize their energy consumption. Integrates with Octopus Energy's API to fetch and display personalized energy data, costs, and usage charts.

asyncio data-visualization discord-bot energy-monitoring graphql matplotlib octopus-energy octopus-energy-api python rest-api sqlite

Last synced: 08 May 2026

https://github.com/moiri-gamboni/vintedcalculator

📊 Smart pricing calculator for Vinted sellers - optimize your listings based on storage, seasonality, and market dynamics

clothing-resale data-visualization e-commerce inventory-management pricing-calculator react recharts sales-optimization seasonal-pricing shadcn-ui tailwindcss typescript vinted

Last synced: 08 May 2026

https://github.com/js-konda/naturaldisasterseda

The project repository for the Exploratory Data analysis of natural disasters done as part of ECE143 course at UCSD

data-science data-visualization pandas python visualization

Last synced: 08 May 2026

https://github.com/samjoesilvano/password_strength_prediction_using_nlp

Developed a predictive model to categorize passwords as Strong, Good, or Weak, enhancing security and reducing breach risks. The project involves cleaning and analyzing data from an SQL database, using the TF-IDF technique for transformation, and implementing a Logistic Regression model to achieve accurate classifications.

data-analysis data-classification data-cleaning data-visualization logistic-regression machine-learning natural-language-processing pandas password-security password-strength python scikit-learn sql tf-idf

Last synced: 08 May 2026

https://github.com/vanshuchaudhary/flightpriceanalysis-

The uploaded file is a Jupyter Notebook titled "Flight Analysis". It likely involves analyzing flight-related data, potentially exploring trends, patterns, or insights using data science techniques. The analysis might include data visualization, statistical analysis, or predictive modeling.

business-analytics data data-analysis data-visualization datainsights datascience matplotlib-pyplot python seaborn seaborn-plots seaborn-python sns statistical-analysis

Last synced: 08 May 2026

https://github.com/jessicaevelin/estudos

Repositório com atividades, exercícios e projetos realizados durante meus estudos em Ciência de Dados, baseados em cursos, livros, vídeos e conteúdos da internet.

data-science data-visualization exercises jupyter machine-learning pandas projects python study

Last synced: 08 May 2026

https://github.com/vinit714/player-retention-analysis

A complete Streamlit + Machine Learning + SHAP + NLP project to analyze, predict, and improve player retention in games. This project simulates a game environment, models churn behavior, and provides insights using SHAP, NLP word clouds, and strategy simulators.

churn-prediction classification data-visualization eda feature-engineering game-analytics game-data-analysis gaming-analytics machine-learning model-interpretability nlp pandas player-retention python retention-analysis sckiit-learn shap streamlit wordcloud

Last synced: 08 May 2026

https://github.com/iyashwantsaini/911_capstone

For this capstone project we will be analyzing some 911 call data from Kaggle.

capstone data-science data-visualization python3

Last synced: 10 Jun 2026

https://github.com/koushikphy/covid-19-visualizer

A python plotly-dash app showing different statistics regarding Coronavirus 2019

covid-19 covid19-data covid19-tracker dash data-visualization plotly-dash webapp

Last synced: 08 May 2026

https://github.com/drod75/burger_king_analysis

A simple analysis on a burger king dataset.

data-analysis data-visualization jupyter-notebook pandas python seaborn

Last synced: 09 May 2026

https://github.com/erikad88/belly-button-challenge

This project is an interactive dashboard that visualizes the Belly Button Biodiversity dataset, which catalogs microbes found in human navels.

css d3js dashboard data-visualization html javascript json plotly

Last synced: 09 May 2026

https://github.com/master-helix/ibm-data-analyst-certification-stock-analysis-project

This is a mini project repository of my IBM Certification involving stock analysis and plotting of Tesla and GameStop

analytics data data-analysis data-visualization ibm matplotlib pandas python web-scraping

Last synced: 09 May 2026

https://github.com/prishabhanot/skin_cancer_classification_model

Classifies 7 types of skin cancer lesions using a deep learning CNN model. Processes and balances the dataset, trains the model, and evaluates its accuracy with visualizations.

cnn confusion-matrix data-visualization keras machine-learning medical-imaging python tensorflow

Last synced: 09 May 2026

https://github.com/smahala02/materials-science-image-analysis

Image analysis for materials science with a focus on particle diameter measurement and image scaling using Python.

data-visualization image-analysis materials-science particle-measurement python

Last synced: 09 May 2026

https://github.com/piras-s/braincancerclassifier

Classifying brain tumors using Gaussian Naive Bayes with MRI-derived features. Includes feature selection, model evaluation, prediction uncertainty, and probability calibration.

baysian-inference calibrated-classification classification data-visualization feature-selection machine-learning medical-imaging naive-bayes-classifier python scikit-learn uncertainty-estimation

Last synced: 09 May 2026

https://github.com/shyamkumarnagilla/big-sales-prediction

The "Big Sales Prediction" model is a machine learning project that aims to accurately forecast sales for a given period. The model utilizes the Random Forest Regressor algorithm, a powerful ensemble learning technique, to analyze historical sales data and make predictions. It can be valuable for businesses looking to optimize sales forecasting.

data-analytics data-preprocessing data-science data-visualization machine-learning model-evaluation model-training

Last synced: 09 May 2026

https://github.com/abhinav330/msc-project

AI-Powered Chatbot for University Websites This project enhances the usability of university websites by providing an AI-driven chatbot powered by advanced Large Language Models (LLMs) and Retrieval-Augmented Generation (RAG).

chatbot data-science data-visualization finetuning-llms gemma2 llama3 llama3-finetune llm llm-inference mistral-7b nlp ollama phi-3-mini rag research-project

Last synced: 09 May 2026

https://github.com/shridhar1504/rafik-s-kitchen-data-analysis

The Project is about the Analysis of the Sales and Expenses Data of a Famous Fast-food Restaurant. This mainly focuses on gaining Insights that will boost the Future Sales and also Business Strategies it Improve the Profit Margins. Handled Tools are SQL, Python, Power BI, MS Office Tools.

business-analytics business-intelligence data-analysis data-analytics data-visualization eda ms-office powerbi-report powerpoint-presentations python sql-server

Last synced: 10 May 2026

https://github.com/jbalooshie/stock-analysis

A VBA script that performs basic stock analysis. Created while participating in a Data Analytics Bootcamp.

data-science data-visualization excel microsoft vba vba-excel vba-macros vba-script

Last synced: 20 Jan 2026

https://github.com/saisurajmatta/cryptocurrency-market-analyzer-python-project

Cryptocurrency Market Analyzer: Python script utilizing CoinMarketCap API to fetch, analyze, and visualize real-time trends of top 15 cryptocurrencies over different time intervals.

data-analytics data-visualization matplotlib pandas python seaborn

Last synced: 05 May 2026

https://github.com/rembertdesigns/pit-stop-simulator

An interactive F1 race simulator using Reinforcement Learning (PPO, Q-learning) and Streamlit to optimize pit stop strategies based on dynamic conditions.

data-science data-visualization f1 formula-1 ppo python python3 q-learning reinforcement-learning sckiit-learn stable-baselines3 streamlit-application

Last synced: 22 Jun 2025

https://github.com/zaydabash/envirowatch

Real time environmental dashboard for live air quality monitoring with anomaly detection, interactive maps, and natural language commands. Built with Next.js, TypeScript, and OpenAQ.

air-quality data-visualization environmental-data environmental-monitoring maplibre nextjs openaq realtime-dashboard recharts shadcn-ui tailwindcss typescript vercel zustand

Last synced: 09 Apr 2026

https://github.com/theshashanksinha/deloitte-au

Analyzed telemetry and salary equality data using Tableau and Excel to identify machine downtime patterns and assess gender pay equity, translating raw data into actionable business insights.

data-analytics data-visualization microsoft-excel tableau

Last synced: 06 Mar 2026

https://github.com/jofaval/ionosphere

Binary Classification of Ionosphere signals at Goose Bay, Labrador in 1988

data-analysis data-science data-visualization deep-learning google-colab keras machine-learning python scikit-learn tensorflow uci xgboost

Last synced: 09 Apr 2026

https://github.com/nicolascampbell97/cobertura-de-salud-censo-2022-power-bi

Dashboards en Power BI sobre la situación de la cobertura de salud en argentina, basado en los resultados del censo 2022

dashboard data-analytics data-science data-visualization dax-functions excel hypothesis-testing power-bi power-query r-studio statistical-inference tidyverse

Last synced: 22 Jun 2025

https://github.com/joselvillaronga/wifi-scan-measure-raspberry-pi-4

Herramienta web basada en Flask que escanea redes WiFi en 2.4 GHz y 5 GHz, estima distancias según RSSI y ofrece visualizaciones interactivas de canales, niveles de señal e historial de escaneos. Soporta almacenamiento en JSON o MongoDB y se ejecuta como servicio systemd para monitorización continua.

data-visualization debian flask graphs iot json mongodb network-monitoring network-tools python rssi-distance-estimation systemd wifi-scanner

Last synced: 09 Apr 2026

https://github.com/balajimohan18/loan-clustering-datascience-project

This project uses Machine Learning to Cluster loan together based on their similarities. The project uses a dataeset of loan application which includes information about the Loan amount and Balance. The project then use the clustering algorithm to group the loan together based on the similarities.

clustering-algorithm data-analysis data-science data-visualization eda kmeans-clustering machine-learning sql unsupervised-learning

Last synced: 27 Jul 2025

https://github.com/vedantshi/tableau-bike-data-dashboard

London Bike Rides Analysis explores bike usage patterns using data visualization and machine learning. It identifies trends through a dynamic moving average, analyzes weather impact with heatmaps, and provides actionable insights via an interactive Tableau dashboard. Tools: Python, Tableau.

data-analysis data-visualization python tableau weather-data

Last synced: 16 May 2026

https://github.com/maxbiostat/diehl_ebola_cell_2016

supplementary code and data to Diehl et al, 2016 (Cell)

data-analysis data-visualization disease-spread ebola mutation

Last synced: 11 Jul 2025

https://github.com/aglowraph/gromacs-xvg-plot-script

A Python script for automating the plotting of .xvg files from GROMACS simulations, with dynamic labeling, time unit detection, and colorful visualization. This script reads, plots, and saves each .xvg file in the same directory, making data analysis more efficient.

automation computational-chemistry data-visualization gromacs matplotlib molecular-dynamics numpy python scientific-computing xvg-plotting

Last synced: 18 May 2026

https://github.com/as16082023/manufacturing-downtime-analysis

In the Maven Analytics data challenge, analyzed manufacturing downtime for a soda production company using Excel, identifying key issues and root causes of delays. Insights were shared through tables, charts, and a concise report with actionable recommendations.

advanced-excel data-visualization excel

Last synced: 20 Jan 2026

https://github.com/simulacrum6/wh-career-graph

A visual exploration tool for career-paths in the Warhammer Fantasy Role Playing Game

data-visualization graph-algorithms javascript web

Last synced: 10 Sep 2025

https://github.com/leonardoberlatto/1000-startups-analytics

Data analytics on startups data using Tableau

analytics data-science data-visualization tableau

Last synced: 11 Jan 2026

https://github.com/rafinha0rafinha/web-analyzer-backend

(Legacy) This is the backend for Mazaoro SARLU's lead magnet "Web Analyzer". This project analyzes websites using Google Lighthouse and returns a detailed report consumed by the frontend.

azure-app-service azure-devops chartjs cicd data-analysis data-science data-visualization express flask hacktoberfest lighthouse numpy sentiment-analysis vader-sentiment-analyzer

Last synced: 10 Apr 2026

https://github.com/sakan811/gachascope

Evaluate the cost-effectiveness of various in-app purchase bundles available in gacha games.

data data-analysis data-visualization game honkai honkai-star-rail honkai-starrail hoyoverse javascript nextjs tableau tableau-public typescript wutheringwaves

Last synced: 04 May 2026

https://github.com/chiefinnovator/daysonpurpose

A single-page, no-dependency web app that lets a user enter their birth date, country, and gender to estimate remaining days, weeks, months, and years based on life expectancy data, with caching, fallback dataset, and accessible, responsive UI.

accessible-ui data-visualization life-expectancy react single-page-application typescript web-app

Last synced: 04 Apr 2026

https://github.com/wurstbroteater/hometemp

Measure temperature and humdity of a room, retrieve online weather data, visualize it, analyse it and send it via email.

apartment-management-system data-visualization raspberry-pi scraped-data temperature temperature-monitoring temperature-sensor

Last synced: 11 Jul 2025

https://github.com/hirudikaanupama/predicting-term-deposit-subscriptions

The purpose of this project is to help banks and financial institutions identify potential customers for term deposit subscriptions, optimize marketing strategies, and improve conversion rates using data-driven insights.

data-cleaning data-imbalance-handling data-normalization data-transformation data-visualization exploratory-data-analysis hyperparameter-tuning neural-network random-forest

Last synced: 11 Jul 2025

https://github.com/saravanansuriya/streamlit

Streamlit Tutorial for machine learning and data science.

data-visualization python-script streamlit-webapp

Last synced: 18 May 2026

https://github.com/sharinas/mapped_travel_locations

A web-based Python mapping project of specific places around the world, with interactive pop-ups and color coded markers. Project uses folium, pandas, python, and a .csv file to store data.

csv data-visualization folium mapping pandas pipenv python

Last synced: 18 May 2026

https://github.com/adnanrahin/nlp-with-disaster-tweets

Kaggle Competition: Predict which Tweets are about real disasters and which ones are not. Natural Language Processing.

data-analysis data-science data-visualization kaggle-competition machine-learning natural-language-processing regular-expression tweets

Last synced: 21 Jun 2025

https://github.com/ianjure/simple-corr

A simple data correlation visualizer built in Streamlit.

data-visualization streamlit

Last synced: 18 May 2026

https://github.com/pkjjoshi/restaurants-analysis

Performed beginner-level EDA on a restaurant dataset using Python. Analyzed top cuisines, city-wise ratings, price ranges, and online delivery impact using Pandas and Matplotlib. Includes 4 well-structured notebooks with visual insights.

beginner-project data-analysis data-visualization exploratory-data-analysis jupyter-notebook pandas python restaurant-data seaborn

Last synced: 21 Jun 2025

https://github.com/yash22222/olympic-games-analytics-using-apache-spark

The "Olympic Games Analytics Using Apache Spark Databricks" project explores data from the Olympic Games (1896-2016) to identify trends and insights. Using Apache Spark for big data processing and Databricks for visualization, the project analyzes key factors like top-performing countries and athlete attributes, showcasing real-world analytics.

apache apache-kafka apache-spark big-data-analytics csv data data-analytics data-visualization databricks excel mysql olympics regions

Last synced: 03 May 2026

https://github.com/onlinebunker/iris-flower

Exploratory Data Analysis of Iris Flower Classification Data

data-visualization eda pandas

Last synced: 28 Apr 2026

https://github.com/smpotts/dash-live-updates

Figuring out how to do live updates in Dash Plotly.

dash-plotly data-visualization python

Last synced: 27 Jul 2025

https://github.com/mae776569/weratedogs-wrangling

Wrangling WeRateDogs Twitter data to create interesting and trustworthy analyses and visualizations

data-analysis data-science data-visualization tweets twitter-api

Last synced: 25 Jan 2026

https://github.com/atharvkadammm/suicide-prediction-system

A machine learning project predicting suicide risk based on multiple socio-economic and environmental factors using data mining techniques.

csv data-analysis data-science data-visualization datamining exploratory-data-analysis feature-engineering machine-learnin matplotlib mental-health numpy pandas riskassesment seaborn sklearn suicide-prediction supervised-

Last synced: 01 Jul 2025

https://github.com/atharvkadammm/calmlytic

An end-to-end machine learning project that predicts anxiety severity using classification models (Naive Bayes, Decision Tree, SVM, Logistic Regression, XGBoost), based on lifestyle, health, and behavioral features.

anxiety-prediction classification csv data-analysis data-preprocessing-and-cleaning data-science data-visualization ensemble-learning logistic-regression machine-learning-algorithms matplotlib mental-health numpy pandas python sci-kit-learn seaborn supervised-learning svm xgboost

Last synced: 21 Jun 2025

https://github.com/willmeyers/usgs-groundwater-trends

Visualized USGS groundwater level trends

data-visualization

Last synced: 30 Oct 2025

https://github.com/rafay99-epic/metricmate

Metric Mate is a modern, Python-based GUI tool for visualizing and analyzing gaming performance metrics with a sleek Tokyo Night theme.

data-visualization python python-gui-tkinter python-script

Last synced: 11 May 2025

https://github.com/benzerinsio/onlineretail-tableau

📊 Um dashboard interativo básico criado no Tableau para explorar vendas de uma loja online, com visualizações de receita por região e tendências temporais.

data-visualization eda sales-analysis tableau visualizacao-de-dados

Last synced: 09 Feb 2026

https://github.com/luka-j/csw5-eda

Materials for CS Week 5 lecture on exploratory data analysis

data-visualization r shiny tidyverse

Last synced: 26 Apr 2026

https://github.com/rezowanrahat/netflix_analysis

Data analysis of Netflix content using Python, Pandas, and Seaborn

data-analysis data-visualization netflix pandas python

Last synced: 07 May 2026

https://github.com/jessicaevelin/datascience

Repositório com atividades, exercícios e projetos realizados durante meus estudos em Ciência de Dados, baseados em cursos, livros, vídeos e conteúdos da internet.

data-science data-visualization exercises jupyter machine-learning pandas projects python study

Last synced: 21 Jun 2025

https://github.com/arction/lcjs-example-0009-severalaxisxy

A demo application showcasing using multiple axes in LightningChart JS.

axis chart data-visualization lcjs lightningchart-js

Last synced: 12 Mar 2025

https://github.com/m-dadej/excess_deaths_poland

Estimation of excess deaths during COVID-19 pandemic in Poland

covid-19 data-science data-visualization rstats time-series

Last synced: 14 May 2026

https://github.com/alpkanoz/ibm_data_science_professional_certificate

The repository contains projects and training materials carried out throughout the IBM data science professional course.

classification clustering data-analysis data-science data-visualization dataframe ibm ibm-watson machine-learning mathplotlib pandas predictive-modeling python scikit-learn

Last synced: 07 Mar 2026

https://github.com/easonlai/covid19_hk_analysis

This is code sample of data analysis (with visualization) for COVID-19 cases in Hong Kong. Data is obtained from official data.gov.hk.

covid-19 data-analytics data-science data-visualization matplotlib pandas python seaborn seaborn-plots

Last synced: 12 Apr 2026

https://github.com/mfakhriazhar/ecom-qtt-prediction

In e-commerce, understanding seasonal sales trends and best-selling products is critical to business strategy. However, companies often struggle with predicting sales, determining factors that influence sales (discounts, product categories, locations), and optimizing stock and marketing.

data-analysis data-science data-visualization e-commerce-project eda machine-learning python

Last synced: 19 May 2026

https://github.com/guomaimang/magic-vaccine

A research of spread of COVID-19 with and without vaccine, also Group Project of COMP1433(Introduction of data analysis).

data-science data-visualization r-language

Last synced: 11 Jan 2026

https://github.com/hirudikaanupama/student-score-prediction-linear-regression

Here the prediction and analysis of student scores using selected features is done entirely by linear regression machine learning algorithm. This project covers all methods of linear regression theory.

cross-validation data-cleaning data-visualization hyperparameter-tuning jupiter-notebook lasso-regression linear-regression machine-learning-algorithms multiple-linear-regression prediction-model python regularization ridge-regression student-score-prediction

Last synced: 26 Apr 2026

https://github.com/kate8382/frontend-module

Frontend module for a web application with user authentication, real-time dashboard, and data management

authentication dashboards data-visualization frontend

Last synced: 21 Jun 2025