An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/sagarprajapat2004/data-analysis-visualization

Downloaded and analyzed a dataset from Kaggle using NumPy and Pandas created visualizations with Matplotlib and Seaborn developed a Flask web application to showcase data insights and conclusions.

data-analysis data-modeling data-visualization exploratory-data-analysis flask python statical-analysis

Last synced: 04 May 2026

https://github.com/hyperplasma/olympic-visualization-analysis

Multidimensional analysis and visualization of Olympic medals, economy, and happiness index.

data-analysis data-visualization matplotlib numpy pandas python wordcloud

Last synced: 04 May 2026

https://github.com/halyusa16/e-commerce-analysis

This project analyzes a public e-commerce dataset to uncover valuable insights and answer critical business questions. The dataset contains customer, product, order, and transaction details, providing a comprehensive view of the e-commerce platform's operations.

data-analysis data-cleaning data-exploration data-visualization self-project

Last synced: 09 Jun 2026

https://github.com/mugilan1309/csv_analyzer

📊 A simple Streamlit-based CSV Analysis & Preprocessing Tool for quick data insights.

csv-processing data-analysis data-visualization machine-learning python streamlit

Last synced: 04 May 2026

https://github.com/bishopce16/pyber_analysis

The purpose of this project was to complete an exploratory analysis and create visualizations of the 2019 ride sharing data from PyBer.

data-analysis data-visualization jupyter-notebook matplotlib pandas python

Last synced: 04 May 2026

https://github.com/drod75/nyc-arrests-analysis

This is a simple Data Science Project made to analyze and display data and trends found within the NYC Arrests Year to Date Dataset.

data-analysis data-visualization folium jupyter-notebook matplotlib-pyplot nyc-opendata nypd python scikit-learn seaborn

Last synced: 04 May 2026

https://github.com/flytomarsz/bike-sharing-system-analysis

This analysis project aim to identify bike rental's behavior in 2012 from Capital Bikeshare system, Washington D.C., USA. This project is part of my Data Analysis study at Dicoding.

data-analysis data-visualization jupyter-notebook python streamlit

Last synced: 04 May 2026

https://github.com/benedart/interactive-information-visualization

Interactive Information Visualization Project

d3 data-visualization information-visualization

Last synced: 05 May 2026

https://github.com/riyouuyt/investigate-hotel-business-using-data-visualization

Explore hospitality data, visualizing customer behavior in hotel reservations and its impact on cancellations for strategic insights.

businessreporting data-visualization hotel-booking jupyter-notebook matplotlib-pyplot presentation python

Last synced: 05 May 2026

https://github.com/dona-eric/projet-etudiant

Objectif : Étudier et Analyser les facteurs qui influencent l'engagement des étudiants et les niveaux de risque dans un contexte éducatif .

data-science data-visualization fastapi machine-learning sante-etudiant streamlit

Last synced: 05 May 2026

https://github.com/badranalyst/residential-unit-prices-data-analysis-application

Python-based analysis of residential unit prices, focusing on data cleaning, visualization, and exploratory data analysis (EDA). Key features include price distribution, and correlation analysis between factors like size, location, and pricing.

data-analysis data-visualization dataset matplotlib numpy pandas python seaborn

Last synced: 05 May 2026

https://github.com/saineshnakra/data-model-visualization-jupyter

A collection of Jupyter notebooks for visualizing machine learning models, echelon graphs, and data structures.

classification-visualization clustering-visualization-notebook data-models data-science data-visualization jupyter-notebook machine-learning python python3

Last synced: 05 May 2026

https://github.com/anushkundu/crime-pattern-analysis

Analyzing Crime Patterns in Montgomery County, USA: An Inclusive Study Based on NIBRS Data (2016-2022)

data-analysis data-visualization descriptive-statistics matplotlib numpy pandas python seaborn

Last synced: 05 May 2026

https://github.com/shruthin4/ipl-cricket-analysis-2007-2024

In-depth IPL Cricket Data Analysis (2007–2024) with visual insights on teams, players, and match outcomes.

analysis cricket data-visualization eda ipl python sports-analytics

Last synced: 05 May 2026

https://github.com/kevinwood15/python_ml_neuralnetwork_project

I build a neural network to evaluate the CIFAR-10 dataset and achieve above 50% accuracy

cifar10 data-science data-visualization deep-learning neural-network python pytorch

Last synced: 05 May 2026

https://github.com/kammarah/data-sample

I designed a database website 🌐 that can be uploaded easily for use 📤. You can check my website 👀.

data-analysis data-visualization database deploy deployment library-management-system panaversity streamlit webapp

Last synced: 05 May 2026

https://github.com/hms75/movie_rating_analysis

A movie rating analysis which identifies trends amongst a dataset of 5000 movies.

data-analysis data-visualization matplotlib-pyplot numpy pandas python

Last synced: 05 May 2026

https://github.com/mito-ds/mitosheet_helper_config

The mitosheet_helper_config package used by enterprises to configure the mitosheet package.

data data-analytics data-science data-visualization jupyter pandas python

Last synced: 05 May 2026

https://github.com/femincan/d3-bar-chart

My solution for the Visualize Data with a Bar Chart project on FCC.

css3 d3js data-visualization html5 javascript

Last synced: 06 May 2026

https://github.com/donmaruko/python-eda-toolkit

CLI-runned EDA with 30 commands utilizing text-related functions, statistical calculations, data visualization, and data manipulation.

data data-analysis data-science data-visualization matplotlib pandas scipy seaborn statistical-analysis statistics wordcloud

Last synced: 06 May 2026

https://github.com/priyapuranik/diwali-sales-analysis_with-data-visualization

Diwali sales data analysis with visualizations, highlighting customer demographics, product categories, and purchase trends.

data-visualization matplotlib-pyplot pandas python seaborn

Last synced: 06 May 2026

https://github.com/benjaminrose/data-analysis-book

A Jupyter Book for my Spring 2025 PHY 5381 class on Data Analysis

book data-analysis data-science data-visualization jupyter-book open-book python r statistics-course

Last synced: 06 May 2026

https://github.com/rishisolanke/twitter-sentiment-analysis-using-machine-learning-

A research project that classifies tweets as positive, negative, or neutral using ML algorithms (Logistic Regression, Naïve Bayes, SVM) with NLP preprocessing.

data-science data-visualization logistic-regression machine-learning ml-models naive-bayes natural-language-processing nlp scikit-learn sentiment-analysis svm text-classification twitter-data

Last synced: 06 May 2026

https://github.com/rehanvhora778/bibtex-extraction

📄 Extract BibTeX entries from PDFs automatically, generating a complete bibliography without manual input or reliance on external APIs.

academic-writing analysis automation bibliometric-analysis bibliometrics bibtex data-visualization langchain latex metadata-extraction pdf pyhton pypdf reference-management research-tools

Last synced: 06 May 2026

https://github.com/hashkanna/youtube-wrapped-2025

🎬 Spotify Wrapped-style dashboard for YouTube watch history. Built with React + Tessl spec-driven development for EF x Tessl x Anthropic Hackathon.

analytics data-visualization hackathon react recharts tessl vite wrapped youtube

Last synced: 06 May 2026

https://github.com/iamrajmani/sentimental-analysis

Sentimental Analysis - Final Year College Project

data-analysis data-visualization machine-learning python pytorch

Last synced: 06 May 2026

https://github.com/ibrahimceyisakar/hotel-finder

Hotel finder system with Python includes data gathering, analyzing, and visualization.

data-analysis data-gathering data-visualization pandas plotly python selenium streamlit

Last synced: 06 May 2026

https://github.com/sankaran-s2001/us-traffic-accidents-analysis-python-eda

Exploratory data analysis of US traffic accidents from 2016-2023, analyzing patterns by time, location, weather, and severity using Python data science libraries.

data-analysis data-science data-visualization eda matplolib numpy pandas python

Last synced: 06 May 2026

https://github.com/abhinav330/customer-behavior-analysis-linear-regression

This repository explores customer behavior data for an NYC clothing company with both a mobile app and website. They want to understand which platform drives higher sales.

data-analysis data-science data-visualization eda exploratory-data-analysis jupyter jupyter-notebook linear-regression machine-learning machine-learning-algorithms machinelearning-python numpy pandas python regression-analysis

Last synced: 06 May 2026

https://github.com/gerhynes/d3-median-age-histogram

A D3 histogram displaying UN data on median ages. Built for The Advanced Web Developer Bootcamp.

d3 data-visualization javascript

Last synced: 06 May 2026

https://github.com/swapnil-jain/whatsapp-chat-statistics

Web Application showing common statistics, graphs and various charts about the uploaded Whatsapp chat.

data-analytics data-science data-visualization heroku-deployment streamlit whatsapp

Last synced: 06 May 2026

https://github.com/harryrlk/data_analysis_showcase

This repository showcases my data analysis and visualization projects using Excel, Python, R, and Tableau. Some projects are under NDA, so key figures and specific numbers are not included, but brief overviews and methodologies are provided. Feel free to explore and contact me for further details.

data-analysis data-science data-visualization excel portfolio python r tableau

Last synced: 06 May 2026

https://github.com/harish-siva/analyzing-personal-expenses

This project aims to simulate an expense tracker for an individual using the Faker library. The project generates realistic monthly expense data, processes and stores it in a SQL database, and creates SQL queries to derive insights into spending behavior. Streamlit app is developed to visualize these insights and showcase the results of SQL queries

data-visualization expense-tracking exploratory-data-analysis financial-analysis matplotlib pandas python seaborn sql sqlalchemy streamlit

Last synced: 06 May 2026

https://github.com/arsh-jafri/econostats

A real-time economic data visualization platform that helps track and analyze key economic indicators through interactive charts, custom datasets, and FRED API integration.

aws data-visualization economic-data economics elasticbeanstalk federal-reserve flask fred-api

Last synced: 06 May 2026

https://github.com/ddeepanshu-997/datascience---youth-tobacco-survey-yts-data

Analysis of Cigarette Smoking Prevalence Among Middle and High School Students (1999-2017)

analysis data-science data-visualization graph matplotlib pandas python survey visualization

Last synced: 06 May 2026

https://github.com/verinverdian/smart-factory

Smart Factory Dashboard – A web-based factory management dashboard to monitor employees, inventories, and productions with real-time data visualization.

admin-dashboard bootstrap dashboard data-visualization factory-management laravel manufacturing php production-management smart-factory

Last synced: 06 May 2026

https://github.com/heyhaiden/mcp-ag-grid

Headless AG Grid server for advanced data visualization, manipulation, and export, seamlessly integrated with Claude Desktop.

ag-grid claude-desktop data-grid data-visualization headless-browser mcp open-source puppeteer

Last synced: 06 May 2026

https://github.com/urbanekda/upwork_dashboard

A data analysis project examining trends and patterns in the data science job market on Upwork. This project analyzes job postings, requirements, and market demands to provide insights into the freelance data science ecosystem.

data-analysis data-science data-science-projects data-visualization freelance jupyter-notebook python streamlit

Last synced: 07 May 2026

https://github.com/karlyndiary/coffee-shop-sales-analysis

Comprehensive analysis of coffee shop sales utilizing Pandas for data cleaning and exploratory data analysis (EDA), complemented by Streamlit for creating interactive data visualization dashboards.

data-analysis data-cleaning data-preprocessing data-visualization eda pandas streamlit streamlit-dashboard

Last synced: 07 May 2026

https://github.com/sitek94/react-interactive-scatter-plot

An interactive scatter plot made with React and D3

d3 data-visualization react

Last synced: 07 May 2026

https://github.com/harshsahu23/wabtec3_brakesgui_shared

💻 App to visualize error logs and dump files generated by the braking system of locomotives 🚅

csv data-visualization graphs pandas plotly python streamlit visualization

Last synced: 07 May 2026

https://github.com/jgohel9902/property-analytics-u.s.-owned-and-leased-properties

This project focuses on analyzing the U.S. Inventory of Owned and Leased Properties using datasets from Data.gov. It includes SQL queries for data cleaning and trend analysis, Excel for manipulation and reporting, Python for automated workflows and exploratory data analysis, and Power BI for creating interactive dashboards to visualize key insights

data-visualization dataanalysis excel jupyter-notebook pandas powerbi python sql

Last synced: 07 May 2026

https://github.com/nicovandenhooff/wids-datathon-2022

This repository contains solution for the 2022 Women in Data Science Kaggle competition that I participated in, which obtained a top 10% leaderboard standing.

catboost data-visualization datascience energy-consumption ensemble-learning exploratory-data-analysis kaggle lightgbm machine-learning scikit-learn women-in-data-science xgboost

Last synced: 07 May 2026

https://github.com/sumit-sinha9/t20wc_2022-best-11

Finalizing best 11 players from 2022 T20 world cup using Python Web scraping, Pandas and Power BI

data-visualization pandas powerbi python webscraping

Last synced: 07 May 2026

https://github.com/citizenlabsgr/openbudgetkw

Promoting a deeper understanding of the Kentwood, MI city budget process and city priorities.

csv data-visualization jade javascript react

Last synced: 07 May 2026

https://github.com/tjas/postgrad-ai-ddv-plotly

Jupyter Notebook to analyze the salaries of Federal District government public servants, using Python, Pandas and Plotly Express, to solve the proposed exercise in "Data Discovery and Visualization" discipline.

analysis analytics data data-analytics data-discovery data-science data-visualization graph graphs jupyter-notebook jupyter-notebooks pandas plotly plotly-express python

Last synced: 07 May 2026

https://github.com/ddihora1604/advanced_business_analytics_on_world_bank_global_financial_inclusion_data_2021

Bridging the Gaps in Financial Inclusion: Understanding the Cash-Credit Paradox, Divide between Cash and Digital Payments, and Financial Resilience.

advanced-excel business-analytics data-analysis data-engineering data-mining data-visualization database exploratory-data-analysis machine-learning preprocessing-data python

Last synced: 07 May 2026

https://github.com/danyal-faheem/project-logs-analyzer

This repo contains scripts to analyze project logs and display some charts related to the data

data data-visualization matplotlib pandas python streamlit

Last synced: 07 May 2026

https://github.com/cnoret/hexa-watts

Interactive data visualization and machine learning app for energy consumption analysis and prediction in France, built with Streamlit. (Text in French)

data-visualization electricity-forecasting energy-analysis france machine-learning scikit-learn streamlit

Last synced: 07 May 2026

https://github.com/sammdu/global-warming-hurricane-typhoon

The Effect of Global Warming on Hurricane and Typhoon Occurrence

data-science data-visualization global-warming hurricane-data

Last synced: 07 May 2026

https://github.com/vyjayanthipolapragada/genai_smart_retail_recommendation

GenAI Smart Retail is a recommendation system designed for retail environments. It provides personalized product recommendations to users based on product descriptions using a content-based filtering approach. The system leverages FastAPI for backend integration, allowing users to interact with the recommendation engine via an API. This project aim

content-based-recommendation data-analysis data-science data-visualization fastapi gen-ai instacart-data jupyter-notebook open-ai python3 retail scikitlearn-machine-learning stream

Last synced: 07 May 2026

https://github.com/satyam4229/identify-employee-attrition

This is the model where we predict the attrition of the employees of the company by checking there records and all. In the given dataset, we have the features like salary, environment, age, gender and their experience.

data-analysis data-science data-visualization jupyter-notebook kaggle python

Last synced: 08 May 2026

https://github.com/moustafamohamed01/mall-customer-segmentation-data

Customer segmentation using K-Means clustering based on annual income and spending score.

data-science data-visualization k-means-clustering machine-learning python scikit-learn unsupervised-learning

Last synced: 08 May 2026

https://github.com/ropaxyz/octobot-octopus-energy-discord-bot

A Discord bot for Octopus Energy users to track and visualize their energy consumption. Integrates with Octopus Energy's API to fetch and display personalized energy data, costs, and usage charts.

asyncio data-visualization discord-bot energy-monitoring graphql matplotlib octopus-energy octopus-energy-api python rest-api sqlite

Last synced: 08 May 2026

https://github.com/bnvulpe/regression-and-time-series

This work centers on assessing and comparing predictive models for regression and time series prediction using specific datasets, with the goal of selecting the most effective methodology for unseen test data.

colab data-analysis data-analysis-python data-science data-visualization forecasting jupyter-notebook machine-learning model-evaluation predictive-modeling python regression sarima sarimax time-series-analysis time-series-analysis-and-forecasting

Last synced: 08 May 2026

https://github.com/moiri-gamboni/vintedcalculator

📊 Smart pricing calculator for Vinted sellers - optimize your listings based on storage, seasonality, and market dynamics

clothing-resale data-visualization e-commerce inventory-management pricing-calculator react recharts sales-optimization seasonal-pricing shadcn-ui tailwindcss typescript vinted

Last synced: 08 May 2026

https://github.com/dsaikiran01/seismomap

An interactive React + Leaflet web app that visualizes real-time global earthquakes from the USGS API with live map markers, magnitude filtering, dark mode, and responsive UI.

data-visualization earthquakes geoscience material-ui react react-leaflet tailwindcss usgs-api vite

Last synced: 08 May 2026

https://github.com/abhash-rai/regression-car-price-prediction

This repository contains my first complete data science project from web scrapping for data to data preprocessing, cleaning, exploratory data analysis, model training and deployment.

data data-science data-visualization eda exploratory-data-analysis machine-learning neural-network prediction prediction-model regression

Last synced: 08 May 2026

https://github.com/alz-11/ftoc

Fahrenheit to Celsius conversion analysis, comparing accurate and approximate methods with visualizations and error analysis.

approximation celsius conversion data-visualization error-analysis fahrenheit fahrenheit-to-celsius jupyter jupyter-notebook matplotlib numpy python python-3 python3

Last synced: 08 May 2026

https://github.com/samjoesilvano/password_strength_prediction_using_nlp

Developed a predictive model to categorize passwords as Strong, Good, or Weak, enhancing security and reducing breach risks. The project involves cleaning and analyzing data from an SQL database, using the TF-IDF technique for transformation, and implementing a Logistic Regression model to achieve accurate classifications.

data-analysis data-classification data-cleaning data-visualization logistic-regression machine-learning natural-language-processing pandas password-security password-strength python scikit-learn sql tf-idf

Last synced: 08 May 2026

https://github.com/rightfulcode/retail-sales-breakdown

Time Series Analysis of Walmart Retail Sales – Internship project analyzing sales trends, seasonal patterns, and revenue breakdowns using Pandas, Matplotlib, and Seaborn.

data-analytics data-visualization elevvo-internship matplotlib pandas python retail-sales seaborn time-series-analysis

Last synced: 08 May 2026

https://github.com/jessicaevelin/estudos

Repositório com atividades, exercícios e projetos realizados durante meus estudos em Ciência de Dados, baseados em cursos, livros, vídeos e conteúdos da internet.

data-science data-visualization exercises jupyter machine-learning pandas projects python study

Last synced: 08 May 2026

https://github.com/vinit714/player-retention-analysis

A complete Streamlit + Machine Learning + SHAP + NLP project to analyze, predict, and improve player retention in games. This project simulates a game environment, models churn behavior, and provides insights using SHAP, NLP word clouds, and strategy simulators.

churn-prediction classification data-visualization eda feature-engineering game-analytics game-data-analysis gaming-analytics machine-learning model-interpretability nlp pandas player-retention python retention-analysis sckiit-learn shap streamlit wordcloud

Last synced: 08 May 2026

https://github.com/hasan9519/dhaka_apartment_sale_listings_analysis

Apartment sale listings analysis based on Dhaka city visualized by Tableau Dashboard

data-cleaning data-visualization python selenium tableau webscraping

Last synced: 09 May 2026

https://github.com/tsbarr/toronto-open-data

Analysis of Toronto's open data initiatives. 🌆 Exploring Toronto's urban systems through data science 📊 Python-based analyses of public datasets 🔍 Focus on community impact and urban patterns 🎓 Academic rigour meets practical insights 🔄 Regularly updated with new analyses

api-integration civic-tech ckan-api data-analysis data-cleaning data-science data-visualization exploratory-data-analysis jupyter-notebook open-data pandas public-data python tableau toronto urban-analytics

Last synced: 09 May 2026