An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/mugilan1309/csv_analyzer

📊 A simple Streamlit-based CSV Analysis & Preprocessing Tool for quick data insights.

csv-processing data-analysis data-visualization machine-learning python streamlit

Last synced: 04 May 2026

https://github.com/balajimohan18/sales-forecasting-datascience-project

Develop a data science project using historical sales data to build a regression model that accurately predicts future sales. Preprocess the dataset, conduct exploratory analysis, select relevant features, and employ regression algorithms for model development. Evaluate model performance, optimize hyperparameters, and provide actionable insights.

data-analytics data-science data-testing data-visualization forecasting forecasting-models machine-learning model-evaluation predictive-modeling python regression-algorithms salesforecast scipy sklearn-library supervised-learning

Last synced: 04 May 2026

https://github.com/flytomarsz/bike-sharing-system-analysis

This analysis project aim to identify bike rental's behavior in 2012 from Capital Bikeshare system, Washington D.C., USA. This project is part of my Data Analysis study at Dicoding.

data-analysis data-visualization jupyter-notebook python streamlit

Last synced: 04 May 2026

https://github.com/dhruvsrikanth/basic-data-science

A short Data Science Project I took up for fun! This is a data analysis based on a dataset I created to predict the distribution of wealth within an economy as well as several characteristics of each class within society!

analysis data-analysis data-pipeline data-science data-visualization machine-learning matplotlib pandas python seaborn sklearn

Last synced: 05 May 2026

https://github.com/s4rrar/israel-gaza-war

Human losses of the Israeli genocide in Gaza and the West Bank in numbers (October 7th War Victims)

data-engineering data-science data-visualization datascience jupyter jupyter-notebook python

Last synced: 05 May 2026

https://github.com/riyouuyt/investigate-hotel-business-using-data-visualization

Explore hospitality data, visualizing customer behavior in hotel reservations and its impact on cancellations for strategic insights.

businessreporting data-visualization hotel-booking jupyter-notebook matplotlib-pyplot presentation python

Last synced: 05 May 2026

https://github.com/dona-eric/projet-etudiant

Objectif : Étudier et Analyser les facteurs qui influencent l'engagement des étudiants et les niveaux de risque dans un contexte éducatif .

data-science data-visualization fastapi machine-learning sante-etudiant streamlit

Last synced: 05 May 2026

https://github.com/badranalyst/residential-unit-prices-data-analysis-application

Python-based analysis of residential unit prices, focusing on data cleaning, visualization, and exploratory data analysis (EDA). Key features include price distribution, and correlation analysis between factors like size, location, and pricing.

data-analysis data-visualization dataset matplotlib numpy pandas python seaborn

Last synced: 05 May 2026

https://github.com/codewithmayank-py/box-office-analysis-with-seaborn-and-python

This repository contains Python code and datasets for analyzing box office data. Explore trends, patterns, and factors influencing movie performance.

analysis box-office-data-analysis data-analysis data-visualization dataset jupyter-notebook matplotlib pandas python3 seaborn

Last synced: 05 May 2026

https://github.com/soupu07/ibm-employee-attrition-prediction

The aim of this project analyzes factors driving IBM employee attrition and predicts those likely to leave, helping the organization understand turnover causes and improve retention and performance.

data-science data-science-projects data-visualization machine-learning python python-data-analysis python-programming-language python-project

Last synced: 05 May 2026

https://github.com/sundarmd/digital_twin_for_li-ion_batteries

Digital Twin for Li-ion batteries on AWS built using S3, EC2, SageMaker, Redshift, Terraform, QuickSight

aws-ec2 aws-s3 data-visualization iot python3 pytorch sql terraform

Last synced: 05 May 2026

https://github.com/anushkundu/crime-pattern-analysis

Analyzing Crime Patterns in Montgomery County, USA: An Inclusive Study Based on NIBRS Data (2016-2022)

data-analysis data-visualization descriptive-statistics matplotlib numpy pandas python seaborn

Last synced: 05 May 2026

https://github.com/shruthin4/ipl-cricket-analysis-2007-2024

In-depth IPL Cricket Data Analysis (2007–2024) with visual insights on teams, players, and match outcomes.

analysis cricket data-visualization eda ipl python sports-analytics

Last synced: 05 May 2026

https://github.com/ledsouza/covid19

Projeto de análise de dados dos casos de Covid19

data-science data-visualization matplotlib pandas seaborn vitrinedev

Last synced: 05 May 2026

https://github.com/hms75/movie_rating_analysis

A movie rating analysis which identifies trends amongst a dataset of 5000 movies.

data-analysis data-visualization matplotlib-pyplot numpy pandas python

Last synced: 05 May 2026

https://github.com/lkasym/smart-dynamic-pricing

An AI-powered dynamic pricing system using Dueling DQN and customer behavior simulation, with a full-stack React + Flask dashboard for real-time insights and performance benchmarking.

ai-project data-visualization deep-learning dqn-tensorflow ecommerce full-stack-ai machine-learning reinforcement-learning tensorflow

Last synced: 05 May 2026

https://github.com/donmaruko/python-eda-toolkit

CLI-runned EDA with 30 commands utilizing text-related functions, statistical calculations, data visualization, and data manipulation.

data data-analysis data-science data-visualization matplotlib pandas scipy seaborn statistical-analysis statistics wordcloud

Last synced: 06 May 2026

https://github.com/robmcelhinney/roman-emperors

D3.js Bar Charts of Roman Emperors' Age and length of Reign from 63 BCE to 395 CE

d3 d3js data-visualization emperor roman rome

Last synced: 06 May 2026

https://github.com/benjaminrose/data-analysis-book

A Jupyter Book for my Spring 2025 PHY 5381 class on Data Analysis

book data-analysis data-science data-visualization jupyter-book open-book python r statistics-course

Last synced: 06 May 2026

https://github.com/rehanvhora778/bibtex-extraction

📄 Extract BibTeX entries from PDFs automatically, generating a complete bibliography without manual input or reliance on external APIs.

academic-writing analysis automation bibliometric-analysis bibliometrics bibtex data-visualization langchain latex metadata-extraction pdf pyhton pypdf reference-management research-tools

Last synced: 06 May 2026

https://github.com/iamrajmani/sentimental-analysis

Sentimental Analysis - Final Year College Project

data-analysis data-visualization machine-learning python pytorch

Last synced: 06 May 2026

https://github.com/ibrahimceyisakar/hotel-finder

Hotel finder system with Python includes data gathering, analyzing, and visualization.

data-analysis data-gathering data-visualization pandas plotly python selenium streamlit

Last synced: 06 May 2026

https://github.com/abhinav330/customer-behavior-analysis-linear-regression

This repository explores customer behavior data for an NYC clothing company with both a mobile app and website. They want to understand which platform drives higher sales.

data-analysis data-science data-visualization eda exploratory-data-analysis jupyter jupyter-notebook linear-regression machine-learning machine-learning-algorithms machinelearning-python numpy pandas python regression-analysis

Last synced: 06 May 2026

https://github.com/filip-kustura/exchange-rates-visualizer

This project is a currency exchange rate trend visualizer, enabling interactive exploration of historical exchange rate data. Built with JavaScript, jQuery, AJAX and Canvas, it features dynamic data retrieval, interactive graphs and responsive controls.

ajax api canvas css currency-exchange data-visualization exchange-rates html javascript jquery

Last synced: 06 May 2026

https://github.com/swapnil-jain/whatsapp-chat-statistics

Web Application showing common statistics, graphs and various charts about the uploaded Whatsapp chat.

data-analytics data-science data-visualization heroku-deployment streamlit whatsapp

Last synced: 06 May 2026

https://github.com/harryrlk/data_analysis_showcase

This repository showcases my data analysis and visualization projects using Excel, Python, R, and Tableau. Some projects are under NDA, so key figures and specific numbers are not included, but brief overviews and methodologies are provided. Feel free to explore and contact me for further details.

data-analysis data-science data-visualization excel portfolio python r tableau

Last synced: 06 May 2026

https://github.com/tetchen9/mapa

A map of a trip to Europe. Using d3.js, Eurostat dataset in GeoJSON.

cartography d3 d3-visualization data-visualization eurostat-data geojson maps typescript

Last synced: 06 May 2026

https://github.com/sahilmate/ebm-breast-cancer-classifier

This repository implements an Explainable Boosting Machine (EBM) model for breast cancer classification using scikit-learn and interpret. The project includes data preprocessing, model training, accuracy evaluation, and feature importance visualization.

breast-cancer-classification data-visualization explainable-boosting-machine feature-importance interpret machine-learning scikit-learn

Last synced: 06 May 2026

https://github.com/bartosz-ziolkowski/social_data

Data analysis of San Francisco's crime data set from 2003 to 2018

data-science data-visualization jupyter-notebook numpy pandas python

Last synced: 06 May 2026

https://github.com/urbanekda/upwork_dashboard

A data analysis project examining trends and patterns in the data science job market on Upwork. This project analyzes job postings, requirements, and market demands to provide insights into the freelance data science ecosystem.

data-analysis data-science data-science-projects data-visualization freelance jupyter-notebook python streamlit

Last synced: 07 May 2026

https://github.com/karlyndiary/coffee-shop-sales-analysis

Comprehensive analysis of coffee shop sales utilizing Pandas for data cleaning and exploratory data analysis (EDA), complemented by Streamlit for creating interactive data visualization dashboards.

data-analysis data-cleaning data-preprocessing data-visualization eda pandas streamlit streamlit-dashboard

Last synced: 07 May 2026

https://github.com/badr-moufad/dashboard-agri-edge-frontend

Dashboard of Moroccan weather data adapted to the wheat calendar. This is part of my research internship.

clustering dashboard data-visualization morocco-regions plotlyjs reactjs redux tailwindcss weather

Last synced: 07 May 2026

https://github.com/ganesh774218/eda-book-store

Exploratory data analysis on a book store dataset to uncover sales trends, popular genres, and top publishers.

data-visualization datacleaning datamanipulation eda matplotlib numpy pandas python pythonp pythonproject seaborn

Last synced: 07 May 2026

https://github.com/alekiie/streamlit-dashboard

A dashboard that utilizes the power of streamlit charts to create intuitive and easy to understand charts for data visualization.

data-visualization matplotlib numpy pandas python3 streamlit

Last synced: 07 May 2026

https://github.com/warazkhan/airplane-crashes-and-fatalities-since-1908-

This project analyzes airplane crash data (1908 - 2008)✈📊 to uncover trends in aviation accidents, fatalities, and safety improvements. Using exploratory data analysis (EDA) and data visualization, we examine key factors influencing crashes, identify high-risk regions, and explore advancements in aviation safety.

data-analysis data-visualization exploratory-data-analysis

Last synced: 10 Jun 2026

https://github.com/jgohel9902/property-analytics-u.s.-owned-and-leased-properties

This project focuses on analyzing the U.S. Inventory of Owned and Leased Properties using datasets from Data.gov. It includes SQL queries for data cleaning and trend analysis, Excel for manipulation and reporting, Python for automated workflows and exploratory data analysis, and Power BI for creating interactive dashboards to visualize key insights

data-visualization dataanalysis excel jupyter-notebook pandas powerbi python sql

Last synced: 07 May 2026

https://github.com/bryanhe24/data_analysis_app

A full-stack web application that allows users to upload CSV datasets, analyze the data with statistical summaries and visualizations, and interact with an AI-powered assistant for querying the dataset.

ai data data-analysis data-visualization fullstack-development javascript math python reactjs

Last synced: 07 May 2026

https://github.com/nicovandenhooff/wids-datathon-2022

This repository contains solution for the 2022 Women in Data Science Kaggle competition that I participated in, which obtained a top 10% leaderboard standing.

catboost data-visualization datascience energy-consumption ensemble-learning exploratory-data-analysis kaggle lightgbm machine-learning scikit-learn women-in-data-science xgboost

Last synced: 07 May 2026

https://github.com/mahmoudnamnam/fc-barcelona-reports

FC Barcelona Reports: An interactive web application to analyze and visualize FC Barcelona's match data. Built with Streamlit, it scrapes match data from WhoScored, stores it in MongoDB, and presents insights through interactive visualizations like pass networks, shot maps, and player statistics.

data-analysis data-visualization football-analytics mplsoccer pandas streamlit web-scraping

Last synced: 07 May 2026

https://github.com/antrikshy/personalmovieanalysis

Finds interesting patterns in an IMDb ratings export; written as a Jupyter notebook, viz using Seaborn

data-visualization imdb jupyter-notebook movie-ratings pandas python seaborn

Last synced: 07 May 2026

https://github.com/ddihora1604/advanced_business_analytics_on_world_bank_global_financial_inclusion_data_2021

Bridging the Gaps in Financial Inclusion: Understanding the Cash-Credit Paradox, Divide between Cash and Digital Payments, and Financial Resilience.

advanced-excel business-analytics data-analysis data-engineering data-mining data-visualization database exploratory-data-analysis machine-learning preprocessing-data python

Last synced: 07 May 2026

https://github.com/athenyx04/arion

Smart animal weighing module for Demeter

data-visualization firebase livestock nextjs

Last synced: 07 May 2026

https://github.com/amarlearning/exploring-67-years-of-lego

In this project, I have explored database of every LEGO set ever built.

data-manipulation data-visualization importing-and-cleaning-data jupyter-notebook pandas python

Last synced: 07 May 2026

https://github.com/cnoret/hexa-watts

Interactive data visualization and machine learning app for energy consumption analysis and prediction in France, built with Streamlit. (Text in French)

data-visualization electricity-forecasting energy-analysis france machine-learning scikit-learn streamlit

Last synced: 07 May 2026

https://github.com/vyjayanthipolapragada/genai_smart_retail_recommendation

GenAI Smart Retail is a recommendation system designed for retail environments. It provides personalized product recommendations to users based on product descriptions using a content-based filtering approach. The system leverages FastAPI for backend integration, allowing users to interact with the recommendation engine via an API. This project aim

content-based-recommendation data-analysis data-science data-visualization fastapi gen-ai instacart-data jupyter-notebook open-ai python3 retail scikitlearn-machine-learning stream

Last synced: 07 May 2026

https://github.com/moustafamohamed01/mall-customer-segmentation-data

Customer segmentation using K-Means clustering based on annual income and spending score.

data-science data-visualization k-means-clustering machine-learning python scikit-learn unsupervised-learning

Last synced: 08 May 2026

https://github.com/nishumehta/sales-analysis-project

This project aims to analyze sales performance using Excel, SQL, Python, Tableau, and Power BI. The goal is to extract insights from sales data, identify trends, and visualize key performance indicators (KPIs).

data-cleaning data-visualization eda excel matplotlib-pyplot pandas python3 tableau-dashboards

Last synced: 08 May 2026

https://github.com/bnvulpe/regression-and-time-series

This work centers on assessing and comparing predictive models for regression and time series prediction using specific datasets, with the goal of selecting the most effective methodology for unseen test data.

colab data-analysis data-analysis-python data-science data-visualization forecasting jupyter-notebook machine-learning model-evaluation predictive-modeling python regression sarima sarimax time-series-analysis time-series-analysis-and-forecasting

Last synced: 08 May 2026

https://github.com/femincan/d3-scatterplot-graph

My solution for the Visualize Data with a Scatterplot Graph project on FCC.

css3 d3js data-visualization html5 javascript

Last synced: 08 May 2026

https://github.com/moiri-gamboni/vintedcalculator

📊 Smart pricing calculator for Vinted sellers - optimize your listings based on storage, seasonality, and market dynamics

clothing-resale data-visualization e-commerce inventory-management pricing-calculator react recharts sales-optimization seasonal-pricing shadcn-ui tailwindcss typescript vinted

Last synced: 08 May 2026

https://github.com/dsaikiran01/seismomap

An interactive React + Leaflet web app that visualizes real-time global earthquakes from the USGS API with live map markers, magnitude filtering, dark mode, and responsive UI.

data-visualization earthquakes geoscience material-ui react react-leaflet tailwindcss usgs-api vite

Last synced: 08 May 2026

https://github.com/abhash-rai/regression-car-price-prediction

This repository contains my first complete data science project from web scrapping for data to data preprocessing, cleaning, exploratory data analysis, model training and deployment.

data data-science data-visualization eda exploratory-data-analysis machine-learning neural-network prediction prediction-model regression

Last synced: 08 May 2026

https://github.com/js-konda/naturaldisasterseda

The project repository for the Exploratory Data analysis of natural disasters done as part of ECE143 course at UCSD

data-science data-visualization pandas python visualization

Last synced: 08 May 2026

https://github.com/samjoesilvano/password_strength_prediction_using_nlp

Developed a predictive model to categorize passwords as Strong, Good, or Weak, enhancing security and reducing breach risks. The project involves cleaning and analyzing data from an SQL database, using the TF-IDF technique for transformation, and implementing a Logistic Regression model to achieve accurate classifications.

data-analysis data-classification data-cleaning data-visualization logistic-regression machine-learning natural-language-processing pandas password-security password-strength python scikit-learn sql tf-idf

Last synced: 08 May 2026

https://github.com/tsear/reddit-discourse-project

Mapping emotional and conceptual discourse across Reddit philosophy communities.

data-visualization emotion-detection network-analysis nlp pandas reddit-api sentiment-analysis spacy text-mining tf-idf topic-modeling

Last synced: 08 May 2026

https://github.com/rightfulcode/retail-sales-breakdown

Time Series Analysis of Walmart Retail Sales – Internship project analyzing sales trends, seasonal patterns, and revenue breakdowns using Pandas, Matplotlib, and Seaborn.

data-analytics data-visualization elevvo-internship matplotlib pandas python retail-sales seaborn time-series-analysis

Last synced: 08 May 2026

https://github.com/jessicaevelin/estudos

Repositório com atividades, exercícios e projetos realizados durante meus estudos em Ciência de Dados, baseados em cursos, livros, vídeos e conteúdos da internet.

data-science data-visualization exercises jupyter machine-learning pandas projects python study

Last synced: 08 May 2026

https://github.com/satvikpraveen/numpymasterpro

A hands-on, production-ready toolkit to master NumPy — from first principles to real-world applications. Includes modular Jupyter notebooks, reusable utility scripts, cheatsheets, and advanced projects like K-Means clustering from scratch.

broadcasting data-analysis data-science data-source data-visualization jupyter-notebook kmeans-clustering linear-algebra machine-learning matrix-algebra numerical-computation numpy numpy-broadcasting numpy-examples numpy-tutorial open-source python scientific-computing standardization vectorization

Last synced: 08 May 2026

https://github.com/iyashwantsaini/911_capstone

For this capstone project we will be analyzing some 911 call data from Kaggle.

capstone data-science data-visualization python3

Last synced: 10 Jun 2026

https://github.com/hasan9519/dhaka_apartment_sale_listings_analysis

Apartment sale listings analysis based on Dhaka city visualized by Tableau Dashboard

data-cleaning data-visualization python selenium tableau webscraping

Last synced: 09 May 2026

https://github.com/drod75/burger_king_analysis

A simple analysis on a burger king dataset.

data-analysis data-visualization jupyter-notebook pandas python seaborn

Last synced: 09 May 2026

https://github.com/tsbarr/toronto-open-data

Analysis of Toronto's open data initiatives. 🌆 Exploring Toronto's urban systems through data science 📊 Python-based analyses of public datasets 🔍 Focus on community impact and urban patterns 🎓 Academic rigour meets practical insights 🔄 Regularly updated with new analyses

api-integration civic-tech ckan-api data-analysis data-cleaning data-science data-visualization exploratory-data-analysis jupyter-notebook open-data pandas public-data python tableau toronto urban-analytics

Last synced: 09 May 2026

https://github.com/arif-rachmat/powershell-wpf-serialmonplot

A modern, WPF-based Serial Port Monitor and Real-time Data Plotter Powershell script

data-visualization powershell powershell-script serial-communication windows wpf

Last synced: 09 May 2026

https://github.com/cgoliver/nlplotlib

Web-server for natural language based data visualization.

data-visualization flask matplotlib ml nlp reinforcement-learning web-app

Last synced: 09 May 2026

https://github.com/msukmanowsky/datadrawer

A handy little utility for when you want some time series data to prototype and don't want to write code.

d3 data-visualization prototype prototyping vue vuejs

Last synced: 09 May 2026

https://github.com/abhroroy365/market_analysis

This project explores customer segmentation and market analysis in the context of online retail using an online retail dataset. By applying advanced analytics, we aim to uncover insights that can drive strategic decisions and enhance business performance.

clustering data data-analysis data-visualization kmeans-clustering machine-learning market-analysis python silhouette-analysis

Last synced: 09 May 2026

https://github.com/master-helix/ibm-data-analyst-certification-stock-analysis-project

This is a mini project repository of my IBM Certification involving stock analysis and plotting of Tesla and GameStop

analytics data data-analysis data-visualization ibm matplotlib pandas python web-scraping

Last synced: 09 May 2026

https://github.com/prishabhanot/skin_cancer_classification_model

Classifies 7 types of skin cancer lesions using a deep learning CNN model. Processes and balances the dataset, trains the model, and evaluates its accuracy with visualizations.

cnn confusion-matrix data-visualization keras machine-learning medical-imaging python tensorflow

Last synced: 09 May 2026

https://github.com/smahala02/materials-science-image-analysis

Image analysis for materials science with a focus on particle diameter measurement and image scaling using Python.

data-visualization image-analysis materials-science particle-measurement python

Last synced: 09 May 2026

https://github.com/priyanshul28/exercise_pandas

A Pandas exercise demonstrating the loading, cleaning and reorganization of data in .xlsx or .csv files, descriptive statistics, data visualization, statistical approximation of data with the normal distribution, etc.). Libraries include Pandas, NumPy, Scipy, SymPy, MatplotLib.

data-cleaning data-visualization matplotlib numpy pandas

Last synced: 09 May 2026

https://github.com/samuelson777/iris-flower-classification

Iris Flower Classification: A machine learning project that classifies iris flowers into three species based on sepal and petal dimensions. Includes data exploration, visualization, and model evaluation using Python and scikit-learn.

classification data-science data-visualization iris-dataset jupyter-notebook machine-learning python scikit-learn

Last synced: 09 May 2026

https://github.com/piras-s/braincancerclassifier

Classifying brain tumors using Gaussian Naive Bayes with MRI-derived features. Includes feature selection, model evaluation, prediction uncertainty, and probability calibration.

baysian-inference calibrated-classification classification data-visualization feature-selection machine-learning medical-imaging naive-bayes-classifier python scikit-learn uncertainty-estimation

Last synced: 09 May 2026

https://github.com/shyamkumarnagilla/big-sales-prediction

The "Big Sales Prediction" model is a machine learning project that aims to accurately forecast sales for a given period. The model utilizes the Random Forest Regressor algorithm, a powerful ensemble learning technique, to analyze historical sales data and make predictions. It can be valuable for businesses looking to optimize sales forecasting.

data-analytics data-preprocessing data-science data-visualization machine-learning model-evaluation model-training

Last synced: 09 May 2026

https://github.com/datalopes1/manufacturing_defects

Projeto de EDA utilizando o Manufacturing Defects que pode ser encontrado no Kaggle

data-analysis data-visualization eda exploratory-data-analysis python

Last synced: 09 May 2026