An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/faraazarsath/guvi-task-4

This repository contains Python scripts for assessing and categorizing student performance data from two CSV files. The tasks include categorizing students based on their CodeKata scores.

data-visualization matplotlib numpy pandas

Last synced: 18 Apr 2026

https://github.com/zoraizmohammad/restaurantsimulation

Simulating a restaurant under various conditions in Durham, NC

data-visualization r restaurant-management statistics toy-models

Last synced: 18 Apr 2026

https://github.com/niteshchawla/logistics-nn-regression

The case study is about India's Largest Marketplace for Intra-City Logistics. This dataset has the required data to train a regression model that will do the delivery time estimation, based on all those features.

adam-optimizer data-visualization encoding exploratory-data-analysis feature-engineering hidden-layers hyperparameter-tuning keras-tensorflow kerastuner metrics neural-network numpy pandas regression relu scaling sequential-models

Last synced: 10 Apr 2026

https://github.com/kezouke/grossing-films-dataviz

Project that extracts information about the highest-grossing films from Wikipedia, stores it in a database, and presents it through an interactive, modern web page hosted on GitHub Pages

data-visualization data-wrangling json sqlite wikipedia

Last synced: 18 Apr 2026

https://github.com/laiba-iqrar/hands-on-machine-learning

Repository containing solutions to CS-324 (Machine Learning) practice problems assigned by my professors.

data-visualization feature-engineering gradient-descent regression-models

Last synced: 18 Apr 2026

https://github.com/p1n2o/ngx-oracle-dv

Angular component to embed visualizations from Oracle Analytics Cloud into your Angular application.

angular data-visualization embed oracle-analytics-cloud oracle-visualization

Last synced: 18 Apr 2026

https://github.com/shridhar1504/rafik-s-kitchen-data-analysis

The Project is about the Analysis of the Sales and Expenses Data of a Famous Fast-food Restaurant. This mainly focuses on gaining Insights that will boost the Future Sales and also Business Strategies it Improve the Profit Margins. Handled Tools are SQL, Python, Power BI, MS Office Tools.

business-analytics business-intelligence data-analysis data-analytics data-visualization eda ms-office powerbi-report powerpoint-presentations python sql-server

Last synced: 10 May 2026

https://github.com/thytranx/datalens

Datalens aims to enable interactive 3D exploration of complex, multi-dimensional datasets. Designed for intuitive usability, it is accessible to users without advanced programming skills or specialized hardware.

3d-visualization data-visualization glfw3 opengl

Last synced: 18 Apr 2026

https://github.com/prakashjha1/loan-eligibility-prediction

This repository contains the codebase and resources for a machine learning-based project aimed at predicting loan eligibility for individuals. The project utilizes various algorithms and data preprocessing techniques to build predictive models that assess the likelihood of an applicant being eligible for a loan based on historical data.

data data-visualization exploratory-data-analysis loan-prediction-analysis machine-learning-algorithms naive-bayes-classification parameter-tuning python random-forest

Last synced: 19 Apr 2026

https://github.com/robinmillford/sales-metrics-dashboard-streamlit

This Streamlit dashboard provides an interactive and comprehensive analysis of customer behavior, regional sales trends, and revenue insights. The dashboard enables businesses to identify key performance metrics, customer segments, and revenue drivers, supporting data-driven decision-making.

dashboard data-analysis data-visualization duckdb sales-analysis sales-dashboard streamlit-dashboard

Last synced: 19 Apr 2026

https://github.com/reallyabdullah/machine-learning-portfolio

Explore my Machine Learning Portfolio Repository for impactful projects showcasing my expertise in fraud detection, MLOps, and cloud deployment. Dive into innovative solutions and let's shape the future of AI together! 🚀

artificial-intelligence cloud-deployment data-analytics data-science data-visualization llms ml mlops nlp python

Last synced: 19 Apr 2026

https://github.com/hhrh/real-time-fraud-detection

A production-style real-time fraud detection pipeline using Kafka, FastAPI, XGBoost/CatBoost/LightGBM, Prometheus, and Grafana.

apache-kafka catboost data-visualization fastapi grafana lightgbm mlops prometheus python xgboost

Last synced: 10 May 2026

https://github.com/kirankumar-ak777/vaccine_usage_analysis_and_prediction

This Vaccine Prediction Project predicts individual likelihood of receiving H1N1 or seasonal flu vaccines using demographic, medical, and behavioral data. Through data processing, feature selection, and logistic regression, it builds a model to better understand vaccine uptake behaviors.

data-visualization machine-learning pandas python

Last synced: 10 Apr 2026

https://github.com/georgiosioannoucoder/2023-fall-data-science-ta

These are my code examples for the 2023-fall-data-science-ta as a Data Science Teaching Assistant at CUNY Tech Prep (CTP) Cohort 9. 📊

dashboard data-visualization decision-tree eda huggingface image-classification machine-learning ml neural-network nlp pandas random-forest regression teaching-assistant transformer

Last synced: 10 May 2026

https://github.com/akash-v7/telecom_customer_churn_prediction

A machine learning project to predict customer churn in the telecom industry using data analysis and classification models. The project includes data preprocessing, exploratory data analysis (EDA), model building, and insights to help telecom companies improve customer retention strategies.

data-analysis data-science data-visualization jupyter-notebook machine-learning predictive-modeling python

Last synced: 20 Apr 2026

https://github.com/aplgr/grovegrid

Interactive growth maps for rows of trees. CSV in; single HTML out.

alpinejs cli csv data-visualization echarts forestry go heatmap

Last synced: 25 May 2026

https://github.com/kmbuki/uk_police_data

R programming - Using open data about crime and policing in England, Wales and Northern Ireland.

data-analysis data-visualization r

Last synced: 04 Jun 2026

https://github.com/karlyndiary/spotify-excel-dashboard

Data Analysis on the Spotify Dataset using Microsoft Excel and VBA.

charts data-analysis data-cleaning data-visualization excel excel-export excel-vba pivot-tables

Last synced: 04 Jan 2026

https://github.com/anjaliwork20/moodify

Mood-based music recommendation system that considers a user's emotional state to recommend songs, genres, artists and playlists using Machine learning

artificial-intelligence cnn-keras cnn-model convolutional-neural-networks data data-analysis data-science data-structures data-visualization database deep-learning machine-learning machine-learning-algorithms python recommended song songs

Last synced: 20 Apr 2026

https://github.com/arda-guler/binmotion

Convert ANY data to a video file. Sister project of binGallery.

data data-visualization proof-of-concept video

Last synced: 04 Jun 2026

https://github.com/xre22zax/roller-coaster

Explore award-winning wood and steel coasters from 2013-2018 Golden Ticket Awards & Captain Coaster, all powered by Python and interactive visualizations.

analytics data-analysis data-visualization pandas python python-lambda python3 visualization

Last synced: 20 Apr 2026

https://github.com/mnkanout/patients_medication_prediction

The aim of the project is to create a model that can help medical professionals select the proper medication for patients based on their symptoms. The model uses historical data of other patients to predict what could be the most suitable medication based on the patient's symptoms.

data data-analysis data-science data-visualization decision-tree-classifier machine-learning python3

Last synced: 29 Jun 2025

https://github.com/bitkeks/hn-vis

Visualization of the Hacker News frontpage with Python and R.

data-visualization ffmpeg scraping

Last synced: 20 Apr 2026

https://github.com/robinmillford/hr-analytics-employee-performance-analysis

HR Analytics: Unveiling Employee Performance - A comprehensive exploration of employee data using SQL and Power BI, uncovering key insights for strategic HR decision-making.

data-analysis data-visualization jupyter-notebook powerbi python3 sql

Last synced: 20 Apr 2026

https://github.com/msrsaditya/distrometer

Data analysis and insights on Linux distribution's subreddit statistics.

data-science data-visualization linux python streamlit

Last synced: 20 Apr 2026

https://github.com/dickyalfauzi/f1-dash

<p align="center"> <picture> <source media="(prefers-color-scheme: dark)" srcset="./dash/public/tag-logo.png" width="200"> <img alt="f1-dash" src="./dash/public/tag-logo.png" width="200"> </picture></p><h1 align="center">Real-time Formula 1 telemetry and timing</h1>## f1-dashA real-time F1 dashboard that shows the leader board, t

bun cars dashboard data-visualization f1 formula1 nextjs plotly-dash races realtime rust telemetry typescript

Last synced: 15 Mar 2026

https://github.com/datasqlsantosh/global-energy-consumption-renewable-generation-python-data-analysis-portfolio

This project focuses on analyzing global energy consumption patterns and trends in renewable energy generation using Python data analysis libraries such as Seaborn and NumPy. The analysis aims to explore energy consumption data from various regions worldwide and examine the contribution of renewable energy sources over time

data data-analysis data-visualization pandas seaborn

Last synced: 10 May 2026

https://github.com/mituskillologies/ds-ref-cdac-aug24

Program of refresher course on Data Science conducted for CDAC officials at CDAC Headquarters, Pune in August 2024.

data-science data-visualization machine-learning mysql python-programming r-programming sql

Last synced: 10 May 2026

https://github.com/kristinbaumann/voting-spots-imperfect-circles

Animated visualization for voting results with imperfect circles

canvas d3 data-visualization dataart

Last synced: 21 Apr 2026

https://github.com/chauxvive/fccscatter

An interactive D3.js scatter plot visualizing doping allegations in professional cycling. Built as part of FreeCodeCamp’s Data Visualization certification

d3 d3js data-visualization dataviz

Last synced: 21 Apr 2026

https://github.com/eracodex/d3js-heatmap

Global Heatmap presentation with D3js and Reactjs

d3js data-science data-visualization heatmap javascript reactjs

Last synced: 07 Apr 2025

https://github.com/datasqlsantosh/electric_vechicle_data_anlaysis

In this personal Electric Vehicle Data Analysis Portfolio project, an exploratory data analysis was performed on the Electric Vehicle Data available on Kaggle. The main aim of the project is to uncover insights into the Electric Vehicle Good and profits trends.

data-cleaning data-visualization electric-vehicle power-bi sql sql-server

Last synced: 05 Jun 2026

https://github.com/aureliennicosiaulaval/ggcircular

A ggplot2 extension for circular, axial and directional data.

circular-statistics data-visualization directional-data ggplot2 r

Last synced: 05 Jun 2026

https://github.com/robinmillford/optimizing-treatment-plans-through-data-analysis

The primary focus was on understanding customer health, treatment, and associated charges over multiple years.

data-analysis data-visualization healthcare mysql powerbi sql

Last synced: 22 Apr 2026

https://github.com/eracodex/d3js-treemap

Data visualization with D3js Treemap

d3js data-science data-visualisation data-visualization reactjs

Last synced: 07 Apr 2025

https://github.com/rajesh9943/sentiment-analysis-of-consumer-opinions-on-amazon-products

Developed a comprehensive Sentiment Analysis System aimed at classifying Amazon product reviews into positive, neutral, and negative sentiments. The project leveraged advanced Natural Language Processing (NLP) techniques alongside machine learning algorithms to deliver accurate and actionable insights from customer feedback

amazon data-analysis data-manipulation data-preprocessing data-presentation data-visualization machine-learning nlp nlp-library nltk product-reviews-analysis sentiment-analysis sklearn-library word-cloud-generator-in-python-3

Last synced: 05 Jun 2026

https://github.com/dineshkumarkotha/pycityschools-analysis

PyCitySchools Analysis is a data analysis project that examines standardized test performance across a city's school district. Using Python and Pandas, the project provides insights into how different factors—such as school type, size, and spending—impact student performance in math and reading.

data-visualization pandas

Last synced: 22 Apr 2026

https://github.com/jordansrowles/rowles.winforms.vizmorph

A versatile WinForms chart control that smoothly transitions between line and bar chart modes, featuring customizable colors, axes, animations, and persistent value labels. Ideal for dynamic data visualization with support for negative values and real-time updates.

data-visualisation data-visualization winforms winforms-controls

Last synced: 23 Apr 2026

https://github.com/tranngoca5039/bigquery-a5y

📊 Streamline your data analysis with bigquery-a5y, a powerful tool for optimizing BigQuery performance and improving query efficiency.

analytics api big-data bigquery cloud-computing data-analysis data-integration data-management data-pipeline data-visualization data-warehouse google-cloud machine-learning serverless sql

Last synced: 05 Jun 2026

https://github.com/barraharrison/spotify-listening-trends

Using EDA to look at song longevity, regional preferences, and streaming behavior in the charts and on Spotify.

data-analysis data-visualization jupyter-notebook kaggle-dataset

Last synced: 03 Feb 2026

https://github.com/ppatrzyk/heatmap

Display CSV as a heatmap in terminal

csv data data-visualization terminal

Last synced: 24 Apr 2026

https://github.com/gowhale/daily-spend-analysis

Python script to analyse spending habits.

data-visualization pandas python

Last synced: 24 Apr 2026

https://github.com/arunabhagit/inventory-misalignment-and-revenue-loss-in-multi-store-bike-retail

This project focuses on identifying the inventory and demand mismatch causing stagnant sales and lost revenue in a bike retail chain. By analyzing store-level performance and regional customer preferences, the project aims to detect underperforming products.

data-analysis data-visualization powerbi python

Last synced: 24 Apr 2026

https://github.com/datalopes1/bank_marketing

Este projeto será baseado no Dataset Bank Marketing encontrado na UC Irvine - Machine Learning Repository e disponibilizado por S. Moro, R. Laureano e P. Cortez

data-analysis data-science data-visualization eda python

Last synced: 24 Apr 2026

https://github.com/knutsynstad/tic-tac-toe-poster

Visualizing the 765 unique gameboards of tic-tac-toe as a matrix using react and create-react-app for a solution space poster.

data-visualization matrix react svg visualization

Last synced: 10 May 2026

https://github.com/andreazoccatelli/europe_emissions_shinyapp

A shiny app which shows the gas emissions of european countries from 2011 to 2020

data-science data-visualization emissions-co2

Last synced: 24 Apr 2026

https://github.com/avnigoyal25/ipl_eda

Exploratory Data Analysis on IPL datasets

data-visualization eda python

Last synced: 24 Apr 2026

https://github.com/yuvrajsaraogi/-iris-flower-classification

Iris flower has three species; setosa, versicolor, and virginica, which differs according to their measurements. Now assume that you have the measurements of the iris flowers according to their species, and the task is to train a machine learning model that can learn from the measurements of the iris species and classify them.

classification data data-analysis data-science data-visualization flower flower-classification iris iris-classification iris-flower iris-flower-classification knn knn-classification machine-learning machine-learning-algorithms ml natural-language-processing nlp python

Last synced: 24 Apr 2026

https://github.com/anuppm9917/super-store-sales-analysis-power-bi-project

My drive to know which products, regions, categories and customer segments a company should target or avoid, I search and selected an appropriate dataset on kaggle which will match a standard superstore requirement.

data data-analysis data-visualization datacleansing excel exploratory-data-analysis jupyter-notebook numpy pandas plotly powerbi python3

Last synced: 10 Apr 2026

https://github.com/tuni56/ventas_streamlit

interactive sells and KPI's dashboard

dashboard data-visualization kpi python streamlit webapp

Last synced: 24 Apr 2026

https://github.com/flazefy2/ds-mobilesdataset

https://www.kaggle.com/datasets/abdulmalik1518/mobiles-dataset-2025

csv data-visualization jupiter-notebook python

Last synced: 24 Apr 2026

https://github.com/drkbluescience/ibm-datascience-spacex

In this project, we predict whether the Falcon 9 first stage will land successfully by following the data science methodology.

data-visualization data-wrangling machine-learning-algorithms sql-query sqlite webscraping-data

Last synced: 10 May 2026

https://github.com/huacenxu/predict-loan-status

Using the Cross-Industry Standard Process of Data Mining (CRISP-DM), this project analyzes loan data from Prosper to identify key factors that predict loan status.

bootcamp-project data-science data-visualization data-w loan-prediction-analysis

Last synced: 25 Apr 2026

https://github.com/marielachirinosr/cyclistic-data-analytics-project

This project explores user behavior within a fictional bike-sharing system, modeled after Cyclistic, operating in Chicago.

data data-visualization pandas powerbi-report powerbi-visuals python

Last synced: 24 Apr 2026

https://github.com/pedrohdosanjos/economic-data-analysis

This project aims to analyze the export data from various states in the United States to Brazil over time. The data is sourced from the FRED (Federal Reserve Economic Data) API and processed to identify the top 5 exporting states for each year, as well as the states with the highest total export value across all years.

api data-analysis data-visualization jupyter-notebook python

Last synced: 24 Apr 2026

https://github.com/ianjure/average-precipitation-map

A 3D data visualization of average precipitation using R.

data-visualization philippines r

Last synced: 16 Aug 2025

https://github.com/mehmetkahya0/gallstone_dataset_analysis_project

Safra Taşı Hastalığı (Gallstone-1) Veri Seti Analizi (https://archive.ics.uci.edu/dataset/1150/gallstone-1)

analysis analytics data data-analysis data-science data-visualization database graph matplotlib python

Last synced: 25 Apr 2026

https://github.com/chetanmalviya513/assembly-election-analysis-data-insights-engagement

Scraped and analyzed real-time election data to build interactive dashboards showcasing seat trends, vote share distribution, and postal ballot stats. The analysis uncovered insights on voting patterns, winning margins, and candidate forfeitures. Visual storytelling and timely data updates helped the project gain strong engagement on social media.

assembly data-visualization dataanalytics descriptive-statistics election-analysis election-data msexcel news tableau-dashboards webscraping

Last synced: 25 Apr 2026

https://github.com/satvikpraveen/pandasplayground

📊 A comprehensive pandas mastery project with 10 modular Jupyter notebooks covering data loading, cleaning, grouping, merging, time series, visualization, and performance profiling. Includes real-world workflows, Docker, Streamlit, and reusable utils. Ideal for data scientists and analysts to learn, practice, and refer. Practice-ready and modular.

analytics cheatsheet data-analysis data-cleaning data-pipeline data-science data-visualization docker etl exploratory-data-analysis jupyter-notebook jupyterlab learning-resource memory-profiling open-source pandas performance-tuning python streamlit time-series

Last synced: 10 Apr 2026

https://github.com/tanyakuznetsova/world-happiness-report-2023-in-europe

Happiness Insight '23: Navigating global joy. Exploring trust's role in life satisfaction with my World Happiness Report analysis.

citizen-science data-storytelling data-visualization global-indicators life-satisfaction social-trust world-happiness-report

Last synced: 25 Apr 2026

https://github.com/rohitinu6/indian-used-car-price-prediction

The aim of this project to predict the price of the used cars in indian metro cities by analyzing the car's features such as company, model, variant, fuel type, quality score and many more.

algorithms analytics car-price-prediction data-science data-visualization eda machine-learning machine-learning-algorithms python regression used-cars

Last synced: 04 May 2026

https://github.com/gerhynes/d3-character-frequencies

A character frequency analyzer built using D3.js. Built for The Advanced Web Developer Bootcamp.

d3 data-visualization javascript

Last synced: 25 Apr 2026

https://github.com/ganesh774218/students-health-predictor

An end-to-end machine learning project that applies logistic regression to identify students at risk of depression based on demographic, academic, and lifestyle features. This repository includes data preprocessing, feature engineering, model training, evaluation metrics, and visualizations to provide actionable insights.

data-science data-visualization jupyter-notebook logistic-regression machine-learning machine-learning-algorithms python regression-models

Last synced: 19 May 2026

https://github.com/virajbhutada/diamond-price-estimator

This project develops a predictive model to estimate diamond prices based on characteristics like carat, cut, color, and clarity. It covers data preprocessing, feature engineering, model selection, training, and evaluation. The final product is a web app where users can input diamond attributes to get accurate and instant price predictions.

cross-validation css data-analysis data-science-projects data-visualization eda feature-engineering html hyperparameter-tuning jupyter-notebooks machine-learning ml-algorithms model-deployment model-selection performance-optimization predictive-modeling python python-app user-interface

Last synced: 14 Apr 2026

https://github.com/ddihora1604/iit_patna

A multifaceted project involving applying ML models like Ridge Classifier, RNN, RIDOR, Rotation Forest and RUSBoost, integrating SMOTE for class balancing, and handling diverse datasets including those for seating arrangement tasks.

data-analysis data-visualization datamodelling machine-learning-algorithms python

Last synced: 25 Apr 2026

https://github.com/fyonietz/infera

IDE For Data Science Or Data Analysis

cpp data-science data-visualization lightweight

Last synced: 25 Apr 2026

https://github.com/ladaegorova18/data_analysis

Learning the basics of data analysis in Python

analytics data-analysis data-visualization steam-games

Last synced: 24 Jun 2026

https://github.com/dingaaling/webcam-mirror

Use a webcam as a mirror to view your NYC/FB data identities

data-identities data-visualization facial-detection facial-keypoints flask-application opencv

Last synced: 25 Apr 2026

https://github.com/dulajkavinda/matplotlib-ml

📊Data visualisation with matplotlib library.

data-visualization jupyter-notebook matplotlib python seaborn

Last synced: 25 Apr 2026

https://github.com/waleedgeorgy/ml_sklearn

Implementation of various machine learning algorithms for regression and classification & feature engineering.

data-visualization jupyter-notebook machine-learning python

Last synced: 26 Apr 2026

https://github.com/danilowskic/queue-initialisation

An app shows data structure called queue. Describes how it works and what it is based on.

data-structures data-visualization java-8 queue queue-simulation

Last synced: 26 Apr 2026

https://github.com/lasyakonduru/web-app-for-sentiment-analyzer-a-comprehensive-tool-for-analyzing-text-and-dataset-sentiments

A powerful and user-friendly tool for analyzing sentiments in text and datasets. This app leverages advanced sentiment analysis techniques to provide real-time insights, helping users classify text as Positive, Negative, or Neutral, and visualize sentiment trends for better decision-making.

data-visualization machine-learning natural-language-processing python sentiment-analysis streamlit vader-sentiment-analysis webapp

Last synced: 26 Apr 2026

https://github.com/minervarose/exoplanet-discovery-observatory

Interactive Tableau exploration of exoplanet discoveries, planetary systems, and discovery trends using NASA Exoplanet Archive data.

analytics astronomy business-intelligence dashboard data-visualization exoplanets nasa space tableau tableau-public

Last synced: 24 Jun 2026

https://github.com/deliprofesor/cinematic-data-analytics-and-recommendation-platform

This project analyzes a movie dataset using machine learning algorithms to predict success, explore revenue-popularity relationships, and develop recommendation systems. It employs techniques like K-Means, DBSCAN, GMM, decision trees, PCA, and NLP for insights and personalized suggestions.

clustering content-based-recommendation data-analysis data-visualization decision-tree gmm k-means machine-learning natural-language-processing nlp pca predictive-modeling python recommendation-system scikit-learn user-based-recommendation

Last synced: 26 Apr 2026

https://github.com/blankscreen-exe/data-mining-lab

Course Code: CS626, MCS Batch-2019 (Final Year) Evening

data-visualization datamining datamining-algorithms datascience

Last synced: 26 Feb 2025

https://github.com/monarch1108/customerinsights-kmeans

understanding customers using KMeans and RFM(recency, frequency & monetary) analysis

data-analysis data-visualization kmeans-clustering machine-learning matplotlib numpy pandas scikit-learn

Last synced: 11 May 2026

https://github.com/odinleepro/airbnbnewyorkcityanalysis

AirbnbNewYorkCityAnalysis is a comprehensive data analysis and visualization project exploring short-term Airbnb rental trends across New York City (2008–2022). Using open source Airbnb data, the project combines data cleaning, statistical summaries, and Tableau dashboards to uncover pricing patterns, borough level distribution, and insights.

airbnb analytics-project data-analysis data-cleaning data-science data-visualization new-york-city real-estate-analytics tableau urban-analysis

Last synced: 27 Apr 2026

https://github.com/mohdumair8896/stock-market-analysis-and-forecasting

This is a project of Stock Market Analysis And Forecasting Using Deep Learning(pytorch,gru).

data-visualization machine-learning prediction python

Last synced: 27 Apr 2026

https://github.com/parnika798/election-result-dashboard

Streamlit dashboard exploring correlations between ad spending and voter turnout in the 2024 Indian General Elections. Includes dynamic filters, intuitive charts, and election campaign impact exploration.

2024-indian-general-elections data-visualization election-data-analysis interactive-dashboard streamlit

Last synced: 22 May 2026

https://github.com/arda-guler/koerimei

KOERI Mapping Extension Interface. Maps latest earthquakes detected by Kandilli Observatory and Earthquake Research Institude.

data-visualisation data-visualization earthquake earthquake-visualization earthquakes geography map mapping

Last synced: 07 Jun 2026