An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/ccolpasm/demographic-data-analyzer

This project analyzes a demographic dataset to gain insights into various population characteristics. It uses data analysis tools like Python, Pandas for data manipulation, and Matplotlib for visualization, providing a clear view of age, salary, education, and occupation trends.

analytics data-visualization matplotlib-pyplot pandas-python

Last synced: 03 Apr 2025

https://github.com/rvalla/covid-19-caba

Some code to analyze open data from Buenos Aires city related to COVID-19 pandemic.

covid-19 data-visualization python3

Last synced: 30 Oct 2025

https://github.com/kuanjiahong/covid19-analysis

A simple project to familiarize myself with data analysis

data data-science data-visualization pandas python

Last synced: 02 Apr 2025

https://github.com/jbalooshie/stock-analysis

A VBA script that performs basic stock analysis. Created while participating in a Data Analytics Bootcamp.

data-science data-visualization excel microsoft vba vba-excel vba-macros vba-script

Last synced: 20 Jan 2026

https://github.com/gerhynes/d3-birth-scatterplot

A scatterplot representing UN data on births in 2011. Built using D3 for The Advanced Web Developer Bootcamp.

d3 data-visualization javascript

Last synced: 27 Apr 2026

https://github.com/saisurajmatta/cryptocurrency-market-analyzer-python-project

Cryptocurrency Market Analyzer: Python script utilizing CoinMarketCap API to fetch, analyze, and visualize real-time trends of top 15 cryptocurrencies over different time intervals.

data-analytics data-visualization matplotlib pandas python seaborn

Last synced: 05 May 2026

https://github.com/al-ghaly/hotel-revenue-excel-analysis

Excel Dashboard to analyze data of a hotel over the past three years.

dashboard data-analysis data-visualization excel excel-analysis

Last synced: 02 Jan 2026

https://github.com/vedikasnehil/sql-50

This project focuses on solving 50 SQL problems every weekend from LeetCode to strengthen SQL skills, master advanced techniques, and build consistency. Each solution is documented with clear explanations, creating a valuable resource for learning and application.

data-visualization database-management sql

Last synced: 06 Jan 2026

https://github.com/leosolar8/stock-price-prediction-ai-model

This project shows how to use a special type of AI called Long Short-Term Memory (LSTM) to predict stock prices. The project is split into two main parts: Training the AI Model and Making Predictions (Inference)

ai csv-dataset data-science data-visualization deep-learning finance financial-data forecasting keras lstm machine-learning python rnn stock-market stock-prediction tensorflow time-series time-series-forecasting

Last synced: 08 Apr 2026

https://github.com/omerdduran/riskfactor-heart

This ML project predicts heart disease using logistic regression on the Cleveland Heart Disease UCI dataset, featuring advanced preprocessing and medical feature engineering, achieving 82.1% accuracy with strong cross-validation.

cardiovascular-health data-science data-visualization heart-disease-prediction logistic-regression machine-learning medical-ai scikit-learn

Last synced: 14 May 2026

https://github.com/fazej99/u.s-climate-and-temperature-analysis

This project analyzes historical temperature trends in the U.S., explores their economic impacts, predicts future changes using machine learning, visualizes regional anomalies with GIS, and presents findings through a secure and interactive Streamlit dashboard.

data-analysis data-science data-visualization gis machine-learning streamlit

Last synced: 22 May 2026

https://github.com/samridhisainii/airbnb-data-analysis

Data analysis of airbnb dataset

analysis data data-visualization eda models

Last synced: 16 May 2026

https://github.com/jofaval/boston-housing

Regression Analysis into the Boston Housing in-demand pricing in 1978

boston-housing data-analysis data-science data-visualization machine-learning python regression

Last synced: 16 May 2026

https://github.com/dmytrori/himalayan_expeditions

Himalayan expedition stats, 1905–2020

alpinism data-analysis data-visualization pandas-python

Last synced: 21 Jun 2025

https://github.com/neuraladitya/electric_vehicle_sales_predictor

EV Sales Prediction in India using Machine Learning. Forecasts electric vehicle sales across Indian states with interactive visualizations and a modern web UI.

dashboard data-science data-visualization electric-vehicles ev-sales flask india machine-learning matplotlib prediction-model python random-forest sales-analysis

Last synced: 16 May 2026

https://github.com/aglowraph/gromacs-xvg-plot-script

A Python script for automating the plotting of .xvg files from GROMACS simulations, with dynamic labeling, time unit detection, and colorful visualization. This script reads, plots, and saves each .xvg file in the same directory, making data analysis more efficient.

automation computational-chemistry data-visualization gromacs matplotlib molecular-dynamics numpy python scientific-computing xvg-plotting

Last synced: 18 May 2026

https://github.com/betkh/datascieneinpython

Jupiter Notebook files

data-analysis data-visualization

Last synced: 16 Jun 2025

https://github.com/as16082023/manufacturing-downtime-analysis

In the Maven Analytics data challenge, analyzed manufacturing downtime for a soda production company using Excel, identifying key issues and root causes of delays. Insights were shared through tables, charts, and a concise report with actionable recommendations.

advanced-excel data-visualization excel

Last synced: 20 Jan 2026

https://github.com/dcostachar/cyclistic-case-study

An analysis of Cyclistic bike-share data with SQL and Tableau to uncover usage trends and generate marketing strategies to boost annual memberships.

consumer-behaviour-analysis data-visualization exploratory-data-analysis marketing-analytics mysql sql tableau

Last synced: 27 Mar 2025

https://github.com/hemangsharma/bookingdataanalysisreport

The report helps understand key trends and insights around customer bookings, pricing, and other related attributes.

analysis data data-analysis data-analytics data-visualization streamlit streamlit-dashboard

Last synced: 14 May 2026

https://github.com/rafinha0rafinha/web-analyzer-backend

(Legacy) This is the backend for Mazaoro SARLU's lead magnet "Web Analyzer". This project analyzes websites using Google Lighthouse and returns a detailed report consumed by the frontend.

azure-app-service azure-devops chartjs cicd data-analysis data-science data-visualization express flask hacktoberfest lighthouse numpy sentiment-analysis vader-sentiment-analyzer

Last synced: 10 Apr 2026

https://github.com/ebrizzzz/data-visualization-project-using-tableau

A data visualization project for the Visual Data Analysis course (Spring Term 2025) at the University of Skövde. This project explores the factors influencing national happiness scores across different global regions from 2005 to 2022.

analytics data data-analysis data-science data-visualization python regression tableau

Last synced: 16 Jun 2025

https://github.com/mituskillologies/ds-dypsoe-jan25

Programs conducted at D.Y.Patil School of Engineering, Pune in training on Data Science during January 2025.

artificial-intelligence data-analytics data-science data-visualization machine-learning supervised-learning unsupervised-learning

Last synced: 18 Mar 2025

https://github.com/juanchiparra/30daychartchallenge

#30DayChartChallenge visualizations using D3.js

d3 data-visualization

Last synced: 03 Apr 2025

https://github.com/saravanansuriya/streamlit

Streamlit Tutorial for machine learning and data science.

data-visualization python-script streamlit-webapp

Last synced: 18 May 2026

https://github.com/sharinas/mapped_travel_locations

A web-based Python mapping project of specific places around the world, with interactive pop-ups and color coded markers. Project uses folium, pandas, python, and a .csv file to store data.

csv data-visualization folium mapping pandas pipenv python

Last synced: 18 May 2026

https://github.com/dmarks84/coursework_project_ml-classifier-eval-selection

Project for University of Michigan Applied Data Science Specialization -- Predicted viewer engagement based on features related to video metrics; evaluated a large set of classifiers under different scoring metrics to select the "optimal" one.

classification cross-validation data-modeling data-reporting data-visualization databases dataframes eda grid-search matplotlib numpy pandas python scikit-learn statistics supervised-ml

Last synced: 02 Apr 2026

https://github.com/ianjure/simple-corr

A simple data correlation visualizer built in Streamlit.

data-visualization streamlit

Last synced: 18 May 2026

https://github.com/whisplnspace/insightgenie

InsightGenie is an AI-powered data analyst that lets you upload files, ask questions, and get insights with visualizations

data-analysis data-science data-visualization deployment gemini-api huggingface nlp

Last synced: 19 Jun 2025

https://github.com/yash22222/olympic-games-analytics-using-apache-spark

The "Olympic Games Analytics Using Apache Spark Databricks" project explores data from the Olympic Games (1896-2016) to identify trends and insights. Using Apache Spark for big data processing and Databricks for visualization, the project analyzes key factors like top-performing countries and athlete attributes, showcasing real-world analytics.

apache apache-kafka apache-spark big-data-analytics csv data data-analytics data-visualization databricks excel mysql olympics regions

Last synced: 03 May 2026

https://github.com/nehul1149/olympic-data-analysis

This project is an interactive data visualization and analytics platform for exploring historical Olympic Games data. Built with Python and Streamlit, it offers an in-depth analysis of medal tallies, athlete statistics, and country-wise performance trends, providing users with powerful insights into the world's biggest sporting event.

analysis data-analysis data-science data-visualization matplotlib python streamlit

Last synced: 18 May 2026

https://github.com/snototter/viren2d

Visualization Toolbox for Computer Vision

computer-vision-tools cpp data-visualization python

Last synced: 15 May 2026

https://github.com/mae776569/weratedogs-wrangling

Wrangling WeRateDogs Twitter data to create interesting and trustworthy analyses and visualizations

data-analysis data-science data-visualization tweets twitter-api

Last synced: 25 Jan 2026

https://github.com/toluwaa-o/stears-lite-overview

Central overview repository for the Stears Lite project — documentation, resources, and links to frontend and backend repositories.

africa charts data data-aggregation data-visualization documentation fastapi nextjs project-overview

Last synced: 14 May 2026

https://github.com/alifeee/occupation-data

Plotting Industry and occupation data from the ONS 2021 Census

census data-visualization employment occupation office-for-national-statistics ons pie-chart

Last synced: 26 Jun 2025

https://github.com/willmeyers/usgs-groundwater-trends

Visualized USGS groundwater level trends

data-visualization

Last synced: 30 Oct 2025

https://github.com/rafay99-epic/metricmate

Metric Mate is a modern, Python-based GUI tool for visualizing and analyzing gaming performance metrics with a sleek Tokyo Night theme.

data-visualization python python-gui-tkinter python-script

Last synced: 11 May 2025

https://github.com/benzerinsio/onlineretail-tableau

📊 Um dashboard interativo básico criado no Tableau para explorar vendas de uma loja online, com visualizações de receita por região e tendências temporais.

data-visualization eda sales-analysis tableau visualizacao-de-dados

Last synced: 09 Feb 2026

https://github.com/luka-j/csw5-eda

Materials for CS Week 5 lecture on exploratory data analysis

data-visualization r shiny tidyverse

Last synced: 26 Apr 2026

https://github.com/malakasupun/crime-data-analysis-of-lapd

This project aims to explore and analyse crime patterns in Los Angeles using a dataset spanning from 2020 to the present. The primary focus is to extract meaningful insights by integrating structured data analysis and advanced techniques in SQL and Natural Language Processing (NLP).

data-analysis data-visualization llm nlp sql

Last synced: 29 Jul 2025

https://github.com/arction/lcjs-example-0009-severalaxisxy

A demo application showcasing using multiple axes in LightningChart JS.

axis chart data-visualization lcjs lightningchart-js

Last synced: 12 Mar 2025

https://github.com/zulfachafidz/green_horizon_forecasting_peak_organic_avocado_sales_with_the_prophet_algorithm

The Green Horizon Project leverages the Prophet algorithm to predict peak sales of organic avocados, supporting the campaign "APEAM GO ORGANIC." Using Python and Looker Studio, this analysis aims to provide deep insight into sales trends and potential, forming the basis of smarter marketing strategies.

algorithm algorithms analytics data data-analysis data-engineering data-mining data-science data-visualization forecasting machine-learning machine-learning-algorithms prophet-model python python-script

Last synced: 17 May 2026

https://github.com/sanjana-bongale/cancer_survival_data_analysis_and_prediction_using_logistic_regression

This project performs data analysis using Python to predict cancer patient survival outcomes. It involves data cleaning, exploratory analysis, and visualizations to explore factors like cancer type, stage, and treatments. A logistic regression model is built to predict patient survival based on demographic and medical data.

data-analysis data-cleaning data-science data-visualization eda jupyter-notebook kaggle logistic-regression machine-learning matplotlib numpy pandas predictive-modeling python scikit-learn seaborn

Last synced: 08 Apr 2026

https://github.com/mfakhriazhar/ecom-qtt-prediction

In e-commerce, understanding seasonal sales trends and best-selling products is critical to business strategy. However, companies often struggle with predicting sales, determining factors that influence sales (discounts, product categories, locations), and optimizing stock and marketing.

data-analysis data-science data-visualization e-commerce-project eda machine-learning python

Last synced: 19 May 2026

https://github.com/robwiederstein/kytc_loc

Plot Kentucky licensing locations

data-visualization ggmap leaflet r xml2

Last synced: 31 Jul 2025

https://github.com/aran203/cricanalytics

ADSC Fall 24 Project for cricket analytics with hawkeye data

data-engineering data-visualization python streamlit

Last synced: 14 May 2026

https://github.com/yaser-123/movie_recommendation

The Movie Recommendation App provides users with personalized movie suggestions, trailers, and essential details, all through an intuitive and interactive interface.The **Movie Recommendation App** is a Streamlit-based application that suggests movies based on user preferences. The app uses data from the TMDB dataset and APIs like YouTube and OMDb

data-visualization imdb jupiter-notebook kaggle omdb-api python streamlit tmdb-api youtube-api

Last synced: 06 May 2026

https://github.com/yash22222/web-scraping-for-data-analysis-predictive-model-on-customer-data

Utilized web scraping for customer feedback at Air India, conducting robust data analysis, and applying machine learning for predictive modeling. Drove data-driven decisions, enhancing services, and elevating customer satisfaction. Expertise in web scraping, analysis, and predictive modeling for actionable insights.

data-analysis data-preprocessing data-science data-visualization exploratory-data-analysis machine-learning powerbi random-forest-classifier sentiment-analysis tableau web-scraping

Last synced: 30 May 2026

https://github.com/travisbreaks/sovereign-matrix

Ruthless project prioritization system. Multi-dimensional weighted scoring with real-time visualization. Kill what doesn't matter. React 19 + Tailwind + Framer Motion.

dashboard data-visualization decision-framework framer-motion prioritization react tailwindcss typescript

Last synced: 01 Mar 2026

https://github.com/sabdikay/telco-customer-churn-analysis-ibm-dataset

This project explores customer churn trends for a company in California using an IBM dataset. Built in a Jupyter Notebook, it employs pandas, NumPy, matplotlib, seaborn, plotly, and scipy to clean, analyze, and visualize data. Through statistical tests and interactive maps, it uncovers key drivers behind customer cancellations

business-intelligence customer-churn data-analysis data-analysis-python data-visualization exploratory-data-analysis jupyter-noteboook matplotlib numpy pandas plotly predictive-modeling python scipy seaborn statistical-analysis

Last synced: 07 Apr 2026

https://github.com/annaanastasy/classification-project-student-grades

A machine learning project to predict students' academic performance using features like demographics, study habits, and parental involvement, achieving 74% accuracy with the CatBoost model.

catboost-classifier classification data-analysis data-visualization machine-learning-algorithms predictive-modeling

Last synced: 29 Mar 2025

https://github.com/manuelgil/vscode-data-pack

This extension pack includes the essential extensions for data analysts.

data-analysis data-science data-structures data-visualization vscode-extension

Last synced: 07 Apr 2026

https://github.com/sparkerdata/hockeyshotmap

Interactive Streamlit app for NHL shot maps & player analysis. Pulls live (or demo) play-by-play data, normalizes rink coordinates, and visualizes shots with context filters (strength, period, player).

data-analysis data-visualization duckdb hockey hockey-analytics ice-hockey nhl nhl-data python sports sports-analytics

Last synced: 18 May 2026

https://github.com/robinmillford/analyzing-e-commerce-transactions---data-cleaning-cohort-analysis-and-sql

In this project, I aimed to analyze the profitability of products in an e-commerce dataset. I performed various SQL queries to extract valuable insights about product profitability, including the identification of the top 5 products with the highest profit margin, and unique combinations of brands and product lines with the highest profitability.

cohort-analysis data-analysis data-visualization excel jupyter-notebook powerbi python3 sql

Last synced: 18 May 2026

https://github.com/aryanpillai2007/credit-card-fraud-detection

The primary goal of this project is to develop a comprehensive fraud detection system that enhances the security and trustworthiness of financial transactions.

anomaly-detection classification credit-card-fraud data-preprocessing data-science data-visualization fraud-detection imbalanced-data logistic-regression machine-learning outlier-detection pca pca-analysis python roc-curve scikit-learn

Last synced: 18 May 2026

https://github.com/jibbs1703/austin-house-prices

This repository contains the exploratory data analysis and prediction model for house prices in Austin, Texas using data collected between 2018 and 2021. The data analyses and model results would be of importance to all stakeholders in the Austin housing market.

business-insights data-science data-visualization exploratory-data-analysis house-price-prediction

Last synced: 25 Jun 2025

https://github.com/kammarah/studentdata

I created & deployed a Streamlit app to store, manage & analyze student data. 📊🎓

connection data data-analysis data-visualization deploy deployments libraries python streamlit streamlit-webapp webapp

Last synced: 18 May 2026

https://github.com/stefagnone/-ames-housing-analysis-feature-engineering-and-model-tuning

Data-driven analysis of the Ames Housing Dataset, combining advanced feature engineering and Stochastic Gradient Descent (SGD) regression model tuning. This repository showcases predictive modeling, hyperparameter optimization, and actionable insights for real estate analytics.

ames-housing-dataset data-visualization feature-engineering machine-learning predictive-modeling python real-estate-analytics regression-analysis sgd

Last synced: 18 May 2026

https://github.com/stefagnone/unsupervised-analysis-project

This project investigates the impact of video content on social media engagement using advanced analytics techniques like PCA, k-means clustering, and logistic regression. It provides actionable insights for optimizing social media strategies for Thai fashion and cosmetics retailers.

data-analysis data-visualization engagement-metrics facebook-live-sellers k-means-clustering logistic-regression marketing-insights pca-analysis python social-media-analytics

Last synced: 05 Apr 2025

https://github.com/stefagnone/data_storyboarding_visualization

Data Storyboarding and Visualization Techniques for Effective Communication

data-analysis data-visualization ggplot2-analysis r tableau-dashboards

Last synced: 05 Apr 2025

https://github.com/rorrell/rightwhaledata

A Jupyter Notebook where I wrangle some data on right whale sightings and create a visualization

data-analysis data-visualization jupyter-notebook python3

Last synced: 11 May 2026

https://github.com/hudson-newey/ecoacoustic-analysis-pipeline

A generalised pre-processing, metadata extraction, and analysis pipeline

data-visualization environment-variables pipeline

Last synced: 29 Apr 2025

https://github.com/nour-zayed/shopping-trends-analytics-sql-python-power-bi

"End-to-end Shopping Trends analytics project using SQL, Python, Excel & Power BI — data cleaning, EDA, KPI generation, and interactive dashboards with DAX for actionable business insights."

business-intelligence data-analysis data-visualization dax powerbi python sql

Last synced: 18 May 2026

https://github.com/vikasraparthi/human-chain

The AI Safety Incident Dashboard is an interactive frontend application designed to enhance your frontend development skills. This project focuses on creating a user-friendly interface to view and log hypothetical AI safety incidents, aligning with HumanChain's mission of promoting AI safety

ai-safety alerts dashboard data-visualization incident-management real-time-monitoring user-management

Last synced: 11 May 2025

https://github.com/andrew-dev-p/chartjs-showcase

Interactive data visualizations using Chart.js with smooth animations and dynamic updates

bar-chart chartjs charts css data-visualization html interactive-graphs javascript line-chart pie-chart

Last synced: 18 Feb 2026

https://github.com/arya920/stockpriceforecasting

The project seamlessly melds diverse technologies, including Numpy, Seaborn, Matplotlib, Keras, and more, to seamlessly integrate data manipulation, visualization, and machine learning.

data-visualization keras-tensorflow lstm-neural-networks modelling neural-network stock-market stock-price-prediction streamlit-webapp webapp

Last synced: 26 Mar 2025

https://github.com/sanogomamadou/projet-pipeline-complet-analyses-streaming

Pipeline de traitement de logs vidéo en streaming avec Spark, Airflow et PostgreSQL, visualisé via Power BI pour analyser l’engagement utilisateur et la popularité du contenu.

airflow data-visualization dataengineering powerbi spark

Last synced: 18 May 2026

https://github.com/kshitiz1302/pizza-sales-report

The report provides insights into pizza sales trends for 2015, focusing on peak periods, customer preferences for large pizzas, and the best-performing menu items.

data-cleaning data-management data-manipulation data-modeling data-storytelling data-visualization dax dax-expression dax-query mysql mysql-database mysqlworkbench powerbi powerbi-dashboards powerbi-desktop powerbi-report powerbi-visuals sql sql-server

Last synced: 18 May 2026

https://github.com/oshinrathor/Data-Science-Systems-and-Analytics-Projects

Dive into my Data Science Projects Repository, featuring a Spam SMS Classifier, NIA Dashboard, H1N1 Vaccine Prediction, and NYC Taxi Fare Prediction. Each project showcases my skills in data cleaning, exploratory analysis, modeling, and visualization, offering valuable insights and methodologies for data enthusiasts and practitioners.

dashboard data-analysis data-driven-decisions data-presentation data-science data-visualization dataexploration eda insights nia webanalytics

Last synced: 12 Sep 2025

https://github.com/casperkristiansson/finance-tracker

A project which solved an issue of mine which was tracking my finance. This Finance Tracking application gives overviews of expenses and income to give its users an easy way to explore their data.

dashboard data-visualization finance-management firebase-auth react

Last synced: 29 Dec 2025

https://github.com/lovasoa/presidentielle

Graphiques correspondants aux derniers sondages IFOP pour les présidentielles françaises de 2017.

chart chartjs data-visualization elections francais france presidential

Last synced: 03 Apr 2025

https://github.com/benmar2406/rent-in-germany

Interactive visualizations and maps depicting topics around rent prices and income in Germany built with Svelte.

charts d3 d3-visualization d3js data-analysis data-visualization gis gis-data infographic infographics map mapbox mapbox-gl mapbox-gl-js mapboxgl svelte

Last synced: 26 Mar 2025

https://github.com/huynhtanphatt/diagnosing-uk-railway-performances

This project analyzes UK railway ticket and operation data to show how revenue, passenger demand, and on-time performance are connected.

data-analysis data-visualization datastorytelling python railway sql ticketing transportation

Last synced: 24 Apr 2026

https://github.com/pramodkondur/dataspark-end-to-end-dataanalytics

Cleaned, performed EDA and stored data in MySQL. Queried, and analyzed data, uncovering opportunities to drive revenue growth and optimize operations, with a potential revenue growth of $30.03 million. Reported key insights using Power BI.

data-analysis data-visualization eda powerbi python sql

Last synced: 21 May 2026

https://github.com/salmaakhalifa199/iris_classification

Classify Iris species using K-Nearest Neighbors and explore dataset visualization and evaluation metrics.

classification data-visualization iris-dataset knn machine-learning-models python-3

Last synced: 26 Apr 2026

https://github.com/sebastianurdaneguibisalaya/enfermedades-fissal

Análisis holístico de atenciones por enfermedades raras, huérfanas y transplantes coberturados por FISSAL en el Perú.

data-analysis data-visualization python

Last synced: 24 Feb 2025

https://github.com/yrohitha/customer-segmentation

Clean and Apply RFM technique to rank and group clusters to identify the best customers and perform targeted marketing campaigns, using real online transaction data

data-cleaning data-science data-visualization datetime marketing-analytics pandas python3 user-segmentation

Last synced: 13 Mar 2025

https://github.com/sadratehranian/pem-fuel-cell

The methodology section details the use of Python for data processing and analysis, employing statistical and machine learning-based anomaly detection techniques to identify potential issues in fuel cell stacks. It emphasizes data preprocessing, feature engineering, exploratory data analysis (EDA), and anomaly detection.

anomaly-detection data-analysis data-science data-visualization exploratory-data-analysis feature-engineering fuel-cell machine-learning preprocessing python statistical-analysis visual-studio-code

Last synced: 26 Mar 2025

https://github.com/zmyzheng/stack_overflow_qa_assistant

Big Data Analysis project with recommendation, cluster analysis and graph database

big-data-analytics cluster-analysis data-visualization graph-database hadoop mahout recommendation-system

Last synced: 30 Mar 2025