An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/callmemaverick/game-of-thrones-investigating-episodes

Data Science project to analyze the duration of Game of Thrones episodes

data-science data-visualization matplotlib pandas-python python

Last synced: 19 Dec 2025

https://github.com/kate8382/frontend-module

Frontend module for a web application with user authentication, real-time dashboard, and data management

authentication dashboards data-visualization frontend

Last synced: 21 Jun 2025

https://github.com/bayunova28/healthcare_analytics

This repository contains about data analytics project from healthcare industry

data-analytics data-engineering data-visualization healthcare pyspark sql

Last synced: 21 Jun 2025

https://github.com/dbolotov/ts_smoothing_visualizer

Streamlit app for visualizing and comparing time series smoothing methods on real and synthetic datasets.

data-science data-visualization streamlit time-series

Last synced: 24 Jul 2025

https://github.com/dmytrori/himalayan_expeditions

Himalayan expedition stats, 1905–2020

alpinism data-analysis data-visualization pandas-python

Last synced: 21 Jun 2025

https://github.com/anuj7411/bankofbaroda-candlestick-dashboard

An interactive stock market visualization project using Python, Pandas, and Plotly to analyze Bank of Baroda price movement through a candlestick dashboard.

candlestick-chart dashboard data-visualization financial-data jupyter-notebook pandas plotly python stock-market time-series

Last synced: 17 May 2026

https://github.com/pjaiswalusf/heart-failure-prediction

A machine learning project predicting heart failure risk using Random Forest and XGBoost. It involves data cleaning, feature engineering, and EDA before training. The best model is saved using Joblib. Key techniques: outlier detection, feature scaling, and optimization.

data-processing data-visualization feature-engineering machine-learning model-training optimization random-forest-classifier saving-model xgboost

Last synced: 07 Mar 2026

https://github.com/hirudikaanupama/email-spam-detection-logistic-regression

This model can predict whether an email is spam or not. The logistic regression machine learning algorithm is used to train this model.

accuracy-score classification classification-report confusionmatrix data-visualization logistic-regression machine-learning roc-curve

Last synced: 11 Sep 2025

https://github.com/rajesh9943/decoding-sales-patterns-strategic-insights-from-data

To identify the key drivers of sales and uncover patterns for strategic decision-making. This involves analyzing purchasing behavior by area, town, and commodity type, while also tracking customer choices over time. Descriptive statistics and time series analysis were used to reveal key sales trends across a four-year period.

data-analytics data-processing data-visualization data-wrangling reporting sales-analysis sales-growth

Last synced: 11 Jul 2025

https://github.com/samaalharbi2/virtual-work-experience---data-analysis-at-stc

Virtual Work Experience in Data Analysis at STC

analysis data data-visualization misk stc

Last synced: 20 Jun 2025

https://github.com/hariprasath-v/machinehack-music_genre_classification_weekend_hackathon_edition_2

predict the genre of the songs from tunable audio track features like energy, tempo, key, mode, and valence, and others.

data-visualization exploratory-data-analysis machine-learning

Last synced: 17 Apr 2026

https://github.com/theshashanksinha/deloitte-au

Analyzed telemetry and salary equality data using Tableau and Excel to identify machine downtime patterns and assess gender pay equity, translating raw data into actionable business insights.

data-analytics data-visualization microsoft-excel tableau

Last synced: 06 Mar 2026

https://github.com/tolumie/web-scraping-rest-api-stock-data-operations

Web Scraping, REST API & Stock Data Operations is a data-driven project that explores the power of web scraping, API interactions, and stock market analysis using Python. From extracting stock data and public records to analyzing real-world financial trends, this repository is a one-stop resource for data enthusiasts, traders, and analysts.

api-integration data-analysis data-cleaning data-visualization financial-data python rest-api sql-databases stock-data web-scraping

Last synced: 19 May 2026

https://github.com/rmrt1n/chess_analysis_project

Webscraping and analysing games of Hikaru Nakamura

chess data-analytics data-visualization eda rvest tidyverse web-scraping

Last synced: 15 Jan 2026

https://github.com/kgelli/news-sentiment-analysis-pipeline-with-microsoft-fabric

End-to-end news sentiment analysis pipeline built with Microsoft Fabric, analyzing Bing News API data with sentiment analysis, visualization in Power BI, and real-time alerts via Teams

azure bing-api data-activator data-engineering data-pipeline data-visualization fabric microsoft-fabric one-lake-synapse power-bi sentiment-analysis

Last synced: 10 May 2026

https://github.com/rvalla/covid-19-caba

Some code to analyze open data from Buenos Aires city related to COVID-19 pandemic.

covid-19 data-visualization python3

Last synced: 30 Oct 2025

https://github.com/leonardoberlatto/1000-startups-analytics

Data analytics on startups data using Tableau

analytics data-science data-visualization tableau

Last synced: 11 Jan 2026

https://github.com/jbalooshie/stock-analysis

A VBA script that performs basic stock analysis. Created while participating in a Data Analytics Bootcamp.

data-science data-visualization excel microsoft vba vba-excel vba-macros vba-script

Last synced: 20 Jan 2026

https://github.com/theshefer/covid-map

Interactive map showing covid data implemented on R language

big-data data-visualization r r-studio

Last synced: 20 Jun 2025

https://github.com/saisurajmatta/cryptocurrency-market-analyzer-python-project

Cryptocurrency Market Analyzer: Python script utilizing CoinMarketCap API to fetch, analyze, and visualize real-time trends of top 15 cryptocurrencies over different time intervals.

data-analytics data-visualization matplotlib pandas python seaborn

Last synced: 05 May 2026

https://github.com/ahmetzamanis/clusteringcountry

Non-hierarchical k-medoids clustering on a dataset of country statistics.

clustering data-science data-visualization k-medoids machine-learning r rmarkdown unsupervised-learning

Last synced: 16 Dec 2025

https://github.com/stanleynguyen/so.cube

World map visualisation of World's Cube Association data 🌏

cas cube data-visualization leaftlet map

Last synced: 24 Jul 2025

https://github.com/guomaimang/magic-vaccine

A research of spread of COVID-19 with and without vaccine, also Group Project of COMP1433(Introduction of data analysis).

data-science data-visualization r-language

Last synced: 11 Jan 2026

https://github.com/madrury/hot-sauce

Simuation of a Hot Sauce Spicyness Dataset

data-analysis data-science data-visualization dataset machine-learning

Last synced: 16 May 2026

https://github.com/shinjimc/simulated_annealing_tsp

This project applies Simulated Annealing to solve the Traveling Salesman Problem using Peru's departments as nodes. Through iterative refinement, it finds the shortest route visiting each department once. Visual feedback enhances understanding and debugging, resulting in an optimal solution displayed with total distance.

data-visualization geospatial-analysis simulated-annealing simulated-annealing-algorithm simulated-annealing-edge-detection traveling-salesman-problem traveling-salesman-problem-solver

Last synced: 24 Jul 2025

https://github.com/whisplnspace/insightgenie

InsightGenie is an AI-powered data analyst that lets you upload files, ask questions, and get insights with visualizations

data-analysis data-science data-visualization deployment gemini-api huggingface nlp

Last synced: 19 Jun 2025

https://github.com/lucasfloresc/final_project

This is the final project of the Ironhack Bootcamp. In this project I applied all methods and tecniques learned in the Bootcamp, such as Web Scrapping and API extraction, Data cleaning and processing with Python, Python logic, the implementation of machine learning and Data Visualization. All displayed in Streamlit for more user friendly interface

data-analysis data-visualization machine-learning python streamlit webscraping

Last synced: 08 May 2026

https://github.com/aglowraph/gromacs-xvg-plot-script

A Python script for automating the plotting of .xvg files from GROMACS simulations, with dynamic labeling, time unit detection, and colorful visualization. This script reads, plots, and saves each .xvg file in the same directory, making data analysis more efficient.

automation computational-chemistry data-visualization gromacs matplotlib molecular-dynamics numpy python scientific-computing xvg-plotting

Last synced: 18 May 2026

https://github.com/as16082023/manufacturing-downtime-analysis

In the Maven Analytics data challenge, analyzed manufacturing downtime for a soda production company using Excel, identifying key issues and root causes of delays. Insights were shared through tables, charts, and a concise report with actionable recommendations.

advanced-excel data-visualization excel

Last synced: 20 Jan 2026

https://github.com/mkk-1817/cvip-ds-exploratory_data_analysis-terrorism

This repository deals with exploring global terrorism trends analyzing the Global Terrorism Database to uncover temporal patterns, identify top terrorist groups, examine attack types, and gain insights into geographical and success/failure dynamics.

coderscave data-analysis data-science data-visualization eda exploratory-data-analysis python terrorism-analysis

Last synced: 19 Jun 2025

https://github.com/dmarks84/ind_project_superstore-sales-time-series-analysis--kaggle

Independent Project - Kaggle Dataset-- I worked on the Superstore Sales Dataset, performing (as Part 1) data cleaning and preparation and exploratory data analysis. The main task was to make predictions for future sales based on time-series analysis, which is found in Part 2.

chloropleth data-modeling data-visualization eda linear-regression matplotlib numpy pandas python seaborn sklearn statistics statsmodels supervised-ml time-series-analysis

Last synced: 09 Apr 2026

https://github.com/rafinha0rafinha/web-analyzer-backend

(Legacy) This is the backend for Mazaoro SARLU's lead magnet "Web Analyzer". This project analyzes websites using Google Lighthouse and returns a detailed report consumed by the frontend.

azure-app-service azure-devops chartjs cicd data-analysis data-science data-visualization express flask hacktoberfest lighthouse numpy sentiment-analysis vader-sentiment-analyzer

Last synced: 10 Apr 2026

https://github.com/kaczmarj/car-safety-shiny

An R Shiny app -- final project for BMI 530

cars data-visualization nhtsa shiny visualization

Last synced: 02 Feb 2026

https://github.com/tanyakuznetsova/music_mental_health

Harnessing music's power for better mental health: genre recommendations and data-driven analysis of listeners' trends

data-visualization decision-tree decision-tree-classifier exploratory-data-analysis k-means-clustering pca-analysis recommendation-system recommender-system surprise-python

Last synced: 11 Jul 2025

https://github.com/mah-22/room-occupancy-prediction-using-environmental-sensor-data

This project uses environmental sensor data to predict room occupancy, providing valuable insights for efficient energy management and space utilization in buildings. By analyzing factors like temperature, humidity, and light levels, the model aims to accurately forecast when rooms will be occupied, optimizing resources and enhancing overall buildi

classification data-science data-visualization exploratory-data-analysis machine-learning numpy pandas python seaborn time-series

Last synced: 07 May 2026

https://github.com/saravanansuriya/streamlit

Streamlit Tutorial for machine learning and data science.

data-visualization python-script streamlit-webapp

Last synced: 18 May 2026

https://github.com/sharinas/mapped_travel_locations

A web-based Python mapping project of specific places around the world, with interactive pop-ups and color coded markers. Project uses folium, pandas, python, and a .csv file to store data.

csv data-visualization folium mapping pandas pipenv python

Last synced: 18 May 2026

https://github.com/marco210210/football-analytics

Football Analytics is a project that collects, analyzes, and visualizes performance data for football teams and players during the Serie A 2017/18 season, using database structures and machine learning models to provide insights into match events and player actions.

data-analytics data-preprocessing data-visualization football-analytics football-performance-analysis machine-learning mongodb mplsoccer python sports-data

Last synced: 27 Feb 2026

https://github.com/ianjure/simple-corr

A simple data correlation visualizer built in Streamlit.

data-visualization streamlit

Last synced: 18 May 2026

https://github.com/yaph/gh-browser-cloud

A word cloud based on browser mentions in GitHub commit messages.

big-query data-processing data-visualization github webbrowser wordcloud

Last synced: 16 May 2026

https://github.com/yash22222/olympic-games-analytics-using-apache-spark

The "Olympic Games Analytics Using Apache Spark Databricks" project explores data from the Olympic Games (1896-2016) to identify trends and insights. Using Apache Spark for big data processing and Databricks for visualization, the project analyzes key factors like top-performing countries and athlete attributes, showcasing real-world analytics.

apache apache-kafka apache-spark big-data-analytics csv data data-analytics data-visualization databricks excel mysql olympics regions

Last synced: 03 May 2026

https://github.com/nafisrayan/crypto-trading-platform

This React Crypto Exchange Template is designed to provide a solid foundation for building a comprehensive cryptocurrency exchange platform. With its sleek and modern design, this template is perfect for anyone looking to create a user-friendly and intuitive trading experience.

crypto dashboard data-analysis data-visualization react template

Last synced: 16 May 2026

https://github.com/matte34/auto-insurance-analysis

Conducted a comprehensive exploratory data analysis (EDA) on an auto insurance dataset that I found from Kaggle. I performed a permutation test and generated data visualizations.

data-analysis data-visualization permutation-test python3 scipy seaborn

Last synced: 06 May 2026

https://github.com/mae776569/weratedogs-wrangling

Wrangling WeRateDogs Twitter data to create interesting and trustworthy analyses and visualizations

data-analysis data-science data-visualization tweets twitter-api

Last synced: 25 Jan 2026

https://github.com/guoweish/amo.gl

toolkit for native webgl

data-visualization glsl webgl

Last synced: 16 Feb 2026

https://github.com/danitilahun/exploratory-data-analysis-projects

This repository contains a collection of my personal Exploratory Data Analysis (EDA) projects. Each project involves exploring various datasets to gain insights, uncover patterns, and visualize trends.

data-analysis data-science data-visualization exploratory-data-analysis python

Last synced: 16 May 2026

https://github.com/shihjen/startup_grant_dashboard

Dashboard for Monitoring Research Laboratory Expenses

dashboard data-visualization python streamlit

Last synced: 07 May 2026

https://github.com/willmeyers/usgs-groundwater-trends

Visualized USGS groundwater level trends

data-visualization

Last synced: 30 Oct 2025

https://github.com/rafay99-epic/metricmate

Metric Mate is a modern, Python-based GUI tool for visualizing and analyzing gaming performance metrics with a sleek Tokyo Night theme.

data-visualization python python-gui-tkinter python-script

Last synced: 11 May 2025

https://github.com/benzerinsio/onlineretail-tableau

📊 Um dashboard interativo básico criado no Tableau para explorar vendas de uma loja online, com visualizações de receita por região e tendências temporais.

data-visualization eda sales-analysis tableau visualizacao-de-dados

Last synced: 09 Feb 2026

https://github.com/luka-j/csw5-eda

Materials for CS Week 5 lecture on exploratory data analysis

data-visualization r shiny tidyverse

Last synced: 26 Apr 2026

https://github.com/bretsw/eme6356-su26-module5

Slide deck for EME6356, Module 5: Data Visualization (Summer 2026)

analytics data-analytics data-visualization slides

Last synced: 02 Jul 2026

https://github.com/arction/lcjs-example-0009-severalaxisxy

A demo application showcasing using multiple axes in LightningChart JS.

axis chart data-visualization lcjs lightningchart-js

Last synced: 12 Mar 2025

https://github.com/Cherukuri-Thanu/Oodles-of-Noodles-Market-Trend-Analysis

This repository contains configuration files for analysing data obtained from Oodles of Noodles

customer-analytics dashboard data-visualization powerbi revenue-analysis service-analytics

Last synced: 02 Jul 2026

https://github.com/srjchsv/datacamp-projects

Projects from DataCamp as part of my Data Science learning journey.

data-science data-visualization datacamp-projects jupyter-notebook matplotlib pandas python statistics

Last synced: 11 May 2026

https://github.com/shyamkumarnagilla/diabetes-prediction-using-k-nearest-neighbors-classification

This project predicts diabetes risk in patients using the K-Nearest Neighbors (KNN) classification algorithm. By analyzing health data, the model assists in early diabetes detection, providing insights that support preventive care.

data-visualization exploratory-data-analysis machine-learning standardscaler statistical-analysis

Last synced: 18 Mar 2025

https://github.com/mfakhriazhar/ecom-qtt-prediction

In e-commerce, understanding seasonal sales trends and best-selling products is critical to business strategy. However, companies often struggle with predicting sales, determining factors that influence sales (discounts, product categories, locations), and optimizing stock and marketing.

data-analysis data-science data-visualization e-commerce-project eda machine-learning python

Last synced: 19 May 2026

https://github.com/davidterpay/youtube-api

A simple python program that will allow you to view the the view count of any keyword in an specified time period

data-science data-visualization python3 youtube-api

Last synced: 16 May 2026

https://github.com/prathmesh2507/ctc-hackthon

A data-driven system designed to reduce overcrowding and optimize urban public transport using real-world geospatial data and intelligent simulation.

dashboard data-analysis data-visualization python streamlit

Last synced: 16 May 2026

https://github.com/yaser-123/movie_recommendation

The Movie Recommendation App provides users with personalized movie suggestions, trailers, and essential details, all through an intuitive and interactive interface.The **Movie Recommendation App** is a Streamlit-based application that suggests movies based on user preferences. The app uses data from the TMDB dataset and APIs like YouTube and OMDb

data-visualization imdb jupiter-notebook kaggle omdb-api python streamlit tmdb-api youtube-api

Last synced: 06 May 2026

https://github.com/yash22222/web-scraping-for-data-analysis-predictive-model-on-customer-data

Utilized web scraping for customer feedback at Air India, conducting robust data analysis, and applying machine learning for predictive modeling. Drove data-driven decisions, enhancing services, and elevating customer satisfaction. Expertise in web scraping, analysis, and predictive modeling for actionable insights.

data-analysis data-preprocessing data-science data-visualization exploratory-data-analysis machine-learning powerbi random-forest-classifier sentiment-analysis tableau web-scraping

Last synced: 30 May 2026

https://github.com/travisbreaks/sovereign-matrix

Ruthless project prioritization system. Multi-dimensional weighted scoring with real-time visualization. Kill what doesn't matter. React 19 + Tailwind + Framer Motion.

dashboard data-visualization decision-framework framer-motion prioritization react tailwindcss typescript

Last synced: 01 Mar 2026

https://github.com/sabdikay/telco-customer-churn-analysis-ibm-dataset

This project explores customer churn trends for a company in California using an IBM dataset. Built in a Jupyter Notebook, it employs pandas, NumPy, matplotlib, seaborn, plotly, and scipy to clean, analyze, and visualize data. Through statistical tests and interactive maps, it uncovers key drivers behind customer cancellations

business-intelligence customer-churn data-analysis data-analysis-python data-visualization exploratory-data-analysis jupyter-noteboook matplotlib numpy pandas plotly predictive-modeling python scipy seaborn statistical-analysis

Last synced: 07 Apr 2026

https://github.com/annaanastasy/classification-project-student-grades

A machine learning project to predict students' academic performance using features like demographics, study habits, and parental involvement, achieving 74% accuracy with the CatBoost model.

catboost-classifier classification data-analysis data-visualization machine-learning-algorithms predictive-modeling

Last synced: 29 Mar 2025

https://github.com/manuelgil/vscode-data-pack

This extension pack includes the essential extensions for data analysts.

data-analysis data-science data-structures data-visualization vscode-extension

Last synced: 07 Apr 2026

https://github.com/beatrice-b-m/bea-tools

🐝 𝓉𝑜𝑜𝓁𝓈 𝓂𝒶𝒹𝑒 𝒷𝓎, 𝒶𝓃𝒹 𝒻𝑜𝓇, 𝒷𝑒𝒶 🐝 . ݁₊ ⊹ . ݁ ⟡ ݁ . ⊹ ₊ ݁ ⊹ . ݁ ⟡ ݁ . ⊹ ₊ ݁. ⊹ . ݁ ⟡ ݁ .⊹ . ݁ ⟡ A Python package of random functions and tools that I use regularly. Data science / analysis focused since, ya know, I'm a data scientist c:

data-analysis data-science data-visualization

Last synced: 15 Jan 2026

https://github.com/sparkerdata/hockeyshotmap

Interactive Streamlit app for NHL shot maps & player analysis. Pulls live (or demo) play-by-play data, normalizes rink coordinates, and visualizes shots with context filters (strength, period, player).

data-analysis data-visualization duckdb hockey hockey-analytics ice-hockey nhl nhl-data python sports sports-analytics

Last synced: 18 May 2026

https://github.com/robinmillford/analyzing-e-commerce-transactions---data-cleaning-cohort-analysis-and-sql

In this project, I aimed to analyze the profitability of products in an e-commerce dataset. I performed various SQL queries to extract valuable insights about product profitability, including the identification of the top 5 products with the highest profit margin, and unique combinations of brands and product lines with the highest profitability.

cohort-analysis data-analysis data-visualization excel jupyter-notebook powerbi python3 sql

Last synced: 18 May 2026

https://github.com/aryanpillai2007/credit-card-fraud-detection

The primary goal of this project is to develop a comprehensive fraud detection system that enhances the security and trustworthiness of financial transactions.

anomaly-detection classification credit-card-fraud data-preprocessing data-science data-visualization fraud-detection imbalanced-data logistic-regression machine-learning outlier-detection pca pca-analysis python roc-curve scikit-learn

Last synced: 18 May 2026

https://github.com/armahdavi/data_pipeline_analytics_statistics_ML_PM_PSD_residential_QFF

Sharing all the data pipelines and processing codes, statistical modellings, descriptive statistics, plot visualizations, and machine learning from Mahdavi & Siegel (2021) (Indoor Air) Project Miestone: 2017 - 2020 Full-length article: https://onlinelibrary.wiley.com/doi/abs/10.1111/ina.12782

data-science data-visualization dust hvac indoor-air-quality jupyter-notebook machine-learning matplotlib-pyplot numpy pandas python scikit-learn scipy-stats spyder spyder-python-ide statistics

Last synced: 17 Sep 2025

https://github.com/jibbs1703/austin-house-prices

This repository contains the exploratory data analysis and prediction model for house prices in Austin, Texas using data collected between 2018 and 2021. The data analyses and model results would be of importance to all stakeholders in the Austin housing market.

business-insights data-science data-visualization exploratory-data-analysis house-price-prediction

Last synced: 25 Jun 2025

https://github.com/kammarah/studentdata

I created & deployed a Streamlit app to store, manage & analyze student data. 📊🎓

connection data data-analysis data-visualization deploy deployments libraries python streamlit streamlit-webapp webapp

Last synced: 18 May 2026

https://github.com/stefagnone/-ames-housing-analysis-feature-engineering-and-model-tuning

Data-driven analysis of the Ames Housing Dataset, combining advanced feature engineering and Stochastic Gradient Descent (SGD) regression model tuning. This repository showcases predictive modeling, hyperparameter optimization, and actionable insights for real estate analytics.

ames-housing-dataset data-visualization feature-engineering machine-learning predictive-modeling python real-estate-analytics regression-analysis sgd

Last synced: 18 May 2026

https://github.com/stefagnone/unsupervised-analysis-project

This project investigates the impact of video content on social media engagement using advanced analytics techniques like PCA, k-means clustering, and logistic regression. It provides actionable insights for optimizing social media strategies for Thai fashion and cosmetics retailers.

data-analysis data-visualization engagement-metrics facebook-live-sellers k-means-clustering logistic-regression marketing-insights pca-analysis python social-media-analytics

Last synced: 05 Apr 2025

https://github.com/stefagnone/data_storyboarding_visualization

Data Storyboarding and Visualization Techniques for Effective Communication

data-analysis data-visualization ggplot2-analysis r tableau-dashboards

Last synced: 05 Apr 2025

https://github.com/rorrell/rightwhaledata

A Jupyter Notebook where I wrangle some data on right whale sightings and create a visualization

data-analysis data-visualization jupyter-notebook python3

Last synced: 11 May 2026

https://github.com/sueszli/uwaterloos-sunshines

d3 data visualization: research output of uwaterloo's sunshines

d3js data-visualization

Last synced: 24 Jan 2026

https://github.com/nour-zayed/shopping-trends-analytics-sql-python-power-bi

"End-to-end Shopping Trends analytics project using SQL, Python, Excel & Power BI — data cleaning, EDA, KPI generation, and interactive dashboards with DAX for actionable business insights."

business-intelligence data-analysis data-visualization dax powerbi python sql

Last synced: 18 May 2026

https://github.com/vikasraparthi/human-chain

The AI Safety Incident Dashboard is an interactive frontend application designed to enhance your frontend development skills. This project focuses on creating a user-friendly interface to view and log hypothetical AI safety incidents, aligning with HumanChain's mission of promoting AI safety

ai-safety alerts dashboard data-visualization incident-management real-time-monitoring user-management

Last synced: 11 May 2025