An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/rightfulcode/customer-segmentation-rfm

This project performs customer segmentation using Recency, Frequency, and Monetary (RFM) metrics to identify key customer groups and provide actionable marketing insights.

data-analysis-python data-visualization elevvo-internship jupyter-notebook matplotlib pandas python rfm-analysis seaborn

Last synced: 08 May 2026

https://github.com/chandkund/housing-price-prediction

Predict housing prices using the Boston Housing Dataset. Covers data loading, cleaning, preprocessing, EDA, normalization, standardization, and regression models (Linear Regression, Decision Tree, Random Forest, Extra Trees). Evaluated with Mean Squared Error (MSE). Tech: Python, Pandas, NumPy, Scikit-learn, Seaborn, Matplotlib.

data-science data-visualization matplotlib numpy pandas pyhton sklearn sklearn-library sklearn-metrics

Last synced: 21 Jan 2026

https://github.com/ratna-babu/generating-graphs

Generate, color, and visualize random graphs using Python's NetworkX and Matplotlib. Includes compression and storage of graph data with .gz and pickle. Ideal for exploring graph coloring and greedy algorithms in graph theory.

data-visualization erdos-renyi graph-coloring graph-theory greedy-algorithm matplotlib networkx python random-graph random-graph-generation

Last synced: 10 Oct 2025

https://github.com/salma-mamdoh/a-visual-history-of-nobel-prize-winners-project

My project aims to practice Data Analysis and Data Visualization on DataCamp

data-analysis data-visualization datacamp matplotlib pandas python seaborn

Last synced: 04 May 2026

https://github.com/allanreda/automated-k-means-clustering-engine

An interactive K-Means clustering tool built with Flask and Scikit-Learn, supporting Excel file uploads, cluster analysis, and data export, deployed on Google Cloud Run via Docker with CI/CD integration.

cicd css data-visualization deployment docker flask google-cloud-run html javascript k-means-clustering machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 19 Jan 2026

https://github.com/controldata23/automobiles-data-exploration

An Exploratory Data Analysis done on an Automobiles dataset from kaggle

data-exploration data-visualization eda jupyter-notebook matplotlib python-data-analysis

Last synced: 19 Jan 2026

https://github.com/rahmamohammad/retail_project

Retail & Data analytics: KPIs, sales trends, Excel planning pack, forecasting & inventory tracking.

data-analysis data-visualization ecommerce excel jupyter-notebook matplotlib python retail-analytics storytelling

Last synced: 17 May 2026

https://github.com/shadz23/smart-energy-dashboard

Power BI dashboard analyzing household electricity consumption to reveal usage patterns, peak hours, and estimated costs for smarter energy management and reduced bills. 🐙

chart data-analysis data-visualization dax energy-consumption hs110 hs300 ibm ibm-cloud influxdb jupyter-notebook kasa kp115 linuxone observability photovoltaics-dashboard plotly sense

Last synced: 19 Aug 2025

https://github.com/itskshitija/hr-data-analysis

The HR Data Analytics Dashboard project uses Power BI to analyze employee data, visualizing key HR metrics and KPIs to support data-driven decisions for improving workforce management, employee satisfaction, and organizational growth.

analytics data-science data-visualization dataanalysis dataanalytics hrdataanalysis powerbi-desktop powerbidashboard

Last synced: 21 Jan 2026

https://github.com/alexmcvay/uber-data

UBER sql clone

data data-visualization sql

Last synced: 19 Jan 2026

https://github.com/azaz9026/email-spam-detection

Welcome to the Email Spam Detection project! This repository provides a machine learning model for detecting spam emails using a Naive Bayes classifier and a simple web interface built with Streamlit.

data-analysis data-cleaning data-structures data-visualization deep-learning machine-learning python sql streamlit

Last synced: 14 Apr 2026

https://github.com/vinay-jose/territorial-sales-dashboard

EDA was carried out in the sales data of Atliq Technologies and a Dashboard was created in PowerBI to draw insights.

data-analysis data-visualization powerbi-desktop sql

Last synced: 11 Oct 2025

https://github.com/mitgar14/etl-workshop-2

Workshop #2 (ETL process using Airflow) for the ETL course using Apache Airflow to build a data pipeline.

airflow data-engineer data-engineering data-visualization etl pandas postgresql powerbi python sqlalchemy

Last synced: 14 Apr 2026

https://github.com/palakjainanalyst/ecommerce-customer-spending-analysis

An end-to-end Ecommerce analytics project uncovering customer spending trends using Excel, Python, SQL, and Power BI. From raw data to interactive dashboards, this project delivers deep insights on spending patterns, high-value customer segments - showcasing a complete data-to-decisions workflow.

data-analysis data-visualization database ecommerce excel jupyter-notebook powerbi python spending sql

Last synced: 06 May 2026

https://github.com/hanannazri/predictive-customer-churn-modelling-with-price-sensitivity-insights-

As a part of BCG Data Science Project, I developed a predictive churn-risk model for XYZ energy utility by engineering price, consumption and contract features and training XGBoost and Random Forest models to identify customers most likely to churn at current prices.

bcgx data-visualization exploratory-data-analysis feature-engineering jupyter jupyter-notebook model-training-and-evaluation numpy pandas random-forest scikit-learn xgboost

Last synced: 19 Aug 2025

https://github.com/izhaan0/predict-marks-based-on-study-hours

Student Marks Predictor is a machine learning project that predicts a student’s exam scores based on the number of study hours. It uses Linear Regression to learn the relationship between study hours and marks, and provides both command-line and interactive Streamlit web interfaces for prediction and visualization.

data-visualization joblib jupyter-notebook machine-learning machine-learning-algorithms matplotlib-pyplot numpy pandas pandas-dataframe pickle python scikit-learn seaborn

Last synced: 05 Apr 2026

https://github.com/jpgiant/nyc_energy_prediction

A comprehensive code for predicting energy usage in NYC using Machine Learning Algorithms.

data-analysis data-science data-visualization folium jupyter-notebook machine-learning matplotlib numpy pandas python seaborn sklearn

Last synced: 10 Apr 2026

https://github.com/bishopce16/pyber_analysis

The purpose of this project was to complete an exploratory analysis and create visualizations of the 2019 ride sharing data from PyBer.

data-analysis data-visualization jupyter-notebook matplotlib pandas python

Last synced: 04 May 2026

https://github.com/tclzcja/china-greenhouse-gas-mitigation

This is an environmental data visualization project that shows how much greenhouse gas China can decrease from emission by switching to natural gas energy from coal.

client-project data-visualization

Last synced: 05 Oct 2025

https://github.com/mvinyard/vinplots

Michael E. Vinyard's python plotting assistant

data-visualization plotting python

Last synced: 05 Oct 2025

https://github.com/drod75/nyc-arrests-analysis

This is a simple Data Science Project made to analyze and display data and trends found within the NYC Arrests Year to Date Dataset.

data-analysis data-visualization folium jupyter-notebook matplotlib-pyplot nyc-opendata nypd python scikit-learn seaborn

Last synced: 04 May 2026

https://github.com/devanshsahu47/prime-content-analytics

Prime Data Explorer analyzes Amazon Prime's content and credits data to uncover trends in release years, genres, and ratings. It cleans, merges, and visualizes the data to provide actionable insights for optimizing content strategy and boosting audience engagement.

data-analysis data-visualization exploratory-data-analysis jupyter-notebook python3

Last synced: 13 May 2026

https://github.com/1dagord/spectrogram

Generates a spectrogram based on live audio input or a .wav file

audio-analysis data-visualization fft-analysis python python3 spectrogram

Last synced: 18 Aug 2025

https://github.com/sebastian-gregoricchio/rseb

An R-package for daily tasks required to handle biological data as well as avoid re-coding of small functions for quick but necessary data management.

atac-seq bedtools chip-seq cutandtag daily-tasks data-visualisation data-visualization datamining deeptools genomics ngs qpcr qpcr-analysis r rna-seq statistics

Last synced: 31 May 2026

https://github.com/moustafamohamed01/san-francisco-salaries-data-analysis

Salary Data Analysis 💰📊 An analysis of salary data to uncover trends, distributions, and anomalies in employee compensation. This project involves data cleaning, visualization, and statistical analysis to explore salary trends, job roles, and correlations. Insights are presented using Python and Jupyter Notebook. 🚀

data-cleaning data-visualization jupyter-notebook matplotlib pandas python seaborn

Last synced: 08 May 2026

https://github.com/saravanansuriya/phonepe-pulse-data-visualization-and-exploration

Creating a dashboard by using streamlit application. In this app visualizing the data taken from Phonepe pulse Github repository.

data-visualization github-cloning mysql-database pandas-dataframe plotly-express python streamlit-webapp

Last synced: 10 Apr 2026

https://github.com/tashi-2004/Apache-Spark-Geospatial-Air-Quality-Analysis

This project analyzes air quality data across regions to identify improvement areas, track trends, and classify similar regions using clustering. Leveraging PySpark, it processes sensor data, calculates Air Quality Index (AQI), and visualizes results with histograms and geographic maps to highlight areas with good air quality.

aqi aqi-prediction clustering data-science data-visualization geospatial-visualization kmeans-clustering predictive-modeling sensor-data time-series-analysis

Last synced: 13 Oct 2025

https://github.com/achronus/data-graphing-tool

A tool for finding the perfect graph that fits your CSV data.

data-visualization matplotlib numpy pandas python3

Last synced: 13 Oct 2025

https://github.com/subhadipsinha722133/credit-card-fraud-dection

Web application for detecting fraudulent credit card transactions using machine learning

data-visualization fraud-detection machine-learning matplotlib numpy pandas seborn sklearn streamlit

Last synced: 10 Apr 2026

https://github.com/alinababer/covid19-timeseries-cases-and-deaths-forecasting-

This study is based on confirmed cases and deaths collected from Pakistan. Results demonstrate the promising potential of TIME SERIES model in forecasting COVID-19 cases and highlight the superior performance of the time series compared to the LSTM.we apply AI-based forecasting models such time series ARIMA, LSTM, prophet and VAR.

arima covid-19 data-analysis data-science data-visualization fbprophet forecasting lstm rnn time-series var vectorautoregression

Last synced: 19 Jun 2026

https://github.com/harshindcoder/online_retail_data_clustering_project

This marketing analytics project uses RFM (Recency, Frequency, Monetary) features for customer classification, inspired by the online retail mining paper. The RFM model helps segment customers, identify high-value ones, and optimize marketing strategies.

customer-segmentation data-analysis data-visualization market-analytics

Last synced: 17 Aug 2025

https://github.com/guilherme-marcello/r-data-analysis-barplots

Reading RDS files, processing and presentation in bar plots

bar-plot data-visualization r

Last synced: 05 Oct 2025

https://github.com/benedart/interactive-information-visualization

Interactive Information Visualization Project

d3 data-visualization information-visualization

Last synced: 05 May 2026

https://github.com/an4pdm/relatorio-de-vendas

O presente projeto foi feito através das ferramentas oferecidas pelo Power BI afim de aprimorar meus conhecimentos sobre ETL. Os dados utilizados foram de origem do site "Kaggle".

data-analysis data-visualization database etl powerbi

Last synced: 20 Jun 2026

https://github.com/kplanisphere/analysis-of-political-texts

Analysis and Classification System of Political Texts Using Natural Language Processing - Final Project for the Information Retrieval Course

classification-model data-science data-visualization deep-learning information-retrieval machine-learning natural-language-processing neural-network nlp nlp-machine-learning

Last synced: 05 Oct 2025

https://github.com/nitheshgoutham/phonepe-pulse-data

Phonepe Pulse Data Visualization and Exploration: A User-Friendly Tool Using Streamlit and Plotly

data-science data-visualization plotly python sql streamlit

Last synced: 09 Apr 2026

https://github.com/matthewandretaylor/csc207project

Forest Visualization 3D MVC. Using Data collected from the city of Kitchener

3d-graphics data-visualization

Last synced: 19 Apr 2026

https://github.com/ycli0536/csemnva

A web application for visualizing and analyzing Controlled Source Electromagnetic (CSEM) data collection and navigation.

data-visualization geophysics time-series timeseries visualization

Last synced: 24 Feb 2026

https://github.com/markusbegerow/powerbi-navigation-menu

Interactive navigation menu visual for Power BI with slide-out filtering and hierarchical data support

business-intelligence d3js data-visualization filter hamburger-menu navigation powerbi powerbi-custom-visuals powerbi-visuals typescript

Last synced: 14 Oct 2025

https://github.com/alexgenovese/react-charts-covid-19-data

Examples on COVID-19 data using different library charts: G2, G2Plot, Plotly, ApexCharts

data-analysis data-science data-visualization react reactjs

Last synced: 13 May 2026

https://github.com/silkiemoth/eds-240-class-examples

Repository for in-class work assignments and notes in EDS-240 Data Visualization and Communication at UCSB.

classwork data-visualization r ucsb-meds

Last synced: 13 May 2026

https://github.com/ianjure/average-precipitation-map

A 3D data visualization of average precipitation using R.

data-visualization philippines r

Last synced: 16 Aug 2025

https://github.com/aman-codde/credit-card-analytics

A full-stack dashboard for credit card users to analyze spending, track rewards, and download statements securely.

analytics dashboard data-visualization express fullstack jwt-authentication mongodb nodejs react recharts tailwindcss

Last synced: 09 Apr 2026

https://github.com/anushkundu/crime-pattern-analysis

Analyzing Crime Patterns in Montgomery County, USA: An Inclusive Study Based on NIBRS Data (2016-2022)

data-analysis data-visualization descriptive-statistics matplotlib numpy pandas python seaborn

Last synced: 05 May 2026

https://github.com/ledsouza/covid19

Projeto de análise de dados dos casos de Covid19

data-science data-visualization matplotlib pandas seaborn vitrinedev

Last synced: 05 May 2026

https://github.com/aathithya-shanmuga-sundaram/cyber-threat-intelligence-dashboard

Interactive Streamlit dashboard for visualizing and analyzing cyber threats, featuring real-time data insights, severity classification, geolocation mapping, and customizable dark-cyber UI.

cyber-threat-intelligence cyber-threat-tool cybersecurity cybersecurity-tools data-visualization dataset numpy pandas plotly python streamlit threat-intelligence

Last synced: 10 Nov 2025

https://github.com/bessouat40/photo-gallery-viewer

This project is a photo gallery app 🎨 It leverages a CLIP model for powerful image search based on text keywords. You can easily filter through your images using AI-driven queries!

artificial-intelligence data-visualization elasticsearch embeddings image-gallery image-search mvc-architecture offline photo-gallery python

Last synced: 09 Apr 2026

https://github.com/a26nine/msc-dissertation-bitcoin-dashboard

An interactive data visualisation dashboard built using Tableau Desktop to research and analyse the relationship between the price volatility and adoptability of bitcoin.

data-analysis data-science data-visualization tableau tableau-desktop tableau-prep

Last synced: 17 Feb 2026

https://github.com/5hraddha/ice-video-games-sales-analysis

Analysis of video game sales data of an online store Ice, which sells video games all over the world & identify patterns that determine whether a game succeeds or not.

data-visualization matplotlib numpy pandas requests-module scipy seaborn

Last synced: 14 Apr 2026

https://github.com/alexeyraspopov/vizmath

Set of useful functions for data visualization

data-visualization

Last synced: 04 Oct 2025

https://github.com/syre/strava-stats

Strava Stats is a simple Python app for providing insights into your Strava riding metrics.

data-visualization metrics plotly-dash python strava tailwindcss

Last synced: 22 Jan 2026

https://github.com/sunnyrao07/education-wage-trends-usa-1973-2022

A Tableau-based data visualization project analyzing wage trends by education, gender, and race in the USA (1973–2022).

dashboard data-visualization tableau

Last synced: 05 Feb 2026

https://github.com/raghul-m/stock-price

Simple Stock Price App Using Streamlit and Yfinance

data-science data-visualization streamlit-webapp yfinance-library

Last synced: 04 Oct 2025

https://github.com/aishanipach/data-visualizer

Visualize data according to month and year build using reactjs.

data-visualization frontend react reactjs

Last synced: 14 Apr 2026

https://github.com/fatihilhan42/nba-players-data-1950-to-2021

In this project, the data of the NBA players between the years 1950-2021 were examined. After the NBA players' season, height, performance, averages of points, teams and positions they played were obtained through csv files, important tables and graphs were created using data cleaning and data visualization algorithms.

data data-analysis data-engineering data-science data-visualization

Last synced: 16 Oct 2025

https://github.com/mukeshlilawat1/netflix-data-visualization

Netflix Data Visualization – This project explores the Netflix dataset using Pandas for data manipulation and Matplotlib for creating meaningful visualizations. It highlights trends in movies and TV shows, distribution by release year, ratings, duration, and categories, making the data easy to understand through graphical insights.

data-visualization matplotlib pandas pip python

Last synced: 09 Apr 2026

https://github.com/gia-lexa/real-time-data-visualizer

An opportunity to tinker with data viz, this Python app tracks live stock prices using the Yahoo Finance API and surfaces the price changes in real-time using live-updating data visualizations.

data-visualization python

Last synced: 17 Oct 2025

https://github.com/gaurav0502/survey-analysis-visualizations

This repository consists of the various data visualization techniques used to analyze the responses of a survey conducted as a part of the project for the course MGT1002-Principles of Management

analysis data-visualization matplotlib seaborn visualization

Last synced: 17 Oct 2025

https://github.com/siddhant4srivastava/numeric-and-visual-summary

Exploring Data with Numeric and Visual Summaries of a Bank Loan Dataset

data-science data-visualization

Last synced: 10 Nov 2025

https://github.com/hms75/movie_rating_analysis

A movie rating analysis which identifies trends amongst a dataset of 5000 movies.

data-analysis data-visualization matplotlib-pyplot numpy pandas python

Last synced: 05 May 2026

https://github.com/maazie-khan/power-bi-projects

Welcome to my personal Power BI portfolio repository! Here you will find a collection of Power BI projects and dashboards that demonstrate my skills and expertise in data visualization, business intelligence, and analytics using Power BI.

dashboard data-analysis data-science data-visualization database excel powerbi

Last synced: 02 Jan 2026

https://github.com/codeslash21/communicate_data_findings

Analyze and visualize Bay Wheel system data which contains 2.5M individual trips data. And communicate the data findings from the dataset in the form notebook slide.

bay-wheel data-analysis data-visualization explanatory-data-visualization exploratory-data-analysis

Last synced: 22 Jan 2026

https://github.com/donmaruko/python-eda-toolkit

CLI-runned EDA with 30 commands utilizing text-related functions, statistical calculations, data visualization, and data manipulation.

data data-analysis data-science data-visualization matplotlib pandas scipy seaborn statistical-analysis statistics wordcloud

Last synced: 06 May 2026

https://github.com/fbarffmann/cryptoclustering

Clustered over 100 cryptocurrencies using K-Means and PCA to identify market patterns. Optimized clustering retained 89.5% explained variance.

clustering crypto-analysis data-visualization hvplot k-means machine-learning pandas pca python sklearn

Last synced: 09 Apr 2026

https://github.com/andreighinea1/bdv-project

Exploratory analysis of the Global Terrorism Database (GTD) using Python and Jupyter. This project visualizes global terrorist activity trends from 1970 to 2017, highlighting patterns across time, regions, and groups using ML techniques.

data-visualization exploratory-data-analysis jupyter-notebook kaggle-dataset terrorism-analysis

Last synced: 19 Oct 2025

https://github.com/benjaminrose/data-analysis-book

A Jupyter Book for my Spring 2025 PHY 5381 class on Data Analysis

book data-analysis data-science data-visualization jupyter-book open-book python r statistics-course

Last synced: 06 May 2026

https://github.com/rehanvhora778/bibtex-extraction

📄 Extract BibTeX entries from PDFs automatically, generating a complete bibliography without manual input or reliance on external APIs.

academic-writing analysis automation bibliometric-analysis bibliometrics bibtex data-visualization langchain latex metadata-extraction pdf pyhton pypdf reference-management research-tools

Last synced: 06 May 2026

https://github.com/jacekkala/tennis_data_visualization

Tennis Data Visualization & Analysis - R Markdown Report

data-storytelling data-visualization dplyr ggplot2 r

Last synced: 09 Feb 2026

https://github.com/iamrajmani/sentimental-analysis

Sentimental Analysis - Final Year College Project

data-analysis data-visualization machine-learning python pytorch

Last synced: 06 May 2026

https://github.com/nathanaelmutua/british-airways-data-science-challenge

My solutions for the Forage program: web scraping, data cleaning, analysis, and visualization to extract business insights. Demonstrating practical data science skills for real-world problem-solving.

british-airways british-airways-virtual-program data-science data-visualization dataanalysis forage internship-project internship-task jupyter-notebook python sentiment-analysis webscraping

Last synced: 12 Aug 2025

https://github.com/r12habh/canada-imigration-data-analysis

Dataset: Immigration to Canada from 1980 to 2013 - International migration flows to and from selected countries - The 2015 revision from United Nation's website. (Cognitive Class Data Analysis with Python)

canada data-analysis data-science data-visualization datascience python python3

Last synced: 23 May 2026

https://github.com/sankaran-s2001/us-traffic-accidents-analysis-python-eda

Exploratory data analysis of US traffic accidents from 2016-2023, analyzing patterns by time, location, weather, and severity using Python data science libraries.

data-analysis data-science data-visualization eda matplolib numpy pandas python

Last synced: 06 May 2026

https://github.com/mindlessmuse666/eda-pandas

Проект по разведочному анализу данных (EDA) о пассажирах Титаника с использованием библиотеки Pandas. Включает в себя загрузку данных, предобработку, статистический анализ, визуализацию и создание сводных таблиц. Цель проекта - демонстрация основных методов и инструментов EDA для анализа и понимания данных.

data-analysis data-processing data-science data-visualization eda exploratory-data-analysis matplotlib pandas python titanic

Last synced: 18 Apr 2026

https://github.com/abhinav330/customer-behavior-analysis-linear-regression

This repository explores customer behavior data for an NYC clothing company with both a mobile app and website. They want to understand which platform drives higher sales.

data-analysis data-science data-visualization eda exploratory-data-analysis jupyter jupyter-notebook linear-regression machine-learning machine-learning-algorithms machinelearning-python numpy pandas python regression-analysis

Last synced: 06 May 2026

https://github.com/gerhynes/d3-median-age-histogram

A D3 histogram displaying UN data on median ages. Built for The Advanced Web Developer Bootcamp.

d3 data-visualization javascript

Last synced: 06 May 2026

https://github.com/timjjting/data-is-beautiful

Introductory slides to data visualization

data-visualization

Last synced: 23 Jan 2026

https://github.com/ashish-kr-srivastava/olympic-games-eda---python

About Exploratory Data Analysis of a Historical Olympic Games Dataset, including all the games from Athens 1896 to Rio 2016.

data-visualization datacleaning eda matpotlib numpy pandas python seaborn seaborn-python

Last synced: 09 Apr 2026

https://github.com/jigyasag18/bird-strikes-in-aviation-project

This project analyzes over a decade of U.S. bird strike data (2000–2011) to evaluate safety risks, damage trends, and cost implications in aviation. Using PostgreSQL for database management and Power BI for dashboard visualization, it uncovers critical insights into when, where, and how wildlife impacts aircraft. Key findings inform strategically.

bird-strike-prevention bird-strike-prevention-in-real-airport data data-analysis data-analysis-project data-visualisation data-visualization data-visualization-project data-visualizations database dataset dax-query postgresql postgresql-database powerbi powerbi-desktop powerbi-report powerbi-visuals sql sql-database

Last synced: 09 May 2026