An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/valinsogna/data_visualization_project

Analyzing scores from 17 major international skating events (Oct 2016-Dec 2017). This project delves into judge biases, athlete rankings based on difficult elements, and the significance of elements versus components in final rankings. Built using Python, it offers insights derived from publicly-released International Skating Union Protocols

data-visualization skating

Last synced: 07 Oct 2025

https://github.com/ljadhav25/world-population-analysis-1990-2023-

This repository contains data and analysis related to the world population from 1990 to 2023. The objective is to explore population trends, identify patterns, and visualize demographic changes across different countries and continents over the past few decades.

data-analysis-python data-visualization matplotlib numpy-library pandas-library seaborn

Last synced: 08 Oct 2025

https://github.com/omarsolieman/socialgiveawaydataanalysis

This project involved cleaning, analyzing, and processing data from an Instagram giveaway to ensure a fair and data-driven winner selection process. The primary goal was to automate the process of identifying valid entries, weighting them based on engagement (likes and multiple entries), and performing a post-giveaway analysis

data-analysis data-science data-visualization instagram scraping threejs

Last synced: 14 May 2026

https://github.com/aymanmomin/excel-coffee-data-analytics-exploring-coffee-orders-dataset

This project utilizes a coffee orders dataset to perform comprehensive data analytics and gain insights into customer preferences, popular items, and sales trends. The analysis aims to provide valuable information for coffee shop owners and enthusiasts, facilitating data-driven decision-making and improved customer satisfaction.

data-analysis data-visualization excel project

Last synced: 18 Jan 2026

https://github.com/tyriek-cloud/power-bi-nyc-housing-financial-report

This report was conducted to provide a comprehensive analysis of various NYC housing and financial data.

dashboard data-analysis data-visualization financial-analysis powerbi statistics

Last synced: 21 Jan 2026

https://github.com/jlee9503/telecommunication-churn

Analyze key factors influencing customer churn using Python data analytics technique. Explore key factors through data preprocessing, exploratory data analysis (EDA), and predictive modeling.

data-analysis data-visualization matplotlib pandas python scikit-learn

Last synced: 18 Jan 2026

https://github.com/abinjohn8138-commits/churn-analysis

This project focuses on analyzing customer churn behavior within a telecommunication company using visual insights. The goal is to understand what factors lead to customer attrition and help the business take proactive steps to retain customers.

colab-notebook data-visualization excel insights jupyter-notebook pandas python

Last synced: 05 May 2026

https://github.com/damisparks/machine-learning

Machine learning is a method of data analysis that automates analytical model building. It is a branch of artificial intelligence based on the idea that systems can learn from data, identify patterns and make decisions with minimal human intervention

data-science data-visualization deeplearning machine-learning machine-learning-algorithms machinelearning-python

Last synced: 09 Oct 2025

https://github.com/priyanshubiswas-tech/priyanshubiswas-tech

SWE-Data Engineer @ EDN | Kubeflow-MLOps | Kubernetes | Databricks | AWS EMR-Lambda-Glue, Eventbridge, SQS-SNS | OCI Multi-Cloud Architect Professional | GCP GA4 | Gen AI | IEEE Brand Amb. | Ex-Chair, PES | Ex-Sec, SB

apache-spark aws data-analysis data-engineering data-visualization dbt hadoop kubernetes python3 sql

Last synced: 21 Jan 2026

https://github.com/ianhaggerty/final-capstone

This represents the final capstone project in my HyperionDev Data Science (fundamentals) course. A dataset of Amazon customer reviews is analysed using natural language processing.

amazon data-analytics data-science data-visualization dataset matplotlib nlp nlp-machine-learning numpy pandas plotly reviews seaborn spacy tabulate textblob wordcloud

Last synced: 19 Jan 2026

https://github.com/hirkojoba/fintrack

Full-stack financial tracking app with ML forecasting and AI insights. Built with Rails, PostgreSQL, Python/scikit-learn, and OpenAI API.

artificial-intelligence data-visualization fintech full-stack machine-learning openai postgresql python ruby-on-rails scikit-learn

Last synced: 14 Apr 2026

https://github.com/chandkund/housing-price-prediction

Predict housing prices using the Boston Housing Dataset. Covers data loading, cleaning, preprocessing, EDA, normalization, standardization, and regression models (Linear Regression, Decision Tree, Random Forest, Extra Trees). Evaluated with Mean Squared Error (MSE). Tech: Python, Pandas, NumPy, Scikit-learn, Seaborn, Matplotlib.

data-science data-visualization matplotlib numpy pandas pyhton sklearn sklearn-library sklearn-metrics

Last synced: 21 Jan 2026

https://github.com/ratna-babu/generating-graphs

Generate, color, and visualize random graphs using Python's NetworkX and Matplotlib. Includes compression and storage of graph data with .gz and pickle. Ideal for exploring graph coloring and greedy algorithms in graph theory.

data-visualization erdos-renyi graph-coloring graph-theory greedy-algorithm matplotlib networkx python random-graph random-graph-generation

Last synced: 10 Oct 2025

https://github.com/salma-mamdoh/a-visual-history-of-nobel-prize-winners-project

My project aims to practice Data Analysis and Data Visualization on DataCamp

data-analysis data-visualization datacamp matplotlib pandas python seaborn

Last synced: 04 May 2026

https://github.com/priyanshubiswas-tech/deloitte-daikibo-telemetry-analysis-task-1

Tableau dashboard analyzing Daikibo telemetry data. Tracks downtime by factory/device with interactive filters. Deloitte task solution with JSON processing.

data-analysis data-visualization deloitte json tableau tableau-public

Last synced: 11 Oct 2025

https://github.com/bhavinpatel4199/machine-learning-programming

This repository serves as a central hub for various machine learning projects and experiments. It contains multiple sub-repositories, each focusing on different aspects of machine learning, from data preprocessing to advanced deep learning techniques.

data-structures data-visualization machine-learning machine-learning-algorithms pandas-dataframe python3 sklearn

Last synced: 19 Jan 2026

https://github.com/azaz9026/email-spam-detection

Welcome to the Email Spam Detection project! This repository provides a machine learning model for detecting spam emails using a Naive Bayes classifier and a simple web interface built with Streamlit.

data-analysis data-cleaning data-structures data-visualization deep-learning machine-learning python sql streamlit

Last synced: 14 Apr 2026

https://github.com/mr-chang95/udacity-starbucks-challenge

Data Science Project for Udacity's Data Scientist Program. Using Python in Jupyter Notebook.

data data-science data-visualization numpy pandas sklearn

Last synced: 14 Apr 2026

https://github.com/archanakokate/kkbox_music_recommendations

Predicting the chances of a user listening to a song repetitively after the first observable listening event.

data-visualization exploratory-data-analysis machine-learning statistical-analysis

Last synced: 11 Oct 2025

https://github.com/sebastian-gregoricchio/rseb

An R-package for daily tasks required to handle biological data as well as avoid re-coding of small functions for quick but necessary data management.

atac-seq bedtools chip-seq cutandtag daily-tasks data-visualisation data-visualization datamining deeptools genomics ngs qpcr qpcr-analysis r rna-seq statistics

Last synced: 31 May 2026

https://github.com/jackiboi307/simpleplot

Simple plotting tool made with pygame

data-visualization pygame python

Last synced: 13 Oct 2025

https://github.com/tashi-2004/Apache-Spark-Geospatial-Air-Quality-Analysis

This project analyzes air quality data across regions to identify improvement areas, track trends, and classify similar regions using clustering. Leveraging PySpark, it processes sensor data, calculates Air Quality Index (AQI), and visualizes results with histograms and geographic maps to highlight areas with good air quality.

aqi aqi-prediction clustering data-science data-visualization geospatial-visualization kmeans-clustering predictive-modeling sensor-data time-series-analysis

Last synced: 13 Oct 2025

https://github.com/arunabhagit/data-driven-marketing-optimization-enhancing-engagement-conversions-and-customer-satisfaction

Analyzed ShopEasy’s marketing data using SQL, Python, and Power BI to identify low engagement and conversion issues, performed sentiment analysis, and delivered data-driven strategies for measurable performance improvement.

business-intelligence data-visualization dataanalytics marketingstrategy powerbi python sql

Last synced: 13 Oct 2025

https://github.com/mishraanuraagx/finsight

FinSight is a machine learning-driven financial analytics tool designed to explore, cluster, and visualize different financial assets based on their risk and return behaviors.

clustering cryptocurrency data-science data-visualization deep-learning fastapi finance financial-analysis gru hierarchical-clustering investment-analysis lstm machine-learning portfolio-analysis predictive-modeling quantitative-finance risk-management stock-market time-series yfinance

Last synced: 24 Apr 2026

https://github.com/jasonleelunn/regex-visualiser

RegEx engine implementation, with an interactive frontend :eight_spoked_asterisk:

0de5 data-visualization regex

Last synced: 14 Oct 2025

https://github.com/ycli0536/csemnva

A web application for visualizing and analyzing Controlled Source Electromagnetic (CSEM) data collection and navigation.

data-visualization geophysics time-series timeseries visualization

Last synced: 24 Feb 2026

https://github.com/pngo1997/chicago-airbnb-listings

Interactive Chicago Airbnb listings geospatial map.

data-visualization geospatial html python visualization

Last synced: 31 May 2026

https://github.com/saisurajmatta/healthcare-data-analytics

Power BI project analyzing Emergency Department data, demonstrating skills in data transformation, DAX, and visualization. It focuses on patient flow, wait times, demographics, and satisfaction, providing actionable insights for healthcare improvement. Includes documentation, data dictionary, and code samples.

data-analysis data-modeling data-visualization dax power-bi powerbi-visuals powerquery

Last synced: 22 Jan 2026

https://github.com/saisurajmatta/e-commerce-sales-advanced-data-analysis

Excel-based e-commerce analytics for FNP, a gift company. It covers data extraction, modeling, and visualization, providing actionable insights on revenue, customer behavior, and operations. Key skills include Excel, Power Query, Power Pivot, and DAX. The analysis culminates in data-driven business recommendations.

data-analysis data-visualization dax excel power-pivot power-query

Last synced: 22 Jan 2026

https://github.com/5hraddha/ice-video-games-sales-analysis

Analysis of video game sales data of an online store Ice, which sells video games all over the world & identify patterns that determine whether a game succeeds or not.

data-visualization matplotlib numpy pandas requests-module scipy seaborn

Last synced: 14 Apr 2026

https://github.com/harshindcoder/sleeping_time_survey_analysis

This survey analysis aims to identify lifestyle factors—such as screen time, exercise, alcohol consumption, beverages, and sleep direction—that affect sleep quality and duration. By analyzing these factors, we seek to predict sleep patterns and provide actionable insights to improve sleep health and overall well-being.

data-cleaning data-collection data-visualization exploratory-data-analysis regression-models survey-analysis

Last synced: 15 Oct 2025

https://github.com/kunalpisolkar24/winequalityprediction

Predicting wine quality using machine learning with matplotlib, numpy, pandas, and seaborn for insightful data analysis. 🍇🤖📊

data-analysis data-science data-visualization machine-learning prediction-model

Last synced: 16 Oct 2025

https://github.com/aishanipach/data-visualizer

Visualize data according to month and year build using reactjs.

data-visualization frontend react reactjs

Last synced: 14 Apr 2026

https://github.com/fatihilhan42/nba-players-data-1950-to-2021

In this project, the data of the NBA players between the years 1950-2021 were examined. After the NBA players' season, height, performance, averages of points, teams and positions they played were obtained through csv files, important tables and graphs were created using data cleaning and data visualization algorithms.

data data-analysis data-engineering data-science data-visualization

Last synced: 16 Oct 2025

https://github.com/gia-lexa/real-time-data-visualizer

An opportunity to tinker with data viz, this Python app tracks live stock prices using the Yahoo Finance API and surfaces the price changes in real-time using live-updating data visualizations.

data-visualization python

Last synced: 17 Oct 2025

https://github.com/mindlessmuse666/iris-ml-based-on-decision-trees

Проект демонстрирует применение моделей машинного обучения на основе деревьев решений и случайного леса для классификации набора данных Iris. Включает в себя загрузку данных, обучение моделей, оценку производительности и визуализацию результатов. Предназначен для изучения основ машинного обучения и анализа данных.

classification data-analysis data-visualization decision-trees iris-dataset machine-learning model-evaluation python random-forest scikit-learn

Last synced: 17 Oct 2025

https://github.com/srosalino/data_wrangling_investigations

Series of 3 investigation works, regarding the subject of Data Wrangling (Acquire data from different sources; Understand how to clean and pre-process data; Transform data for analytics purposes; Perform feature engineering; Visualize data)

data-cleaning-and-preprocessing data-extraction-and-pre-processing data-visualization feature-engineering

Last synced: 19 Oct 2025

https://github.com/tejaswirupa/united-airlines-flight-gain-analysis

Analyzed 30K+ United Airlines flights to evaluate time gained or lost during flight. Used hypothesis testing to compare on-time vs. late departures, identifying routes with the highest average time gain.

data-science data-visualization hypothesis-testing r statistical-analysis

Last synced: 27 Jan 2026

https://github.com/estebanrucan/personal-book

Algunos materiales de apoyo que he ido realizando en el tiempo

data-science data-visualization dplyr ggplot2 here magrittr r vroom

Last synced: 20 Oct 2025

https://github.com/iamsachin2135/power-bi-certificate

Achieved certification in advanced Power BI concepts, specializing in data visualization, report creation, interactive dashboards, and data modeling. Skilled in leveraging these capabilities to generate actionable insights and create dynamic reports that drive enhanced data analysis and informed decision-making.

data-modeling data-visualization dax powerbi powerbi-desktop powerbi-report powerbi-service

Last synced: 06 Feb 2026

https://github.com/timjjting/data-is-beautiful

Introductory slides to data visualization

data-visualization

Last synced: 23 Jan 2026

https://github.com/keerthanapalanikumar/prodigy-infotech

This repository contains data science projects from my Prodigy Infotech internship, including data visualization, cleaning and EDA on the Titanic dataset, a decision tree classifier for the Bank Marketing dataset, and Twitter sentiment analysis.

data-cleaning-and-eda data-visualization decision-tree-classifier sentiment-analysis

Last synced: 23 Oct 2025

https://github.com/aayush-683/introspex

An Android application built with Kotlin and Android Studio for monitoring real-time device performance metrics and accessing detailed information about installed applications.

android app-development data-visualization device-info-android device-monitoring jitpack kotlin mpandroidchart open-source performance-metrics real-time-data

Last synced: 23 Jan 2026

https://github.com/mishra-krishna/olympics-2024-analysis

Interactive dashboard for the Paris 2024 Olympics using Streamlit and Plotly. Explore medal counts, athlete stats, and event data. Containerized with Docker and deployed on Azure.

data-visualization olympics olympics-visualization streamlit streamlit-dashboard streamlit-webapp

Last synced: 08 Mar 2026

https://github.com/satyacoder29/crm-analytics-power-bi

CRM Analytics Dashboard – An interactive dashboard using Tableau, SQL, and Salesforce CRM Analytics (CRMA) to analyze sales performance, customer segmentation, and churn prediction. Features automated ETL pipelines, predictive analytics, and real-time insights for data-driven decision-making. 🚀📊

advanced-excel data-analysis data-cleaning data-collection data-transformation data-visualization matplotlib numpy pandas powerbi python seaborn sql tableau

Last synced: 14 Apr 2026

https://github.com/adhamsalama/nyc_vehicle_collisions

A web app to analyze and visualize the data of New York City vehicle collisions.

data-science data-visualization python web-application

Last synced: 01 Jun 2026

https://github.com/rezaafaisal/game-recommendation

By using a dataset sourced from IMDb taken from the kaggle.com site. This system can provide video game recommendations based on their genre.

data-visualization exploratory-data-analysis machine-learning reccomendation-system

Last synced: 26 Oct 2025

https://github.com/aakk23/professional-survey-powerbi

This Power BI dashboard analyzes survey data from data professionals, highlighting salary trends, job roles, and career satisfaction. It provides insights into work-life balance, programming language preferences, and industry demographics.

data-analysis data-visualization dax excel powerbi powerquery

Last synced: 23 Feb 2026

https://github.com/jmofuture/movies_datdata

Microsoft Power BI - Introduction to Power BI Tutorial

csv data-visualization dataset dax powerbi powerquery

Last synced: 27 Oct 2025

https://github.com/lauralivers/beyond-the-binary

comprehensive data storytelling about the shortage of female ICT specialists in Switzerland

bar-chart data-visualisation data-visualization data-viz dviz geopandas jupyter-notebook python sankey-plots switzerland women-in-tech

Last synced: 28 Jan 2026

https://github.com/wassimhd/pwc-switzerland-power-bi-in-data-analytics-virtual-case-experience

The Project helps to build a foundation in data analysis and Power BI software which is provided by PWC virtual internship

data-analysis data-visualization datastorytelling powerbi

Last synced: 28 Jan 2026

https://github.com/saisrivatsat/vizcraft-data-science-app

VizCraft is an intuitive web app for uploading, exploring, and visualizing CSV and Excel datasets. It offers powerful features like dataset summaries, column value counting, and groupby analysis with interactive visualizations. Designed for data enthusiasts, VizCraft simplifies data analysis and visualization for quick, actionable insights.

data-science data-visualization eda pandas plotly python streamlit

Last synced: 15 Apr 2026

https://github.com/aykutsahinn/covid19_cases-data_analysis

Covid_19 Vakalarının Veri Analizi ve Görselleştirilmesi

analysis data-visualization html jupyter-notebook python

Last synced: 29 Jan 2026

https://github.com/vbs360/sql_mysql_for_data_analytics_and_business_intelligence

This repository contains SQL exercises and projects based on the "SQL - MySQL for Data Analytics and Business Intelligence" course.

data-engineering data-management data-manipulation data-science data-visualization

Last synced: 29 Jan 2026

https://github.com/jujulis18/olympicsmedalsdashboard

Olympic Dashboard – Paris 2024 est un tableau de bord interactif permettant d’explorer les performances des athlètes médaillés des Jeux Olympiques d’été de Paris 2024.

dashboard data-analysis data-visualization eda olympic python streamlit

Last synced: 31 Jan 2026

https://github.com/l-gre/data_analytics_for_finance

Comprehensive course materials for the Data Analytics for Finance - Master Programme, covering data manipulation, statistical analysis, visualisation, automation, and real-world case studies using industry-standard tools.

automation data-cleaning data-manipulation data-visualization excel hypothesis-testing industry-applications matplotlib numpy pandas python real-world-case-studies regression-analysis seaborn sql statistical-analysis tableau workflow-automation

Last synced: 15 Apr 2026

https://github.com/shafaq-aslam/pandas-lab

A comprehensive collection of Jupyter notebooks exploring Pandas, from Series and DataFrames to data cleaning, aggregation, merging, and visualization. A complete hands-on guide for mastering data manipulation and analysis with Python.

analytics data-analysis data-cleaning data-science data-visualization dataframe jupyter-notebook machine-learning pandas pandas-dataframe pandas-library pandas-series python python3 series

Last synced: 15 Apr 2026

https://github.com/busesimsek/retail-sales-dashboard

An interactive Excel dashboard analyzing retail sales, customer demographics, and product trends using pivot tables, charts, and KPIs to deliver actionable insights.

data-visualization excel excel-dashboard retail-sales-dashboard

Last synced: 07 Feb 2026

https://github.com/bineet-ratna-shakya/data-science-salary-analysis

analyzing a dataset containing salaries of data science professionals from 2020 to 2023.

data-analysis data-science data-visualization jupyter numpy pandas python

Last synced: 01 Feb 2026

https://github.com/mituskillologies/data-science-may24

Programs of Data Science batch @ MITU Skillologies, May 2024

data-analytics data-science data-visualization machine-learning

Last synced: 15 Apr 2026

https://github.com/amotivv-inc/snowflake-memory-box

Snowflake Memory Box: AI analytics with persistent memory using Cortex, Claude AI and Memory Box

ai analytics data-visualization memory snowflake

Last synced: 01 Feb 2026

https://github.com/sarathchandranpm/bike-store-dashboard

This project combines SQL and Power BI to analyze and visualize sales data from a bike retail shop across 2021-2022. The analysis integrates data from multiple sources to provide insights into sales patterns, customer types, and profitability through an interactive dashboard featuring temporal analysis and key performance metrics.

data-visualization dax mysql power-query powerbi sql

Last synced: 25 Feb 2026

https://github.com/archanakokate/world_population_analysis_2023_using_powerbi

World Population Analysis 2023: PowerBI interactive dashboard to visualize the World Population 2023 by Country

analytics data-science data-visualization powerbi

Last synced: 07 Feb 2026

https://github.com/nirmit27/coursera-data-viz-dash

Some dashboards created using Dash and Plotly.

dash dashboard data-visualization plotly plotly-dash python3

Last synced: 15 Apr 2026

https://github.com/estebanrucan/covid-interactive-exploration

Covid Interactive Exploration, made with Shiny

covid-19 data-visualization flexdashboard shiny

Last synced: 01 Feb 2026

https://github.com/lalitmee/data-visualization

Data Visualization using d3js library

csv-files data-visualization

Last synced: 01 Feb 2026

https://github.com/vishnu-vamshii/data-science-jobs-salaries

Created an interactive dashboard to analyze data science jobs salaries in different regions of the world, experience levels, average salaries in USD and type of employment along with a geographical visual.

data-analysis data-science data-visualization tableau tableau-dashboard

Last synced: 01 Feb 2026

https://github.com/amoghkori/data-analysis-using-algebraic-and-geometric-methods-in-statistics

This project utilizes algebraic and geometric methods in statistics to analyze real-life data using R. It involves conducting various statistical tests, creating Bayesian network graphs, employing linear regression models, and evaluating model fit.

data-interpretation data-visualization graphical-models linear-regression model-evaluation rprogramming statistical-analysis

Last synced: 02 Feb 2026

https://github.com/shubham200137/customer-churn-analysis

In this case study, we analyze customer churn for a telecom company serving Southern California. The company faces increased competition and wants to retain customers by understanding the reasons for churn. Our objectives include improving service quality, identifying churn factors, pinpointing attractive services, and retaining high LTV customers.

data-analysis data-visualization numpy-python pandas-python sqlite tableau

Last synced: 15 Apr 2026

https://github.com/athari22/extracting-and-visualizing-stock-data

The idea of the project is to work for a new startup investment firm that helps customers invest their money in stocks.

amazon dashboard data-science data-visualization database ibm ibm-watson pandas python sql tesla visualization

Last synced: 13 Apr 2026

https://github.com/ninadpatil09/heart_disease_detection_analysis

The Heart Disease Detection Analysis aims to create a predictive model for identifying individuals at risk of heart disease. Using a dataset with attributes like age, sex, and health metrics, the project focuses on distinguishing patients with and without heart disease.

data-analysis data-cleaning data-science data-visualization machine-learning

Last synced: 15 Apr 2026