An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/harshindcoder/salifort_motors_project

This people analytics project analyzes factors influencing employee turnover and predicts whether an employee is likely to leave. It aims to uncover patterns behind departures, helping Salifort improve retention, workplace culture, and professional growth strategies.

data-analysis data-science data-visualization hr-analytics machine-learning tree-models

Last synced: 02 May 2026

https://github.com/quocduyenanhnguyen/twitter-despicable-me-4-hashtag-engagement-analysis

In this project, I explored Despicable Me 4 hashtag on Twitter to gather engagement metrics for data analysis over a one week period.

csv-files data-analytics data-cleaning data-visualization mysql python3 tableau tableau-dashboards tableau-public twitter twitter-hashtag

Last synced: 16 May 2026

https://github.com/faithererer/haokanvideo_spider

好看视频爬取与数据分析

data-analysis data-visualization python spider

Last synced: 02 May 2026

https://github.com/cassiofb-dev/covid-grafico

Gráficos da COVID-19 (Mortes e Casos) com Chart.js.

chartjs covid-19 data-visualization

Last synced: 25 Jan 2026

https://github.com/gerhynes/d3-mobile-subscription-literacy-scatterplot

A D3 scatterplot showing mobile phone subscriptions against literacy rates. Built for The Advanced Web Developer Bootcamp.

d3 data-visualization javascript

Last synced: 02 May 2026

https://github.com/anderson-andre-p/uber-data-analysis

This repository contains a comprehensive data analysis project focused on Uber rides. The dataset used in this project is a spreadsheet obtained from Uber, containing data related to ride details, such as pick-up and drop-off locations, date and time of the ride, and the fare amount.

data-analysis data-science data-visualization python

Last synced: 15 Jun 2026

https://github.com/tushar2704/employee-distribution

This repository contains valuable insights and visualizations derived from an extensive HR dataset spanning from 2000 to 2020, with over 22,000 rows.

data-analysis data-visualization excel postgresql powerbi sql tushar2704

Last synced: 04 Nov 2025

https://github.com/rorrell/employmentdata

A Jupyter Notebook where I use group by to analyze the average unemployment rate by year

data-analysis data-visualization jupyter-notebook python3

Last synced: 02 May 2026

https://github.com/holy-angel-university/global-cost-index-analysis

This analysis explores the cost of living across various countries, aiming to provide insights into economic disparities and living standards on a global scale. Utilizing a dataset that includes indices for overall cost of living, groceries, restaurant prices, and rent, we investigate the top and least expensive countries worldwide.

data-science data-visualization exploratory-data-analysis jupyter-notebook python3

Last synced: 02 May 2026

https://github.com/debjyotisaha/web-application-projects-streamlit-phase-2

This repository showcases interactive web applications built using the Streamlit framework.

dashboard data-visualization python streamlit

Last synced: 02 May 2026

https://github.com/hanifheinrich/population-data-visualization

Implementasi Visualisai Data pada Data Kependudukan Nagari Tanjung Balik, Kabupaten Solok, Sumatera Barat Menggunakan Streamlit

data-visualization python streamlit-dashboard

Last synced: 16 Jun 2026

https://github.com/peter-gy/autovistype

Probing vision-language model alignment with human expert visual grouping over stratified sample of VIS30K dataset.

data-visualization google-genai langchain llm-benchmarking marimo meta-llama mistral multi-label-classification openai polars qwen uv vis30k vision-language-model visual-stimuli visualization-categorization vlm

Last synced: 02 May 2026

https://github.com/gerhynes/d3-notes-app

A simple notes app built to practice D3 selection methods. Built for The Advanced Web Developer Bootcamp.

d3 data-visualization javascript

Last synced: 03 May 2026

https://github.com/gkar90/gdp-vs-life-expectancy

Statistical analysis on GDP vs Life Expectancy

data-science data-visualization statistical-analysis

Last synced: 09 Jun 2026

https://github.com/davityak03/stock-value-prediction-and-forecasting

This project predicts stock market trends using an LSTM neural network, focusing on Apple Inc.'s historical data for accurate future price forecasting. It includes data retrieval, preprocessing, model training, and evaluation.

data-visualization datareader lstm pandas python tensorflow tiingo

Last synced: 03 May 2026

https://github.com/fatihilhan42/tourist_analysis_in_turkey_with_python

In this project, the number of tourists coming to Turkey between 2008-2021 was analyzed. The data from the data set you can find in the warehouse was first organized using data cleaning algorithms. These cleaned data were then output graphically using data visualization algorithms.

data-analysis data-cleaning data-science data-visualization jupyter-notebook python

Last synced: 03 May 2026

https://github.com/lut-ful/credit-card-financial-report-dashboard

Visualizing and unlocking different KPI's of a Financials instution depending on credit card data using Power BI.

data-analytics data-visualization dax-query postgresql powe-bi python statics

Last synced: 03 May 2026

https://github.com/corey-richardson/microbit-data-logger

In preparation for Work Experience Students coming in, I am using this project to familiarise myself with the BBC micro:bits which we will provide them with. I am also using it as a chance to expand on my data visualisation with Python experience.

data-visualization matplotlib microbit pandas pyplot signal-processing

Last synced: 03 May 2026

https://github.com/baggiponte/ta-business-statistics-2023

🎓 Business Statistics [UniMi • AY 2022/2023]

data-mining data-science data-visualization r tidyverse

Last synced: 03 May 2026

https://github.com/dvmorozov/arxiv

ArxivExpress - arxiv.org client for Android and iOS, ArxivNavigator - interactive arxiv.org metadata visualization. I would appreciate any way of contributing: GitHub issue, email or pull request.

android-application arxiv arxiv-api arxiv-org csharp d3-js d3-visualization d3js data-science data-visualization datascience ios-app javascript pyhton3 xamarin xamarin-forms

Last synced: 03 May 2026

https://github.com/codeslash21/tmdb_data_analysis

We analysed TMDB dataset which contains around 11000 movies details. We analyzed to find some interesting facts about the dataset.

data-analysis data-visualization matplotlib nanodegree-project numpy pandas python tmdb-movie

Last synced: 03 May 2026

https://github.com/akhil-krishnan-r/super_market_analysis

The growth of supermarkets in most populated cities are increasing and market competitions are also high. This dataset is one of the historical sales of supermarket company which has recorded in 3 different branches for 3 months data. Predictive data analytics methods are easy to apply with this dataset

data-visualization matplotlib numpy pandas seaborn

Last synced: 03 May 2026

https://github.com/allanreda/ga4-session-predictor-flask-app

Flask app that can predict future number of GA4 sessions, using the Prophet library.

data-visualization flask ga4-api matplotlib pandas prophet-library python time-series-forecasting

Last synced: 03 May 2026

https://github.com/prachipatel15/priceprediction

I've build a real estate price prediction project which predicts a house price in Banglore.

data-science data-visualization machine-learning pandas python

Last synced: 03 May 2026

https://github.com/yugsumeet17/churn-analysis-project--power-bi-sql-machine-learning

Dataset Explained, Project Goals & Metrics Required, SQL Server ETL & Data Cleaning, Power BI Data Load, Transformation, Blueprint & Measures, Power BI Visualization - Summary Page, Building Machine Learning Model - Random Forest, Power BI Visualization - Churn Prediction Page

data data-visualization dataanalytics excel postgresql powerbi python3

Last synced: 03 May 2026

https://github.com/ibrahim1993-2/iraq-primary-enrollment

Data analysis of Iraq’s primary school enrollment (1960–2024). Includes SQL-style exploration, 14 insights with CSV exports, visualizations, and a polished Jupyter Notebook (PDF + HTML).

data-visualization eda education-data iraq jupyter-notebook matplotlib pandas portfolio-project python3 world-bank

Last synced: 03 May 2026

https://github.com/devesh8423/machine_learning

Machine Learning practice projects, Jupyter notebooks, and datasets for learning regression, classification, and data analysis.

classification data-analysis data-science data-visualization jupyter-notebook machine-learning matplotlib ml-project numpy-library pandas python regression sckit-learn seaborn

Last synced: 03 May 2026

https://github.com/donmaruko/flask-data-analysis

Flask API for statistical calculations. Data analysis, cleansing, visualization, and manipulation. Documented by Swagger.

api api-rest data-analysis data-science data-visualization datascience flasgger matplotlib pandas seaborn sqlite wordcloud

Last synced: 03 May 2026

https://github.com/fnfkkengine/website-performance-data-analysis-project

🌐 Analyze website performance data to gain insights on user engagement, traffic trends, and optimize your digital presence effectively.

analysis analytics dashboard data-analyst data-collection data-preprocessing data-visualization dax funnel-analysis jupyter-notebook matplotlib mysql numpy paid-search power-query project-management python storytelling

Last synced: 04 May 2026

https://github.com/parmaster/logserver

Collecting telemetry data posted by IoT devices into MQTT, filtering, processing and visualizing results

data-visualization golang-template iot-application mqtt mqtt-client template

Last synced: 04 May 2026

https://github.com/xiaohan2012/myunisport

Visualize your Unisport annual training records

data-analysis data-visualization pandas pygal sports-stats tikzposter

Last synced: 04 May 2026

https://github.com/furkalb/global-power-plant-analysis

Global Power Plant Database Analysis is a Streamlit-based interactive web app designed to explore and analyze worldwide energy generation data

analysis data-science data-visualization energy-analysis exploratory-data-analysis gis global-energy machine-learning mapbox plotly predictive-modeling python streamlit streamlit-dashboard

Last synced: 04 May 2026

https://github.com/sagarprajapat2004/data-analysis-visualization

Downloaded and analyzed a dataset from Kaggle using NumPy and Pandas created visualizations with Matplotlib and Seaborn developed a Flask web application to showcase data insights and conclusions.

data-analysis data-modeling data-visualization exploratory-data-analysis flask python statical-analysis

Last synced: 04 May 2026

https://github.com/joonarafael/ids-exercises

Repository to store the exercise submissions for the Introduction to Data Science course (University of Helsinki).

course-work data-science data-visualization jupyter-notebook university-assignment

Last synced: 16 Jun 2026

https://github.com/ljadhav25/logistic-regression-data-science-

Logistic regression estimates the probability of an event occurring, such as voted or didn’t vote, based on a given data set of independent variables.

data-analysis data-science data-visualization logestic-regression machine-learning

Last synced: 04 May 2026

https://github.com/mugilan1309/csv_analyzer

📊 A simple Streamlit-based CSV Analysis & Preprocessing Tool for quick data insights.

csv-processing data-analysis data-visualization machine-learning python streamlit

Last synced: 04 May 2026

https://github.com/leftcoastnerdgirl/webscraping_and_beautifulsoup

This project uses Beautiful Soup to create scrap data from a news website.

beautifulsoup data-visualization jupyter-notebook splinter webscraping

Last synced: 17 Jun 2026

https://github.com/bishopce16/pyber_analysis

The purpose of this project was to complete an exploratory analysis and create visualizations of the 2019 ride sharing data from PyBer.

data-analysis data-visualization jupyter-notebook matplotlib pandas python

Last synced: 04 May 2026

https://github.com/flytomarsz/bike-sharing-system-analysis

This analysis project aim to identify bike rental's behavior in 2012 from Capital Bikeshare system, Washington D.C., USA. This project is part of my Data Analysis study at Dicoding.

data-analysis data-visualization jupyter-notebook python streamlit

Last synced: 04 May 2026

https://github.com/tanmayborse/institionistic_fuzzy_approx_space

This model introduces a hybrid approach that utilizes rough sets on intuitionistic fuzzy approximation spaces for pre-processing and soft sets for post-processing, resulting in an effective decision-making solution.

data-cleaning-and-preprocessing data-science data-visualization decision-making fuzzy-logic

Last synced: 17 Jun 2026

https://github.com/eslamdyab21/data-visualization-using-matplotlib-and-seaborn

This is the last project in the nanodegree udacity program. it's about data visualization.

data data-analysis data-visualization matplotlib pandas python seaborn udacity udacity-data-analyst-nanodegree

Last synced: 09 May 2026

https://github.com/maruf-hossen/kaggle-projects-and-learning

Comprehensive data science learning journey through Kaggle courses and exercises. Documenting progress in SQL, Python, ML, and data visualization with practical projects and business applications.

business-intelligence data-cleaning data-science data-visualization kaggle learning-journey machine-learning pandas python sql

Last synced: 05 May 2026

https://github.com/nimomach/skateboarding-in-olympics

Skateboarding made its debut in Olympics at the 2020 Summer Olympics. This is a dashboard focused on "Skateboarding in the Olympics" representing a comprehensive overview of the sport's performance, popularity, and key metrics during the Olympic Games.

data-analysis data-visualization olympics paris skateboarding tokyo

Last synced: 10 Mar 2026

https://github.com/sayamalt/house-price-prediction

Successfully created a regression model for predicting the price of any house, excluding enormous real estates and mansions, to a significant level of accuracy.

data-visualization exploratory-data-analysis feature-engineering feature-selection machine-learning regression-analysis regression-testing

Last synced: 09 Nov 2025

https://github.com/leosimoes/datascienceacademy-powerbi-clinicadebi

Atividades do curso Análise de Dados com Microsoft Power BI e Clínica de BI da Data Science Academy.

dashboards data-analysis data-visualization microsoft-power-bi power-bi

Last synced: 05 Jan 2026

https://github.com/sehaj003/telco-churn-analysis

This repository contains files (dataset and Jupyter codebooks) for a project aimed to build machine learning models to predict customer churn based on given parameters.

data-science data-visualization exploratory-data-analysis machine-learning machine-learning-models predictive-modeling principal-component-analysis python

Last synced: 20 May 2026

https://github.com/jiyanshgarg/delhivery-logistics-data-analysis

This project analyzes Delhivery's logistics delivery dataset to understand delivery performance, route efficiency, and operational patterns using data analytics techniques. The analysis focuses on transforming raw segment-level logistics data into meaningful trip-level insights that can help improve delivery efficiency and route planning.

business-insights-and-recommendations data-analysis data-cleaning-and-preprocessing data-visualization exploratory-data-analysis feature-engineering feature-extraction feature-selection hypothesis-testing outlier-detection outlier-treatment

Last synced: 12 Jun 2026

https://github.com/salma-mamdoh/investigating-netflix-movies-and-guest-stars-in-the-office

My Project to learn the Basics of Analysis & Visualization on DataCamp

data-analysis data-visualization datacamp matplotlib pandas python

Last synced: 11 Apr 2026

https://github.com/bhavinpatel4199/machine-learning-framework

This repository, showcases various projects that explore key concepts in both supervised and unsupervised learning, with a focus on real-world applications. The projects utilize a range of machine learning techniques, including data preprocessing, feature selection, exploratory data analysis (EDA), and model optimization.

classification clustering data-science data-structures data-visualization exploratory-data-analysis machine-learning machine-learning-algorithms machine-learning-models pandas-dataframe predictive-modeling preprocessing-data sklearn supervised-learning unsupervised-learning

Last synced: 20 Jan 2026

https://github.com/dheyhasan/echo-trends

EchoTrends is a data visualization app that analyzes your Spotify playlists and reveals insightful patterns—such as track duration, popularity, and statistical correlations—using interactive charts and statistical tests. Built with React (frontend) and FastAPI (backend), it offers both functional analysis and a demo landing

correlation-analysis data-visualization fastapi javascript music-analysis python react recharts spotify-api tailwindcss

Last synced: 11 Apr 2026

https://github.com/shafaq-aslam/data-analytics-dairy

A comprehensive repository for Data Analytics learning and projects. It includes MySQL, Python, Power BI, Tableau, and Excel. The goal is to analyze data, generate insights, and create compelling visualizations for real-world datasets.

data-analysis data-visualization excel excel-based-data-analysis powerbi python-scripts sql sql-queries sql-queries-for-data-manipulation sql-query-for-data-visualization tableau

Last synced: 20 Jan 2026

https://github.com/jimohola/streamlit_ml

How to build a Web App using Streamlit for Machine Learning Algorithms

data-visualization exploratory-data-analysis machine-learning streamlit webapp

Last synced: 09 May 2026

https://github.com/sayamalt/life-expectancy-prediction

Successfully established a machine learning model which can accurately predict the expected life duration of a human being based on several demographic features such as alcohol consumption per capita, average BMI of entire population, etc.

cross-validation data-cleaning-and-preprocessing data-visualization docker end-to-end-pipeline exploratory-data-analysis feature-engineering github-actions-workflow hyperparameter-tuning machine-learning model-deployment model-training-and-evaluation

Last synced: 04 May 2026

https://github.com/rudra-g-23/power-bi-custom-visual

A custom Power BI visual that displays a customizable, interactive charts with advanced capabilities.

custom-visuals data-analysis data-visualization dax powerbi powerbi-custom-visuals svg visualization

Last synced: 02 Jan 2026

https://github.com/riyouuyt/investigate-hotel-business-using-data-visualization

Explore hospitality data, visualizing customer behavior in hotel reservations and its impact on cancellations for strategic insights.

businessreporting data-visualization hotel-booking jupyter-notebook matplotlib-pyplot presentation python

Last synced: 05 May 2026

https://github.com/gunjanmimo/d3-visualization

D3.js is a JavaScript library for producing dynamic, interactive data visualizations in web browsers. It makes use of Scalable Vector Graphics, HTML5, and Cascading Style Sheets standards. It is the successor to the earlier Protovis framework

d3js data data-science data-visualization reactjs

Last synced: 29 Apr 2026

https://github.com/alfiyafatima09/heuristic_algorithms

This project compares pathfinding algorithms (A*, Greedy Best-First, and Hill Climbing) by visualizing their paths and comparing performance metrics (nodes explored, memory, execution time) on a grid with obstacles.

algorithms data-visualization

Last synced: 20 Jan 2026

https://github.com/apelullo/cobalt_health_wellness_platform_ops

Cobalt is a mental health and wellness platform created for Penn Medicine employees that serves as a hub for support services such as therapy, wellness coaching, topic- and population-specific group sessions, and a variety of self-help resources.

academic-research data-cleaning-pipeline data-validation data-visualization decision-support feature-development healthcare-data hipaa key-performance-metrics mental-health-services operations-research product-analytics reporting-pipeline

Last synced: 23 Mar 2025

https://github.com/badranalyst/residential-unit-prices-data-analysis-application

Python-based analysis of residential unit prices, focusing on data cleaning, visualization, and exploratory data analysis (EDA). Key features include price distribution, and correlation analysis between factors like size, location, and pricing.

data-analysis data-visualization dataset matplotlib numpy pandas python seaborn

Last synced: 05 May 2026

https://github.com/filip-kustura/statistics-olympics-analysis

A group seminar analyzing the relationship between citizens' average height and a country's Olympic success. The project involved data collection, descriptive statistics and statistical testing. Created and presented as part of the mandatory undergraduate Statistics course in spring 2021.

correlation-analysis data-analysis data-visualization descriptive-statistics group-project hypothesis-testing olympic-games r-programming research sports-analytics statistical-testing statistics university-project

Last synced: 05 Jan 2026

https://github.com/hemangsharma/dataanalysis

This repo contains analysis like a dashboard and time series forecast on NASDAQ data

analysis data data-analysis data-visualization python

Last synced: 10 Mar 2026

https://github.com/ledsouza/curso_de_estatistica_parte_3

Projeto de curso de estatística sobre distribuições e teste de hipósteses

data-science data-visualization pandas scipy seaborn statsmodels vitrinedev

Last synced: 29 Apr 2026

https://github.com/cube-m4st3r/sphere.discord-bot

The main discord frontend project for sphere, designated for general purpose commands and visualization.

data-visualization discord-bot discord-py discord-py-bot python3

Last synced: 16 Jan 2026

https://github.com/navp7/roadaccident_powerbi

An interactive Power BI dashboard designed to analyze road accident data

dashboards data-analysis data-visualization powerbi

Last synced: 19 Mar 2026

https://github.com/rakeshdabbikar4/sales-performance-dashboard-powerbi

Interactive Sales Performance Dashboard built using Power BI to analyze revenue, orders, profit, trends, and regional performance.

business-analytics business-intelligence data-analytics data-visualization dax powerbi sales-dashboard

Last synced: 13 Jan 2026

https://github.com/vishwas-r/canvasjs-data-parser

CanvasJS Data Parser - Parse Data to CanvasJS accepted Format

canvasjs charts data-parser data-visualization javascript

Last synced: 31 Mar 2025

https://github.com/BAMresearch/SOFA

SOftware for Force Analysis - A graphical user interface to analyze Atomic Force Microscopy Force Spectroscopy data

atomic-force-microscopy data-science data-visualization

Last synced: 16 Jan 2026

https://github.com/kernelshreyak/kaggle-notebooks

Collection of my Kaggle notebooks for data analysis and machine learning on a variety of datasets

data-analysis data-science data-visualization kaggle kaggle-competition machine-learning

Last synced: 27 Apr 2026

https://github.com/batthulavinay/which-anime-dominates-the-world

This Kaggle notebook analyzes the global popularity of anime across different countries. Using data visualization and statistical insights, we explore which anime dominate in various regions and uncover trends in global anime consumption.

data-visualization exploratory-data-analysis matplotlib numpy pandas python seaborn

Last synced: 24 Mar 2025

https://github.com/danhnnguyen0606/bitcoin-navigator

Bitcoin Navigator: A data-driven dashboard designed to analyze Bitcoin trends, empowering investors to refine their strategies and identify optimal investment opportunities.

bitcoin btc crypto cryptocurrency data-analysis data-analytics data-science data-visualization investment looker looker-studio

Last synced: 15 Mar 2025

https://github.com/steviecurran/dashboards

Compilation of Links to the dashboards in the other repositories

dashboard data-analysis data-science data-visualization pandas powerbi python-dash tableau

Last synced: 21 Feb 2026

https://github.com/saineshnakra/data-model-visualization-jupyter

A collection of Jupyter notebooks for visualizing machine learning models, echelon graphs, and data structures.

classification-visualization clustering-visualization-notebook data-models data-science data-visualization jupyter-notebook machine-learning python python3

Last synced: 05 May 2026

https://github.com/walid0912/rfm_analysis

RFM Analysis is employed to comprehend and categorize customers according to their purchasing patterns. RFM, an acronym for recency, frequency, and monetary value, comprises three essential metrics that offer insights into customer involvement, allegiance, and significance to a business.

data-analysis data-visualization python rfm-analysis

Last synced: 02 Sep 2025

https://github.com/yaser-123/energy-consumption-dashboard

A Power BI dashboard to analyze energy consumption for water, gas, and electricity across cities and buildings. Features include interactive charts, drill-down insights, and dynamic filters for easy monitoring and optimization.

dashboard data-analysis data-analytics data-visualization energy-consumption energy-efficiency powerbi

Last synced: 05 Jan 2026

https://github.com/akhdandann/productdashboard-powerbi

A Power BI dashboard to monitor engineering effectiveness at PT. XL Axiata Tbk. It tracks release frequency, production defects, cycle time, developer activity, and team happiness. Note: All names have been changed and the data has been modified - this is dummy data for demonstration purposes only.

business-intelligence dashboard data-visualization power-bi reporting

Last synced: 26 Jan 2026

https://github.com/sudarsann27/basic_machine_learning_algorithms

Basic Machine learning algorithms using scikit-learn and other fundamental libraries

data-science data-visualization ensemble-model kaggle numpy pandas scikit-learn supervised-machine-learning

Last synced: 20 Jan 2026

https://github.com/dmarks84/ind_project_obesity-multi-class-classification--kaggle

Independent Project - Kaggle Competition -- I worked on the obesity classification data set as part of a Kaggle Competition of the same name, scoring (for accuracy) above 0.9

classification correlation-analysis cross-validation data-modeling data-visualization dataframes eda gridsearchcv matplotlib multiclass-classification numpy pandas python seaborn sklearn statistics supervised-ml

Last synced: 11 Apr 2026