An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/carlosagalicia/sars-cov-2-variant-analysis

The project includes data visualization, variant sequence analysis, and a report detailing the spread and characteristics of COVID-19 variants worldwide.

biostrings data-science data-visualization ggplot2 r

Last synced: 03 Mar 2025

https://github.com/alexmcvay/uber-data

UBER sql clone

data data-visualization sql

Last synced: 19 Jan 2026

https://github.com/lkasym/smart-dynamic-pricing

An AI-powered dynamic pricing system using Dueling DQN and customer behavior simulation, with a full-stack React + Flask dashboard for real-time insights and performance benchmarking.

ai-project data-visualization deep-learning dqn-tensorflow ecommerce full-stack-ai machine-learning reinforcement-learning tensorflow

Last synced: 05 May 2026

https://github.com/mitgar14/etl-workshop-2

Workshop #2 (ETL process using Airflow) for the ETL course using Apache Airflow to build a data pipeline.

airflow data-engineer data-engineering data-visualization etl pandas postgresql powerbi python sqlalchemy

Last synced: 14 Apr 2026

https://github.com/cluzier/crypto-price-dashboard

Shows current crypto prices and trade history

charts cryptocurrency data-visualization

Last synced: 13 Oct 2025

https://github.com/kernelshreyak/kaggle-notebooks

Collection of my Kaggle notebooks for data analysis and machine learning on a variety of datasets

data-analysis data-science data-visualization kaggle kaggle-competition machine-learning

Last synced: 27 Apr 2026

https://github.com/manisharora96/instagram-reach-analysis

This project provides a detailed approach to analyzing Instagram reach and engagement metrics. By leveraging the code and tools shared here, you can gain valuable insights into your Instagram content's performance and optimize your strategy to grow your audience effectively

data-analysis data-visualization instagram-reach python-tools

Last synced: 23 Mar 2025

https://github.com/robmcelhinney/roman-emperors

D3.js Bar Charts of Roman Emperors' Age and length of Reign from 63 BCE to 395 CE

d3 d3js data-visualization emperor roman rome

Last synced: 06 May 2026

https://github.com/fatihilhan42/turkey_earthquake_analysis_1915-2021_python

In this project, earthquakes in Turkey from 1915 to 2021 were analyzed. The data taken from the data set, which you can find in the repo, was first organized using data cleaning algorithms. Afterwards, these cleaned data were printed out as graphics and animation using data visualization algorithms.

data-analysis data-cleaning data-visualization jupyter-notebook

Last synced: 23 Mar 2025

https://github.com/tzerk/esr

R package 'ESR' for plotting and analysing ESR spectra in dating applications

data-analysis data-visualization electron-spin-resonance geochronology r

Last synced: 13 Mar 2026

https://github.com/hemangsharma/dataanalysis

This repo contains analysis like a dashboard and time series forecast on NASDAQ data

analysis data data-analysis data-visualization python

Last synced: 10 Mar 2026

https://github.com/camsai/notebooks

CAMSAI Notebooks provides interactive Jupyter notebooks for AI-driven materials science research. These notebooks demonstrate the use of CAMSAI tools, schemas, and workflows, offering hands-on examples for data validation, materials design, and AI integration to accelerate scientific discovery.

artificial-intelligence chemistry data-science data-standards data-structures data-visualization density-functional-theory machine-learning materials materials-design materials-informatics materials-science modeling-and-simulation molecular-dynamics

Last synced: 27 Oct 2025

https://github.com/rishisolanke/twitter-sentiment-analysis-using-machine-learning-

A research project that classifies tweets as positive, negative, or neutral using ML algorithms (Logistic Regression, Naïve Bayes, SVM) with NLP preprocessing.

data-science data-visualization logistic-regression machine-learning ml-models naive-bayes natural-language-processing nlp scikit-learn sentiment-analysis svm text-classification twitter-data

Last synced: 06 May 2026

https://github.com/jackiboi307/simpleplot

Simple plotting tool made with pygame

data-visualization pygame python

Last synced: 13 Oct 2025

https://github.com/iamrajmani/sentimental-analysis

Sentimental Analysis - Final Year College Project

data-analysis data-visualization machine-learning python pytorch

Last synced: 06 May 2026

https://github.com/petitatelier/data-generators

A collection of data generators, to play with in visualization experiments

data-generator data-visualization

Last synced: 13 Oct 2025

https://github.com/flowsta/ods-educacion-aporta

ODS para educación, iniciativa APORTA 2021

data data-visualization ods sdg

Last synced: 27 Jan 2026

https://github.com/juanchiparra/du-bois-challenge

Du Bois Challenge visualizations using D3.js

d3 data-visualization

Last synced: 14 Oct 2025

https://github.com/jdfoster11/northwest_territories_collision_factors

Using Python & Tableau to perform a statistical and regression analysis on a NorthWest Territories Vehicle Collision Dataset

clustering-algorithm co-lab data-science data-visualization heatmap-visualization html python3 tableau

Last synced: 15 Mar 2025

https://github.com/ycli0536/csemnva

A web application for visualizing and analyzing Controlled Source Electromagnetic (CSEM) data collection and navigation.

data-visualization geophysics time-series timeseries visualization

Last synced: 24 Feb 2026

https://github.com/markusbegerow/powerbi-navigation-menu

Interactive navigation menu visual for Power BI with slide-out filtering and hierarchical data support

business-intelligence d3js data-visualization filter hamburger-menu navigation powerbi powerbi-custom-visuals powerbi-visuals typescript

Last synced: 14 Oct 2025

https://github.com/adamouization/python-machine-learning-data-science-notes

:orange_book: Jupyter notebooks containing useful Python code and notes for general Machine Learning and Data Science projects.

data data-science data-visualization guide jupyter jupyter-notebook machine-learning matplotlib notes numpy pandas pandas-dataframe python seaborn

Last synced: 11 Apr 2026

https://github.com/sankaran-s2001/us-traffic-accidents-analysis-python-eda

Exploratory data analysis of US traffic accidents from 2016-2023, analyzing patterns by time, location, weather, and severity using Python data science libraries.

data-analysis data-science data-visualization eda matplolib numpy pandas python

Last synced: 06 May 2026

https://github.com/pngo1997/chicago-airbnb-listings

Interactive Chicago Airbnb listings geospatial map.

data-visualization geospatial html python visualization

Last synced: 31 May 2026

https://github.com/mahambilalandahaan/week8

K-Means Deep Dive: Clustering analysis with Elbow and Silhouette methods in Python

clustering data-visualization jupyter-notebook k-means machine-learning python scikit-learn unsupervised-learning

Last synced: 20 Apr 2026

https://github.com/kinolag/traffic

A geospatial visualisation app showing road traffic information for all areas of Inner London. Built in TypeScript, combining React with D3.

d3 data-visualization geojson geospatial-visualization mapping react responsive-design svg topojson typescript

Last synced: 13 May 2026

https://github.com/chahelgupta/dep-videogames-dataset

The data extraction and processing involved thorough exploration, preprocessing, and visualization of the "Video Game Sales with Ratings" dataset.

data-analysis data-exploration data-extraction data-preparation data-preprocessing data-processing data-science data-visualization

Last synced: 15 Oct 2025

https://github.com/saisurajmatta/e-commerce-sales-advanced-data-analysis

Excel-based e-commerce analytics for FNP, a gift company. It covers data extraction, modeling, and visualization, providing actionable insights on revenue, customer behavior, and operations. Key skills include Excel, Power Query, Power Pivot, and DAX. The analysis culminates in data-driven business recommendations.

data-analysis data-visualization dax excel power-pivot power-query

Last synced: 22 Jan 2026

https://github.com/a26nine/msc-dissertation-bitcoin-dashboard

An interactive data visualisation dashboard built using Tableau Desktop to research and analyse the relationship between the price volatility and adoptability of bitcoin.

data-analysis data-science data-visualization tableau tableau-desktop tableau-prep

Last synced: 17 Feb 2026

https://github.com/debjyotisaha/power-bi-projects-phase-1

Portfolio projects related to data visualisation in Power BI

data-analysis data-visualization dax-expression powerbi powerquery

Last synced: 18 Jan 2026

https://github.com/sayamalt/steel-energy-consumption-prediction-using-pyspark

Successfully established a machine learning model using PySpark which can precisely predict the energy consumption of the steel industry, up to an r2 score of approximately 99.5%.

apache-spark big-data-analytics big-data-processing cross-validation data-visualization exploratory-data-analysis hyperparameter-tuning machine-learning model-training-and-evaluation python regression spark sql

Last synced: 10 Mar 2026

https://github.com/syre/strava-stats

Strava Stats is a simple Python app for providing insights into your Strava riding metrics.

data-visualization metrics plotly-dash python strava tailwindcss

Last synced: 22 Jan 2026

https://github.com/tolumie/exploratory-data-analytics-projects

Exploratory Data Analytics – A collection of projects covering data exploration, feature engineering, hypothesis testing, and predictive modeling across diverse datasets, including insurance, real estate, laptops, cars, COVID-19, and the Olympics.

data-analysis data-visualization data-wrangling exploratory-data-analysis-eda feature-engineering hypothesis-testing machine-learning matplotlib numpy pandas predictive-modeling python seaborn statistical-analysis

Last synced: 11 Apr 2026

https://github.com/erishen/langgraph-csv-analyst

CSV → Multi-Agent Analysis Pipeline → Visual HTML Report. Built with LangGraph StateGraph: data profiling, trend analysis, anomaly detection, and investment portfolio analysis.

asset-lens csv-analysis data-visualization investment-analysis langchain langgraph multi-agent plotly python

Last synced: 23 Jun 2026

https://github.com/sayamalt/house-price-prediction

Successfully created a regression model for predicting the price of any house, excluding enormous real estates and mansions, to a significant level of accuracy.

data-visualization exploratory-data-analysis feature-engineering feature-selection machine-learning regression-analysis regression-testing

Last synced: 09 Nov 2025

https://github.com/swapnil-jain/whatsapp-chat-statistics

Web Application showing common statistics, graphs and various charts about the uploaded Whatsapp chat.

data-analytics data-science data-visualization heroku-deployment streamlit whatsapp

Last synced: 06 May 2026

https://github.com/harryrlk/data_analysis_showcase

This repository showcases my data analysis and visualization projects using Excel, Python, R, and Tableau. Some projects are under NDA, so key figures and specific numbers are not included, but brief overviews and methodologies are provided. Feel free to explore and contact me for further details.

data-analysis data-science data-visualization excel portfolio python r tableau

Last synced: 06 May 2026

https://github.com/gia-lexa/real-time-data-visualizer

An opportunity to tinker with data viz, this Python app tracks live stock prices using the Yahoo Finance API and surfaces the price changes in real-time using live-updating data visualizations.

data-visualization python

Last synced: 17 Oct 2025

https://github.com/tetchen9/mapa

A map of a trip to Europe. Using d3.js, Eurostat dataset in GeoJSON.

cartography d3 d3-visualization data-visualization eurostat-data geojson maps typescript

Last synced: 06 May 2026

https://github.com/verinverdian/smart-factory

Smart Factory Dashboard – A web-based factory management dashboard to monitor employees, inventories, and productions with real-time data visualization.

admin-dashboard bootstrap dashboard data-visualization factory-management laravel manufacturing php production-management smart-factory

Last synced: 06 May 2026

https://github.com/andreighinea1/bdv-project

Exploratory analysis of the Global Terrorism Database (GTD) using Python and Jupyter. This project visualizes global terrorist activity trends from 1970 to 2017, highlighting patterns across time, regions, and groups using ML techniques.

data-visualization exploratory-data-analysis jupyter-notebook kaggle-dataset terrorism-analysis

Last synced: 19 Oct 2025

https://github.com/hanzopgp/lolanalysis

League Of Legends game data engineering, analysis, visualization and machine learning. Business intelligence project.

data-analysis data-cleaning data-engineering data-visualization dataiku deep-learning etl machine-learning scraping university

Last synced: 27 May 2026

https://github.com/dantasl/map-covid-brazil

A map from Brazil for COVID-19 confirmed cases and deaths powered by Google Charts API.

covid-19 data-visualization google-charts map

Last synced: 11 Aug 2025

https://github.com/parnika-singh/oncovision

An intelligent machine learning model for classifying breast cancer cells as benign or malignant using the UCI Breast Cancer Wisconsin dataset.

breast-cancer-prediction cancer-detection classification data-visualization decision-tree healthcare knn logistic-regression machine-learning medical-ai-project python3 sklearn svm-model xgboost

Last synced: 07 May 2026

https://github.com/vidyadnina/cyclistic-sql-tableau-project

Trip data analysis for a bike-sharing service company using SQL and Tableau.

bigquery dashboard data-analysis data-analytics-sql data-cleaning data-visualization sql

Last synced: 02 Jan 2026

https://github.com/emirhansilsupur/hotel-booking-analytics-dashboard

Interactive Power BI dashboard visualizing hotel booking metrics for two Portuguese properties (Algarve resort & Lisbon city).

dashboard data-visualization power-bi

Last synced: 27 Jan 2026

https://github.com/pekiiipy/credit-card-fraud-detection

🔍 Detect credit card fraud efficiently using advanced machine learning techniques, achieving high accuracy rates on a large dataset of transactions.

adasyn anomaly-detection class-imbalance credit-card-fraud data-visualization fraud fraud-detection frauddetection kaggle keras logistic-regression plotly-python postgresql random-forest scikit-learn tensorflow tree-model xgboost

Last synced: 11 Apr 2026

https://github.com/spear97/montecarlo-python

This was a project for my Programming Language Concepts Class were we were assigned to create a Monty Carlo Simulation using Python.

data-science data-visualization matplotlib-figures matplotlib-python montecarlo pandas-library pandas-python python python-3

Last synced: 23 Mar 2025

https://github.com/dmarks84/ind_project_obesity-multi-class-classification--kaggle

Independent Project - Kaggle Competition -- I worked on the obesity classification data set as part of a Kaggle Competition of the same name, scoring (for accuracy) above 0.9

classification correlation-analysis cross-validation data-modeling data-visualization dataframes eda gridsearchcv matplotlib multiclass-classification numpy pandas python seaborn sklearn statistics supervised-ml

Last synced: 11 Apr 2026

https://github.com/tsopermon/comparison-ml-algorithms

This repository compares the performance of Adaline, Logistic Regression, and Perceptron models on binary classification tasks using linearly, non-linearly, and marginally separable datasets from the Iris dataset. It includes MATLAB implementations, 10-fold cross-validation, and visualizations of decision boundaries and MSE histories.

adaline binary-classification classification-accuracy cross-validation data-visualization decision-boundaries iris-dataset logistic-regression machine-learning matlab mse neural-networks perceptron

Last synced: 15 Mar 2025

https://github.com/vedantshi/coffee-sales-dashboard

This project analyzes coffee sales data using Excel, featuring data cleaning, trend analysis, and an interactive dashboard. Key insights highlight top-performing products, regional sales trends, and seasonal patterns. Recommendations focus on marketing strategies and inventory optimization. Future plans include Power BI integration for visuals.

business-insights data-analysis data-visualization excel-dashboard pivot-tables sales-trends

Last synced: 05 Jan 2026

https://github.com/jatin-s16/hr_mysql_powerbi

This repository contains raw HR data along with key business questions. I performed data cleaning using MySQL queries and wrote analytical queries to extract meaningful insights. The results were then visualised using Power BI to enhance business understanding.

data-analysis data-science data-visualization mysql powerbi

Last synced: 29 May 2026

https://github.com/dianaow/ploty-dash-fifa

Dash/Plotly (Python): Visualizing high-level score attributes and connections of players in the FIFA 22 game

data-visualization fifa fifa-2022 plotly plotly-dash python soccer sports-analytics sports-visualisation

Last synced: 23 Oct 2025

https://github.com/aayush-683/introspex

An Android application built with Kotlin and Android Studio for monitoring real-time device performance metrics and accessing detailed information about installed applications.

android app-development data-visualization device-info-android device-monitoring jitpack kotlin mpandroidchart open-source performance-metrics real-time-data

Last synced: 23 Jan 2026

https://github.com/ianjure/martial-law-in-data

A data visualization of how martial law shaped the Philippine economy.

data-visualization

Last synced: 05 Jan 2026

https://github.com/salman-khan-mohammed/music-album-popularity-prediction

"Music Album Popularity Prediction" is a project focused on building a model to forecast the success of music albums. By analyzing streaming data, social media engagement, and other relevant factors, the project aims to predict the popularity of albums across various genres and artists.

data-visualization eda machine-learning outlier outlier-detection outlier-treatment

Last synced: 08 Aug 2025

https://github.com/mishra-krishna/olympics-2024-analysis

Interactive dashboard for the Paris 2024 Olympics using Streamlit and Plotly. Explore medal counts, athlete stats, and event data. Containerized with Docker and deployed on Azure.

data-visualization olympics olympics-visualization streamlit streamlit-dashboard streamlit-webapp

Last synced: 08 Mar 2026

https://github.com/shaolans/projet_algav_trie

Implementation of the Patricia Trie and the Hybrid Trie in Java

algorithms data-structures data-visualization graphviz-dot hybrid-training java patricia-tree tree trie

Last synced: 11 Jun 2026

https://github.com/satyacoder29/crm-analytics-power-bi

CRM Analytics Dashboard – An interactive dashboard using Tableau, SQL, and Salesforce CRM Analytics (CRMA) to analyze sales performance, customer segmentation, and churn prediction. Features automated ETL pipelines, predictive analytics, and real-time insights for data-driven decision-making. 🚀📊

advanced-excel data-analysis data-cleaning data-collection data-transformation data-visualization matplotlib numpy pandas powerbi python seaborn sql tableau

Last synced: 14 Apr 2026

https://github.com/adhamsalama/nyc_vehicle_collisions

A web app to analyze and visualize the data of New York City vehicle collisions.

data-science data-visualization python web-application

Last synced: 01 Jun 2026

https://github.com/udhaya2823/dataspark-illuminating-insights-for-global-electronics

✨DataSpark✨ is a powerful analytics project transforming raw retail data into actionable insights for Global Electronics. By leveraging Python, SQL, and interactive visualizations, it uncovers trends in customer behavior, sales performance, and product popularity, driving smarter business decisions and boosting growth.

data-science data-visualization database-management datacleaning exploratory-data-analysis matplotlib numpy pandas powerbi python seaborn sql version-control

Last synced: 11 Apr 2026

https://github.com/tommcn/ontario-hs-data

A simple visualization of Ontario High School data

data-visualization r

Last synced: 25 Oct 2025

https://github.com/amanyadav-07/customer-churn-prediction

Machine Learning project to predict customer churn using Logistic Regression, Random Forest, and XGBoost. Includes data preprocessing, feature engineering, SMOTE balancing, model training, evaluation, and business insights.

accuracy-metrics data-analysis data-visualization logistic-regression machine-learning matplotlib numpy pandas python3 random-forest-classifier seaborn sklearn xgboost-classifier

Last synced: 11 Apr 2026

https://github.com/a26nine/kortext-usage-dashboard

An interactive data visualisation dashboard built using Tableau software to understand the value of digital resources issued on Kortext platform at Middlesex University, London.

data-analysis data-science data-visualization knime tableau

Last synced: 01 Feb 2026

https://github.com/shrutiijoshi/apple_greenhouse_gas_emissions

A breakdown of Apple's greenhouse gas emissions from 2015 to 2022 as they aim to reach net zero emissions by 2030.

dashboard data-analysis data-visualization powerbi

Last synced: 06 Feb 2026

https://github.com/simongoring/giphyr

A wrapper for getting giphy GIFs in R

data-visualization gif gifs giphy package r

Last synced: 26 Oct 2025

https://github.com/miteshgupta07/streamlit-machine-learning-app

A Streamlit application for interactive exploratory data analysis (EDA) and data visualization, offering dynamic tools to analyze and visualize machine learning datasets.

data-visualization python streamlit

Last synced: 27 Apr 2026

https://github.com/chingu-voyages/v30-geckos-team-04

An educational application dedicated to informing people about the issues involved in air quality | Voyage-30-Geckos-Team-04 | https://clean-the-air.herokuapp.com/

air-quality data-visualization

Last synced: 26 Oct 2025

https://github.com/unidata/gempak-scripts

A collection of GEMPAK scripts

data-visualization gempak meteorology

Last synced: 26 Oct 2025

https://github.com/msikorski93/ukraine-choropleth

This short repository contains a geospatial data visualization of percentage of people with Ukrainian as their native (or first) language according to 2001 census. A customized choropleth was developed with folium library. The map is interactive and allows the user do basic actions: zoom in-out, choose layers, display tooltips. The choropleth was saved to HTML file and is available for downloading.

data-visualization folium-choropleth-map ukraine

Last synced: 26 Oct 2025

https://github.com/felinjob/ibm-applied-data-science-capstone

Este projeto, parte da especialização IBM Data Science Professional Certificate, prevê o sucesso do pouso do Falcon 9 da SpaceX. Usando dados da API da SpaceX e Web Scraping, o projeto inclui análise de dados e Machine Learning para gerar insights sobre os lançamentos.

data-analysis data-science data-visualization ibm jupyter-notebook machine-learning numpy pandas python scikit-learn seaborn sql

Last synced: 11 Apr 2026

https://github.com/abhay-sinha-0/carpricepredictionproject

A machine learning project that predicts the selling price of a car based on its features such as year, mileage, fuel type, transmission, and more. This model can assist individuals and dealerships in estimating fair market prices for used cars.

artificial-intelligence data-analysis data-science data-visualization exploratory-data-analysis machine-learning-algorithms matplotlib-pyplot mysql-database numpy-library pandas-library python skit-learn sklearn-library

Last synced: 15 May 2025

https://github.com/hhlitval/siemens-cashflow-analysis

Financial data engineering and analysis project extracting cash flow metrics from Siemens annual reports and presenting insights through a static, data-driven web dashboard.

cashflow chartsjs data-engineering data-visualization duckdb etl financial-analysis javascript pdf-extraction python

Last synced: 26 May 2026

https://github.com/tashi-2004/apache-hadoop-spark-hive-cyberanalytics

This project utilizes Apache Hadoop, Hive, and PySpark to process and analyze the UNSW-NB15 dataset, enabling advanced query analysis, machine learning modeling, and visualization. The project demonstrates efficient data ingestion, processing, and predictive analytics for network security insights.

ai apache-hadoop apache-hive big-data-analytics big-data-processing data-analysis data-engineering data-science data-security data-visualization hdfs machine-learning network-analysis network-security pyspark python3 threat-detection unsw-nb15-dataset

Last synced: 02 May 2026