An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/szapp/candyanalysis

Case study: Analyze the candy power ranking to identify and recommend popular candy characteristics

data-analysis data-visualization feature-selection interaction-terms

Last synced: 28 Apr 2026

https://github.com/alexquilis1/news-sentiment-analyzer

A Flask web app that analyzes sentiment in news articles and generates word clouds to visualize emotional trends in current events

data-visualization flask natural-language-processing news-api nlp nltk python sentiment-analysis vader-sentiment wordcloud

Last synced: 19 May 2026

https://github.com/stephenombuya/automation_scripts

A collection of Python scripts and tools designed to automate various tasks, improve productivity, and simplify repetitive actions. Each script is well-documented and serves a specific purpose, ranging from data visualization to smart home control.

automation-with-python data-visualization productivity python3 smart-home-automation webautomation

Last synced: 13 Jun 2026

https://github.com/chanmeng666/customer-insight

AI-powered customer review analysis platform — sentiment analysis, keyword extraction, topic modeling, and anomaly detection

chinese-nlp customer-feedback customer-insights data-visualization machine-learning nlp python review-analysis sentiment-analysis streamlit text-analysis text-mining topic-modeling

Last synced: 29 Apr 2026

https://github.com/mauriciovazquezm/data_visualization_course_project

This project implements an interactive data visualization dashboard using R and Shiny. It leverages World Bank development indicators to explore key economic, social, and demographic metrics over time across countries and regions. The web app enables users to select specific indicators, filter by countries or years, and visualize trends through dyn

data-science data-visualization ggplot2 r-programming shiny web-app

Last synced: 29 Apr 2026

https://github.com/salvof88/raspberry-sensor-kit-demo

A lightweight Raspberry Pi sensor logger in Python for HC-SR04 (ultrasound) and DHT11 (temperature/humidity), exporting data to CSV or Google Sheets. Perfect for IoT experiments, smart home logging, or Raspberry Pi Zero DIY kits.

automation csv-logger data-logging data-visualization dht11 google-sheets gpio hc-sr04 hc-sr04-ultrasonic-sensor iot python python-sensors python3 raspberry-pi raspberry-pi-3 raspberry-pi-4 raspberry-pi-gpio raspberry-pi-zero sensor-data

Last synced: 29 Apr 2026

https://github.com/luizassimoes/q5ga-latency-and-throughput

Quick 5G Analyser: PyQT5 software developed to help with simple graphical analysis and chart generating for ping and iperf3 tests.

data-analysis data-visualization pyqt5 python

Last synced: 13 Jun 2026

https://github.com/jatinnxn/diabetes-prediction

this repository showcases a machine learning model built to predict diabetes using Diabetes dataset. The project walks through data preprocessing, model training, and evaluation, offering a Decision Tree-based solution to classify individuals as diabetic or non-diabetic based on various health metrics. It also supports real-time predictions.

data-cleaning data-preprocessing data-visualization decision-tree-classifier machine-learning

Last synced: 13 Jun 2026

https://github.com/stanleynguyen/so.cube

World map visualisation of World's Cube Association data 🌏

cas cube data-visualization leaftlet map

Last synced: 24 Jul 2025

https://github.com/mfakhriazhar/python-data-analyst-tutorial

A collection of My Python learning files for Data Analyst purposes. Covers fundamental to advanced topics such as data exploration, visualization, statistical analysis, and the use of popular libraries like Pandas, NumPy, Matplotlib, and Seaborn. Suitable for personal documentation or shared learning references.

data-analysis data-science data-visualization exploratory-data-analysis portfolio python

Last synced: 29 Apr 2026

https://github.com/hazz-i/e-commerce-analysis

FP Dicoding Analisis data dengan python

data-visualization jupyter-notebook python

Last synced: 29 Apr 2026

https://github.com/rohitblaze10/microsoft_stock_analysis-2025-kaggle

A comprehensive analysis of Microsoft's (MSFT) stock data from 1986 to 2025, covering trends, volatility, and interactive visualizations using Python

data-science data-visualization eda python

Last synced: 08 Nov 2025

https://github.com/prithviraj-2003/cognifyz-data-science-internship

🎓 Data Science Internship at Cognifyz Technologies 📅 Duration: 2 Months 🧠 Worked on real-world restaurant data 🗂️ Completed structured tasks across 3 levels 📌 Tasks focused on EDA, data preprocessing, visualization, and analysis 📎 Task descriptions provided in an attached PDF

data-analysis data-science data-visualization matplotlib numpy pandas python3

Last synced: 29 Apr 2026

https://github.com/mominurr/amazon-best-sellers-data-analysis

Exploring trends and product insights in Amazon Best Sellers data.

data-analysis data-visualization python scraping selenium tableau

Last synced: 29 Apr 2026

https://github.com/laipching/sprint6_module1

Exploratory Data Analysis with Python (Pandas/Matplotlib/Seaborn). Business questions, metrics and clear visualizations.

data-visualization eda matplotlib numpy pandas python seaborn

Last synced: 29 Apr 2026

https://github.com/r-mahesh45/fraud-detection-and-sales-analysis-using-random-forest

This project uses Random Forest to classify fraud risk based on taxable income and analyze key factors driving high sales for a cloth manufacturing company.

classification data-visualization extract-transform-load python3 random-forest

Last synced: 30 Apr 2026

https://github.com/abhinav330/instagram-influencers-analysis

This Jupyter Notebook focuses on preprocessing and visualizing data from an Instagram profiles dataset. It includes data loading, inspection, visualization, and some data preprocessing steps.

data data-science data-visualization exploratory-data-analysis exploratory-data-visualizations influncer-products instagram scikit-learn sklearn

Last synced: 08 Jun 2026

https://github.com/devprnvk/realestateml

This Python program analyzes a dataset (HousePricePrediction.xlsx) containing information about house prices. It utilizes pandas for data manipulation, matplotlib for plotting, and seaborn for visualizing correlations and distributions.

data-science data-visualization datasets houses npm plotting prediction-model seaborn

Last synced: 30 Apr 2026

https://github.com/dina-hosny/import-preprocess-and-visualize-a-dataset-project

A simple project to practice importing a dataset, data cleaning and preparation processes, and visualize the results to answer some given questions.

data-cleaning data-engineering data-science data-visualization jupyter-notebook matplotlib numpy pandas python

Last synced: 30 Apr 2026

https://github.com/saagpatel/signal-noise

An interactive essay teaching Bayesian reasoning through direct manipulation of live visualizations

bayesian d3 data-visualization education interactive-essay nextjs statistics typescript

Last synced: 28 Jun 2026

https://github.com/cagandemirmr/airbnb_available_houses

In this repo, i create dashboard using Tableau.In this process, i use SQL and Python languages.

dashboard data-visualization dataprocessing python sql tableau

Last synced: 30 Apr 2026

https://github.com/mayankfreelancer/advanced-sales-analytics-dashboard-power-bi-

This interactive Power BI dashboard provides a comprehensive analysis of sales data across regions, categories, and time periods. The project aims to uncover key trends in total sales, profit, quantity sold, and product performance, using advanced visualizations and forecasting techniques. 🛠 Tools & Techniques Used: Power BI

dashboard data-science data-visualization excel numpy pandas powerbi python sales-analysis sql

Last synced: 30 Apr 2026

https://github.com/tolumie/aviva-insurance-statistics-hypothesis-abtesting-modelling

This project explores the impact of demographic and lifestyle factors on insurance charges. Using statistical hypothesis testing (ANOVA, Chi-Square, T-tests) and predictive modeling (Elastic Net, Random Forest, Gradient Boosting). The analysis is deployed using Streamlit.

anova chi-square-test data-visualization eda gradient-boosting hypothesis-testing insurance-dataset machine-learning predictive-modeling python random-forest statistical-analysis streamlit

Last synced: 30 Apr 2026

https://github.com/gerhynes/d3-births-pie-chart

A D3 pie chart showing UN birth data grouped by month and quarter. Built for The Advanced Web Developer Bootcamp.

d3 data-visualization javascript

Last synced: 30 Apr 2026

https://github.com/sanveed-adnan/supermarket-sales-sql-project

SQL-based data analysis project on supermarket sales performance using SQLite and Power BI.

business-intelligence data-analysis data-science data-science-projects data-visualization power-bi sales-data sql sqlite

Last synced: 08 Nov 2025

https://github.com/realvuk/r-for-data-science-by-vuk

My exercise from the book R for Data Science: Import, Tidy, Transform, Visualize, and Model Data 2nd Edition

data-science data-visualization r rstats

Last synced: 13 Jun 2026

https://github.com/gitchaell/computer-scrapping

Tool that extracts data from the pages of companies that sell computers in the city of Trujillo - Peru, exports them in an XLSX file according to a relational data model, and displays them on a Power BI dashboard.

data-analysis data-structures data-visualization database dbdiagram export-excel powerbi scrapper-script scrapping xlsx

Last synced: 01 May 2026

https://github.com/falakrana/data-analysis-visualization

This repository showcases data analysis and visualization projects using Python and Tableau. It includes exploratory data analysis, interactive dashboards, and insightful visual stories derived from real-world datasets.

data-analysis data-visualization python tableau-public

Last synced: 01 May 2026

https://github.com/cdeweyx/bryce-harper-2016-analysis

Notebook analyzing Bryce Harper's disappointing 2016 campaign in historical context through data analytics.

data-analysis data-visualization python

Last synced: 01 May 2026

https://github.com/april-jk/stoke-your-code

Trading-terminal style Git history viewer that turns repository activity into candlestick charts and volume bars.

analytics candlestick-chart data-visualization developer-tools git-history git-visualization github lightweight-charts react typescript

Last synced: 15 Jun 2026

https://github.com/prathmesh2507/global-stock-intelligence-dashboard

Interactive Global Stock Market Analytics Dashboard built using Python, YFinance, Pandas, Streamlit, and Plotly. Analyze 20+ countries and 400+ top stocks with advanced visualizations and financial insights.

dashboard data-analysis data-visualization python stock-analysis streamlit

Last synced: 15 Jun 2026

https://github.com/marielachirinosr/nyc-taxi-trip-exploration-2019-2020

Explores passenger behavior & impact of COVID-19 on NYC taxi industry (Q1 2019-2020).

bigquery data data-analysis data-visualization python sql tableau

Last synced: 15 Jun 2026

https://github.com/abdoomohamedd/python-data-analysis-projects

A collection of data analysis projects that I have worked on. Each project involves cleaning data, performing exploratory data analysis (EDA), and creating visualizations to extract insights. The projects utilize various Python libraries, including pandas, numpy, matp

data-analysis data-analysis-python data-cleaning data-visualization matplotlib matplotlib-pyplot numpy pandas python

Last synced: 01 May 2026

https://github.com/treyhamilton/baccarat-betting-simulator

Streamlit app simulating a baccarat betting strategy with full results visualization and CSV output.

baccarat betting data-visualization gambling machine-learning simulator streamlit

Last synced: 01 May 2026

https://github.com/samia35-2973/world-university-ranking-2023-prediction

This repository is about creating models for predicting world university rankings 2023. The World University Rankings 2023 dataset include 1,799 universities across 104 countries and regions, making them the largest and most diverse university rankings to date. A clean dataset is generated through data preprocessing.

data-cleaning data-preprocessing data-visualization decision-trees machine-learning machine-learning-algorithms model-training prediction world-university-rankings world-university-rankings-2023

Last synced: 01 May 2026

https://github.com/anderson-andre-p/uber-data-analysis

This repository contains a comprehensive data analysis project focused on Uber rides. The dataset used in this project is a spreadsheet obtained from Uber, containing data related to ride details, such as pick-up and drop-off locations, date and time of the ride, and the fare amount.

data-analysis data-science data-visualization python

Last synced: 15 Jun 2026

https://github.com/ujjwalll/get-that-flair

It is a repository for project detecting the flair of reddit post through their links. You can find the working model of it at - https://get-that-flair.herokuapp.com/

data-analysis data-visualization django-application herokuapp machine-learning naive-bayes-classifier praw-reddit python3 random-forest reddit-api sentiment-analysis topic-modeling

Last synced: 01 May 2026

https://github.com/mrwilbroad/react-data-visualizaion

Data visualization with reactjs , react-chartjs-2

chartjs data-visualization graphs react-chartjs-2 reactjs

Last synced: 02 May 2026

https://github.com/rbreeze/dashboard

My personal health dashboard, with daily stats on food and sleep. Undergone several redesigns since 2015.

css dashboard data data-visualization design front-end google-sheets google-sheets-api health html javascript personal-health-record personal-website running static static-site visualization

Last synced: 02 May 2026

https://github.com/hanifheinrich/population-data-visualization

Implementasi Visualisai Data pada Data Kependudukan Nagari Tanjung Balik, Kabupaten Solok, Sumatera Barat Menggunakan Streamlit

data-visualization python streamlit-dashboard

Last synced: 16 Jun 2026

https://github.com/hafs96/prediction_consommation-de-carburant

Dans ce projet, l'objectif est de développer un modèle permettant de prédire si une voiture a une consommation de carburant élevée ou faible en fonction de ses caractéristiques techniques.

analysis data data-visualization machine-learning testing training

Last synced: 09 Jun 2026

https://github.com/ronitjariwala/prodigy_ds_04

Prodigy InfoTech Data Science Internship Task-4

data-analysis data-science data-visualization python

Last synced: 02 May 2026

https://github.com/bhawnagoyal18/ai-doctor-a-symptom-checker-disease-predictor

AI Doctor is an intelligent healthcare application that utilizes machine learning (ML) and Python to predict potential diseases based on user-input symptoms. The project integrates data from multiple medical datasets and provides an interactive web-based UI for an intuitive user experience.

data-analysis data-engineering data-visualization dataset flask html5 machine-learning python sql stacking statistics

Last synced: 02 May 2026

https://github.com/saagpatel/sovereign

Browser-based geopolitical simulator — apply policy levers to 18 countries and watch cascading effects over 60 months

d3 data-visualization geopolitics monte-carlo nextjs simulation typescript web-worker

Last synced: 28 Jun 2026

https://github.com/jakobtroidl/barrio

A visual tool to compare and analyze nanoscale brain structures.

comparison data-visualization neuroscience scientific-visualization

Last synced: 09 Apr 2026

https://github.com/holy-angel-university/global-cost-index-analysis

This analysis explores the cost of living across various countries, aiming to provide insights into economic disparities and living standards on a global scale. Utilizing a dataset that includes indices for overall cost of living, groceries, restaurant prices, and rent, we investigate the top and least expensive countries worldwide.

data-science data-visualization exploratory-data-analysis jupyter-notebook python3

Last synced: 02 May 2026

https://github.com/s1dewalker/electric-future

Visual Analysis: Future of Automotive Industry

data data-visualization machine-learning python3 regression-analysis tableau

Last synced: 02 May 2026

https://github.com/gerhynes/d3-notes-app

A simple notes app built to practice D3 selection methods. Built for The Advanced Web Developer Bootcamp.

d3 data-visualization javascript

Last synced: 03 May 2026

https://github.com/nathadriele/transaction_fraud_prevention_pipeline

Uma solução de detecção e prevenção de fraudes em transações financeiras, combinando Machine Learning, regras de negócio e análises estatísticas avançadas. O sistema oferece um dashboard interativo para monitoramento em tempo real, análise de dados e gestão de alertas de fraude.

data-analysis data-visualization docker fraud-prevention machine-learning matplotlib numpy pandas pipeline pytest python scikit-learn scipy seaborn streamlit tensorflow transaction xgboost

Last synced: 10 Apr 2026

https://github.com/leftcoastnerdgirl/webscraping_and_beautifulsoup

This project uses Beautiful Soup to create scrap data from a news website.

beautifulsoup data-visualization jupyter-notebook splinter webscraping

Last synced: 17 Jun 2026

https://github.com/corey-richardson/microbit-data-logger

In preparation for Work Experience Students coming in, I am using this project to familiarise myself with the BBC micro:bits which we will provide them with. I am also using it as a chance to expand on my data visualisation with Python experience.

data-visualization matplotlib microbit pandas pyplot signal-processing

Last synced: 03 May 2026

https://github.com/vipulbunny/restaurant-insight-analysis

A comprehensive data analysis project exploring restaurant ratings, locations, and customer sentiments. This project includes data preprocessing, descriptive analysis, geospatial mapping, sentiment analysis, and price-rating correlations using Python and visualization tools.

data-analysis data-preprocessing data-visualization folium geospatial geospatial-analysis geospatial-visualization machine-learning nlp pandas python restaurant-insights seaborn sentiment-analysis

Last synced: 03 May 2026

https://github.com/nathadriele/diabetes-clinical-etl-pipeline

Este projeto de Engenharia de Dados em Saúde Pública implementa um pipeline completo para coletar, tratar, padronizar, validar, integrar e visualizar dados públicos do SUS relacionados ao Diabetes Mellitus no Brasil, filtrando pelos códigos CID-10 E10 a E14.

cid data-analysis data-extraction data-pipeline data-science data-structures data-visualization datasus diabetes-detection diabetes-prediction epidemiology-analysis etl-pipeline healthcare-analytics ibge logger pytest sih streamlit sus

Last synced: 09 Jun 2026

https://github.com/akhil-krishnan-r/super_market_analysis

The growth of supermarkets in most populated cities are increasing and market competitions are also high. This dataset is one of the historical sales of supermarket company which has recorded in 3 different branches for 3 months data. Predictive data analytics methods are easy to apply with this dataset

data-visualization matplotlib numpy pandas seaborn

Last synced: 03 May 2026

https://github.com/sinsunsan/earth-survival-kit

Global warning data visualisation app to make everyone understand global warning and take actions that matter

angular angular7 d3 data-analysis data-visualization ecology global-warning ngx-charts

Last synced: 05 May 2026

https://github.com/shivabajelan/belly-button-biodiversity-dashboard

Belly Button Biodiversity Dashboard is an open-source interactive dashboard that visualises the Belly Button Biodiversity dataset. Built with JavaScript, D3.js, Plotly.js, HTML, and CSS, the dashboard features include a dropdown menu, horizontal bar chart, bubble chart, demographic information display.

css d3 dashboard data-visualization html javascript plotly visualization

Last synced: 03 May 2026

https://github.com/imgios/dependensee

📊 Minimal relaxed functional dependencies set visualization webapp.

angular d3js data-visualization ionic rfd typescript visual-analytics webapp

Last synced: 03 May 2026

https://github.com/pejpero/multi-class_classifier

This project demonstrates multiclass classification using Perceptron and Logistic Regression, implemented from scratch without using built-in libraries. It includes techniques like One-Versus-The-Rest and One-Versus-One for Perceptron, and Softmax for Logistic Regression, with a focus on understanding core ML concepts.

data-visualization logistic-regression machine-learning multiclass-classification one-vs-one one-vs-rest perceptron python scratch-implementation softmax

Last synced: 09 Jun 2026

https://github.com/parmaster/logserver

Collecting telemetry data posted by IoT devices into MQTT, filtering, processing and visualizing results

data-visualization golang-template iot-application mqtt mqtt-client template

Last synced: 04 May 2026

https://github.com/joaopalmeiro/altair-playground-app

An application powered by Streamlit to test different configurations on an Altair chart.

altair data-visualization streamlit

Last synced: 04 May 2026

https://github.com/nitrocosmstudios/temperature-sonification

Experimental sonification of temperature sensor data revealing structured, speech-like patterns coinciding with geomagnetic disturbances. Includes original dataset processing, synthetic signal attempts, and full write-up.

audio-analysis data-visualization electromagnetic-interference emf environmental-monitoring open-science raspberry-pi signal-processing sonification temperature temperature-monitoring temperature-sensor usb-devices weird-science

Last synced: 04 May 2026

https://github.com/ljadhav25/logistic-regression-data-science-

Logistic regression estimates the probability of an event occurring, such as voted or didn’t vote, based on a given data set of independent variables.

data-analysis data-science data-visualization logestic-regression machine-learning

Last synced: 04 May 2026

https://github.com/farseenmanekhan1232/analyse-economic-cycle

A Python-based CLI tool for analyzing economic cycles and making data-driven investment decisions in the Indian stock market using Kite Connect API.

data-visualization investment matplotlib portfolio-optimization python stock-market

Last synced: 30 Jul 2025

https://github.com/bishopce16/pyber_analysis

The purpose of this project was to complete an exploratory analysis and create visualizations of the 2019 ride sharing data from PyBer.

data-analysis data-visualization jupyter-notebook matplotlib pandas python

Last synced: 04 May 2026

https://github.com/flytomarsz/bike-sharing-system-analysis

This analysis project aim to identify bike rental's behavior in 2012 from Capital Bikeshare system, Washington D.C., USA. This project is part of my Data Analysis study at Dicoding.

data-analysis data-visualization jupyter-notebook python streamlit

Last synced: 04 May 2026

https://github.com/mattsebastianh/Making-a-Visual-Argument

Data Visualization with Matplotlib | Making a Visual Argument in Matplotlib

data-visualization matplotlib python

Last synced: 18 Jun 2026

https://github.com/s4rrar/israel-gaza-war

Human losses of the Israeli genocide in Gaza and the West Bank in numbers (October 7th War Victims)

data-engineering data-science data-visualization datascience jupyter jupyter-notebook python

Last synced: 05 May 2026

https://github.com/mattsebastianh/Make-a-Line-Chart

Data Visualization with Matplotlib | Matplotlib Fundamentals

data-visualization matplotlib pandas-dataframe python

Last synced: 18 Jun 2026

https://github.com/zborovskaanna/e-commerce-web-events-analysis

SQL project based on the Big Query public database 'The Look e-Commerce' and a dashboard in Looker Studio

analysis bigquery dashboard data-visualization looker-studio sql

Last synced: 03 Jan 2026

https://github.com/femincan/d3-bar-chart

My solution for the Visualize Data with a Bar Chart project on FCC.

css3 d3js data-visualization html5 javascript

Last synced: 06 May 2026

https://github.com/rishisolanke/twitter-sentiment-analysis-using-machine-learning-

A research project that classifies tweets as positive, negative, or neutral using ML algorithms (Logistic Regression, Naïve Bayes, SVM) with NLP preprocessing.

data-science data-visualization logistic-regression machine-learning ml-models naive-bayes natural-language-processing nlp scikit-learn sentiment-analysis svm text-classification twitter-data

Last synced: 06 May 2026

https://github.com/iamrajmani/sentimental-analysis

Sentimental Analysis - Final Year College Project

data-analysis data-visualization machine-learning python pytorch

Last synced: 06 May 2026

https://github.com/sankaran-s2001/us-traffic-accidents-analysis-python-eda

Exploratory data analysis of US traffic accidents from 2016-2023, analyzing patterns by time, location, weather, and severity using Python data science libraries.

data-analysis data-science data-visualization eda matplolib numpy pandas python

Last synced: 06 May 2026

https://github.com/cyprianfusi/data-scientist-technical-exercise-10ds

With recommendations to UK Department for Education of 10 Local Authorities where National Tutoring Programme (NTP) should be intensified and a response to UK Secretary of Health regarding a 76% Accident and Emergency (A&E) performance target which seems far-fetched.

data-analysis data-cleaning data-visualization hypothesis-testing pandas-python policy statistics

Last synced: 21 Sep 2025

https://github.com/harish-siva/analyzing-personal-expenses

This project aims to simulate an expense tracker for an individual using the Faker library. The project generates realistic monthly expense data, processes and stores it in a SQL database, and creates SQL queries to derive insights into spending behavior. Streamlit app is developed to visualize these insights and showcase the results of SQL queries

data-visualization expense-tracking exploratory-data-analysis financial-analysis matplotlib pandas python seaborn sql sqlalchemy streamlit

Last synced: 06 May 2026

https://github.com/sahilmate/ebm-breast-cancer-classifier

This repository implements an Explainable Boosting Machine (EBM) model for breast cancer classification using scikit-learn and interpret. The project includes data preprocessing, model training, accuracy evaluation, and feature importance visualization.

breast-cancer-classification data-visualization explainable-boosting-machine feature-importance interpret machine-learning scikit-learn

Last synced: 06 May 2026