An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/romerorodriguezd/housing-market-tracker

Jupyter Notebook to store and evaluate long-run evolution of house sales, based on Idealista website.

data-visualization matplotlib pandas python scraping scraping-python sqlite3

Last synced: 06 May 2026

https://github.com/drcbeatz/machine-learning-tool

Machine Learning Tool - Train and test supervised ML algorithms (incl. binary classification and regression) on custom data sets and visualize your results without knowing how to code.

data-science data-visualization django machine-learning python scikit-learn

Last synced: 06 May 2026

https://github.com/kirkalyn13/opensignal_autogenerate_report

Script used to generate results/summary, including the trends of flagged provinces, from the raw excel data file,

data-analysis data-science data-visualization matplotlib numpy pandas python

Last synced: 06 May 2026

https://github.com/mrankitgupta/titanic-survival-prediction-93-xgboost

Titanic Survival Prediction Project (93% Accuracy)🛳️ In this notebook, The goal is to correctly predict if someone survived the Titanic shipwreck using different Machine Learning Model & Hyperparameter tunning.

classification data-analysis data-science data-visualization gradient-boosting kaggle-competition linear-regression logistic-regression machine-learning machine-learning-algorithms ml ml-models nlp prediction predictive-modeling random-forest titanic titanic-kaggle titanic-survival-prediction xgboost

Last synced: 06 May 2026

https://github.com/scarblase/portfolioprojects

A collection of data analysis and business intelligence projects using SQL, Python, and visualization tools to uncover insights from real-world datasets. 🚀📊

csv data-analysis data-engineering data-mining data-science data-visualization matplotlib matplotlib-pyplot pandas python python3 seaborn sql

Last synced: 06 May 2026

https://github.com/himanchalchandra/science-canvas

Repo containing projects I did during a four months bootcamp on Data Science and Machine Learning organized by Science Canvas India.

data-mining data-science data-visualization machine-learning-algorithms mysql nlp-machine-learning

Last synced: 06 May 2026

https://github.com/vietdoo/real-estate-marketplace

a webapp that can visualize homes for sale on a cluster map. Data is continuously fetched from MongoDB, build filtering functions and APIs to find homes.

api bigdata data-visualization flask maps mongodb reactjs webscraping

Last synced: 07 May 2026

https://github.com/ssreeramj/binod-detector

Scrapes comments of a youtube video and shows distribution of comments having 'binod' in it

data-visualization heroku python streamlit youtube-api

Last synced: 16 May 2026

https://github.com/backdoorali/insider-threat-detection-project

Personal data analysis project combining insider threat detection, cybersecurity, and exploratory data analytics. Built for portfolio showcase and practical skills demonstration.

cybersecurity data-analysis data-analysis-excel data-analysis-project data-analyst data-analytics data-visualization eda excel insider-threat jupyter-lab jupyter-notebook matplotlib numbers pandas portfolio-project python python3 threat-detection threat-intelligence

Last synced: 07 May 2026

https://github.com/y-india/retail-sales-analysis-project

Analysis and preprocessing of retail store sales data. Includes data loading, merging, and initial inspection. 📌 Recommended: See README.md for detailed project progress and dataset information.

ai dashboard data-analysis data-science data-visualization jupiter-notebook machine-learning matplotlib python real-world-problem-solving real-world-project retail-analytics sales-analysis seaborn sklearn-library streamlit

Last synced: 07 May 2026

https://github.com/1ayanabil1/iris-visualization

This repository focuses on visualizing the Iris dataset using various data visualization techniques. It includes histograms, scatter plots, box plots, pie charts, bubble charts, and KDE plots to provide insights into the dataset’s structure. The project utilizes Matplotlib, Seaborn, Plotly, and Scikit-learn to generate insightful visualizations.

analytics clustering data-analysis data-science data-visualization datavisualization-project datavisualizations eda exploratory-data-analysis machine-learning machinelearning-python python

Last synced: 07 May 2026

https://github.com/oldhero5/talent_track

TalentTrack is an open‐source recruitment analytics web application built with Flask and Python. It leverages advanced machine learning techniques—such as Product Quantization (PQ) for candidate ranking and SHAP for model interpretability—to help HR teams and recruitment professionals identify high-quality candidates efficiently.

active-learning analytics candidate-ranking data-visualization faiss flask hrtech machine-learning open-source python recruitment shap talent-analytics

Last synced: 07 May 2026

https://github.com/geo-y20/loan-approval-automation-using-mongodb-and-pymongo

This project demonstrates the implementation of a loan approval system that utilizes MongoDB for distributed data storage and management, and PyMongo for database operations. The project aims to automate the assessment of loan eligibility using customer details from online applications.

crud-application data data-analysis data-science data-visualization deployment jupyter-notebook loan-default-prediction loan-prediction-analysis machine-learning machine-learning-algorithms matplotlib mongodb pymongo streamlit web

Last synced: 08 May 2026

https://github.com/manhtdxxx/batch-and-stream-pipeline-via-lakehouse

This project demonstrates a modern Lakehouse architecture supporting both streaming and batch data pipelines, built on Apache Iceberg tables.

airflow batch-processing data-engineering data-visualization docker elt-pipeline hive-metastore iceberg kafka lakehouse medallion-architecture spark stream-processing superset trino

Last synced: 08 May 2026

https://github.com/darkmenacer/google-geolocation-bias

Tool to scrape web links for different queries from different cities across the world, all from your home

data-visualization postgresql python3 selenium webscraping

Last synced: 08 May 2026

https://github.com/iguptashubham/ott-churn-eda-ml

Understanding why customers discontinue their subscriptions will be crucial in optimizing the user experience, reducing churn, and maximizing customer lifetime value. By using Machine learning model to predict the Customer Churn.

data-analysis data-analysis-project data-science data-science-portfolio data-science-projects data-visualization machine-learning python

Last synced: 08 May 2026

https://github.com/md-emon-hasan/4-eda-football-ml-app

A ML application focused on exploratory data analysis and football analytics, featuring data visualization and insights using Python and relevant libraries.

data-science-projects data-visualization eda exploratory-data-analysis football-analytics sports-analytics webapp

Last synced: 08 May 2026

https://github.com/sniperwolf/vis-gem

Wrap vis.js dependencies in Rails project.

charts data-visualization javascript network rails ruby timeline visualization

Last synced: 09 May 2026

https://github.com/fatihilhan42/covid-19-data-analysis-visualization

The first project of our data visualization studies is the COVID-19 data analysis project. In this project, we analyzed the data of the COVID-19 pandemic, which started in the first month of 2020 and still continues to affect the world, on the basis of countries. You can find the brief details of the project we realized in 3 stages in the readme file. We have tried to explain the details of the project step by step below. We wish you healthy days.

covid-19 data-science data-visualization pandas python visualization

Last synced: 09 May 2026

https://github.com/talha-1010/imdb-data-analysis

A data analysis project made with python using pandas

data-analysis data-visualization jupyter-notebook pandas pandas-dataframe

Last synced: 09 May 2026

https://github.com/douglasvolcato/fiis-analysis-brasilian-market

Brazilian investment fund analysis focused in dividend yield and minimun price variation

data-science data-visualization pandas python scraping scraping-websites

Last synced: 09 May 2026

https://github.com/barrarrr/fly-in

A dynamic, terminal-based drone network simulation application.

42 42school a-star algorithms breadth-first-search data-visualization drone fly-in

Last synced: 10 Jun 2026

https://github.com/deva-246/business-insights-on-realtime-swiggy-data-using-python

Data analysis for business decision-making and insights of a real time segment of Swiggy data.

data-visualization jupyter pandas python seaborn

Last synced: 10 May 2026

https://github.com/kalelmartinho/7daysofcode

Esse projeto tem como objetivo experienciar o dia a dia de um cientista de dados. É um desafio com duração de uma semana, proposto pela Alura.

7daysofcode alura data-cleaning data-science data-visualization forecasting machine-learning

Last synced: 10 May 2026

https://github.com/neelanjan-chakraborty/custoclarity

CUSTO CLARITY is a customer segmentation model built in Python. Using clustering on real retail datasets, it identifies 5 customer segments that unlocked strategic retail partnerships. Powered by scikit-learn, pandas, seaborn, and Matplotlib.

clustering-algorithm clustering-algorithms customer-analytics customer-segmentation data-visualization kmeans kmeans-clustering pandas python scikit-learn

Last synced: 11 May 2026

https://github.com/scarblase/russian-military-losses-analysis

This repository provides an in-depth analysis of Russian equipment losses using PySpark and data visualization techniques.

data data-science data-visualization jyputer-notebook matplotlib pyspark python3 seaborn seaborn-plots ukraine ukraine-invasion

Last synced: 12 May 2026

https://github.com/mtrebi/d3_cars

Cars Dataset Visualization (PCP) using d3.js

d3js data-visualization javascript

Last synced: 12 May 2026

https://github.com/is-leeroy-jenkins/sherpa

A budget execution & data analysis tool based on Winforms, .NET 6, and written in C# for EPA analysts

budget-management data-analysis data-science data-visualization federal-government

Last synced: 13 May 2026

https://github.com/mpolinowski/victory-data-chart

React.js components for modular charting and data visualization

chart css-grid-layout data-visualization react styled-components victory

Last synced: 13 May 2026

https://github.com/matesoft2033/temperature-and-soil-moisture-sensor

The Humidity and Temperature Monitoring System is an Arduino project that measures and displays humidity and temperature levels in real-time. It uses LEDs to indicate environmental conditions, making it a great tool for learning about sensors and data visualization.

arduino data-visualization electronics-engineering enviromental-sciences sensor-technology troubleshooting-and-debugging

Last synced: 13 May 2026

https://github.com/vjo/d3-punchcard

D3 Punchcard chart 📊●•●

chart d3js data-visualization library punchcard visualization

Last synced: 13 Jun 2026

https://github.com/soufianboukir/ecom-analytics-platform

End-to-end data science project on an Amazon sales dataset, including data preprocessing, analysis, modeling, and a Streamlit dashboard for insights and decision-making.

data-analysis data-science data-visualization data-visualization-dashboard forecasting-models timeseries

Last synced: 14 Jun 2026

https://github.com/kaushik0911/jubilant-guide

A Streamlit application for advanced route planning and accessibility analysis using OpenRouteService (ORS). Explore optimal routes while avoiding roadblocks and discover points of interest (POIs) within travel time ranges.

data-analysis data-visualization geospatial-analysis python streamlit

Last synced: 16 Jun 2026

https://github.com/akshadk7/exploratory-data-analysis

Implementing EDA and Machine Learning Algorithms on Kaggle Car Dataset

data-visualization exploratory-data-analysis machine-learning-algorithms predictive-modeling

Last synced: 17 Jun 2026

https://github.com/dorukalkan/3a-superstore-analysis

End-to-end data analysis, machine learning, and visualization project on a Turkish supermarket dataset

data-science data-visualization dbt machine-learning power-bi python sql

Last synced: 20 Jun 2026

https://github.com/alicankaya192/world-happiness-report-2025

Comprehensive exploratory data analysis (EDA) and visualization of the World Happiness Report 2025. Analyzes global rankings, regional distributions, key happiness factors, and detects wealth-happiness paradox outliers using Python (Pandas, Matplotlib, SciPy).

correlation-analysis data-analysis data-science data-visualization eda exploratory-data-analysis global-happiness happiness-index matplotlib pandas python scipy statistics whr-2025 world-happiness-report

Last synced: 21 Jun 2026

https://github.com/matusf/glasgow_wifi

Script that plots wifi access points to map and labels them by their protection

data data-visualization folium python python3

Last synced: 24 Jun 2026

https://github.com/jeugregg/coronavirusmodel

Coronavirus Visualization & Modeling

coronavirus covid-19 data-visualization

Last synced: 24 Jun 2026

https://github.com/judahpaul16/plume

Map how user information flows through any codebase or infrastructure into a readable graphic

charts cli data-flow data-visualization documentation flow observability pii reports tooling

Last synced: 25 Jun 2026

https://github.com/karthikmprakash/911-call-dataanalysis

Data Analysis of Emergency (911) Calls: Fire, Traffic, EMS for Montgomery County, PA

911-call-analysis data-analysis data-visualization python3 united-states-data

Last synced: 10 May 2026

https://github.com/cwendorf/polyplot

Visualizing Distributional Statistics [R Package]

data-visualization distribution-shape polyplot r r-package statistics

Last synced: 15 Mar 2025

https://github.com/mitevpi/vue-d3-bar-chart

Reusable, reactive, animated bar chart using D3 + Vue.js. Written in idiomatic Vue, rather than D3 syntax.

d3 data data-visualization frontend interactive svg vue web

Last synced: 18 May 2026

https://github.com/harshals499/ecosecure-visualization

Data visualization project using Qlik to analyze sales performance for EcoSecure Systems.

business-intelligence data-analysis data-visualization qlik-sense sales-analysis

Last synced: 12 Jun 2026

https://github.com/hamdaniqhmqd/project-predict-saham-bbri

Repository Project-Predict-Saham-BBRI is a group assignment project that uses Streamlit, scikit-learn, and related technologies to build a BBRI stock price prediction application based on day, week, and month input.

data-visualization numpy pandas python sklearn streamlit

Last synced: 11 Apr 2026

https://github.com/eduardotlc/materials_chempy

Python modules targetting the Materials Chemistry area, with different submodules, focused on data analysis and visualization.

chemistry data-mining data-visualization mass-spectrometry materials-science spectrophotometry spectroscopy

Last synced: 16 Jan 2026

https://github.com/mathpfreitas/top-hits-spotify-2000-to-2019-

# 🎧 Top Hits Spotify (2000 - 2019)Explore music trends from 2000 to 2019 with this dataset of songs, artists, and genres. Use the insights to understand what makes a hit in today's music landscape. 🐙💻

analysis analytics chart data-analysis data-visualization exploratory-data-analysis hits interactivedashboards jupyter-notebook matplotlib music musical numpy pandas plotly python timeseries track-hits

Last synced: 01 Jul 2025

https://github.com/ascender1729/cds-iisc-p1-datasci-predoc

A data science project utilizing machine learning to predict movie release years and genres based on directors' previous works.

data-cleaning data-visualization film-industry machine-learning movie-metadata multi-label-classification predictive-modeling regression-analysis-feature-engineering

Last synced: 31 Mar 2025

https://github.com/flower-of-the-bridges/va-project

Project for the Visual Analytics course from the Engineering in Computer Science master degree at Sapienza

d3-js d3-visualization data-visualization data-viz visual-analytics

Last synced: 21 Feb 2026

https://github.com/tim-richter/spinne

Spins a web of component relationships for react projects

abstract-syntax-tree components data-visualization design-system jsx react stats tsx usage

Last synced: 10 Mar 2026

https://github.com/luke-feng/MAP

This is a master project takes place in CSG at UZH.

cybersecurity data-visualization

Last synced: 08 Apr 2025

https://github.com/g0bel1n/gravlaw-model

Gravlaw-model is a data visualization project for the econophysic's concept of Gravity law used for modeling inter-cities traffic.

data-visualization econophysics gravity-model

Last synced: 15 Mar 2025

https://github.com/rayyan9477/calorie-burnage-exploratory-data-analysis

Calorie Burnage is the measure of calories burned during physical activity or exercise, crucial for weight management and fitness goals. This project focuses on analyzing a dataset that includes information on duration, pulse rates, and calories burned during exercise sessions.

data-analytics data-science data-visualization exploratory-data-analysis linear-regression r-language r-programming

Last synced: 08 Jul 2025

https://github.com/katiesaund/market_size_app

An interactive dashboard to plot a company's estimated market size.

data-visualization finance r rshiny shiny shinyapp

Last synced: 24 Mar 2025

https://github.com/jorgeatgu/apaga-luz

💡 ¿Cuánto cuesta la luz? 💶

data data-visualization flat-data

Last synced: 04 Feb 2026

https://github.com/guruakaashjn/te_project_microsoft_ai

AI based statistical analysis of land-use plastic pollution in India using AI/ML techniques.

artificial-intelligence data-analysis data-analytics data-science data-visualization machine-learning powerbi

Last synced: 27 Feb 2025

https://github.com/stephanabitter534/flowy.blazor

🌐 Build responsive web applications effortlessly with VIOVNL.Flowy.Blazor, a Blazor component for smooth drag-and-drop functionality.

blazor component-library csharp data-visualization dotnet event-driven flowchart frontend interactive-graphics open-source single-page-application ui-framework user-interface web-application web-development

Last synced: 01 May 2026

https://github.com/sreejabethu/sales-data-analysis-forecasting

Welcome to the Sales Data Analysis & Forecasting project! 🚀 This repository showcases my data analysis skills through exploratory data analysis (EDA), data cleaning, and visualization of sales and customer feedback data. The goal is to extract actionable insights to drive business decisions.

analysis barchart data-visualization datacleaning exploratory-data-analysis forecasting histogram matplotlib-pyplot numpy-library pandas-library pycharm-ide sales-analysis salesdata salesdataanalysis seaborn-plots transformation

Last synced: 19 May 2026

https://github.com/noturlee/titanic-datamodel

This project demonstrates the process of data preprocessing, model training, evaluation, and tuning in building a predictive model for a classic dataset. The Random Forest model, with its ability to handle complex relationships and interactions between features, proved to be the most effective in this case.

data-modeling data-science data-visualization python

Last synced: 08 Apr 2025

https://github.com/gutyoh/netflix-dashboard

Dashboard created with Python 🐍 and the Dash 📊 framework to visualize Netflix’s content distribution and classification.

dash dashboard data-visualization plotly python

Last synced: 08 Apr 2025

https://github.com/sevdanurgenc/python-for-data-science-lecture-notes

In this repo, I have the course contents of Python for Data Science training, which will be given to Siemens by the cooperation of Academy Peak Information Technologies Training and Consultancy between 28 June - 1 July 2022.

data-analysis data-mining data-modeling data-science data-structure data-visualization matplotlib-tutorial numpy-tutorial pandas-tutorial

Last synced: 23 Mar 2025

https://github.com/priyanshubiswas-tech/aws-mwaa-elt-airflow-sql-dbt-superset-project

This project was created as part of an assessment for DigitalXC AI. It demonstrates a cloud-based ELT pipeline using AWS MWAA, Airflow, dbt, PostgreSQL, and Superset. The pipeline automates data ingestion from S3, transformation with dbt, and visualization through Superset, following modern data engineering practices on a scalable AWS architecture.

apache-airflow apache-superset aws-s3 dag data-analysis data-engineering-pipeline data-visualization dbt elt-pipeline python rds-postgres

Last synced: 03 Jul 2025

https://github.com/gher-uliege/bluecloud-plankton

Spatial interpolation of plankton data using a neural network

data data-analysis data-visualization neural-network oceanography

Last synced: 30 Mar 2025

https://github.com/nrobledosagredo/lda-topic-analysis-news-chile

Topic analysis on news from Chile using LDA for extracting and visualizing relevant patterns.

data-visualization latent-dirichlet-allocation

Last synced: 16 Mar 2025

https://github.com/aubainmbk/optimisation-ventes-supermarche

Notre objectif est de ressortir des informations grâce aux données de ventes d'un Supermarché.

data-visualization dataanalysis excel powerbi

Last synced: 04 Feb 2026

https://github.com/ksmooi/mscs_data_visualization

This series of reports demonstrates the process of creating, refining, and evaluating interactive data visualizations using Altair, focusing on Seattle weather data, with an emphasis on accessibility, usability, and effectively communicating insights.

altair data-visualization

Last synced: 01 Apr 2025

https://github.com/samuelbarbosadev/roof_imoveis_data_analysis

The company hired you because they want to know what would be the 5 properties they should invest in and why, and which 5 you would not recommend investing in at all.

data-preparation data-understanding data-visualization pandas python

Last synced: 12 Apr 2026

https://github.com/mohagungnursalim/da-covid-data-cleansing

Identifying and correcting errors, inconsistencies, and inaccuracies within datasets related to the COVID-19 pandemic. This is essential for obtaining reliable and actionable insights from the data.

data-analytics data-visualization pandas-dataframe

Last synced: 25 Mar 2025

https://github.com/robthepcguy/ahk-mouse-heatmap

An AutoHotkey script that records left, right, and middle mouse clicks, logging the date, time, and x and y coordinates. It features automatic GUI updates and generates a visual heatmap via a Python script, accessible from the system tray. This tool is ideal for analyzing user interaction and creating detailed mouse activity maps.

autohotkey-script click-counter click-log click-map data-analysis data-visualization heatmap heatmap-visualization mouse-events mouse-tracking python

Last synced: 01 Apr 2025

https://github.com/yusuf-abol/alumni-interaction-and-conversation-dynamics-nlp

This Natural Language Processing (NLP) project took a dive into chat engagement dynamics within the University of Ilorin’s Class of 2018 Statistics alumni group. By applying Latent Dirichlet Allocation (LDA) for topic modeling and network analysis, I uncovered communication patterns, topic distributions, and member interactions.

alumni-network anonymization conversation data-science data-visualization engagement machine-learning network-analysis nlp python-3 sentiment-analysis statistics whatsapp

Last synced: 05 May 2026

https://github.com/getkey/stereotype-map

Map of national stereotypes

data-visualization google-suggestions vuejs vuex

Last synced: 12 Apr 2026

https://github.com/rahul-404/full_stack_data_science_with_generative_ai

Welcome to the repository for the course "Full Stack Data Science with Generative AI". This repository is designed to accompany the course and provide resources, exercises, and projects related to the study of data science and generative AI techniques.

data-analysis data-science data-visualization database deep-learning exploratory-data-analysis feature-engineering generative-ai machine-learning nlp python statistics

Last synced: 12 Apr 2026

https://github.com/alexeyraspopov/vepr

🚧 Visualization protocol for complex and sizable data visualizations

data-visualization zero-dependency

Last synced: 04 Jul 2025

https://github.com/geetisha/sales_insight_data_analysis_using_sql_and_tableau-etl-

Sales Insights - A Data Analysis Project performed on Tableau & SQL Topics

dashboard data-analysis data-visualization mysql project sales-analysis sql tableau

Last synced: 07 Jan 2026

https://github.com/md-emon-hasan/3-eda-basketball-ml-app

A ML application focused on EDA and basketball analytics, showcasing data visualization and insights using Python and relevant libraries.

basketball-analysis csv data-visualization eda exploratory-data-analysis exploratory-data-analysis-eda ml-app

Last synced: 12 Apr 2026

https://github.com/superchordate/data-viz-talk

Resources from the "Data Visualization in Business Communication" presentation at the 2023 Gamma Iota Sigma Regional Conference in Fort Worth, TX.

data-visualization eda exploratory-data-analysis public-data

Last synced: 29 Jul 2025

https://github.com/aromoh/data-visualization-nobel-laureates

Data Visualization using a interactive dashboard create in Tableau. Data set has been obtained in Kaggle

dashboard data-visualization kaggle-dataset nobel-laureates tableau

Last synced: 07 Jan 2026

https://github.com/isatyamks/chatsense

This is my first machine learning model, designed to predict the mood and behavior of users by analyzing their WhatsApp chat archives.

analysis artificial-intelligence behavior data-visualization machine-learning matplotlib pandas prediction seaborn vercel-deployment whatsapp-chat wordcloud

Last synced: 07 Jan 2026

https://github.com/anurag-kumar-molankala/blinkit-grocery-sales-dashboard

The BlinkIT Grocery Sales Dashboard is an interactive Power BI dashboard that provides insights into grocery sales performance. It includes key KPIs, sales trends, and outlet performance analysis.

business-intelligence dashboards data-analysis data-visualization dax excel kpi-dashboard power-bi powerquery slicers-kpi-card-multirow-card sql-server ssis ssms ssrs-reports

Last synced: 09 Apr 2025