An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/ipanalytics/vpn-infrastructure-atlas

Interactive VPN infrastructure atlas: aggregate provider footprint by reported endpoint country, ASN, and hosting network. No raw configs or per-IP feed.

asn-analysis cybersecurity data-visualization fraud-detection github-pages ip-reputation network-intelligence proxy-detection threat-intelligence vpn-intelligence

Last synced: 25 May 2026

https://github.com/yuvraj0412s/proactive-fraud-detection-using-machine-learning

An end-to-end machine learning project for detecting financial fraud using LightGBM, featuring in-depth EDA, advanced feature engineering, and a focus on actionable business insights.

class-imbalance classification-model data-analysis data-science data-visualization exploratory-data-analysis feature-engineering fintech fraud-detection jupyter-notebook lightgbm machine-learning pandas python scikit-learn smote

Last synced: 02 May 2026

https://github.com/ryanfranklin237/data-visualization-python

A tool that allows you to visualize data from a csv or excel file in a graph or charts form

data-analysis data-science data-visualization matplotlib pandas-dataframe python

Last synced: 11 Jun 2026

https://github.com/odeyiany2/flit-apprenticeship-data-science-projects

This repo contains all my projects for my FLiT Apprenticeship

data-analysis data-science data-visualization machine-learning sql

Last synced: 17 May 2026

https://github.com/ipanalytics/vpn-infrastructure-intelligence-lab

Aggregate VPN infrastructure intelligence dataset and interactive dashboard for provider, country, ASN, relationship, archetype, and hosting dependency analysis without publishing raw endpoint data.

asn cybersecurity data-visualization geoip github-pages hosting-analysis infrastructure-intelligence internet-measurement network-analysis osint privacy-research static-dashboard threat-intelligence vpn vpn-research

Last synced: 25 May 2026

https://github.com/samjtro/tri-map

EPA-TRI Data Visualization & Analysis

data-visualization emissions environment python

Last synced: 01 Sep 2025

https://github.com/sankalp130/call-center_data-analysis

This repository contains Tableau dashboard, visualization, and data analysis projects I have created.

data-cleaning data-schema data-visualization tableau

Last synced: 05 Jan 2026

https://github.com/katiesaund/market_size_app

An interactive dashboard to plot a company's estimated market size.

data-visualization finance r rshiny shiny shinyapp

Last synced: 24 Mar 2025

https://github.com/owengombas/r_place

🖼 Data analysis on r/place 2022

data-science data-visualization rplace

Last synced: 18 Jan 2026

https://github.com/mgobeaalcoba/matplotlib_y_seaborn

Aquí dejaré trabajos de visualización realizados con ambas librerías de Python.

data-analysis data-science data-visualization dataset matplotlib numpy pandas python seaborn

Last synced: 13 Apr 2026

https://github.com/shantoroy/data-visualization-streamlit-app

Simple csv data analytics platform built using Python Streamlit

data-analytics data-visualization machine-learning python streamlit

Last synced: 13 Apr 2026

https://github.com/sugapriya-k/final-year-project

An AI-powered personal finance tracker to manage expenses, predict spending, and plan budgets using machine learning.

ai budget-tracker data-visualization final-year-project finbert gradient-boosting-decision-tree lstm ml personal-finance-manager postgresql prophet-model python stanza web-speech-api

Last synced: 25 Feb 2026

https://github.com/ahmednasef3/udemy-courses-full-eda

Exploratory Data Analysis on the factors that can affect the promotions and earnings in Udemy Courses and the perfect way to make a good saled course in Udemy.

data-analysis data-science data-visualization eda exploratory-data-analysis matplotlib pandas seaborn udemy-course-project

Last synced: 01 May 2026

https://github.com/xilinjia/csv-analyser

Visualize and analyze data in csv files. A multiplatform project runnable on Linux, Android, Windows, MacOS

android compose csv data-science data-visualization kotlin kotlin-multiplatform linux macos multiplatform windows

Last synced: 09 Apr 2026

https://github.com/ashwathnakate/sales-dashboard

A sales dashboard made purely in Python and deployed using shinyapps

dashboard data-visualization python shiny-apps

Last synced: 05 Oct 2025

https://github.com/czoido/conan-graph3d

Generate a Conan dependency graph visualization in 3d

conan cpp data-visualization

Last synced: 20 Jan 2026

https://github.com/husna-poyraz/titanic-machine-learning

Use machine learning to create a model that predicts which passengers survived the Titanic shipwreck.

data data-analysis data-science data-visualization deep-learning machine-learning missing-data outlier-detection python titanic

Last synced: 10 May 2026

https://github.com/sayamalt/black-friday-sales-prediction

Successfully established a machine learning regression model which can estimate the gross Black Friday sales for a particular customer, based on a distinct set of related and meaningful features, to a fair level of accuracy.

artificial-neural-networks data-visualization deep-learning exploratory-data-analysis feature-engineering machine-learning model-deployment model-training-and-evaluation regression-analysis regression-modelling regression-testing supervised-learning

Last synced: 09 Nov 2025

https://github.com/ksmooi/mscs_data_visualization

This series of reports demonstrates the process of creating, refining, and evaluating interactive data visualizations using Altair, focusing on Seattle weather data, with an emphasis on accessibility, usability, and effectively communicating insights.

altair data-visualization

Last synced: 01 Apr 2025

https://github.com/neoxio/charts

Neoxio.Charts is a data visualization library for .NET WPF applications.

chart data-visualization line plot time-range timeline wpf

Last synced: 14 Jan 2026

https://github.com/ascender1729/cds-iisc-p1-datasci-predoc

A data science project utilizing machine learning to predict movie release years and genres based on directors' previous works.

data-cleaning data-visualization film-industry machine-learning movie-metadata multi-label-classification predictive-modeling regression-analysis-feature-engineering

Last synced: 31 Mar 2025

https://github.com/gracysapra/r-in-data-science

This repository contains essential guides for data analysis using R, covering topics like data preparation, data reshaping, and data visualization. Each file focuses on fundamental techniques to manipulate, clean, and visualize data effectively using R programming.

data-analysis data-preparation data-reshaping data-science data-visualization data-visualizations ggplot r r-for-data-science

Last synced: 19 Apr 2026

https://github.com/guidodipietro/fmcranks

Raw text input to nicely formatted XLSX file + PNG image of the range

data-analysis-python data-visualization python

Last synced: 26 Jan 2026

https://github.com/ondrejhruby/countries-of-the-world

Explore global data with this repository, featuring insights, visualizations, and Python code examples on countries worldwide—perfect for enhancing your data analysis and visualization skills.

data-analysis data-science data-visualization geography jupyter-notebook machine-learning matplotlib pandas python statistics

Last synced: 16 Apr 2026

https://github.com/flazefy2/ds-global_cybersecurity_threats

https://www.kaggle.com/datasets/atharvasoundankar/global-cybersecurity-threats-2015-2024

data-science data-visualization numpy python squarify statistics

Last synced: 05 May 2026

https://github.com/14richa/forecastflow

ForecastFlow is a web application built with Streamlit that visualizes solar and wind power forecasts from Elexon.

analysis data-visualization python streamlit

Last synced: 30 Apr 2026

https://github.com/ahmetcansolak/developer-insights

New project of ClubRockers from Sarıyer Hills

bitbucket data data-science data-visualization github python3

Last synced: 28 Apr 2026

https://github.com/sunnybibyan/random_data_generation

A project that generates a dataset using various statistical distributions (Normal, Uniform, Exponential, Random Integers, and Binomial) and performs data analysis. Includes visualizations and an option to export the data as a CSV file.

data-analysis data-visualization python random-data-generation statistics streamlit-webapp

Last synced: 13 Jun 2026

https://github.com/mkk-1817/hr-attrition

This project, conducted during my internship at MeriSKILL, focuses on HR Attrition Prediction using advanced Machine Learning models. The initiative includes the development of a dynamic Dashboard and in-depth Analysis to offer actionable insights for proactive human resource strategies.

data-analysis data-science data-visualization jupyter-notebook machine-learning-algorithms powerbi python

Last synced: 03 May 2026

https://github.com/shuddha2021/stellar-candidate-selector

A sophisticated candidate selection algorithm leveraging multi-criteria analysis and machine learning to identify top software engineering candidates. This tool features flexible filtering, score adjustment, and detailed visualizations to streamline the recruitment process.

candidate-selection data-analysis data-visualization machine-learning pandas plotting-in-python python python-data-analysis recruitment scikit-learn

Last synced: 05 May 2026

https://github.com/mohini1403/road_accident_data_analytics

This project aims to analyze road accident data to gain insights into the factors contributing to accidents, identify patterns, and propose data-driven recommendations for improving road safety. The dataset used in this project contains information about various aspects of road accidents, such as location, time, weather conditions, and severity.

analytics data-visualization pandas powerbi

Last synced: 14 Jun 2026

https://github.com/affec-ds/dashboard-ventas-vinilos

Dashboard interactivo de ventas para tienda de vinilos. Análisis visual, KPIs clave y filtros dinámicos para decisiones comerciales.

business-intelligence data-analysis data-visualization ipywidgets jupyter-notebook kpis matplotlib music-industry notebook-project python retail-analytics sales-analysis seaborn vinyl-records

Last synced: 30 Apr 2026

https://github.com/flazefy2/ds-customer_shopping

https://www.kaggle.com/datasets/bhadramohit/customer-shopping-latest-trends-dataset

data-science data-visualization jupiter-notebook numpy python sales shopping squarify statistics

Last synced: 05 May 2026

https://github.com/kaushik0911/jubilant-guide

A Streamlit application for advanced route planning and accessibility analysis using OpenRouteService (ORS). Explore optimal routes while avoiding roadblocks and discover points of interest (POIs) within travel time ranges.

data-analysis data-visualization geospatial-analysis python streamlit

Last synced: 16 Jun 2026

https://github.com/akshadk7/exploratory-data-analysis

Implementing EDA and Machine Learning Algorithms on Kaggle Car Dataset

data-visualization exploratory-data-analysis machine-learning-algorithms predictive-modeling

Last synced: 17 Jun 2026

https://github.com/bhavik444/techistanbul_python_bootcamp

👨💻 Master Python programming through practical exercises in this 80-hour bootcamp, designed for beginners to advanced learners.

algorithms api-development automation coding-bootcamp data-analysis data-visualization django flask git machine-learning python software-engineering testing web-development web-scraping

Last synced: 28 Apr 2026

https://github.com/msohaill/wrappedify

Full stack implementation of Wrappedify

data-visualization express music socket-io spotify sveltekit

Last synced: 05 May 2026

https://github.com/anastasius21/imdb-movie-analysis

Analysis of IMDb's Top 1000 Movies dataset using Pandas, Matplotlib, and Seaborn. It provides visualizations and insights into various aspects of movies, such as ratings, genres, directors, and release years.

data-analysis data-exploration data-science data-visualization imdb imdb-dataset jupyter-notebook python

Last synced: 25 Apr 2026

https://github.com/dorukalkan/3a-superstore-analysis

End-to-end data analysis, machine learning, and visualization project on a Turkish supermarket dataset

data-science data-visualization dbt machine-learning power-bi python sql

Last synced: 20 Jun 2026

https://github.com/drcbeatz/machine-learning-tool

Machine Learning Tool - Train and test supervised ML algorithms (incl. binary classification and regression) on custom data sets and visualize your results without knowing how to code.

data-science data-visualization django machine-learning python scikit-learn

Last synced: 06 May 2026

https://github.com/alicankaya192/world-happiness-report-2025

Comprehensive exploratory data analysis (EDA) and visualization of the World Happiness Report 2025. Analyzes global rankings, regional distributions, key happiness factors, and detects wealth-happiness paradox outliers using Python (Pandas, Matplotlib, SciPy).

correlation-analysis data-analysis data-science data-visualization eda exploratory-data-analysis global-happiness happiness-index matplotlib pandas python scipy statistics whr-2025 world-happiness-report

Last synced: 21 Jun 2026

https://github.com/macagua/entrenamiento.data_scientist_python

Repositorio de manuales y recursos del entrenamiento "Data Scientist en Python" realizado por Leonardo J. Caballero G.

data-analytics data-scientist data-visualization numpy pandas-dataframe python37 streamlit

Last synced: 01 May 2026

https://github.com/pradipece/interactive_analytics_data_automation

This project allows users to control complex dataset processing with Python programming by merging Streamlit with Plotly for visualizations.

automation data-science data-visualization pandas plotly streamlit

Last synced: 01 May 2026

https://github.com/kirkalyn13/opensignal_autogenerate_report

Script used to generate results/summary, including the trends of flagged provinces, from the raw excel data file,

data-analysis data-science data-visualization matplotlib numpy pandas python

Last synced: 06 May 2026

https://github.com/alexandertiopan1212/arbidex

ArbiDex is a smart prediction market arbitrage dashboard that detects mismatches between similar Manifold markets using fuzzy logic, complete with actionable insights, weekly trend tracking, and historical database.

ai-trading altair data-visualization finance fuzzy-matching manifold-markets prediction-markets python sqlalchemy sqlite streamlit

Last synced: 21 Apr 2026

https://github.com/rudra-g-23/find-my-joint

A utility to find potential join keys (matching columns) across multiple DataFrames.

data-analysis data-visualization join network-graph pandas pandas-dataframe

Last synced: 24 Jun 2026

https://github.com/mulliru/estudo-data-visualization

Este repositório contém os códigos que desenvolvi e utilizei durante o curso de Visualização de Dados da Alura. Aqui, você encontrará diversos scripts e notebooks que abordam os principais conceitos e técnicas de Data Science, aplicados em diferentes contextos e projetos.

alura data-visualization

Last synced: 08 Jun 2026

https://github.com/camara94/text-from-image

Dans ce projet j'ai utilisé la librairie EasyOCR qui est un projet open source de deep learning, qui permet d'extraire facilement du text à partir des images.

algorithms data-visualization datanalysis deep-learning machine-learning python3 rnn-tensorflow

Last synced: 25 Apr 2026

https://github.com/alejo1630/ibm_capstone_project

This project aims to leverage predictive analytics to forecast the outcomes of rocket launches for Space Y, a new player in the commercial space industry.

data-collection data-science data-visualization data-wrangling exploratory-data-analysis machine-learning predictive-modeling python spacex

Last synced: 01 May 2026

https://github.com/kitestring/scidata

Extracts and cleanses flat data exported from a Chemical Analyzer instrument (Time of Flight Mass Spectrometer), then loads data into a SQL database using a Star schema. A basic CLI is implemented to query SQL database to create data visualizations which describe instrument performance in a simple and digestible manner. The following python libraries are utilized: numpy, pandas, matplotlib, sklearn, statistics, sqlite3, os, time, & datetime.

data-visualization extract matplotlib pandas python3 sql-database

Last synced: 25 Apr 2026

https://github.com/com-480-data-visualization/project-2023-the-vizards

Lausanne Transportation : a data visualization of the Lausanne Transportation network. Developed by the Vizards team as part of the EPFL Data Visualization course project (COM-480).

buses data-analysis data-science data-visualization epfl lausanne map metro public-transport public-transportation switzerland webgl

Last synced: 01 May 2026

https://github.com/amruthadevops/suspicious_web_threat_interactions

To detect and analyze patterns in web interactions for identifying suspicious or potentially harmful activities

cyber-security data-analysis data-science data-visualization jupyter-notebook machine-learning powerbi python

Last synced: 29 Jun 2026

https://github.com/vyjayanthipolapragada/early_sepsis_detection_ml

An end-to-end project leveraging clinical datasets (PhysioNet, MIMIC-IV, MIMIC-IV-ED) to develop and compare ML and LSTM-based models for early sepsis prediction.

data-analysis data-visualization deep-learning healthcare jupyter-notebook keras-tensorflow lstm-neural-networks machine-learning neural-network python

Last synced: 28 Apr 2026

https://github.com/bladealex9848/presion_arterial

Aplicación para el seguimiento de la presión arterial, permitiendo el registro y visualización de mediciones de presión sistólica y diastólica.

blood-pressure-monitor chronic-disease-management data-visualization e-health e-healthcare healthcare-application medical-data-analysis patient-management python sqlite streamlit

Last synced: 06 May 2026

https://github.com/gitjeff05/nih-grant-awards

Analysis of NIH grant awards from ExPORTER data files

data-visualization jupyter-notebook jupyterlab matplotlib pandas python3 seaborn

Last synced: 06 May 2026

https://github.com/flazefy2/ds-global_health_statistics

https://www.kaggle.com/datasets/malaiarasugraj/global-health-statistics

data-science data-visualization dummy health jupiter-notebook python statistics

Last synced: 25 Apr 2026

https://github.com/asifdotexe/quickvu

Quick VU: No-code, data cleaning analysis and visualization tool built on Streamlit. Quickly clean, visualize, explore, and understand data relationships and correlations with ease. Perfect for analysts, business users, and anyone looking to gain data insights—without writing a single line of code.

automation data-analysis data-cleaning data-visualization python3 streamlit-application toolkit

Last synced: 06 Jun 2026

https://github.com/amirhosseinhonardoust/ai-personal-study-tracker

An AI-driven productivity tracking app built with Python, Streamlit, SQLite, and Machine Learning. It logs and analyzes study sessions, predicts productivity using Random Forest models, and visualizes key insights to help learners improve focus, habits, and overall academic efficiency.

ai data-analytics data-visualization education learning-analytics machine-learning productivity python random-forest self-improvement sqlite streamlit student-success study-tracker time-management

Last synced: 07 May 2026

https://github.com/sathyasris27/environmental-classification-based-on-gaming-patterns

The aim of this project is to create a more nuanced understanding of the interactions between socio-demographic characteristics, in-game behaviours, and global-scale environmental consciousness.

data-engineering data-mining data-science data-visualization python-3

Last synced: 28 Apr 2026

https://github.com/sivas-2/coffee-sales-visualization

This repository contains data visualization scripts and notebooks analyzing coffee sales data from a vending machine, sourced from Kaggle. The visualizations explore sales trends, customer preferences, and product popularity over time.

data data-analysis data-science data-visualization python visualization

Last synced: 07 May 2026

https://github.com/pheithar/socialdata_madridcentral

Social data and visualization course at DTU - 2022. Effectiveness of Madrid Central

data-analysis data-visualization jupyer-notebook madrid python

Last synced: 28 Apr 2026

https://github.com/aravind-selvam/bikeshare-company-analysis

Google Data Analytics Professional Certificate program's Capstone project, of a bike sharing company

analytics business-analytics business-intelligence data data-analysis data-visualization dataanalytics google-data-analytics postgresql sql sql-server

Last synced: 22 Apr 2026

https://github.com/teohrt/roulette-martingale-visualizer

Visually weigh the risk of the Martingale Roulette strategy

api data-visualization flask gambling matplotlib roulette-wheel-algorithm

Last synced: 28 Apr 2026

https://github.com/backdoorali/insider-threat-detection-project

Personal data analysis project combining insider threat detection, cybersecurity, and exploratory data analytics. Built for portfolio showcase and practical skills demonstration.

cybersecurity data-analysis data-analysis-excel data-analysis-project data-analyst data-analytics data-visualization eda excel insider-threat jupyter-lab jupyter-notebook matplotlib numbers pandas portfolio-project python python3 threat-detection threat-intelligence

Last synced: 07 May 2026

https://github.com/bala-1409/foreign-exchange-rate-time-series-data-science-project

This project will use time series analysis to forecast the exchange rate between the euro and the US dollar. The project will use a variety of statistical techniques, such as ARIMA to model the data and forecast the exchange rate.

data-analysis data-science data-visualization datapreprocessing eda exploratory-data-analysis forecasting machine-learning-algorithms model modelfitting predictive-modeling python3 scikit-learn statsmodels time-series time-series-analysis

Last synced: 07 May 2026

https://github.com/dangerousfish/uk-climate-trends-dashboard-metoffice

A data pipeline and Streamlit dashboard that aggregates, cleans and visualises historical UK Met Office station data - interactive charts, heatmaps and maps for temperature, rainfall and sunshine.

climate climate-analysis climate-change climate-data climate-science data-analysis data-visualization metoffice metofficeweather streamlit temperature weather

Last synced: 02 May 2026

https://github.com/y-india/retail-sales-analysis-project

Analysis and preprocessing of retail store sales data. Includes data loading, merging, and initial inspection. 📌 Recommended: See README.md for detailed project progress and dataset information.

ai dashboard data-analysis data-science data-visualization jupiter-notebook machine-learning matplotlib python real-world-problem-solving real-world-project retail-analytics sales-analysis seaborn sklearn-library streamlit

Last synced: 07 May 2026

https://github.com/oldhero5/talent_track

TalentTrack is an open‐source recruitment analytics web application built with Flask and Python. It leverages advanced machine learning techniques—such as Product Quantization (PQ) for candidate ranking and SHAP for model interpretability—to help HR teams and recruitment professionals identify high-quality candidates efficiently.

active-learning analytics candidate-ranking data-visualization faiss flask hrtech machine-learning open-source python recruitment shap talent-analytics

Last synced: 07 May 2026

https://github.com/md-emon-hasan/2-simple-bioinformatics-dna-ml-app

A simple bioinformatics application for DNA sequence analysis using machine learning techniques, implemented in Python.

bioinformatics data-science data-visualization deployment dna-sequences dna-sequencing supervised-machine-learning

Last synced: 09 Jun 2026

https://github.com/melogabriel/nubank-expenses-analysis

This project consolidates monthly credit card statement data from Nubank into a single CSV file using Python, enabling data visualization through a Google Sheets dashboard in Looker Studio.

data-analysis data-visualization googlesheets lookerstudio pandas python

Last synced: 02 May 2026

https://github.com/ahmad-ali-rafique/pyviznotebook

PyVizNotebook is a collection of Matplotlib visualizations demonstrating a wide range of plot types and techniques for data visualization. Whether you're a beginner looking to learn or an experienced developer seeking inspiration, this repository offers a diverse set of examples to explore.

analytics colab-notebook data data-science data-visualization dataanalytics matplotlib-python plots seaborn-python visualization

Last synced: 06 Jun 2026

https://github.com/clubgamma/machine-learning-and-data-analysis-tasks

This repository is a part of Hacktoberfest 2022. I have created few basic task of machine learning and Exploratory data analysis. Have a look at it

contributions-welcome data-visualization exploratory-data-analysis hacktoberfest hacktoberfest2022 machine-learning-algorithms open-source python

Last synced: 25 Apr 2026

https://github.com/geo-y20/loan-approval-automation-using-mongodb-and-pymongo

This project demonstrates the implementation of a loan approval system that utilizes MongoDB for distributed data storage and management, and PyMongo for database operations. The project aims to automate the assessment of loan eligibility using customer details from online applications.

crud-application data data-analysis data-science data-visualization deployment jupyter-notebook loan-default-prediction loan-prediction-analysis machine-learning machine-learning-algorithms matplotlib mongodb pymongo streamlit web

Last synced: 08 May 2026

https://github.com/fengxiaoxiao-001/data_preprocessing

提供处理缺失值,处理异常值,处理特征工程以及多种数据绘图功能;适合大型数据,以及配备处理超多不同数据类型分布的方法

data-science data-visualization processing

Last synced: 29 Apr 2026

https://github.com/manhtdxxx/batch-and-stream-pipeline-via-lakehouse

This project demonstrates a modern Lakehouse architecture supporting both streaming and batch data pipelines, built on Apache Iceberg tables.

airflow batch-processing data-engineering data-visualization docker elt-pipeline hive-metastore iceberg kafka lakehouse medallion-architecture spark stream-processing superset trino

Last synced: 08 May 2026

https://github.com/gerhynes/d3-data-dashboard

A D3 data dashboard showing CO2 emissions per country. Built for The Advanced Web Developer Bootcamp.

d3 data-visualization jaavscript

Last synced: 02 May 2026

https://github.com/msthamizh/phonepe-pulse-data-visualization-and-exploration

Developing a Streamlit application that allows users to explore and analyze transaction data from the PhonePe Pulse dataset. The project aims to provide insights into digital payment trends across India.

data-analysis data-visualization dataframe mysql pandas plotly python streamlit

Last synced: 02 May 2026