An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/backdoorali/insider-threat-detection-project

Personal data analysis project combining insider threat detection, cybersecurity, and exploratory data analytics. Built for portfolio showcase and practical skills demonstration.

cybersecurity data-analysis data-analysis-excel data-analysis-project data-analyst data-analytics data-visualization eda excel insider-threat jupyter-lab jupyter-notebook matplotlib numbers pandas portfolio-project python python3 threat-detection threat-intelligence

Last synced: 07 May 2026

https://github.com/ali-el-badry/machine-learning-algorithm

It is a Repo that contain different type of Machine Learning Algorithm like Regression ,classification and clustering that will be added soon

ai data-science data-visualization decision-tree feature-selection knn linear-regression logestic-regression machine-learning modelling random-forest svc svm titanic-kaggle xgboost xgboost-classifier

Last synced: 12 Jun 2026

https://github.com/bala-1409/foreign-exchange-rate-time-series-data-science-project

This project will use time series analysis to forecast the exchange rate between the euro and the US dollar. The project will use a variety of statistical techniques, such as ARIMA to model the data and forecast the exchange rate.

data-analysis data-science data-visualization datapreprocessing eda exploratory-data-analysis forecasting machine-learning-algorithms model modelfitting predictive-modeling python3 scikit-learn statsmodels time-series time-series-analysis

Last synced: 07 May 2026

https://github.com/sunnybibyan/random_data_generation

A project that generates a dataset using various statistical distributions (Normal, Uniform, Exponential, Random Integers, and Binomial) and performs data analysis. Includes visualizations and an option to export the data as a CSV file.

data-analysis data-visualization python random-data-generation statistics streamlit-webapp

Last synced: 13 Jun 2026

https://github.com/vjo/d3-punchcard

D3 Punchcard chart 📊●•●

chart d3js data-visualization library punchcard visualization

Last synced: 13 Jun 2026

https://github.com/vishvamporwal/pharmassist

A progressive web app made with Flask for Industrial pharmaceutical management and Analysis, to improve efficiency and make management easier.

data-visualization flask html-css-javascript python

Last synced: 07 May 2026

https://github.com/soufianboukir/ecom-analytics-platform

End-to-end data science project on an Amazon sales dataset, including data preprocessing, analysis, modeling, and a Streamlit dashboard for insights and decision-making.

data-analysis data-science data-visualization data-visualization-dashboard forecasting-models timeseries

Last synced: 14 Jun 2026

https://github.com/scarblase/sales_insights

A data-driven analysis of 15,000 sales records using Python, Pandas, and visualizations to uncover trends, optimize strategies, and enhance business performance. 🚀📊

data-analysis data-visualization dataset matplotlib-pyplot pandas python3 sales-analysis seaborn

Last synced: 05 May 2026

https://github.com/y-india/retail-sales-analysis-project

Analysis and preprocessing of retail store sales data. Includes data loading, merging, and initial inspection. 📌 Recommended: See README.md for detailed project progress and dataset information.

ai dashboard data-analysis data-science data-visualization jupiter-notebook machine-learning matplotlib python real-world-problem-solving real-world-project retail-analytics sales-analysis seaborn sklearn-library streamlit

Last synced: 07 May 2026

https://github.com/divyanshu-rawat/data-visualization-highmaps

Built Using Highcharts JavaScript API to Visualize Data !:mortar_board:

bootstrap data-visualization geolocation-api highcharts javascript jquery

Last synced: 07 May 2026

https://github.com/1ayanabil1/iris-visualization

This repository focuses on visualizing the Iris dataset using various data visualization techniques. It includes histograms, scatter plots, box plots, pie charts, bubble charts, and KDE plots to provide insights into the dataset’s structure. The project utilizes Matplotlib, Seaborn, Plotly, and Scikit-learn to generate insightful visualizations.

analytics clustering data-analysis data-science data-visualization datavisualization-project datavisualizations eda exploratory-data-analysis machine-learning machinelearning-python python

Last synced: 07 May 2026

https://github.com/rayyan9477/house-price-prediction-model

This project aims to predict house prices using a machine learning model. The project involves data cleaning, feature engineering, model selection, training, and evaluation. The dataset is uploaded by the user, and the model is trained to predict house prices based on various features.

data-science data-visualization gridsearchcv machine-learning machine-learning-algorithms notebook python random-forest

Last synced: 05 May 2026

https://github.com/kaushik0911/jubilant-guide

A Streamlit application for advanced route planning and accessibility analysis using OpenRouteService (ORS). Explore optimal routes while avoiding roadblocks and discover points of interest (POIs) within travel time ranges.

data-analysis data-visualization geospatial-analysis python streamlit

Last synced: 16 Jun 2026

https://github.com/topfunky/learning-r-stats

Scripts and data while learning to use the R statistics and charting software program

data-visualization r statistics

Last synced: 16 Jun 2026

https://github.com/oldhero5/talent_track

TalentTrack is an open‐source recruitment analytics web application built with Flask and Python. It leverages advanced machine learning techniques—such as Product Quantization (PQ) for candidate ranking and SHAP for model interpretability—to help HR teams and recruitment professionals identify high-quality candidates efficiently.

active-learning analytics candidate-ranking data-visualization faiss flask hrtech machine-learning open-source python recruitment shap talent-analytics

Last synced: 07 May 2026

https://github.com/dangerousfish/uk-climate-trends-dashboard-metoffice

A data pipeline and Streamlit dashboard that aggregates, cleans and visualises historical UK Met Office station data - interactive charts, heatmaps and maps for temperature, rainfall and sunshine.

climate climate-analysis climate-change climate-data climate-science data-analysis data-visualization metoffice metofficeweather streamlit temperature weather

Last synced: 02 May 2026

https://github.com/nicholas-miklaucic/rho_plus

The Python data viz nitro canister you didn't know you needed

aesthetics bokeh colormap data-visualization matplotlib plotly python

Last synced: 05 May 2026

https://github.com/flazefy2/ds-global_cybersecurity_threats

https://www.kaggle.com/datasets/atharvasoundankar/global-cybersecurity-threats-2015-2024

data-science data-visualization numpy python squarify statistics

Last synced: 05 May 2026

https://github.com/vara-co/python-api-challenge

Weather and Perfect Vacationing Spots Worldwide, by using APIs

api apis data-analysis data-visualization hvplot jupyter-notebook matplotlib pandas vacation weather

Last synced: 05 May 2026

https://github.com/kirkalyn13/open-signal-report-generator

Script used to generate results/summary, including the trends of flagged provinces, from the raw excel data file,

data-analysis data-science data-visualization matplotlib numpy pandas python

Last synced: 19 Jun 2026

https://github.com/zoliqua/venn-diagram-lab

🟢 🟤 Interactive Venn diagram viewer & editor — 44 SVG models (2–9 sets), pre-computed region paths, React + TypeScript + Vite

adelaide bioinformatics data-visualization edwards-venn euler hamilton interactive manawatu massey palmerton-north python-package react set-theory svg typescript upsetplot venn venn-diagram venndiagram victoria

Last synced: 05 May 2026

https://github.com/shuddha2021/stellar-candidate-selector

A sophisticated candidate selection algorithm leveraging multi-criteria analysis and machine learning to identify top software engineering candidates. This tool features flexible filtering, score adjustment, and detailed visualizations to streamline the recruitment process.

candidate-selection data-analysis data-visualization machine-learning pandas plotting-in-python python python-data-analysis recruitment scikit-learn

Last synced: 05 May 2026

https://github.com/dorukalkan/3a-superstore-analysis

End-to-end data analysis, machine learning, and visualization project on a Turkish supermarket dataset

data-science data-visualization dbt machine-learning power-bi python sql

Last synced: 20 Jun 2026

https://github.com/baggiponte/ta-statistics-for-big-data-2022

🎓 Introduction to Python and Machine Learning [UniMi • AY 2021/2022]

clustering data-science data-visualization machine-learning python scikit-learn

Last synced: 03 May 2026

https://github.com/pm25/youbike-station-finder

🚲 Display and visualize real-time information for Taipei YouBikes.

css3 d3js data-visualization html5 javascript visualization website youbike

Last synced: 08 May 2026

https://github.com/alicankaya192/ai-jobs-market-2025-2026-salaries

🤖 Global AI & LLM jobs market analysis (2025–2026). Salary trends, remote work premiums, top paying skills, and LLM engineering vs traditional AI comparisons. 📈

ai-jobs data-analysis data-science data-visualization eda exploratory-data-analysis generative-ai jobs jupyter-notebook llm-learning market-analysis matplotlib pandas salary-analysis statistics

Last synced: 21 Jun 2026

https://github.com/riddhis2226/titanic-survival-data-analysis

Titanic-Survival-Data-Analysis : Analyze passenger data from the Titanic to predict survival based on features like age, gender, class, and fare.

data-analysis data-mining data-science data-visualization database jupyter-notebook machine-learning-models machinelearning-python plotlyjs python3

Last synced: 01 May 2026

https://github.com/alcestide/scianalytics

Playground for Data Analysis and Visualization for Research and Scientifical Purposes with Pandas and Plotly.

csv data-analysis data-science data-visualization pandas plotly python science-research statistics

Last synced: 30 Apr 2026

https://github.com/rohithay/customer-segmentation

Clean and Apply RFM technique to rank and group clusters to identify the best customers and perform targeted marketing campaigns, using real online transaction data

data-cleaning data-science data-visualization datetime marketing-analytics pandas python3 user-segmentation

Last synced: 03 May 2026

https://github.com/geo-y20/loan-approval-automation-using-mongodb-and-pymongo

This project demonstrates the implementation of a loan approval system that utilizes MongoDB for distributed data storage and management, and PyMongo for database operations. The project aims to automate the assessment of loan eligibility using customer details from online applications.

crud-application data data-analysis data-science data-visualization deployment jupyter-notebook loan-default-prediction loan-prediction-analysis machine-learning machine-learning-algorithms matplotlib mongodb pymongo streamlit web

Last synced: 08 May 2026

https://github.com/rogernet/desafio-profissional-produto-data-driven

Ajudar a formar Analistas de Produto, PMs e Gestores de Negócio capazes de tomar decisões estratégicas baseadas em dados.

data-analysis data-science data-visualization product

Last synced: 23 Jun 2026

https://github.com/fluxswitch/lwscope

A lightweight UART oscilloscope/data visualizer made with DearPyGui

data-visualization dearpygui debugging-tool lightweight oscilloscope python uart

Last synced: 02 May 2026

https://github.com/wizardoftrap/football-team-analytics

This Jupyter notebook, created on Kaggle, analyzes football player and team statistics for the 2024-2025 season. It provides insights into player performance, team metrics, and playing styles across major European leagues using data from the dataset players_data-2024_2025.csv.

data-analysis data-visualization jupyter-notebook pandas python

Last synced: 05 May 2026

https://github.com/manhtdxxx/batch-and-stream-pipeline-via-lakehouse

This project demonstrates a modern Lakehouse architecture supporting both streaming and batch data pipelines, built on Apache Iceberg tables.

airflow batch-processing data-engineering data-visualization docker elt-pipeline hive-metastore iceberg kafka lakehouse medallion-architecture spark stream-processing superset trino

Last synced: 08 May 2026

https://github.com/matusf/glasgow_wifi

Script that plots wifi access points to map and labels them by their protection

data data-visualization folium python python3

Last synced: 24 Jun 2026

https://github.com/jeugregg/coronavirusmodel

Coronavirus Visualization & Modeling

coronavirus covid-19 data-visualization

Last synced: 24 Jun 2026

https://github.com/tushard48/product-cluster-analysis

This project performs clustering analysis on a product dataset to identify and group similar products. The analysis includes data preprocessing, application of various clustering algorithms, and visualization of results to gain insights into product patterns. Key techniques used are K-Means, Mini Batch K-Means, evaluated using metr

data-visualization excel machine-learning powerbi streamlit unsupervised-learning

Last synced: 03 May 2026

https://github.com/flazefy2/ds-customer_shopping

https://www.kaggle.com/datasets/bhadramohit/customer-shopping-latest-trends-dataset

data-science data-visualization jupiter-notebook numpy python sales shopping squarify statistics

Last synced: 05 May 2026

https://github.com/darkmenacer/google-geolocation-bias

Tool to scrape web links for different queries from different cities across the world, all from your home

data-visualization postgresql python3 selenium webscraping

Last synced: 08 May 2026

https://github.com/judahpaul16/plume

Map how user information flows through any codebase or infrastructure into a readable graphic

charts cli data-flow data-visualization documentation flow observability pii reports tooling

Last synced: 25 Jun 2026

https://github.com/shridhar1504/loan-clustering-datascience-project

This project uses Machine Learning to Cluster loan together based on their similarities. The project uses a dataset of loan application which includes information about the Loan amount and Balance. The project then use the clustering algorithm to group the loan together based on the similarities.

clustering-algorithm data-analysis data-science data-visualization datanalysis eda kmeans-clustering machine-learning python sql sql-server unsupervised-learning

Last synced: 08 May 2026

https://github.com/hyoaru/philippine-poverty-area-estimates-choropleth

A web application providing a visual representation using a choropleth map of the estimated magnitude of poor families in the Philippines from the years 2006, 2009, 2012, and 2015.

data-visualization plotly python streamlit web-application

Last synced: 08 May 2026

https://github.com/kiranmayi5/python-projects

A collection of Python projects showcasing skills in data analysis and visualization.

data-analysis data-visualization machine-learning nlp python

Last synced: 05 May 2026

https://github.com/newking9088/marketing_campaign_ml_prediction_dashboard

Transform your marketing strategy with our intuitive ML Prediction Dashboard, providing real-time, data-driven insights to optimize campaign success.

data-visualization finance-application streamlit-webapp

Last synced: 29 Jun 2026

https://github.com/saeedkohansal/chart.js-tutorial-with-examples

Chart.js is a lightweight, open-source JavaScript library for creating stunning and interactive charts using HTML5 Canvas. It supports various chart types like bar, line, and pie charts, is easy to use, and offers customization options to suit any data visualization needs. Perfect for modern web applications!

barchart canvas chart chart-js chartjs charts css data-visualization datavisualization gilgeekify html html-chart html5 javascript linechart piechart programming scatterchart web-development webdevelopment

Last synced: 05 May 2026

https://github.com/iguptashubham/ott-churn-eda-ml

Understanding why customers discontinue their subscriptions will be crucial in optimizing the user experience, reducing churn, and maximizing customer lifetime value. By using Machine learning model to predict the Customer Churn.

data-analysis data-analysis-project data-science data-science-portfolio data-science-projects data-visualization machine-learning python

Last synced: 08 May 2026

https://github.com/amruthadevops/suspicious_web_threat_interactions

To detect and analyze patterns in web interactions for identifying suspicious or potentially harmful activities

cyber-security data-analysis data-science data-visualization jupyter-notebook machine-learning powerbi python

Last synced: 29 Jun 2026

https://github.com/jethronap/jstat-gui

Web-based GUI application for data analysis

data-analysis data-visualization java jstat mongodb

Last synced: 08 May 2026

https://github.com/md-emon-hasan/2-simple-bioinformatics-dna-ml-app

A simple bioinformatics application for DNA sequence analysis using machine learning techniques, implemented in Python.

bioinformatics data-science data-visualization deployment dna-sequences dna-sequencing supervised-machine-learning

Last synced: 09 Jun 2026

https://github.com/trigeminal/hospital-respiratory-forensics

(DS) A comprehensive repository dedicated to the analysis of weekly hospital respiratory data and metrics reported to the Centers for Disease Control and Prevention’s (CDC) National Health Safety Network (NHSN) from August 2020 through October 2024.

data-visualization jupyter-notebooks python3

Last synced: 30 Apr 2026

https://github.com/themuhd/world-cup-analysis

Analysis of The FIFA World cup from its inception to the recently completed tournament in 2023

data data-science data-visualization dataanalysis matplotlib matplotlib-pyplot notebook python

Last synced: 08 May 2026

https://github.com/alejo1630/ibm_capstone_project

This project aims to leverage predictive analytics to forecast the outcomes of rocket launches for Space Y, a new player in the commercial space industry.

data-collection data-science data-visualization data-wrangling exploratory-data-analysis machine-learning predictive-modeling python spacex

Last synced: 01 May 2026

https://github.com/md-emon-hasan/ml-projects-telcom-customer-churn-prediction

📱 Customers are likely to leave a telecom service, enabling companies to take measures for retention and create accurate churn prediction models.

boostrap5 customer-churn customer-segmentation data-engineering data-science data-visualization logestic-regression machine-learning telco-customer-churn-prediction telcom-churn

Last synced: 05 May 2026

https://github.com/zeynepcol/data-analysis-visualization

Data visualization and interactive analytics - Olympics Dataset

data-analysis data-science data-visualization matplotlib pandas plotly python scipy seaborn streamlit

Last synced: 03 May 2026

https://github.com/msohaill/wrappedify

Full stack implementation of Wrappedify

data-visualization express music socket-io spotify sveltekit

Last synced: 05 May 2026

https://github.com/sniperwolf/vis-gem

Wrap vis.js dependencies in Rails project.

charts data-visualization javascript network rails ruby timeline visualization

Last synced: 09 May 2026

https://github.com/melogabriel/nubank-expenses-analysis

This project consolidates monthly credit card statement data from Nubank into a single CSV file using Python, enabling data visualization through a Google Sheets dashboard in Looker Studio.

data-analysis data-visualization googlesheets lookerstudio pandas python

Last synced: 02 May 2026

https://github.com/flazefy/customanalytic

created using next js, mysql, laravel api

apexcharts api data-visualization nextjs vercel

Last synced: 09 May 2026

https://github.com/gutyoh/narrative_visualization

Visualizations of the evolution of Netflix's content over the years using the D3 JavaScript library.

d3 data-visualization javascript

Last synced: 30 Apr 2026

https://github.com/com-480-data-visualization/project-2023-choo-choo-data-darlings

This repository contains the source code for our data visualization project, an interactive platform designed to explore the intricate Swiss transportation network. Developed by the Choo Choo Data Darlings team at EPFL, the project provides an in-depth view into the vast array of Swiss transportation operations, including trains, buses, and trams.

boats buses data-analysis data-science data-visualisation data-visualization epfl metro public-transport public-transportation switzerland trains trams

Last synced: 01 May 2026

https://github.com/fatihilhan42/covid-19-data-analysis-visualization

The first project of our data visualization studies is the COVID-19 data analysis project. In this project, we analyzed the data of the COVID-19 pandemic, which started in the first month of 2020 and still continues to affect the world, on the basis of countries. You can find the brief details of the project we realized in 3 stages in the readme file. We have tried to explain the details of the project step by step below. We wish you healthy days.

covid-19 data-science data-visualization pandas python visualization

Last synced: 09 May 2026

https://github.com/mehulcode12/atliq-bank_creditcard_transaction_analysis

The credit card project at Atliq Bank comprises two key phases: market identification and trial. This initiative aims to leverage mathematical and statistical concepts to analyze data related to demographics, income, credit scores, and spending patterns in order to identify the target audience for the credit card.

codebasics data-analysis data-science data-visualization mathematics python python3 statistics

Last synced: 30 Apr 2026

https://github.com/avijit-jana/redbus-data-scraper-dashboard

A Streamlit-based application leveraging Selenium to automate data scraping from Redbus, enabling efficient collection, analysis, and visualization of bus travel data for improved operational efficiency and strategic planning in the transportation industry.

automation dashboard data-analysis data-visualisation data-visualization datadrivendecisions filtering python3 redbus selenium selenium-python streamlit streamlit-application travel web-scraping webscrapping

Last synced: 09 May 2026

https://github.com/romerorodriguezd/housing-market-tracker

Jupyter Notebook to store and evaluate long-run evolution of house sales, based on Idealista website.

data-visualization matplotlib pandas python scraping scraping-python sqlite3

Last synced: 06 May 2026

https://github.com/najibninaba/jobs-sg

Interactive visualization of AI exposure across 562 Singapore occupations, adapted from karpathy/jobs using MOM wage data and SSOC classifications.

ai data-visualization dataviz jobs labour-market labour-market-statistics llm occupational-data singapore treemap

Last synced: 10 Jun 2026

https://github.com/macagua/entrenamiento.data_scientist_python

Repositorio de manuales y recursos del entrenamiento "Data Scientist en Python" realizado por Leonardo J. Caballero G.

data-analytics data-scientist data-visualization numpy pandas-dataframe python37 streamlit

Last synced: 01 May 2026

https://github.com/msikorski93/visualizing-lastest-usgs-earthquakes

This notebook contains an introduction to the use of Python and cartopy to visualize data concerning earthquakes. We will first read a file with earthquake locations (latitudes, and longitudes), magnitudes in Richter scale, and depths, and other descriptors and then overlay it on a worldwide map.

cartopy data-visualization folium map

Last synced: 09 Jun 2026

https://github.com/darshan1924/stock-price-pridiction

Stock Price Predictor App: A machine learning-powered application built with Streamlit to analyze historical stock data, visualize trends (e.g., moving averages), and predict future stock prices using an LSTM model. It fetches real-time data via Yahoo Finance and provides interactive visualizations for informed decision-making.

data-visualization jupyter-notebook kera lstm-neural-networks machine-learning python3 stock-price-prediction streamlit yahoo-finance-api

Last synced: 09 May 2026

https://github.com/drcbeatz/machine-learning-tool

Machine Learning Tool - Train and test supervised ML algorithms (incl. binary classification and regression) on custom data sets and visualize your results without knowing how to code.

data-science data-visualization django machine-learning python scikit-learn

Last synced: 06 May 2026

https://github.com/dogan-the-analyst/developer_survey_analysis

Analysis of the 2024 Stack Overflow developer survey. Tools used include Python, Pandas, Matplotlib, and IBM Cognos.

data-analysis data-visualization ibm-cognos-analytics matplotlib pandas python

Last synced: 09 May 2026

https://github.com/scarblase/homeless-animals-analysis

A data-driven exploration of homeless animal statistics 🐶🐱. Analyze age distribution, shelter dynamics, and adoption patterns using Python, Pandas, and Seaborn.

animals data-analysis data-mining data-science data-science-projects data-visualization matplotlib matplotlib-pyplot numpy pandas plotly python python3 ukraine

Last synced: 06 May 2026

https://github.com/chauxvive/uschildpoverty

An interactive choropleth map visualizing U.S. state-level child poverty data using D3.js. Compare child poverty rates over time with data from KIDS COUNT and the US Census Bureau.

choropleth-map d3 d3js data-visualization dataviz

Last synced: 09 May 2026

https://github.com/talha-1010/imdb-data-analysis

A data analysis project made with python using pandas

data-analysis data-visualization jupyter-notebook pandas pandas-dataframe

Last synced: 09 May 2026

https://github.com/muralisuvvani/airbnb-exploratory-analysis

Exploratory Data Analysis (EDA) on Airbnb NYC dataset to uncover trends, patterns, and insights. Data cleaning and visualization of Airbnb listings in New York City. Analyzing Airbnb NYC 2019 dataset using Python (Pandas, Numpy, Matplotlib, Seaborn)

analytics data-visualization exploratory-data-analysis numpy pandas python3

Last synced: 04 May 2026

https://github.com/douglasvolcato/brazilian-stock-market-analysis

Brazilian stocks analysis focused in dividend yield, diversification and minimun price variation

data-science data-visualization pandas python scraping scraping-websites

Last synced: 09 May 2026

https://github.com/athari22/house_sales_in_king_count_usa

The idea of the project is to do a Data analysis in a Real Estate Investment Trust. The Trust would like to start investing in Residential real estate.

analysis data data-science data-visualization ibm ibm-watson linearregression machine-learning matplotlib numpy pandas sklearn-library

Last synced: 01 May 2026

https://github.com/gerhynes/d3-data-dashboard

A D3 data dashboard showing CO2 emissions per country. Built for The Advanced Web Developer Bootcamp.

d3 data-visualization jaavscript

Last synced: 02 May 2026