An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/pavankethavath/dataspark-illuminating-insights-for-global-electronics

DataSpark is a retail analytics project for Global Electronics leveraging Python, SQL, and Power BI. It uncovers customer insights, sales trends, and store performance to optimize marketing, inventory, and operations. Features include clean datasets, SQL-driven analysis, and interactive dashboards, driving data-driven growth and decision-making.

data-engineering data-visualization dataanalytics powerbi python retail-data sql

Last synced: 27 Apr 2026

https://github.com/lilivalgo/coal-production-colombia

Data analysis that includes information on annual coal production, royalties generated, and climate variables. Descriptive analysis and visual analysis techniques were used

analysis data-visualization dataframes insights manipulation matplotlib python seaborn transformation

Last synced: 27 Apr 2026

https://github.com/bhavik444/techistanbul_python_bootcamp

👨💻 Master Python programming through practical exercises in this 80-hour bootcamp, designed for beginners to advanced learners.

algorithms api-development automation coding-bootcamp data-analysis data-visualization django flask git machine-learning python software-engineering testing web-development web-scraping

Last synced: 28 Apr 2026

https://github.com/kritimbist/365-days-of-github-challenge-ai-machine-learning

This repository is part of my 365 Days Challenge: AI × Machine learning, where I combine my passion for Machine Learning 🤖 to learn, build, and document projects every single day for one year.

data-science data-visualization deep-learning machine-learning matplotlib numpy python scikit-learn

Last synced: 28 Apr 2026

https://github.com/mulliru/estudo-data-visualization

Este repositório contém os códigos que desenvolvi e utilizei durante o curso de Visualização de Dados da Alura. Aqui, você encontrará diversos scripts e notebooks que abordam os principais conceitos e técnicas de Data Science, aplicados em diferentes contextos e projetos.

alura data-visualization

Last synced: 08 Jun 2026

https://github.com/pheithar/socialdata_madridcentral

Social data and visualization course at DTU - 2022. Effectiveness of Madrid Central

data-analysis data-visualization jupyer-notebook madrid python

Last synced: 28 Apr 2026

https://github.com/teohrt/roulette-martingale-visualizer

Visually weigh the risk of the Martingale Roulette strategy

api data-visualization flask gambling matplotlib roulette-wheel-algorithm

Last synced: 28 Apr 2026

https://github.com/dsrodrigovieira/houserocketsales

Este repositório contém um projeto desenvolvido para praticar habilidades de análise de dados utilizando Python

data-analysis data-visualization heroku kaggle-dataset python

Last synced: 29 Apr 2026

https://github.com/mehulcode12/atliq-bank_creditcard_transaction_analysis

The credit card project at Atliq Bank comprises two key phases: market identification and trial. This initiative aims to leverage mathematical and statistical concepts to analyze data related to demographics, income, credit scores, and spending patterns in order to identify the target audience for the credit card.

codebasics data-analysis data-science data-visualization mathematics python python3 statistics

Last synced: 30 Apr 2026

https://github.com/alrza2003/alrza2003.github.io

This repository contains the source files for my personal portfolio website. It highlights my background as a data analyst and radiology student, and showcases real-world projects, tools I use, and ways to connect with me. The site is based on a pre-built template that I customized to reflect my profile and experience.

data data-analysis data-visualization portfolio portfolio-website python

Last synced: 30 Apr 2026

https://github.com/mkk-1817/hr-attrition

This project, conducted during my internship at MeriSKILL, focuses on HR Attrition Prediction using advanced Machine Learning models. The initiative includes the development of a dynamic Dashboard and in-depth Analysis to offer actionable insights for proactive human resource strategies.

data-analysis data-science data-visualization jupyter-notebook machine-learning-algorithms powerbi python

Last synced: 03 May 2026

https://github.com/scarblase/salary-comparison

Submission for the DataCamp Salary Competition(1 level). 🏆

data data-analysis data-science data-visualization engineering python sql structured-data

Last synced: 01 May 2026

https://github.com/anarya22/heart-disease-classification

Predicting heart disease using machine learning. This notebook looks into various python base ML and DS libraries in an attempt to build a machine learning model capable of predicting whether or not someone has heart disease based on their medical attributes.

data-cleaning data-visualization machine-learning matplotlib numpy pandas scikit-learn

Last synced: 01 May 2026

https://github.com/pradipece/interactive_analytics_data_automation

This project allows users to control complex dataset processing with Python programming by merging Streamlit with Plotly for visualizations.

automation data-science data-visualization pandas plotly streamlit

Last synced: 01 May 2026

https://github.com/riddhis2226/titanic-survival-data-analysis

Titanic-Survival-Data-Analysis : Analyze passenger data from the Titanic to predict survival based on features like age, gender, class, and fare.

data-analysis data-mining data-science data-visualization database jupyter-notebook machine-learning-models machinelearning-python plotlyjs python3

Last synced: 01 May 2026

https://github.com/com-480-data-visualization/project-2023-the-vizards

Lausanne Transportation : a data visualization of the Lausanne Transportation network. Developed by the Vizards team as part of the EPFL Data Visualization course project (COM-480).

buses data-analysis data-science data-visualization epfl lausanne map metro public-transport public-transportation switzerland webgl

Last synced: 01 May 2026

https://github.com/macagua/entrenamiento.data_scientist_python

Repositorio de manuales y recursos del entrenamiento "Data Scientist en Python" realizado por Leonardo J. Caballero G.

data-analytics data-scientist data-visualization numpy pandas-dataframe python37 streamlit

Last synced: 01 May 2026

https://github.com/archie-cm/credit_risk_model_vix_id-x_partners

The objective project is to decrease the company's losses by up to 30% through bad loans by creating a machine learning system to assist in automating loan assessments

credit-risk data-analysis data-visualization machine-learning scorecard

Last synced: 01 May 2026

https://github.com/bsperezb/python_sql_manage

this project read a excel, save and delete objects of database and export logs and data errors in excel format

data-science data-visualization postgresql python sql

Last synced: 02 May 2026

https://github.com/ineelhere/shinydwight

A dashboard depicting the great battle of Dwight Schrute and the Computer | The Office

bslib data-visualization imola r rshiny rshiny-application sass server shiny-apps ui-components

Last synced: 02 May 2026

https://github.com/dangerousfish/uk-climate-trends-dashboard-metoffice

A data pipeline and Streamlit dashboard that aggregates, cleans and visualises historical UK Met Office station data - interactive charts, heatmaps and maps for temperature, rainfall and sunshine.

climate climate-analysis climate-change climate-data climate-science data-analysis data-visualization metoffice metofficeweather streamlit temperature weather

Last synced: 02 May 2026

https://github.com/melogabriel/nubank-expenses-analysis

This project consolidates monthly credit card statement data from Nubank into a single CSV file using Python, enabling data visualization through a Google Sheets dashboard in Looker Studio.

data-analysis data-visualization googlesheets lookerstudio pandas python

Last synced: 02 May 2026

https://github.com/msikorski93/visualizing-lastest-usgs-earthquakes

This notebook contains an introduction to the use of Python and cartopy to visualize data concerning earthquakes. We will first read a file with earthquake locations (latitudes, and longitudes), magnitudes in Richter scale, and depths, and other descriptors and then overlay it on a worldwide map.

cartopy data-visualization folium map

Last synced: 09 Jun 2026

https://github.com/fybex/chatgpt-conversations-analysis

Analysis of 89,000 ChatGPT conversations to understand interaction patterns and response behaviors.

chatgpt conversation-analysis data-analysis data-visualization language-analysis prompt-patterns sentiment-analysis

Last synced: 02 May 2026

https://github.com/kplanisphere/plotted-3d-environment

Plotted 3D Environment is a graphical project inspired by Minecraft, designed to demonstrate 3D object creation, animation, and interaction using OpenGL. It features first-person navigation, texture mapping, and collision detection within a dynamic 3D environment filled with obstacles and enemies - Final project for the Graphing course.

3d-graphics animation camera-movement collision-detection computer-graphics cpp data-visualization educational-project opengl texture-mapping

Last synced: 03 May 2026

https://github.com/baggiponte/ta-statistics-for-big-data-2022

🎓 Introduction to Python and Machine Learning [UniMi • AY 2021/2022]

clustering data-science data-visualization machine-learning python scikit-learn

Last synced: 03 May 2026

https://github.com/zeynepcol/data-analysis-visualization

Data visualization and interactive analytics - Olympics Dataset

data-analysis data-science data-visualization matplotlib pandas plotly python scipy seaborn streamlit

Last synced: 03 May 2026

https://github.com/musfiquejim/enacp-a-hybrid-machine-learning-framework-for-detecting-anticancer-peptides

EnACP: একটি Ensemble Learning মডেল যা অ্যান্টিক্যান্সার পেপটাইড সনাক্তকরণের জন্য ব্যবহৃত হয়।

adamoptimizer ai cnn-classification data-visualization deep-learning densenet inceptionv3 machine-learning matplotlib-pyplot numpy pandas randomforestclassifier randomforestregressor restnet sklearn u-net-pytorch vgg16 vgg19

Last synced: 04 May 2026

https://github.com/angelgardt/wlm-sdarp-old

World of Linear Models: Statistics & Data Analysis in R for Psychologists

data-analysis data-visualization gh-pages manim-animations quarto r rstudio statistics

Last synced: 04 May 2026

https://github.com/rayyan9477/house-price-prediction-model

This project aims to predict house prices using a machine learning model. The project involves data cleaning, feature engineering, model selection, training, and evaluation. The dataset is uploaded by the user, and the model is trained to predict house prices based on various features.

data-science data-visualization gridsearchcv machine-learning machine-learning-algorithms notebook python random-forest

Last synced: 05 May 2026

https://github.com/flazefy2/ds-global_cybersecurity_threats

https://www.kaggle.com/datasets/atharvasoundankar/global-cybersecurity-threats-2015-2024

data-science data-visualization numpy python squarify statistics

Last synced: 05 May 2026

https://github.com/kiranmayi5/python-projects

A collection of Python projects showcasing skills in data analysis and visualization.

data-analysis data-visualization machine-learning nlp python

Last synced: 05 May 2026

https://github.com/md-emon-hasan/ml-projects-telcom-customer-churn-prediction

📱 Customers are likely to leave a telecom service, enabling companies to take measures for retention and create accurate churn prediction models.

boostrap5 customer-churn customer-segmentation data-engineering data-science data-visualization logestic-regression machine-learning telco-customer-churn-prediction telcom-churn

Last synced: 05 May 2026

https://github.com/romerorodriguezd/housing-market-tracker

Jupyter Notebook to store and evaluate long-run evolution of house sales, based on Idealista website.

data-visualization matplotlib pandas python scraping scraping-python sqlite3

Last synced: 06 May 2026

https://github.com/drcbeatz/machine-learning-tool

Machine Learning Tool - Train and test supervised ML algorithms (incl. binary classification and regression) on custom data sets and visualize your results without knowing how to code.

data-science data-visualization django machine-learning python scikit-learn

Last synced: 06 May 2026

https://github.com/scarblase/homeless-animals-analysis

A data-driven exploration of homeless animal statistics 🐶🐱. Analyze age distribution, shelter dynamics, and adoption patterns using Python, Pandas, and Seaborn.

animals data-analysis data-mining data-science data-science-projects data-visualization matplotlib matplotlib-pyplot numpy pandas plotly python python3 ukraine

Last synced: 06 May 2026

https://github.com/namratha2301/best-selling-books

Comprehensive examination of best-selling books, focusing on understanding sales patterns, genre distributions, and the impact of various features on book performance.This project aims to predict book sales and classify genres, providing valuable insights for authors, publishers, and readers.

data-analysis data-visualization matplotlib pandas sckiit-learn seaborn

Last synced: 06 May 2026

https://github.com/bladealex9848/presion_arterial

Aplicación para el seguimiento de la presión arterial, permitiendo el registro y visualización de mediciones de presión sistólica y diastólica.

blood-pressure-monitor chronic-disease-management data-visualization e-health e-healthcare healthcare-application medical-data-analysis patient-management python sqlite streamlit

Last synced: 06 May 2026

https://github.com/himanchalchandra/science-canvas

Repo containing projects I did during a four months bootcamp on Data Science and Machine Learning organized by Science Canvas India.

data-mining data-science data-visualization machine-learning-algorithms mysql nlp-machine-learning

Last synced: 06 May 2026

https://github.com/mxagar/statistics_with_python_coursera

My personal notes done while following the Coursera Specialization "Statistics with Python", from the University of Michingan, hosted by Dr. Brenda Gunderson.

data-modeling data-science data-visualization hypothesis-testing machine-learning pandas python statistics

Last synced: 06 May 2026

https://github.com/ssreeramj/binod-detector

Scrapes comments of a youtube video and shows distribution of comments having 'binod' in it

data-visualization heroku python streamlit youtube-api

Last synced: 16 May 2026

https://github.com/amirhosseinhonardoust/ai-personal-study-tracker

An AI-driven productivity tracking app built with Python, Streamlit, SQLite, and Machine Learning. It logs and analyzes study sessions, predicts productivity using Random Forest models, and visualizes key insights to help learners improve focus, habits, and overall academic efficiency.

ai data-analytics data-visualization education learning-analytics machine-learning productivity python random-forest self-improvement sqlite streamlit student-success study-tracker time-management

Last synced: 07 May 2026

https://github.com/sivas-2/coffee-sales-visualization

This repository contains data visualization scripts and notebooks analyzing coffee sales data from a vending machine, sourced from Kaggle. The visualizations explore sales trends, customer preferences, and product popularity over time.

data data-analysis data-science data-visualization python visualization

Last synced: 07 May 2026

https://github.com/rembertdesigns/smart-vinyl-catalog

AI-powered vinyl cataloging and music discovery platform leveraging BigQuery’s generative AI. Processes mixed-format data to deliver personalized recommendations, collection analytics, and intelligent search. Created for the Kaggle BigQuery AI Challenge to showcase real-world, scalable AI solutions for music lovers.

ai bigquery data-science data-visualization generative-ai hackathon kaggle kaggle-competition machine-learning music-analytics music-recommendation-algorithm python recommender-system vinyl

Last synced: 07 May 2026

https://github.com/backdoorali/insider-threat-detection-project

Personal data analysis project combining insider threat detection, cybersecurity, and exploratory data analytics. Built for portfolio showcase and practical skills demonstration.

cybersecurity data-analysis data-analysis-excel data-analysis-project data-analyst data-analytics data-visualization eda excel insider-threat jupyter-lab jupyter-notebook matplotlib numbers pandas portfolio-project python python3 threat-detection threat-intelligence

Last synced: 07 May 2026

https://github.com/vishvamporwal/pharmassist

A progressive web app made with Flask for Industrial pharmaceutical management and Analysis, to improve efficiency and make management easier.

data-visualization flask html-css-javascript python

Last synced: 07 May 2026

https://github.com/divyanshu-rawat/data-visualization-highmaps

Built Using Highcharts JavaScript API to Visualize Data !:mortar_board:

bootstrap data-visualization geolocation-api highcharts javascript jquery

Last synced: 07 May 2026

https://github.com/oldhero5/talent_track

TalentTrack is an open‐source recruitment analytics web application built with Flask and Python. It leverages advanced machine learning techniques—such as Product Quantization (PQ) for candidate ranking and SHAP for model interpretability—to help HR teams and recruitment professionals identify high-quality candidates efficiently.

active-learning analytics candidate-ranking data-visualization faiss flask hrtech machine-learning open-source python recruitment shap talent-analytics

Last synced: 07 May 2026

https://github.com/manhtdxxx/batch-and-stream-pipeline-via-lakehouse

This project demonstrates a modern Lakehouse architecture supporting both streaming and batch data pipelines, built on Apache Iceberg tables.

airflow batch-processing data-engineering data-visualization docker elt-pipeline hive-metastore iceberg kafka lakehouse medallion-architecture spark stream-processing superset trino

Last synced: 08 May 2026

https://github.com/darkmenacer/google-geolocation-bias

Tool to scrape web links for different queries from different cities across the world, all from your home

data-visualization postgresql python3 selenium webscraping

Last synced: 08 May 2026

https://github.com/hyoaru/philippine-poverty-area-estimates-choropleth

A web application providing a visual representation using a choropleth map of the estimated magnitude of poor families in the Philippines from the years 2006, 2009, 2012, and 2015.

data-visualization plotly python streamlit web-application

Last synced: 08 May 2026

https://github.com/themuhd/world-cup-analysis

Analysis of The FIFA World cup from its inception to the recently completed tournament in 2023

data data-science data-visualization dataanalysis matplotlib matplotlib-pyplot notebook python

Last synced: 08 May 2026

https://github.com/elaaatif/data-visualisation

This project aims to visualize the popularity of programming languages on GitHub from 2011 to 2021. We use data obtained from BigQuery's public `github_repos` and `githubarchive` datasets, focusing on public repositories, pull requests (PRs), and issues.

css d3-visualization d3js data-visualization

Last synced: 08 May 2026

https://github.com/sniperwolf/vis-gem

Wrap vis.js dependencies in Rails project.

charts data-visualization javascript network rails ruby timeline visualization

Last synced: 09 May 2026

https://github.com/flazefy/customanalytic

created using next js, mysql, laravel api

apexcharts api data-visualization nextjs vercel

Last synced: 09 May 2026

https://github.com/fatihilhan42/covid-19-data-analysis-visualization

The first project of our data visualization studies is the COVID-19 data analysis project. In this project, we analyzed the data of the COVID-19 pandemic, which started in the first month of 2020 and still continues to affect the world, on the basis of countries. You can find the brief details of the project we realized in 3 stages in the readme file. We have tried to explain the details of the project step by step below. We wish you healthy days.

covid-19 data-science data-visualization pandas python visualization

Last synced: 09 May 2026

https://github.com/najibninaba/jobs-sg

Interactive visualization of AI exposure across 562 Singapore occupations, adapted from karpathy/jobs using MOM wage data and SSOC classifications.

ai data-visualization dataviz jobs labour-market labour-market-statistics llm occupational-data singapore treemap

Last synced: 10 Jun 2026

https://github.com/chauxvive/uschildpoverty

An interactive choropleth map visualizing U.S. state-level child poverty data using D3.js. Compare child poverty rates over time with data from KIDS COUNT and the US Census Bureau.

choropleth-map d3 d3js data-visualization dataviz

Last synced: 09 May 2026

https://github.com/talha-1010/imdb-data-analysis

A data analysis project made with python using pandas

data-analysis data-visualization jupyter-notebook pandas pandas-dataframe

Last synced: 09 May 2026

https://github.com/sathyasris27/data-analysis-on-adult-smoking-patterns-in-the-uk

The aim of this analysis is to understand the smoking patterns among adults in the UK.

data data-analysis data-visualization python3

Last synced: 09 May 2026

https://github.com/barrarrr/fly-in

A dynamic, terminal-based drone network simulation application.

42 42school a-star algorithms breadth-first-search data-visualization drone fly-in

Last synced: 10 Jun 2026

https://github.com/dimitryzub/walmart-stores-coffee-analysis

Walmart Coffee Exploratory Data Analysis. Data Extracted with SerpApi 🧡

analysis analytics data data-visualization matplotlib pandas python pythonanalysis seaborn

Last synced: 10 May 2026

https://github.com/gabrielmpinho/cs50-sql

Solutions and notes from CS50’s Introduction to Databases with SQL. Covers CRUD operations, data modeling, normalization, joins, views, indexes, and connecting SQL with Python and Java. Begins with SQLite for portability and introduces PostgreSQL and MySQL for scalability.

data-analysis data-structures data-visualization database databases javascript python sql

Last synced: 10 May 2026

https://github.com/vikhram-s/indian-budget-2025-dashboard

Interactive dashboard visualizing the Indian Government's Annual Budget 2025 using Next.js, Chart.js, and responsive design. Built with a focus on performance, accessibility, and clean UI/UX for data-driven insights.

budget data-visualization india nextjs open-data tailwindcss webdevelopment

Last synced: 10 May 2026

https://github.com/neelanjan-chakraborty/custoclarity

CUSTO CLARITY is a customer segmentation model built in Python. Using clustering on real retail datasets, it identifies 5 customer segments that unlocked strategic retail partnerships. Powered by scikit-learn, pandas, seaborn, and Matplotlib.

clustering-algorithm clustering-algorithms customer-analytics customer-segmentation data-visualization kmeans kmeans-clustering pandas python scikit-learn

Last synced: 11 May 2026

https://github.com/mtrebi/d3_cars

Cars Dataset Visualization (PCP) using d3.js

d3js data-visualization javascript

Last synced: 12 May 2026

https://github.com/is-leeroy-jenkins/sherpa

A budget execution & data analysis tool based on Winforms, .NET 6, and written in C# for EPA analysts

budget-management data-analysis data-science data-visualization federal-government

Last synced: 13 May 2026

https://github.com/mpolinowski/victory-data-chart

React.js components for modular charting and data visualization

chart css-grid-layout data-visualization react styled-components victory

Last synced: 13 May 2026

https://github.com/nick-peter-marcus/marketing-data-analysis

Analyzing Marketing Analytics Data on Purchase Behavior and Campaign Responses - Customer Segmentation, Data Visualization, Regression Analysis, Random Forest

data-visualization k-means-clustering linear-regression logistic-regression pca random-forest segmentation

Last synced: 09 Sep 2025

https://github.com/shuklayash02/complete_data_analysis_project

A Full Data Analysis project where a sales data is ask,prepare,process,analyze,share and act through data analysis process

data data-visualization dataanalysis database datacleaning powerbi sql

Last synced: 16 Jul 2025

https://github.com/ahmednurabdii/data-analytics-portfolio-superstore

My first portfolio project showcasing data cleaning, analysis, and visualization of Superstore sales data.

data-analysis data-visualization jupyter-notebook matplotlib numpy pandas portfolio-project python sales-analysis scipy seaborn superstore-dataset

Last synced: 07 Apr 2026

https://github.com/iamashwin99/jj-ic-finder

A website to quickly plot PPMS data and find the Critical current (Ic) of an IV curve. deployed at https://share.streamlit.io/iamashwin99/jj-ic-finder/main/app.py and https://jj-ic-finder.herokuapp.com/

data-visualization josephson-junctions streamlit-webapp superconductors

Last synced: 15 Jan 2026