An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/johannaschmidle/bookauthors

Explored a book sales database. Cleaned data using Excel and created an interactive dashboard to analyze author popularity, ratings, and sales trends. The project highlighted key insights such as sales performance and rating distributions [Excel]

author-sales book-sales books data-analysis data-visualization excel

Last synced: 04 Feb 2026

https://github.com/berkeley-gif/caladapt-website-2021

Redesign and rewrite of the website for Cal-Adapt.org

cal-adapt california climate-change climate-models data-visualization svelte

Last synced: 26 Jan 2026

https://github.com/misaghmomenib/shop-revenue-analysis

A Data Analysis Project Aimed at Analyzing and Forecasting Shop Revenue Based on Sales and Other Business Metrics. It Helps to Identify Trends, Patterns, and Key Factors Influencing Revenue to Make Data-driven Decisions for Business Growth.

data-analysis data-visualization python

Last synced: 24 Mar 2025

https://github.com/prekshivyas/datastreamingetl

Utilizing my background and love for Apache Airflow and Data to build a real-time data streaming pipeline

apache-airflow apache-kafka apache-spark apache-zookeeper cassandra data-engineering data-ingestion data-pipeline data-processing data-visualization docker docker-compose

Last synced: 20 Jan 2026

https://github.com/analysisbyvivek/road-accident

Analyzes road accident patterns, exploring factors like lighting, weather, speed limits, time of day, and road conditions to uncover trends in severity and frequency.

data-analysis data-visualization eda jupyter-notebook kaggle tableau-public

Last synced: 19 Jun 2026

https://github.com/samaalharbi2/project-data-science-blog-post

A data science project from Udacity’s Nanodegree — exploring what drives developer success

crisp-dm data-analysis data-science data-visualization nanodegree udacity

Last synced: 26 Jan 2026

https://github.com/felinjob/ibm-applied-data-science-capstone

Este projeto, parte da especialização IBM Data Science Professional Certificate, prevê o sucesso do pouso do Falcon 9 da SpaceX. Usando dados da API da SpaceX e Web Scraping, o projeto inclui análise de dados e Machine Learning para gerar insights sobre os lançamentos.

data-analysis data-science data-visualization ibm jupyter-notebook machine-learning numpy pandas python scikit-learn seaborn sql

Last synced: 11 Apr 2026

https://github.com/petitatelier/data-sets

A collection of data sets, to play with in visualization experiments

data-visualization dataset

Last synced: 02 Jul 2025

https://github.com/abhay-sinha-0/carpricepredictionproject

A machine learning project that predicts the selling price of a car based on its features such as year, mileage, fuel type, transmission, and more. This model can assist individuals and dealerships in estimating fair market prices for used cars.

artificial-intelligence data-analysis data-science data-visualization exploratory-data-analysis machine-learning-algorithms matplotlib-pyplot mysql-database numpy-library pandas-library python skit-learn sklearn-library

Last synced: 15 May 2025

https://github.com/mr-chang95/webpage_abtest_analysis_udacity

A/B Testing Project for Udacity's Data Analyst Nanodegree Program. Using Python in Jupyter Notebook.

abtesting data-science data-visualization matplotlib pandas python webpage

Last synced: 11 Apr 2026

https://github.com/samanhur/data_visualization_pcc

First experiences in data visualization with python

data-analysis data-science data-visualization python3

Last synced: 23 Mar 2025

https://github.com/stopyransky/wdvp

World Government Data Visualisation Prize - submitted work

d3 d3js data-visualization dataviz react svg

Last synced: 18 May 2026

https://github.com/petzi53/learning-plotly

Personal notes and trials during reading "Interactive web-based data visualization with R, plotly, and shiny" by Carson Sievert

data-visualization plotly visualization

Last synced: 16 Mar 2025

https://github.com/hannahgsimon/halmodeling2024graphs

Created code to develop and analyze statistical graphs for the spatial radiotherapy model, which can be found at https://github.com/hannahgsimon/HALModeling2024. This project was in association with the Cleveland Clinic Lerner Research Institute, Jacob Scott Lab.

agent-based-model bifurcation-analysis cancer-models computational-biology data-visualization hybrid-automata immune-response mathematical-modelling ordinary-differential-equations radiation-therapy spatial-model statistics systems-biology

Last synced: 23 Nov 2025

https://github.com/bala-1409/power-bi-visualization-project

This repository contains Visualization Projects which is visualized through Power BI Software, by using the visualization we can gain multiple insights and strategies which helps to develop the business for gaining high profit margins and by the insights we can reduce the damages by accidents & calamities.

dashboard data-analysis data-science data-visualization exploratory-data-analysis microsoft-excel microsoft-power-bi microsoft-powerpoint power-bi powerbi powerbi-reports powerbi-visuals visualization

Last synced: 04 Jan 2026

https://github.com/katrinleinweber/leaving-the-bar

A less-code variant of Joachim Goedhart's "Leaving the bar in five steps"

barchart boxplot boxplots data-visualisation data-visualization ggplot

Last synced: 20 Aug 2025

https://github.com/ved-coder-king/wheat_ai_project

This project, Smart Wheat Farming AI System, was developed as part of the coursework for the Artificial Intelligence program at Esprit School of Engineering.

agriculture data-analysis data-visualization deep-learning image-classification machine-learning object-detection python wheat

Last synced: 15 Apr 2025

https://github.com/gaurav0502/router-traffic-analysis

Exploratory Analysis of the different kinds of traffics being experienced by a router.

data-analytics data-visualization network-analysis python

Last synced: 06 Apr 2025

https://github.com/josephbarbierdarnal/matoolkit

matoolkit is a python package containing a toolbox for creating visually appealing graphs/annotations in matplotlib

data-analysis data-visualization matplotlib

Last synced: 31 Mar 2025

https://github.com/jaewonson37/data_visualization2

Topic : Revealing and analyzing the distribution, frequency, and impact of significant earthquakes that happened across various regions and periods of time with several visualization techniques.

bar-plot binning-plot data-visualization ggplot2 mosaic-plots scatter-plot world-map

Last synced: 11 Jun 2025

https://github.com/mumtaz4118/nlp-course

Programming Assignments and Lectures for Stanford's CS 224: Natural Language Processing with Deep Learning

course data data-analysis data-analytics data-science data-visualization deep-learning education machine-learning natural-language-processing neural-network transfer-learning

Last synced: 24 Nov 2025

https://github.com/mindlessmuse666/train-test-splitter

Анализ данных о пассажирах Титаника и разбиение на обучающую и тестовую выборки. Практическое задание по дисциплине "Основы применения методов искусственного интеллекта в программировании".

data-analysis data-preprocessing data-visualization machine-learning pandas python scikit-learn seaborn titanic train-test-split

Last synced: 12 Apr 2026

https://github.com/johannaschmidle/road-collisions-project

Analyzed road accident data in the UK from 2019 to 2022 to identify patterns and trends in road accidents, for Effective Road Management [Excel]

data-analysis data-visualization excel pivot-tables traffic-analysis

Last synced: 01 Mar 2026

https://github.com/hossamAhmedSalah/Computer-Vision-

contains my training projects in this field

computer-vision data-visualization detection edge opencv

Last synced: 10 Mar 2025

https://github.com/rauhanahmed/auto-data-analyzer

AutoDataAnalyzer: Automate data ingestion, analysis, and visualization with AI/ML-powered pipelines. Features natural language query processing, interactive Plotly visualizations, and seamless deployment via Docker.

ai-powered-analysis automated-pipeline cicd data-analysis data-visualization docker end-to-end-project flask generative-ai langchain llama3-1 machine-learning natural-language-processing plotly python3 pywebio

Last synced: 12 Apr 2026

https://github.com/arthurdanjou/studies

💼 This is the repository containing all my projects done during my studies in Python and R.

ai data data-science data-visualization jupyter jupyter-notebook ml python r

Last synced: 08 Apr 2025

https://github.com/yuchenq/comp90055-project

This is the lastest version of my project belong to Comp90055.

couchdb crawler data-visualization python3 textblob tweepy

Last synced: 16 Jul 2025

https://github.com/prgermux/image-scrapper

"Image Scrapper" is a Python application that recursively scrapes images from directories and displays them on an interactive, zoomable, and scrollable canvas. Ideal for organizing and navigating large image datasets.

data-visualization desktop-application file-explorer graphics-view gui-tool image-organization image-processing image-scraper image-viewer interactive-visualization pyqt5 python recursive-directory zoom-and-pan

Last synced: 24 Mar 2025

https://github.com/naomiwolfe/golden-isles-dashboard2

Interactive tourism analytics dashboard for Georgia's Golden Isles

analytics chartjs dashboard data-visualization georgia golden-isles tailwindcss tourism

Last synced: 05 Oct 2025

https://github.com/aphp/jupyter-eds-notebooks

jupyter-eds-notebooks provides Docker images with preconfigured Jupyter environments for clinical and health data analysis, tailored for AP‑HP Datalabs and the HELIX platform.

data-analysis data-science data-visualization healthcare lab

Last synced: 13 Jan 2026

https://github.com/urvee1810/eda-time-series

A comprehensive time series analysis of French retail quarterly sales data from 2012 to 2017. The project focuses on analyzing sales patterns, seasonal decomposition, and trend analysis using various statistical techniques and visualizations.

arima-modeling data-visualization exploratory-data-analysis matplotlib numpy pandas pmdarima python scikit-learn seaborn statsmodels time-series-analysis trend-analysis

Last synced: 12 Apr 2026

https://github.com/architj6/cancerguardian

CancerGuardian is a machine learning-powered web app that helps predict breast cancer diagnoses based on cytology measurements. 🩺✨ Built with Streamlit, Scikit-Learn, and Plotly, this tool visualizes tumor characteristics and provides predictions using a trained model. 🚀

binary-classification breast-cancer-prediction classification-models data-science data-visualization deep-learning healthcare healthcare-ai machine-learning medical-ai medical-diagnostics predictive-analytics python streamlit supervised-learning

Last synced: 01 May 2026

https://github.com/apsinghanalytics/bearmarketfollowthroughs

Python Script for Calculating and Visualization Follow Through Rallies during Bear Markets

bear-market data-visualization powerbi recession stock-market time-series

Last synced: 12 Jun 2026

https://github.com/tashi-2004/apache-hadoop-spark-hive-cyberanalytics

This project utilizes Apache Hadoop, Hive, and PySpark to process and analyze the UNSW-NB15 dataset, enabling advanced query analysis, machine learning modeling, and visualization. The project demonstrates efficient data ingestion, processing, and predictive analytics for network security insights.

ai apache-hadoop apache-hive big-data-analytics big-data-processing data-analysis data-engineering data-science data-security data-visualization hdfs machine-learning network-analysis network-security pyspark python3 threat-detection unsw-nb15-dataset

Last synced: 02 May 2026

https://github.com/usk2003/vnrvjiet-lab-work

This repository contains my lab work for the B.Tech CSE-AIML program (2022-2026) under the R22 regulation at VNR Vignana Jyothi Institute of Engineering and Technology. It includes various subjects like Machine Learning, OS, Data Structures, C Programming, and more, showcasing my practical learning and implementations.

c-programming compiler-design computer-networks data-engineering data-structures data-visualization dbms engineering-drawing java machine-learning operating-system python software-engineering

Last synced: 11 Apr 2026

https://github.com/datastalker/survival-cox

This repository contains an R script for performing survival analysis on breast cancer surgery data from the University of Chicago's Billings Hospital. The analysis includes Kaplan-Meier estimation and Cox Proportional Hazards modeling to assess patient survival.

breast-cancer-prediction cox-model data-analysis data-science data-visualization epidemiology kaplan-meier r survival-analysis

Last synced: 02 Apr 2025

https://github.com/hari7261/data-visualization

Python-based application built using CustomTkinter for the graphical user interface (GUI) and Matplotlib for data visualization. It allows users to import datasets, perform real-time data visualization, and analyze data using various chart types and machine learning techniques.

data-analysis data-visualization export hari7261 import python realtime-visualization

Last synced: 17 Jun 2025

https://github.com/Sparsh7082/Data-Analysis-Portfolio

This repository is dedicated to showcasing my skills, sharing projects, and tracking my progress in Data Analytics and Data Science.

canva data-analytics data-manipulation data-visualization database-querying google-slides power-bi power-point presentation-tools programming-language python r spreadsheets sql tableau

Last synced: 10 Mar 2025

https://github.com/m4tice/qm_project

Bicycle project crowd evaluation.

data-analysis data-engineering data-visualization

Last synced: 16 Mar 2025

https://github.com/vbhatsaccnt/softdrinktrendsanalysis

A Tableau dashboard project providing comprehensive insights into soft drink sales trends, allowing for detailed analysis and informed decision-making within the beverage industry.

dashboard data-visualization food-products marketing tableau trend-analysis

Last synced: 01 Mar 2026

https://github.com/ashwin331133/hospital_allpatients_waitinglist_data

This Power BI project analyzes patient waiting lists across various medical specialties and case types (Day Case, Inpatient, Outpatient). The goal is to gain insights to improve healthcare management and resource allocation.

data-analysis data-visualization powerbi

Last synced: 03 Jan 2026

https://github.com/living-with-machines/machines-interactive

This is the “machines interactive” for the Living with Machines exhibit at Leeds City Museum 2022–23.

data-visualization history-of-technology industrial-revolution machines museum museum-experience museum-installation

Last synced: 20 Jan 2026

https://github.com/natanast/30daychartchallenge

This repository contains my contributions to the #30DayChartChallenge

30daychartchallenge data-science data-visualization data-viz

Last synced: 24 Mar 2025

https://github.com/erabossid/d3js-heatmap

Global Heatmap presentation with D3js and Reactjs

d3js data-science data-visualization heatmap javascript reactjs

Last synced: 10 Mar 2025

https://github.com/githubsolver123/bus-tracker

Real-time bus tracking simulation built with R Shiny and Google Maps API. Visualizes bus movement along Broadway in NYC with 2-second position updates.

data-visualization geospatial gis google-maps-api r r-shiny real-time shiny simulation transportation web-application

Last synced: 01 Apr 2025

https://github.com/saiteja-talluri/data-analytics-assignement

Report on World Happiness Data (Data Analysis and Visualisation of the data)

data-analysis data-visualization ipynb-jupyter-notebook

Last synced: 20 Jan 2026

https://github.com/h-sutiwas/r2de-2025

This repository is related to the Road To Data Engineer Bootcamp by DataTH. It contains all related coursework, some mini projects and other resources within the field of Data Engineering.

data data-engineering data-visualization docker gcp pipeline spark

Last synced: 30 Apr 2026

https://github.com/quantumudit/test-store-data-analysis

This repository showcases a web scraper with a pipeline structure for efficient data extraction and transformation from websites. The tool can be tailored to leverage its capabilities for insightful data analysis, providing valuable insights and informed decision-making.

data data-visualization dataanalytics python python-webscraping webscraper webscraping-data

Last synced: 11 Apr 2026

https://github.com/christs8920/process-mining-py

A process mining project that analyzes an event log and discovers its process model.

data-science data-visualization datavisualization pm4py process-mining processmining python

Last synced: 26 May 2026

https://github.com/apsinghanalytics/hranalytics_myersbriggspersonalityinsights

A Excel analytics study exploring the correlation between personality traits and key HR-relevant parameters, including tenure and performance

data-analysis data-visualization excel pivot-tables

Last synced: 30 Jan 2026

https://github.com/manikantasanjay/-spatial-analysis-of-temperature-trends

"Global Climate Insights" offers a deep dive into global temperature trends, using spatial analysis and interactive visualizations to highlight the impacts of climate change.

data-visualization folium geospatial matplotlib plotly spatial-analysis temperature-trends

Last synced: 04 Jul 2025

https://github.com/jabonsote/financial-anomaly-detection-with-deepseek-and-isolation-forest

🚀 Financial Anomaly Detection with DeepSeek and Isolation Forest – A powerful, locally-run tool for detecting financial anomalies using Isolation Forest and DeepSeek LLM. Features AI-powered insights, interactive time-series visualization, and automated PDF audit reports. 🔍📊

anomaly-detection chatbot data-visualization deepseek financial-analysis financial-data isolation-forest llm machienlearning ollama report-generator streamlit

Last synced: 12 Apr 2026

https://github.com/weybsonalves/prevendo-o-atrito-de-clientes

Projeto em que percorro as etapas que compõem o ciclo de vida da ciência de dados a fim de prever o atrito de clientes do serviço de cartões de crédito de um banco.

data-analysis data-science data-visualization machine-learning python

Last synced: 06 May 2026

https://github.com/pabvald/terrorism-eu

A linear graphic implemented using d3.js which shows the evolution of terrorism in the European Union from 1970 to 2016

d3v4 data-visualization graphics

Last synced: 11 Jun 2025

https://github.com/leonel-m/python_project_spring_2025

Interactive sales dashboard built with Dash and Plotly for data exploration and visualization.

css dash data-visualization interactive-dashboards pandas plotly python sales-dashboard

Last synced: 12 Apr 2026

https://github.com/ekenes/elections-timeline

Data visualization showing the results of the previous 5 U.S. presidential elections in a single map.

arcgis-js-api data-visualization elections gis mapping

Last synced: 24 Mar 2025

https://github.com/albertofaraujo/excel_dashboard_prev_fraudes

O objetivo da análise é extrair informações de performances individuais dos colabores de uma empresa fictícia para tomadas de decisão. (Dashbord em Excel)

analise-de-dados dashboard data-visualization excel

Last synced: 06 Jan 2026

https://github.com/leandrocollares/infant-mortality-in-africa

An interactive choropleth map that shows infant mortality rates in Africa between 1960 and 2018

d3 data-visualization react

Last synced: 07 May 2026

https://github.com/msikorski93/meteorite-landings

Basic data analysis focused mainly on visualizing geospatial data worldwide with cartopy.

cartopy data-visualization geopandas gis mapping meteorite-landing-sites shapefile

Last synced: 16 May 2026

https://github.com/urvee1810/market_basket_analysis

A data mining project analyzing Instacart's 3 million grocery orders to uncover customer shopping patterns and product associations. Using market basket analysis and the Apriori algorithm, the project reveals key insights about shopping behavior, product combinations, and temporal patterns, providing valuable recommendations for retail strategy

apriori-algorithm data-mining data-visualization machine-learning market-basket-analysis matplotlib mlxtend numpy pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/akhdandann/squadevaluationdashboard-powerbi

A Power BI dashboard that visualizes squad evaluation metrics including happiness, contribution, commitment, delivery, and agile behavior across tribes at PT. XL Axiata Tbk. (with dummy data)

business-intelligence dashboard data-visualization power-bi reporting

Last synced: 26 Jan 2026

https://github.com/davidchocholaty/bithack_hackathon_2024

This repository contains my personal code tasks for the BIT_Hack hackathon, created in 2024.

data-mining data-science data-visualization exploratory-data-analysis hackaton hackaton-project machine-learning

Last synced: 06 May 2026

https://github.com/magnusrodseth/celeritas

A website for categorizing and visualizing data structures and algorithms.

algorithms data-structures data-visualization nextjs react tailwindcss typescript

Last synced: 12 Apr 2026

https://github.com/ashwin331133/gorkha_earthquake_damage_prediction

The main objective is to predict the level of damage to buildings caused by the 2015 Gorkha earthquake in Nepal.

data-analysis data-visualization machine-learning python

Last synced: 29 Apr 2026

https://github.com/shubham200137/cyclistic-case-study

This repository contains a case study for Google's Data Analytics Professional Certificate, focusing on Cyclistic, a fictional bike sharing company in Chicago. The case study aims to drive growth by converting casual riders into members through a marketing strategy.

data-analysis data-visualization numpy-python pandas-python presentation-slides sql tableau

Last synced: 11 Jun 2026

https://github.com/naveen88112/healthcare

HealthCare Data Analysis and Forecasting This project examines healthcare data by processing missing values with KNN imputation, preprocessing features, and training classification models (Logistic Regression and Random Forest). The output includes performance metrics such as accuracy, confusion matrix, precision, recall, and ROC analysis.

data-visualization feature-engineering machine-learning model-evaluation numpy pandas python scikitlearn-machine-learning

Last synced: 12 Apr 2026

https://github.com/shellynagar27/transportation-and-logistics-challenge

Analyzing logistics data to optimize shipment efficiency, reduce delays, and enhance supply chain visibility using Power BI. Insights include top routes, delays, supplier trends, and peak shipments.

cleaning-data critical-thinking data-analysis data-visualization exploratory-data-analysis feature-engineering powerbi preprocessing-data problem-solving python

Last synced: 16 May 2026

https://github.com/mahmoudnamnam/superstore-analysis

This project explores the SuperStore dataset to uncover insights into sales, profit, and customer behavior. It identifies key trends, regional variations, and product performance, using data analysis and machine learning techniques to guide business strategy and optimize performance.

clustering data-analysis data-science data-visualization geopandas jupyter-notebook machine-learning numpy pandas plotly regression seaborn sklearn

Last synced: 12 Apr 2026

https://github.com/crazy-dot/bank-laon-case-study

Used Exploratory Data Analysis (EDA) to analyse patterns in the bank dataset. The main was to analyse the potential defaulters list and identify the cause for payment default. Tried to understand one of the risk assessment used by the banks and have duplicated the same for this project.

advanced-excel bank-loan-analysis data-analytics data-visualization exploratory-data-analysis statistical-analysis

Last synced: 06 Jan 2026

https://github.com/shubham200137/icc-women-s-t20-world-cup-data-analytics

Created a Power BI report to identify top 11 players for a T20 cricket team by scraping data from espncricinfo with Python, cleaning and transforming the data with pandas, and evaluating various player performance metrics.

beautifulsoup4 data-analysis data-visualization numpy-python pandas-python powerbi web-scraping

Last synced: 25 Feb 2025

https://github.com/acdh-oeaw/visartist

Visual Artwork Analysis and Collection Tool

color-clustering color-space data-visualization visual-analysis

Last synced: 13 Jul 2025