An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/jabonsote/financial-anomaly-detection-with-deepseek-and-isolation-forest

🚀 Financial Anomaly Detection with DeepSeek and Isolation Forest – A powerful, locally-run tool for detecting financial anomalies using Isolation Forest and DeepSeek LLM. Features AI-powered insights, interactive time-series visualization, and automated PDF audit reports. 🔍📊

anomaly-detection chatbot data-visualization deepseek financial-analysis financial-data isolation-forest llm machienlearning ollama report-generator streamlit

Last synced: 12 Apr 2026

https://github.com/manikantasanjay/-spatial-analysis-of-temperature-trends

"Global Climate Insights" offers a deep dive into global temperature trends, using spatial analysis and interactive visualizations to highlight the impacts of climate change.

data-visualization folium geospatial matplotlib plotly spatial-analysis temperature-trends

Last synced: 04 Jul 2025

https://github.com/jigyasag18/power-bi-dashboard-project

The Ecommerce Sales Analysis Dashboard project utilizes Power BI to provide detailed insights into ecommerce sales data, enabling stakeholders to track key performance metrics and uncover trends. This interactive dashboard allows users to explore the data in real-time, offering features such as drill-down capabilities, customizable filters.

dashboard data data-visualization datacleaning datanalysis datanalytics datapreprocessing powerbi visulaization

Last synced: 04 Mar 2026

https://github.com/bhushan148/finance-domain-bank-loan-report-tableau

I analyzed 🏦 bank loan data to reveal trends, KPIs, and insights. Using Tableau 📈 for dashboards and SQL 🗃️ for data extraction, I visualized loan applications, borrower profiles, and repayment behaviors 💡.

bussiness-intelligence dashboard-design data-analysis data-visualization excel figma sql sqlqueries tableau

Last synced: 08 Apr 2025

https://github.com/nrobledosagredo/covid19-dashboard

Dashboard to visualize and compare COVID-19 cases across different countries over time.

data-visualization

Last synced: 11 Sep 2025

https://github.com/johannaschmidle/netflix-subscription-analysis

Examined Netflix subscription data to understand market behaviour, predict future trends, and identify consumer preferences. [SQL, Tableau]

data-analysis data-cleaning data-trend data-visualization netflix

Last synced: 05 Mar 2026

https://github.com/vaxdata22/salifort-motors-and-waze-churn

Employee retention predictive model development for Salifort Motors and Waze. This is a terminal project I did to earn the Google Advanced Data Analytics Professional Certificate.

data-analytics data-visualization model-development predictive-analytics python statistical-analysis

Last synced: 16 Apr 2026

https://github.com/nafisarkar/co-po_attainment_system

CO-PO Calculation for Outcome Based Education (OBE)

data-processing data-visualization python tkinter-gui

Last synced: 05 Mar 2026

https://github.com/saiteja-talluri/data-analytics-assignement

Report on World Happiness Data (Data Analysis and Visualisation of the data)

data-analysis data-visualization ipynb-jupyter-notebook

Last synced: 20 Jan 2026

https://github.com/erabossid/d3js-heatmap

Global Heatmap presentation with D3js and Reactjs

d3js data-science data-visualization heatmap javascript reactjs

Last synced: 10 Mar 2025

https://github.com/jigyasag18/amazon-prime-power-bi-dashboard

The Amazon Prime Power BI Project is a centralized data storage system containing detailed information on movies and TV shows available on Amazon Prime Video, including metadata and analytics insights. It supports data-driven decision-making for content acquisition and viewer engagement strategies. This repo is optimized for querying & analysis.

dashboard data data-visualization dataanalysis dataanalytics datacleaning dataset powerbi powerbi-dashboards powerbi-report powerbi-visuals powerbidashboard

Last synced: 05 Mar 2026

https://github.com/vasugi2003/customer_churn_analysis_using_tableau

Customer Churn Analysis - To identify various reasons for customer to discontinue a company services.

business-analytics business-intelligence charts csv data-science data-visualization dataanalytics predictive-modeling preprocessing tableau

Last synced: 05 Mar 2026

https://github.com/shashwat9kumar/us-accidents-data-analysis

Analysis of the US accidents using the US-Accidents dataset (4.2 million entries) from Kaggle

accidents accidents-analysis data-analysis data-analytics data-visualisation data-visualization matplotlib numpy pandas python

Last synced: 17 Apr 2026

https://github.com/ashwin331133/hospital_allpatients_waitinglist_data

This Power BI project analyzes patient waiting lists across various medical specialties and case types (Day Case, Inpatient, Outpatient). The goal is to gain insights to improve healthcare management and resource allocation.

data-analysis data-visualization powerbi

Last synced: 03 Jan 2026

https://github.com/m4tice/qm_project

Bicycle project crowd evaluation.

data-analysis data-engineering data-visualization

Last synced: 16 Mar 2025

https://github.com/nathadriele/ifood-data-governance-pipeline

Este projeto demonstra uma solução completa de Data Governance com foco em qualidade, rastreabilidade, segurança e conformidade com LGPD. Utiliza tecnologias modernas como Streamlit, Airflow, dbt e Pydantic para implementar um ecossistema funcional e interativo com dashboard de governança de dados.

airflow dashboard data-analysis data-catalog data-engineering data-governance data-quality data-visualization dbt ifood lgpd matplotlib numpy observability-data pandas pipeline pyspark redis seaborn streamlit

Last synced: 02 Apr 2026

https://github.com/ruajean/netflixmoviescraper

🎬 A powerful tool for gathering movie data and user reviews from FilmAffinity's Netflix category. This script scrapes movie details and iterates through user reviews, saving structured information to a CSV file for analysis. Ideal for insights into user sentiments and movie popularity on FilmAffinity.

data-analysis data-visualization dataset jupyter-notebook python scraping

Last synced: 17 Apr 2026

https://github.com/theo-liang/python-project-analysis-for-instacart

This project involved analyzing Instacart's sales data to understand customer purchasing behaviors and optimize marketing strategies.

aggregation data-visualization datatypes deriving-new-variables merging-data pandas-dataframe python subsetting wrangling-data

Last synced: 17 Apr 2026

https://github.com/sanand0/datastories

Small data visualizations and stories, mostly vibe-coded

data-visualization

Last synced: 03 Apr 2026

https://github.com/Sparsh7082/Data-Analysis-Portfolio

This repository is dedicated to showcasing my skills, sharing projects, and tracking my progress in Data Analytics and Data Science.

canva data-analytics data-manipulation data-visualization database-querying google-slides power-bi power-point presentation-tools programming-language python r spreadsheets sql tableau

Last synced: 10 Mar 2025

https://github.com/humayun-raza-030/restaurant-recommendation-system

This project is a Restaurant Recommendation System that helps users find restaurants in Lahore based on their location, customer reviews, and ratings. The system scrapes restaurant data from Google Maps, analyzes user reviews for sentiment, and provides a visualization dashboard using Tableau.

data-analysis data-science data-visualization python

Last synced: 17 Apr 2026

https://github.com/ecastillo081/marketing-finance-dashboard

An analytics project that combines Google Ads + Meta Ads data and translates it into finance metrics.

business-intelligence cac cohort-analysis data-pipeline data-visualization ltv marketing-analytics mode-analytics postgres roas sql strategic-finance supabase

Last synced: 17 Apr 2026

https://github.com/alex-martineau/realisation_dashboard_veille_technique

Dashboard Streamlit de scoring crédit explicable + veille NLP comparative BERT vs MiniLM pour la classification de produits e-commerce.

api bert credit-scoring dashboard data-visualization ecommerce explainability flask heroku machine-learning minilm nlp sentence-transformers shap streamlit

Last synced: 17 Apr 2026

https://github.com/walkerdustin/ml-notebook-template

This is a template to Kickstart your ML project Let this be a starting point for your next Data analysis project

data-science data-visualization machine-learning machine-learning-algorithms ml notebook python

Last synced: 09 May 2026

https://github.com/abhroroy365/market_analysis

This project explores customer segmentation and market analysis in the context of online retail using an online retail dataset. By applying advanced analytics, we aim to uncover insights that can drive strategic decisions and enhance business performance.

clustering data data-analysis data-visualization kmeans-clustering machine-learning market-analysis python silhouette-analysis

Last synced: 09 May 2026

https://github.com/noluyorabi/bravais-pearson-interactive

Bravais-Pearson Interactive – A web app for visualizing Pearson correlation. Generate datasets, analyze correlations, and explore regression lines, r², and p-values interactively.

data-visualization interactive-analysis matplotlib numpy pearson-correlation scipy seaborn statistics streamlit

Last synced: 17 Apr 2026

https://github.com/datastalker/survival-cox

This repository contains an R script for performing survival analysis on breast cancer surgery data from the University of Chicago's Billings Hospital. The analysis includes Kaplan-Meier estimation and Cox Proportional Hazards modeling to assess patient survival.

breast-cancer-prediction cox-model data-analysis data-science data-visualization epidemiology kaplan-meier r survival-analysis

Last synced: 02 Apr 2025

https://github.com/eslamdyab21/data-visualization-using-matplotlib-and-seaborn

This is the last project in the nanodegree udacity program. it's about data visualization.

data data-analysis data-visualization matplotlib pandas python seaborn udacity udacity-data-analyst-nanodegree

Last synced: 09 May 2026

https://github.com/architj6/cancerguardian

CancerGuardian is a machine learning-powered web app that helps predict breast cancer diagnoses based on cytology measurements. 🩺✨ Built with Streamlit, Scikit-Learn, and Plotly, this tool visualizes tumor characteristics and provides predictions using a trained model. 🚀

binary-classification breast-cancer-prediction classification-models data-science data-visualization deep-learning healthcare healthcare-ai machine-learning medical-ai medical-diagnostics predictive-analytics python streamlit supervised-learning

Last synced: 01 May 2026

https://github.com/chaaalistaa/thelookecommerce---project

Analysis "TheLook" eCommerce with highlight goals such as identifying sales trends, understanding customer behaviors, enhancing customer retention, and driving repeat purchases.

big-data-analytics bigquery data-analytics data-visualization looker-studio sql

Last synced: 17 Apr 2026

https://github.com/mattbixley/tidy_tuesday

A home for some #tidytuesday code, plots table and general upskilling.

data-science data-visualization ggplot r4ds tidytuesday tidyverse

Last synced: 15 Feb 2026

https://github.com/amirdora/covid19_lockdown_policies_germany

Python visualisation - Covid19 lockdown policy effects and new cases in germany. Using "Oxford policy tracker" and "Coronavirus Source Data - Our World in Data" data.

data-science data-visualization

Last synced: 13 Mar 2025

https://github.com/arthurdanjou/studies

💼 This is the repository containing all my projects done during my studies in Python and R.

ai data data-science data-visualization jupyter jupyter-notebook ml python r

Last synced: 08 Apr 2025

https://github.com/zxjahid/matplotlib

A comprehensive guide to mastering data visualization with Matplotlib through hands-on examples and advanced techniques. 🚀📊

candlestick candlestick-chart cheatsheet data-analysis data-visualization gtk jupyter-notebook maps matplotlib-python pandas thesis-template tk tutorial wx

Last synced: 09 May 2026

https://github.com/subho004/housepricepredictor

Bangalore House Price Predictor: A web app using Flask and scikit-learn to predict house prices in Bangalore based on location, area, bedrooms, and bathrooms.

bootstrap css data-science data-visualization flask github html lasso-regression linear-regression machine-learning model-deployment pickle predictive-modeling real-estate-housing-prices regression-analysis ridge-regression web-development

Last synced: 04 Apr 2026

https://github.com/saagpatel/sovereign

Browser-based geopolitical simulator — apply policy levers to 18 countries and watch cascading effects over 60 months

d3 data-visualization geopolitics monte-carlo nextjs simulation typescript web-worker

Last synced: 28 Jun 2026

https://github.com/sanam2405/ahs

This contains the analysis of result of AHS Madhyamik Examination 2022

data-analysis data-visualization jupyter-notebook python

Last synced: 18 Apr 2026

https://github.com/rauhanahmed/auto-data-analyzer

AutoDataAnalyzer: Automate data ingestion, analysis, and visualization with AI/ML-powered pipelines. Features natural language query processing, interactive Plotly visualizations, and seamless deployment via Docker.

ai-powered-analysis automated-pipeline cicd data-analysis data-visualization docker end-to-end-project flask generative-ai langchain llama3-1 machine-learning natural-language-processing plotly python3 pywebio

Last synced: 12 Apr 2026

https://github.com/hossamAhmedSalah/Computer-Vision-

contains my training projects in this field

computer-vision data-visualization detection edge opencv

Last synced: 10 Mar 2025

https://github.com/gattsu001/telecom-churn-predictor

Predicts which telecom customers are likely to churn with 95% accuracy using engineered features from usage, billing, and support data. Implements Sturges-based binning, one-hot encoding, stratified 80/20 train-test split, and a two-level ensemble pipeline with soft voting. Achieves 94.60% accuracy, 0.8968 AUC, 0.8675 precision, 0.7423 recall.

churn-prediction classification classification-algorithm customer-retention data-science data-visualization feature-engineering joblib jupyter-notebook machine-learning pandas scikit-learn supervised-learning svm

Last synced: 18 Apr 2026

https://github.com/awanraskall/retail-demand-analysis

Data analysis of retail meal orders, fulfillment centers, and product demand using Python

data-analysis data-visualization jupyter-notebook numpy pandas python

Last synced: 18 Apr 2026

https://github.com/priyadarshinijain/data-analysis-with-python

This Python project is focused on analyzing Diwali sales data to gain insights into customer demographics, purchase behavior, and sales trends.

data-visualization python

Last synced: 18 Apr 2026

https://github.com/nahiyanhkhan/data-processing-and-visualization

Loan Data Processing using Python's numpy and pandas libraries. For data visualization, matplotlib and seaborn are used.

data-analysis-python data-visualization matplotlib numpy pandas seaborn

Last synced: 09 May 2026

https://github.com/andersoncrs/clientes-restaurante-analisis-exploratorio-de-datos--eda-

En este proyecto comprendemos como a partir de los datos de los clientes de un restaurante, se pueden descubrir patrones y detalles interesantes que ayuden a una mejor toma de decisiones basadas en datos.

analisis data-exploration data-visualization notebook restaurant

Last synced: 03 Jul 2025

https://github.com/leandrocollares/urbanization-versus-income

A responsive scatter plot that shows urban population percentages and GDP per capita in Americas.

d3 data-visualization svelte

Last synced: 18 Apr 2026

https://github.com/leandrocollares/temperatures-in-victoria

Responsive line chart that shows maximum daily temperatures in Victoria, BC in June 2021

d3 data-visualization svelte

Last synced: 18 Apr 2026

https://github.com/josephbarbierdarnal/matoolkit

matoolkit is a python package containing a toolbox for creating visually appealing graphs/annotations in matplotlib

data-analysis data-visualization matplotlib

Last synced: 31 Mar 2025

https://github.com/nishanthmuruganantham/football-player-wages-eda

This repository uses Python for analyzing football player data, focusing on various aspects such as player positions, league distributions, wages, and the relationship between player age and appearances. It includes visualizations generated using Plotly to provide insights into the dynamics of football player demographics and performance.

data-analysis data-science data-visualization eda football football-analytics football-data kaggle kaggle-dataset pandas plotly python

Last synced: 18 Apr 2026

https://github.com/hannahgsimon/halmodeling2024graphs

Created code to develop and analyze statistical graphs for the spatial radiotherapy model, which can be found at https://github.com/hannahgsimon/HALModeling2024. This project was in association with the Cleveland Clinic Lerner Research Institute, Jacob Scott Lab.

agent-based-model bifurcation-analysis cancer-models computational-biology data-visualization hybrid-automata immune-response mathematical-modelling ordinary-differential-equations radiation-therapy spatial-model statistics systems-biology

Last synced: 23 Nov 2025

https://github.com/manalisbhavsar/mall-customers-clustering

K-Means clustering to mall customer data, segmenting customers based on their annual income and spending score. To identify patterns and group customers for targeted marketing.

data-analysis data-visualization matplotlib numpy pandas python scikit-learn

Last synced: 18 Apr 2026

https://github.com/cicerolino/project-lab

Projeto da disciplina de Project Lab do curso de ciências da computação. Discipline project (Project Lab) of computer science course.

computer-science-project data-visualization docker-compose nextjs postgresql-database prisma-orm reactjs vercel-deployment

Last synced: 05 Apr 2026

https://github.com/mtimma001/clinical-trial-data-tool

Clinical Trial Data Analysis Tool is a Flask-based web app for healthcare professionals to manage and analyze clinical trial data. It features full CRUD functionality, interactive visualizations (Plotly/Matplotlib), a responsive Bootstrap UI, MySQL database integration, and Heroku deployment for accessible, scalable use.

bootstrap5 clinical-trials crud data-analysis data-visualization flask healthcare heroku mysql pandas plotly python

Last synced: 05 Apr 2026

https://github.com/nowon1/insurance-claim-prediction_version

This project aims to predict the insurance claim amounts based on various customer attributes using machine learning techniques. The project involves data preprocessing, exploratory data analysis, feature engineering, and model training and evaluation.

data-preprocessing data-science data-visualization exploratory-data-analysis feature-engineering insurance jupyter-notebook machine-learning numpy pandas predictive-modeling python random-forest regression-analysis scikit-learn

Last synced: 05 Apr 2026

https://github.com/faraazarsath/guvi-task-4

This repository contains Python scripts for assessing and categorizing student performance data from two CSV files. The tasks include categorizing students based on their CodeKata scores.

data-visualization matplotlib numpy pandas

Last synced: 18 Apr 2026

https://github.com/kwokhing/visualizing-datasets-with-facets

Demo on using Facets: An Open Source Visualization Tool for Machine Learning Training Data developed by Google's PAIR Initiative

anaconda data-analysis data-visualization facets jupyter-notebook missing-data open-source python skewness unbalanced-data visualisation visualization

Last synced: 18 Apr 2026

https://github.com/rahuls-1106/dataspark

DataSpark is a powerful analytics project transforming raw retail data into actionable insights for Global Electronics. By leveraging Python, SQL, and interactive visualizations, it uncovers trends in customer behavior, sales performance, and product popularity, driving smarter business decisions and boosting growth.

data-visualization jupyter-notebook matplotlib numpy pandas-dataframe powerbi python seaborn sql

Last synced: 18 Apr 2026

https://github.com/kezouke/grossing-films-dataviz

Project that extracts information about the highest-grossing films from Wikipedia, stores it in a database, and presents it through an interactive, modern web page hosted on GitHub Pages

data-visualization data-wrangling json sqlite wikipedia

Last synced: 18 Apr 2026

https://github.com/laiba-iqrar/hands-on-machine-learning

Repository containing solutions to CS-324 (Machine Learning) practice problems assigned by my professors.

data-visualization feature-engineering gradient-descent regression-models

Last synced: 18 Apr 2026

https://github.com/fbarffmann/tornado-damage-dashboard

Built a Flask dashboard visualizing 1,000+ US tornadoes from 2023 using Leaflet.js and MongoDB. Interactive maps show tornado magnitude, damage, and frequency.

api data-visualization flask geospatial leaflet mongodb pandas python tornado-dashboard

Last synced: 11 Apr 2026

https://github.com/p1n2o/ngx-oracle-dv

Angular component to embed visualizations from Oracle Analytics Cloud into your Angular application.

angular data-visualization embed oracle-analytics-cloud oracle-visualization

Last synced: 18 Apr 2026

https://github.com/hfzdzakii/dicoding-solvinghrproblem

This repo is a master submission for my Dicoding Final Project. Employee Attrition & Performance Dataset was being used to fulfill the submission. Feel free to explore and I hope my work give you some insight!

data-analysis data-visualization

Last synced: 16 May 2025

https://github.com/chauxvive/fccheatmap

A D3.js-driven heatmap visualizing monthly global land surface temperature variations over time. Built as part of FreeCodeCamp’s Data Visualization certification.

d3 d3js data-visualization dataviz

Last synced: 29 Jun 2026

https://github.com/mksingh431/free-data-science-courses

Data science is a rapidly growing tech field that’s transforming business decision-making. To break into this field, you need the right skills. Fortunately, top institutions like Harvard and IBM offer free online courses. These courses cover everything from basic programming to advanced machine learning.

course data data-analysis data-science data-visualization free freecou python

Last synced: 19 Apr 2026

https://github.com/shyamkumarnagilla/ai-powered-forecasting-for-agricultural-productivity

AI Powered Forecasting for Agricultural Productivity is a project that utilizes machine learning to predict crop yields and optimize farming practices. By harnessing historical and real-time data, this model empowers farmers with data-driven insights to enhance productivity and sustainability in agriculture.

data-analysis data-visualization deep-learning flask neural-network

Last synced: 19 Apr 2026

https://github.com/yuvrajsaraogi/unemployment-analysis-with-python

Unemployment is measured by the unemployment rate which is the number of people who are unemployed as a percentage of the total labour force. We have seen a sharp increase in the unemployment rate during Covid-19, so analyzing the unemployment rate can be a good data science project.

big-data big-data-analytics data-analysis data-science data-visualization engineering excel jupyter-notebook machine-learning mini-project natural-language-processing nlp project python3 sql

Last synced: 19 Apr 2026

https://github.com/simoesleticia/data-visualization-tutorial

Guia prático de visualização de dados com Seaborn e Matplotlib

data-visualization data-visualization-python matplotlib-tutorial seaborn-tutorial

Last synced: 19 Apr 2026