An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/sharoonjoseph321/liver_cirrhosis

This project aim to understanding the factors contributing to liver cirrhosis, analyzing its impact, and possibly predicting disease outcomes using machine learning. It might also explore survival analysis or risk stratification for liver cirrhosis patients.

analytics data-science data-visualization dataanalysis machine-learning machine-learning-algorithms predictive-analytics predictive-modeling python random-forest-classifier visualization

Last synced: 15 Mar 2025

https://github.com/miteshgupta07/streamlit-machine-learning-app

A Streamlit application for interactive exploratory data analysis (EDA) and data visualization, offering dynamic tools to analyze and visualize machine learning datasets.

data-visualization python streamlit

Last synced: 27 Apr 2026

https://github.com/amanyadav-07/customer-churn-prediction

Machine Learning project to predict customer churn using Logistic Regression, Random Forest, and XGBoost. Includes data preprocessing, feature engineering, SMOTE balancing, model training, evaluation, and business insights.

accuracy-metrics data-analysis data-visualization logistic-regression machine-learning matplotlib numpy pandas python3 random-forest-classifier seaborn sklearn xgboost-classifier

Last synced: 11 Apr 2026

https://github.com/azaz9026/myntra_review_project

Myntra Scraper Project Project Overview: The Myntra Scraper Project is designed to extract product data from the Myntra website. This tool enables users to gather information such as product names, prices, descriptions, ratings, and images for analysis, comparison, or personal use.

data-science data-structures data-visualization filesystem github mogodb mogoose python3 strreamlit web-scraping

Last synced: 10 Apr 2026

https://github.com/badranalyst/student-tests-data-analysis-application

Python-based analysis of student test scores in math, reading, and writing, examining correlations with parental education, lunch type, and test preparation. Includes data cleaning, visualization, and statistical insights into factors influencing academic performance.

data-analysis data-visualization dataset matplotlib numpy pandas python sklearn

Last synced: 05 May 2026

https://github.com/allanreda/telco-customer-churn-predictor-app

A web-based machine learning application that predicts customer churn using a logistic regression model. Built with Scikit-Learn for model training, Gradio for the user interface, and deployed on Google Cloud App Engine. The app allows users to input customer data and receive predictions on churn risk to support business decision-making.

app-engine data-visualization deployment google-cloud gradio hyperparameter-tuning logistic-regression machine-learning numpy pandas scikit-learn

Last synced: 16 Apr 2026

https://github.com/udhaya2823/dataspark-illuminating-insights-for-global-electronics

✨DataSpark✨ is a powerful analytics project transforming raw retail data into actionable insights for Global Electronics. By leveraging Python, SQL, and interactive visualizations, it uncovers trends in customer behavior, sales performance, and product popularity, driving smarter business decisions and boosting growth.

data-science data-visualization database-management datacleaning exploratory-data-analysis matplotlib numpy pandas powerbi python seaborn sql version-control

Last synced: 11 Apr 2026

https://github.com/andersoncrs/analisis_exploratorio_de_datos-eda-_rendimiento_estudiantil

Este análisis exploratorio de datos (EDA) realizado sobre el conjunto de datos de rendimiento estudiantil tiene como objetivo identificar y comprender los factores que influyen en el desempeño académico de los estudiantes. A través de la limpieza, transformación y visualización de datos, se busca descubrir patrones y relaciones significatvas.

data-analysis data-exploration data-exploration-and-preprocessing data-visualization seaborn

Last synced: 30 Mar 2025

https://github.com/abhinavbammidi1401/covid-19_analytics

A very comprehensive notebook of statistical models to analyze Covid-19 data and visualization.

analytics covid-19 data-analysis-python data-analytics data-science data-visualization jupyter-notebook predictive-modeling python

Last synced: 19 May 2026

https://github.com/lucertgvby/phat

Graphical PowerShell application designed to help investigators, security analysts, and IT professionals examine email headers for signs of phishing or spoofing. The tool parses headers from .eml and .msg files, highlights important fields, and provides insights into SPF, DKIM, and DMARC results.

data-visualization dimensionality-reduction distributed-computing hashcracking led-matrix-displays mqtt off-chain-compute phala phat raspberry-pi-library single-cell srp-phat unsupervised-learning visualization

Last synced: 21 May 2026

https://github.com/shaolans/projet_algav_trie

Implementation of the Patricia Trie and the Hybrid Trie in Java

algorithms data-structures data-visualization graphviz-dot hybrid-training java patricia-tree tree trie

Last synced: 11 Jun 2026

https://github.com/trimoyee-g/phishing-site-predictor

A phishing site prediction model using scikit-learn's Random Forest Classifier, achieving high accuracy and gaining insights into website characteristics.

data-visualization machine-learning python random-forest-classifier scikit-learn

Last synced: 11 Apr 2026

https://github.com/ianjure/martial-law-in-data

A data visualization of how martial law shaped the Philippine economy.

data-visualization

Last synced: 05 Jan 2026

https://github.com/jeffbla/image-and-porosity-analysis-by-dash-plotly

This app uses Dash to explore core data. Compare images in the first row and interact with a line chart in the second row. Hovering reveals details, and clicking updates the images to match the selected data point.

dashboard data-visualization dicom-images interactive-visualizations plotly-dash xct

Last synced: 21 Jul 2025

https://github.com/sanad343/complete-data-analyst

Data analysis is the process of turning raw data into useful information for decision-making.

data data-visualization datamanipulation eda excel exploratory-data-analysis powerbi python-3 sql tableau

Last synced: 30 Jun 2025

https://github.com/jatin-s16/hr_mysql_powerbi

This repository contains raw HR data along with key business questions. I performed data cleaning using MySQL queries and wrote analytical queries to extract meaningful insights. The results were then visualised using Power BI to enhance business understanding.

data-analysis data-science data-visualization mysql powerbi

Last synced: 29 May 2026

https://github.com/asuquoaa/bar_chart_visualization_with_confidence_intervals_and_interactive_slider

This project visualizes probabilistic data using bar charts with 95% confidence intervals, allowing users to explore deviations from a Value of Interest (V of I) interactively.

data-visualization interactive-visualizations statistics

Last synced: 01 Sep 2025

https://github.com/csoren66/customer-personality-analysis

Predict how different customer segments will respond for a particular product or service.

data-analysis data-visualization python

Last synced: 03 Mar 2025

https://github.com/mcommer/emtools

A toolbox for geophysical EM-simulation data- and model-file processing, analysis, plotting, and other gimmicks

data-visualization electromagnetics geophysics plotting-scripts shell-scripts

Last synced: 30 Jun 2025

https://github.com/khushi-sabarad/adinsights_dashboard

AdInsights Dashboard: An interactive web dashboard built with Python (Flask, Pandas, Plotly) to visualize and analyze digital advertising performance. Allows filtering by gender, ad type, and location for detailed insights

ad-performance advertising dashboard data-analysis data-visualization flask pandas plotly python web-application

Last synced: 01 May 2026

https://github.com/vedantshi/coffee-sales-dashboard

This project analyzes coffee sales data using Excel, featuring data cleaning, trend analysis, and an interactive dashboard. Key insights highlight top-performing products, regional sales trends, and seasonal patterns. Recommendations focus on marketing strategies and inventory optimization. Future plans include Power BI integration for visuals.

business-insights data-analysis data-visualization excel-dashboard pivot-tables sales-trends

Last synced: 05 Jan 2026

https://github.com/Gregoritsch3/Exercise_Pandas_1

A Pandas exercise demonstrating the loading, cleaning and reorganization of data in .xlsx or .csv files, descriptive statistics, data visualization, statistical approximation of data with the normal distribution, etc.). Libraries include Pandas, NumPy, Scipy, SymPy, MatplotLib,

data-cleaning data-visualization descriptive-statistics matplotlib numpy pandas scipy sympy

Last synced: 01 May 2025

https://github.com/samjoesilvano/adventureworks_sales_performance_dashboard

Createed an interactive 4-page dashboard for AdventureWorks that visualizes key sales metrics—including revenue, profit, orders, and return rates—across 2020 to 2022. Featuring dynamic geographic analysis and detailed customer insights, this dashboard empowers data-driven decision-making and enhances business performance.

business-intelligence data-analysis-python data-analytics data-driven-decisions data-modeling data-visualization geographic-analysis interactive-dashboards kpi-metrics powerbi sales-performance-analysis

Last synced: 05 Jan 2026

https://github.com/tsopermon/comparison-ml-algorithms

This repository compares the performance of Adaline, Logistic Regression, and Perceptron models on binary classification tasks using linearly, non-linearly, and marginally separable datasets from the Iris dataset. It includes MATLAB implementations, 10-fold cross-validation, and visualizations of decision boundaries and MSE histories.

adaline binary-classification classification-accuracy cross-validation data-visualization decision-boundaries iris-dataset logistic-regression machine-learning matlab mse neural-networks perceptron

Last synced: 15 Mar 2025

https://github.com/spear97/montecarlo-python

This was a project for my Programming Language Concepts Class were we were assigned to create a Monty Carlo Simulation using Python.

data-science data-visualization matplotlib-figures matplotlib-python montecarlo pandas-library pandas-python python python-3

Last synced: 23 Mar 2025

https://github.com/pekiiipy/credit-card-fraud-detection

🔍 Detect credit card fraud efficiently using advanced machine learning techniques, achieving high accuracy rates on a large dataset of transactions.

adasyn anomaly-detection class-imbalance credit-card-fraud data-visualization fraud fraud-detection frauddetection kaggle keras logistic-regression plotly-python postgresql random-forest scikit-learn tensorflow tree-model xgboost

Last synced: 11 Apr 2026

https://github.com/chokzb/covid19_vaccination_analysis

An EDA project examining global COVID-19 vaccination progress. The notebook investigates vaccination trends by country, daily vaccination rates, timeline patterns, and dose distribution. The project includes visualisations created with Matplotlib, Seaborn, and Plotly.

covid-19 data-analysis data-visualization jupyter-notebook matplotlib numpy pandas plotly python seaborn vaccination

Last synced: 07 May 2026

https://github.com/vidyadnina/cyclistic-sql-tableau-project

Trip data analysis for a bike-sharing service company using SQL and Tableau.

bigquery dashboard data-analysis data-analytics-sql data-cleaning data-visualization sql

Last synced: 02 Jan 2026

https://github.com/karo23361/toy-store-kpi-power-bi

PowerBI Portfolio Project

csv data data-visualization powerbi

Last synced: 03 Feb 2026

https://github.com/danhnnguyen0606/bitcoin-navigator

Bitcoin Navigator: A data-driven dashboard designed to analyze Bitcoin trends, empowering investors to refine their strategies and identify optimal investment opportunities.

bitcoin btc crypto cryptocurrency data-analysis data-analytics data-science data-visualization investment looker looker-studio

Last synced: 15 Mar 2025

https://github.com/carmendev/covid-19-tracker

Data visualization React.js project deployed with Firebase. Daily statistics about current, recovered and closed cases coming from an API.

data-visualization firebase numeral reactjs

Last synced: 11 Apr 2026

https://github.com/hanzopgp/lolanalysis

League Of Legends game data engineering, analysis, visualization and machine learning. Business intelligence project.

data-analysis data-cleaning data-engineering data-visualization dataiku deep-learning etl machine-learning scraping university

Last synced: 27 May 2026

https://github.com/pat8901/diskanalyzer-cli

Processes a pdf file holding storage utilization data to automatically create graph visualizations revealing the true demographics hidden in large data.

data-visualization graphs-generation matplotlib

Last synced: 27 Dec 2025

https://github.com/syarwinaaa09/hypothesis-testing-with-mens-and-womens-soccer-matches

a data-driven exploration of international men's and women's football (soccer) match results using Python

data-analysis data-visualization football jupyter-notebook men-vs-women pandas python soccer sports-analytics visualization

Last synced: 05 May 2026

https://github.com/navp7/roadaccident_powerbi

An interactive Power BI dashboard designed to analyze road accident data

dashboards data-analysis data-visualization powerbi

Last synced: 19 Mar 2026

https://github.com/gautam25raj/data-sync

A powerful platform designed to revolutionize the way teams collaborate and visualize data.

chat collaboration data-visualization express material-tailwind mongodb mongoose nextjs nodejs reactjs redux redux-toolkit tableau tableau-dashboard tailwindcss

Last synced: 11 Apr 2026

https://github.com/apelullo/cobalt_health_wellness_platform_ops

Cobalt is a mental health and wellness platform created for Penn Medicine employees that serves as a hub for support services such as therapy, wellness coaching, topic- and population-specific group sessions, and a variety of self-help resources.

academic-research data-cleaning-pipeline data-validation data-visualization decision-support feature-development healthcare-data hipaa key-performance-metrics mental-health-services operations-research product-analytics reporting-pipeline

Last synced: 23 Mar 2025

https://github.com/alfiyafatima09/heuristic_algorithms

This project compares pathfinding algorithms (A*, Greedy Best-First, and Hill Climbing) by visualizing their paths and comparing performance metrics (nodes explored, memory, execution time) on a grid with obstacles.

algorithms data-visualization

Last synced: 20 Jan 2026

https://github.com/gunjanmimo/d3-visualization

D3.js is a JavaScript library for producing dynamic, interactive data visualizations in web browsers. It makes use of Scalable Vector Graphics, HTML5, and Cascading Style Sheets standards. It is the successor to the earlier Protovis framework

d3js data data-science data-visualization reactjs

Last synced: 29 Apr 2026

https://github.com/jfaccioli/citi-bike-tableau

A data analysis of Citi Bike users in Jersey City using Tableau

data-analysis data-visualization tableau tableau-public

Last synced: 26 Jan 2026

https://github.com/sayamalt/taxi-trip-fare-prediction

Successfully created a machine learning model which can accurately predict the fare of a taxi trip based on several features such as trip duration, tip amount, etc.

cross-validation data-exploration-and-preprocessing data-visualization exploratory-data-analysis feature-engineering hyperparameter-optimization machine-learning model-deployment model-selection model-training-and-evaluation regression-modelling

Last synced: 09 Nov 2025

https://github.com/sayamalt/credit-card-approval-prediction

Successfully developed a machine learning model which can accurately predict up to 100% accuracy whether a credit card application of a given applicant would be approved or not, based on several demographic features such as applicant age, total income, marital status, total years of work experience, etc.

binary-classification cicd-deployment cross-validation data-exploration-and-preprocessing data-visualization exploratory-data-analysis feature-engineering hyperparameter-optimization machine-learning model-deployment model-retraining model-selection model-testing model-training-and-evaluation

Last synced: 09 Nov 2025

https://github.com/leosimoes/datascienceacademy-powerbi-clinicadebi

Atividades do curso Análise de Dados com Microsoft Power BI e Clínica de BI da Data Science Academy.

dashboards data-analysis data-visualization microsoft-power-bi power-bi

Last synced: 05 Jan 2026

https://github.com/sayamalt/house-price-prediction

Successfully created a regression model for predicting the price of any house, excluding enormous real estates and mansions, to a significant level of accuracy.

data-visualization exploratory-data-analysis feature-engineering feature-selection machine-learning regression-analysis regression-testing

Last synced: 09 Nov 2025

https://github.com/sayamalt/customer-churn-prediction

Successfully established a machine learning model which can predict whether any given customer currently utilizing the products and services offered by a company will churn at anytime in the future or not, depending upon a set of unique features/characteristics pertaining to that specific individual, to a great level of accuracy.

classification data-visualization exploratory-data-analysis feature-engineering hyperparameter-tuning model-deployment model-evaluation model-optimization model-training supervised-machine-learning

Last synced: 09 Nov 2025

https://github.com/eslamdyab21/data-visualization-using-matplotlib-and-seaborn

This is the last project in the nanodegree udacity program. it's about data visualization.

data data-analysis data-visualization matplotlib pandas python seaborn udacity udacity-data-analyst-nanodegree

Last synced: 09 May 2026

https://github.com/tolumie/exploratory-data-analytics-projects

Exploratory Data Analytics – A collection of projects covering data exploration, feature engineering, hypothesis testing, and predictive modeling across diverse datasets, including insurance, real estate, laptops, cars, COVID-19, and the Olympics.

data-analysis data-visualization data-wrangling exploratory-data-analysis-eda feature-engineering hypothesis-testing machine-learning matplotlib numpy pandas predictive-modeling python seaborn statistical-analysis

Last synced: 11 Apr 2026

https://github.com/thenorthkun/movies-dataset-analysis

Analysis & categorizing of Movies based on Actors, Genres, Gross covered etc 🦸🏼🧜🏼‍♀️🎧

data-analysis data-visualization filtering

Last synced: 23 Mar 2025

https://github.com/anurag-kumar-molankala/sales-performance-dashboard

A Power BI dashboard that analyzes sales trends, product performance, customer segmentation, and payment distribution. It uses DAX, time intelligence, and interactive visuals for data-driven insights. The model includes Sales, Product, and Customer tables for in-depth analysis.

dashboards data-analysis data-visualization dax dax-functions dax-measures dax-query etl-process powerbi powerbi-visuals powerquery sql-query sql-server

Last synced: 03 Apr 2025

https://github.com/debjyotisaha/power-bi-projects-phase-1

Portfolio projects related to data visualisation in Power BI

data-analysis data-visualization dax-expression powerbi powerquery

Last synced: 18 Jan 2026

https://github.com/adamouization/python-machine-learning-data-science-notes

:orange_book: Jupyter notebooks containing useful Python code and notes for general Machine Learning and Data Science projects.

data data-science data-visualization guide jupyter jupyter-notebook machine-learning matplotlib notes numpy pandas pandas-dataframe python seaborn

Last synced: 11 Apr 2026

https://github.com/sehaj003/telco-churn-analysis

This repository contains files (dataset and Jupyter codebooks) for a project aimed to build machine learning models to predict customer churn based on given parameters.

data-science data-visualization exploratory-data-analysis machine-learning machine-learning-models predictive-modeling principal-component-analysis python

Last synced: 20 May 2026

https://github.com/mithoon278/us-visa-approval-prediction-mlops-project

This project presents a ML based solution using ML Algorithm to predict which visa applications will be approved and thus recommend a suitable profile for applicants whose visa have a high chance of approval.

aws classification data-visualization ec2-instance exploratory-data-analysis feature-engineering hyperparameter-tuning machine-learning random-forest-classifier s3-bucket

Last synced: 11 Apr 2026

https://github.com/Lightning-Chart/lcjs-example-0305-racingbars

A demo application showcasing using LightningChart JS to visualize tracking of COVID-19.

chart data-visualization lcjs lightningchart-js

Last synced: 27 Dec 2025

https://github.com/pratanup/exploratory-data-analysis-eda-

Objective is to make this data ready for modeling by transforming the given data into clean data by doing EDA

data-analytics data-science data-visualization exploratory-data-analysis python

Last synced: 19 Apr 2026

https://github.com/salma-mamdoh/investigating-netflix-movies-and-guest-stars-in-the-office

My Project to learn the Basics of Analysis & Visualization on DataCamp

data-analysis data-visualization datacamp matplotlib pandas python

Last synced: 11 Apr 2026

https://github.com/ledsouza/dataviz_vendas_lojas

Projeto de visualização de dados para realizar comparações e distribuições

data-science data-visualization matplotlib pandas pyplot python seaborn vitrinedev

Last synced: 11 Apr 2026

https://github.com/pythoncoderunicorn/gi-joe

Dataset for GI Joe action figures from 1980s & 1990s. My dataset for TidyTuesday

data-science data-visualization gi-joe r toy-project

Last synced: 27 May 2026

https://github.com/dheyhasan/echo-trends

EchoTrends is a data visualization app that analyzes your Spotify playlists and reveals insightful patterns—such as track duration, popularity, and statistical correlations—using interactive charts and statistical tests. Built with React (frontend) and FastAPI (backend), it offers both functional analysis and a demo landing

correlation-analysis data-visualization fastapi javascript music-analysis python react recharts spotify-api tailwindcss

Last synced: 11 Apr 2026

https://github.com/shafaq-aslam/data-analytics-dairy

A comprehensive repository for Data Analytics learning and projects. It includes MySQL, Python, Power BI, Tableau, and Excel. The goal is to analyze data, generate insights, and create compelling visualizations for real-world datasets.

data-analysis data-visualization excel excel-based-data-analysis powerbi python-scripts sql sql-queries sql-queries-for-data-manipulation sql-query-for-data-visualization tableau

Last synced: 20 Jan 2026

https://github.com/hfzdzakii/dicoding-solvinghrproblem

This repo is a master submission for my Dicoding Final Project. Employee Attrition & Performance Dataset was being used to fulfill the submission. Feel free to explore and I hope my work give you some insight!

data-analysis data-visualization

Last synced: 16 May 2025

https://github.com/fbarffmann/tornado-damage-dashboard

Built a Flask dashboard visualizing 1,000+ US tornadoes from 2023 using Leaflet.js and MongoDB. Interactive maps show tornado magnitude, damage, and frequency.

api data-visualization flask geospatial leaflet mongodb pandas python tornado-dashboard

Last synced: 11 Apr 2026

https://github.com/camsai/notebooks

CAMSAI Notebooks provides interactive Jupyter notebooks for AI-driven materials science research. These notebooks demonstrate the use of CAMSAI tools, schemas, and workflows, offering hands-on examples for data validation, materials design, and AI integration to accelerate scientific discovery.

artificial-intelligence chemistry data-science data-standards data-structures data-visualization density-functional-theory machine-learning materials materials-design materials-informatics materials-science modeling-and-simulation molecular-dynamics

Last synced: 27 Oct 2025

https://github.com/filip-kustura/statistics-olympics-analysis

A group seminar analyzing the relationship between citizens' average height and a country's Olympic success. The project involved data collection, descriptive statistics and statistical testing. Created and presented as part of the mandatory undergraduate Statistics course in spring 2021.

correlation-analysis data-analysis data-visualization descriptive-statistics group-project hypothesis-testing olympic-games r-programming research sports-analytics statistical-testing statistics university-project

Last synced: 05 Jan 2026

https://github.com/lytello/data-visualizations

Assortment of data visualizations I have created

data-visualization r

Last synced: 28 May 2026

https://github.com/riju18/data-analysis-and-visualizaton

Most complex data analyzing for clustering, preparing, complex calculation, joining, cross-over & more for Data science.

data-analysis data-mining data-science data-visualization powerbi tableau

Last synced: 04 Jan 2026

https://github.com/cube-m4st3r/sphere.discord-bot

The main discord frontend project for sphere, designated for general purpose commands and visualization.

data-visualization discord-bot discord-py discord-py-bot python3

Last synced: 16 Jan 2026

https://github.com/fatihilhan42/turkey_earthquake_analysis_1915-2021_python

In this project, earthquakes in Turkey from 1915 to 2021 were analyzed. The data taken from the data set, which you can find in the repo, was first organized using data cleaning algorithms. Afterwards, these cleaned data were printed out as graphics and animation using data visualization algorithms.

data-analysis data-cleaning data-visualization jupyter-notebook

Last synced: 23 Mar 2025

https://github.com/fatihilhan42/eda-spacex-launches-falcon9-and-falcon-heavy

In this project, we analyze the space flight data of Spacex space research company Falcon 9 rocket.

data-analysis data-science data-visualization eda elonmusk spacex

Last synced: 23 Mar 2025

https://github.com/BAMresearch/SOFA

SOftware for Force Analysis - A graphical user interface to analyze Atomic Force Microscopy Force Spectroscopy data

atomic-force-microscopy data-science data-visualization

Last synced: 16 Jan 2026

https://github.com/batthulavinay/which-anime-dominates-the-world

This Kaggle notebook analyzes the global popularity of anime across different countries. Using data visualization and statistical insights, we explore which anime dominate in various regions and uncover trends in global anime consumption.

data-visualization exploratory-data-analysis matplotlib numpy pandas python seaborn

Last synced: 24 Mar 2025

https://github.com/cluzier/crypto-price-dashboard

Shows current crypto prices and trade history

charts cryptocurrency data-visualization

Last synced: 13 Oct 2025

https://github.com/ishmal793/dashboard-cms-

streamlit_dashboard (content management system ) both for compliance and violation option

data-visualization streamlit streamlit-dashboard

Last synced: 02 Sep 2025