An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/ved-coder-king/wheat_ai_project

This project, Smart Wheat Farming AI System, was developed as part of the coursework for the Artificial Intelligence program at Esprit School of Engineering.

agriculture data-analysis data-visualization deep-learning image-classification machine-learning object-detection python wheat

Last synced: 15 Apr 2025

https://github.com/bala-1409/power-bi-visualization-project

This repository contains Visualization Projects which is visualized through Power BI Software, by using the visualization we can gain multiple insights and strategies which helps to develop the business for gaining high profit margins and by the insights we can reduce the damages by accidents & calamities.

dashboard data-analysis data-science data-visualization exploratory-data-analysis microsoft-excel microsoft-power-bi microsoft-powerpoint power-bi powerbi powerbi-reports powerbi-visuals visualization

Last synced: 04 Jan 2026

https://github.com/samanhur/data_visualization_pcc

First experiences in data visualization with python

data-analysis data-science data-visualization python3

Last synced: 23 Mar 2025

https://github.com/mr-chang95/webpage_abtest_analysis_udacity

A/B Testing Project for Udacity's Data Analyst Nanodegree Program. Using Python in Jupyter Notebook.

abtesting data-science data-visualization matplotlib pandas python webpage

Last synced: 11 Apr 2026

https://github.com/abhay-sinha-0/carpricepredictionproject

A machine learning project that predicts the selling price of a car based on its features such as year, mileage, fuel type, transmission, and more. This model can assist individuals and dealerships in estimating fair market prices for used cars.

artificial-intelligence data-analysis data-science data-visualization exploratory-data-analysis machine-learning-algorithms matplotlib-pyplot mysql-database numpy-library pandas-library python skit-learn sklearn-library

Last synced: 15 May 2025

https://github.com/felinjob/ibm-applied-data-science-capstone

Este projeto, parte da especialização IBM Data Science Professional Certificate, prevê o sucesso do pouso do Falcon 9 da SpaceX. Usando dados da API da SpaceX e Web Scraping, o projeto inclui análise de dados e Machine Learning para gerar insights sobre os lançamentos.

data-analysis data-science data-visualization ibm jupyter-notebook machine-learning numpy pandas python scikit-learn seaborn sql

Last synced: 11 Apr 2026

https://github.com/sharoonjoseph321/liver_cirrhosis

This project aim to understanding the factors contributing to liver cirrhosis, analyzing its impact, and possibly predicting disease outcomes using machine learning. It might also explore survival analysis or risk stratification for liver cirrhosis patients.

analytics data-science data-visualization dataanalysis machine-learning machine-learning-algorithms predictive-analytics predictive-modeling python random-forest-classifier visualization

Last synced: 15 Mar 2025

https://github.com/raghavendranhp/phonepe-pulse-data-visualization-and-exploration

This code clones PhonePe data from GitHub. After processing the data, it is displayed in an appealing manner to gain insights from PhonePe's information. This can be used to increase productivity, profits, and focus specifically on business development.

data-visualization githubclone mysql mysqlconnector pandas plotly plotly-dash python sqlalchemy streamlit visualization

Last synced: 11 Apr 2026

https://github.com/azaz9026/myntra_review_project

Myntra Scraper Project Project Overview: The Myntra Scraper Project is designed to extract product data from the Myntra website. This tool enables users to gather information such as product names, prices, descriptions, ratings, and images for analysis, comparison, or personal use.

data-science data-structures data-visualization filesystem github mogodb mogoose python3 strreamlit web-scraping

Last synced: 10 Apr 2026

https://github.com/badranalyst/student-tests-data-analysis-application

Python-based analysis of student test scores in math, reading, and writing, examining correlations with parental education, lunch type, and test preparation. Includes data cleaning, visualization, and statistical insights into factors influencing academic performance.

data-analysis data-visualization dataset matplotlib numpy pandas python sklearn

Last synced: 05 May 2026

https://github.com/allanreda/telco-customer-churn-predictor-app

A web-based machine learning application that predicts customer churn using a logistic regression model. Built with Scikit-Learn for model training, Gradio for the user interface, and deployed on Google Cloud App Engine. The app allows users to input customer data and receive predictions on churn risk to support business decision-making.

app-engine data-visualization deployment google-cloud gradio hyperparameter-tuning logistic-regression machine-learning numpy pandas scikit-learn

Last synced: 16 Apr 2026

https://github.com/udhaya2823/dataspark-illuminating-insights-for-global-electronics

✨DataSpark✨ is a powerful analytics project transforming raw retail data into actionable insights for Global Electronics. By leveraging Python, SQL, and interactive visualizations, it uncovers trends in customer behavior, sales performance, and product popularity, driving smarter business decisions and boosting growth.

data-science data-visualization database-management datacleaning exploratory-data-analysis matplotlib numpy pandas powerbi python seaborn sql version-control

Last synced: 11 Apr 2026

https://github.com/andersoncrs/analisis_exploratorio_de_datos-eda-_rendimiento_estudiantil

Este análisis exploratorio de datos (EDA) realizado sobre el conjunto de datos de rendimiento estudiantil tiene como objetivo identificar y comprender los factores que influyen en el desempeño académico de los estudiantes. A través de la limpieza, transformación y visualización de datos, se busca descubrir patrones y relaciones significatvas.

data-analysis data-exploration data-exploration-and-preprocessing data-visualization seaborn

Last synced: 30 Mar 2025

https://github.com/abhinavbammidi1401/covid-19_analytics

A very comprehensive notebook of statistical models to analyze Covid-19 data and visualization.

analytics covid-19 data-analysis-python data-analytics data-science data-visualization jupyter-notebook predictive-modeling python

Last synced: 19 May 2026

https://github.com/lucertgvby/phat

Graphical PowerShell application designed to help investigators, security analysts, and IT professionals examine email headers for signs of phishing or spoofing. The tool parses headers from .eml and .msg files, highlights important fields, and provides insights into SPF, DKIM, and DMARC results.

data-visualization dimensionality-reduction distributed-computing hashcracking led-matrix-displays mqtt off-chain-compute phala phat raspberry-pi-library single-cell srp-phat unsupervised-learning visualization

Last synced: 21 May 2026

https://github.com/ayaankhan98/covid-19-analysis

Covid-19 Analysis. This repository is a part of AMURoboHack 1.0, Here we tried to visulize the world data of Covid-19. Data Visulization gives an easy way to understand bunch of data. We tried plotting the data over a world map so that users can eaisly get the stats for a conuntry by just hovering the mouse pointer over the country in the world map, we also provided the zooming over the world map to bring a sense of attractiveness and user friendly interface.

covid-19 d3js data-visualization topojson

Last synced: 30 Mar 2025

https://github.com/trimoyee-g/phishing-site-predictor

A phishing site prediction model using scikit-learn's Random Forest Classifier, achieving high accuracy and gaining insights into website characteristics.

data-visualization machine-learning python random-forest-classifier scikit-learn

Last synced: 11 Apr 2026

https://github.com/ianjure/martial-law-in-data

A data visualization of how martial law shaped the Philippine economy.

data-visualization

Last synced: 05 Jan 2026

https://github.com/albertofaraujo/pbi_dashboard_prouni

Analisar os dados referentes ao detalhamento quantitativo das bolsas PROUNI concedidas no ano de 2021.

data-visualization dax-studio power-query powerbi

Last synced: 03 Feb 2026

https://github.com/mitgar14/etl-workshop-1

Workshop #1 (Data Engineer) for the ETL course using Pandas, Matplotlib, SQLAlchemy and Power BI for the creation of the dashboard.

data-engineer data-visualization etl pandas postgresql powerbi python sqlalchemy

Last synced: 11 Apr 2026

https://github.com/jeffbla/image-and-porosity-analysis-by-dash-plotly

This app uses Dash to explore core data. Compare images in the first row and interact with a line chart in the second row. Hovering reveals details, and clicking updates the images to match the selected data point.

dashboard data-visualization dicom-images interactive-visualizations plotly-dash xct

Last synced: 21 Jul 2025

https://github.com/sanad343/complete-data-analyst

Data analysis is the process of turning raw data into useful information for decision-making.

data data-visualization datamanipulation eda excel exploratory-data-analysis powerbi python-3 sql tableau

Last synced: 30 Jun 2025

https://github.com/jatin-s16/hr_mysql_powerbi

This repository contains raw HR data along with key business questions. I performed data cleaning using MySQL queries and wrote analytical queries to extract meaningful insights. The results were then visualised using Power BI to enhance business understanding.

data-analysis data-science data-visualization mysql powerbi

Last synced: 29 May 2026

https://github.com/asuquoaa/bar_chart_visualization_with_confidence_intervals_and_interactive_slider

This project visualizes probabilistic data using bar charts with 95% confidence intervals, allowing users to explore deviations from a Value of Interest (V of I) interactively.

data-visualization interactive-visualizations statistics

Last synced: 01 Sep 2025

https://github.com/csoren66/customer-personality-analysis

Predict how different customer segments will respond for a particular product or service.

data-analysis data-visualization python

Last synced: 03 Mar 2025

https://github.com/vedantshi/coffee-sales-dashboard

This project analyzes coffee sales data using Excel, featuring data cleaning, trend analysis, and an interactive dashboard. Key insights highlight top-performing products, regional sales trends, and seasonal patterns. Recommendations focus on marketing strategies and inventory optimization. Future plans include Power BI integration for visuals.

business-insights data-analysis data-visualization excel-dashboard pivot-tables sales-trends

Last synced: 05 Jan 2026

https://github.com/samjoesilvano/adventureworks_sales_performance_dashboard

Createed an interactive 4-page dashboard for AdventureWorks that visualizes key sales metrics—including revenue, profit, orders, and return rates—across 2020 to 2022. Featuring dynamic geographic analysis and detailed customer insights, this dashboard empowers data-driven decision-making and enhances business performance.

business-intelligence data-analysis-python data-analytics data-driven-decisions data-modeling data-visualization geographic-analysis interactive-dashboards kpi-metrics powerbi sales-performance-analysis

Last synced: 05 Jan 2026

https://github.com/tsopermon/comparison-ml-algorithms

This repository compares the performance of Adaline, Logistic Regression, and Perceptron models on binary classification tasks using linearly, non-linearly, and marginally separable datasets from the Iris dataset. It includes MATLAB implementations, 10-fold cross-validation, and visualizations of decision boundaries and MSE histories.

adaline binary-classification classification-accuracy cross-validation data-visualization decision-boundaries iris-dataset logistic-regression machine-learning matlab mse neural-networks perceptron

Last synced: 15 Mar 2025

https://github.com/dmarks84/ind_project_obesity-multi-class-classification--kaggle

Independent Project - Kaggle Competition -- I worked on the obesity classification data set as part of a Kaggle Competition of the same name, scoring (for accuracy) above 0.9

classification correlation-analysis cross-validation data-modeling data-visualization dataframes eda gridsearchcv matplotlib multiclass-classification numpy pandas python seaborn sklearn statistics supervised-ml

Last synced: 11 Apr 2026

https://github.com/upes-open/open-cryptocurrency-analysis

A web app to visualise and predict the cryptocurrency’s impact by using Web scraping, data exploration, EDA and Data Visualization.

analysis cryptocurrency data-analysis data-science data-visualization jupyter-notebook streamlite visualization

Last synced: 15 Apr 2025

https://github.com/spear97/montecarlo-python

This was a project for my Programming Language Concepts Class were we were assigned to create a Monty Carlo Simulation using Python.

data-science data-visualization matplotlib-figures matplotlib-python montecarlo pandas-library pandas-python python python-3

Last synced: 23 Mar 2025

https://github.com/yaser-123/energy-consumption-dashboard

A Power BI dashboard to analyze energy consumption for water, gas, and electricity across cities and buildings. Features include interactive charts, drill-down insights, and dynamic filters for easy monitoring and optimization.

dashboard data-analysis data-analytics data-visualization energy-consumption energy-efficiency powerbi

Last synced: 05 Jan 2026

https://github.com/pekiiipy/credit-card-fraud-detection

🔍 Detect credit card fraud efficiently using advanced machine learning techniques, achieving high accuracy rates on a large dataset of transactions.

adasyn anomaly-detection class-imbalance credit-card-fraud data-visualization fraud fraud-detection frauddetection kaggle keras logistic-regression plotly-python postgresql random-forest scikit-learn tensorflow tree-model xgboost

Last synced: 11 Apr 2026

https://github.com/vidyadnina/cyclistic-sql-tableau-project

Trip data analysis for a bike-sharing service company using SQL and Tableau.

bigquery dashboard data-analysis data-analytics-sql data-cleaning data-visualization sql

Last synced: 02 Jan 2026

https://github.com/carmendev/covid-19-tracker

Data visualization React.js project deployed with Firebase. Daily statistics about current, recovered and closed cases coming from an API.

data-visualization firebase numeral reactjs

Last synced: 11 Apr 2026

https://github.com/franloza/contratosdemadrid

This project is an interactive web application for exploring and analyzing public contracts in the Community of Madrid. It allows users to search for companies and view their contract details, aiming to promote transparency and facilitate access to public information.

data-visualization duckdb evidence open-data

Last synced: 02 Sep 2025

https://github.com/hanzopgp/lolanalysis

League Of Legends game data engineering, analysis, visualization and machine learning. Business intelligence project.

data-analysis data-cleaning data-engineering data-visualization dataiku deep-learning etl machine-learning scraping university

Last synced: 27 May 2026

https://github.com/syarwinaaa09/hypothesis-testing-with-mens-and-womens-soccer-matches

a data-driven exploration of international men's and women's football (soccer) match results using Python

data-analysis data-visualization football jupyter-notebook men-vs-women pandas python soccer sports-analytics visualization

Last synced: 05 May 2026

https://github.com/gautam25raj/data-sync

A powerful platform designed to revolutionize the way teams collaborate and visualize data.

chat collaboration data-visualization express material-tailwind mongodb mongoose nextjs nodejs reactjs redux redux-toolkit tableau tableau-dashboard tailwindcss

Last synced: 11 Apr 2026

https://github.com/alfiyafatima09/heuristic_algorithms

This project compares pathfinding algorithms (A*, Greedy Best-First, and Hill Climbing) by visualizing their paths and comparing performance metrics (nodes explored, memory, execution time) on a grid with obstacles.

algorithms data-visualization

Last synced: 20 Jan 2026

https://github.com/gunjanmimo/d3-visualization

D3.js is a JavaScript library for producing dynamic, interactive data visualizations in web browsers. It makes use of Scalable Vector Graphics, HTML5, and Cascading Style Sheets standards. It is the successor to the earlier Protovis framework

d3js data data-science data-visualization reactjs

Last synced: 29 Apr 2026

https://github.com/jfaccioli/citi-bike-tableau

A data analysis of Citi Bike users in Jersey City using Tableau

data-analysis data-visualization tableau tableau-public

Last synced: 26 Jan 2026

https://github.com/sayamalt/taxi-trip-fare-prediction

Successfully created a machine learning model which can accurately predict the fare of a taxi trip based on several features such as trip duration, tip amount, etc.

cross-validation data-exploration-and-preprocessing data-visualization exploratory-data-analysis feature-engineering hyperparameter-optimization machine-learning model-deployment model-selection model-training-and-evaluation regression-modelling

Last synced: 09 Nov 2025

https://github.com/sayamalt/credit-card-approval-prediction

Successfully developed a machine learning model which can accurately predict up to 100% accuracy whether a credit card application of a given applicant would be approved or not, based on several demographic features such as applicant age, total income, marital status, total years of work experience, etc.

binary-classification cicd-deployment cross-validation data-exploration-and-preprocessing data-visualization exploratory-data-analysis feature-engineering hyperparameter-optimization machine-learning model-deployment model-retraining model-selection model-testing model-training-and-evaluation

Last synced: 09 Nov 2025

https://github.com/jiyanshgarg/delhivery-logistics-data-analysis

This project analyzes Delhivery's logistics delivery dataset to understand delivery performance, route efficiency, and operational patterns using data analytics techniques. The analysis focuses on transforming raw segment-level logistics data into meaningful trip-level insights that can help improve delivery efficiency and route planning.

business-insights-and-recommendations data-analysis data-cleaning-and-preprocessing data-visualization exploratory-data-analysis feature-engineering feature-extraction feature-selection hypothesis-testing outlier-detection outlier-treatment

Last synced: 12 Jun 2026

https://github.com/sayamalt/house-price-prediction

Successfully created a regression model for predicting the price of any house, excluding enormous real estates and mansions, to a significant level of accuracy.

data-visualization exploratory-data-analysis feature-engineering feature-selection machine-learning regression-analysis regression-testing

Last synced: 09 Nov 2025

https://github.com/farhannirzhor/vrinda_store_excel_project

This project is about excel analysis and visualization. In this project, I analyzed Vrinda Store's sales and made an annual sales report

data-analysis data-cleaning data-preprocessing data-visualization microsoft-excel reporting

Last synced: 05 Jan 2026

https://github.com/eslamdyab21/data-visualization-using-matplotlib-and-seaborn

This is the last project in the nanodegree udacity program. it's about data visualization.

data data-analysis data-visualization matplotlib pandas python seaborn udacity udacity-data-analyst-nanodegree

Last synced: 09 May 2026

https://github.com/tolumie/exploratory-data-analytics-projects

Exploratory Data Analytics – A collection of projects covering data exploration, feature engineering, hypothesis testing, and predictive modeling across diverse datasets, including insurance, real estate, laptops, cars, COVID-19, and the Olympics.

data-analysis data-visualization data-wrangling exploratory-data-analysis-eda feature-engineering hypothesis-testing machine-learning matplotlib numpy pandas predictive-modeling python seaborn statistical-analysis

Last synced: 11 Apr 2026

https://github.com/sayamalt/steel-energy-consumption-prediction-using-pyspark

Successfully established a machine learning model using PySpark which can precisely predict the energy consumption of the steel industry, up to an r2 score of approximately 99.5%.

apache-spark big-data-analytics big-data-processing cross-validation data-visualization exploratory-data-analysis hyperparameter-tuning machine-learning model-training-and-evaluation python regression spark sql

Last synced: 10 Mar 2026

https://github.com/thenorthkun/movies-dataset-analysis

Analysis & categorizing of Movies based on Actors, Genres, Gross covered etc 🦸🏼🧜🏼‍♀️🎧

data-analysis data-visualization filtering

Last synced: 23 Mar 2025

https://github.com/nel-zi/nuga_bank

Developed an automated data exploration and cleaning pipeline for Nuga Bank to streamline data preparation, ensure consistent data quality, and normalize datasets into structured databases for efficient analysis and reporting.

data data-automation data-visualization datacleaning datatransformation etl-automation etl-pipeline

Last synced: 16 May 2025

https://github.com/debjyotisaha/power-bi-projects-phase-1

Portfolio projects related to data visualisation in Power BI

data-analysis data-visualization dax-expression powerbi powerquery

Last synced: 18 Jan 2026

https://github.com/nimomach/skateboarding-in-olympics

Skateboarding made its debut in Olympics at the 2020 Summer Olympics. This is a dashboard focused on "Skateboarding in the Olympics" representing a comprehensive overview of the sport's performance, popularity, and key metrics during the Olympic Games.

data-analysis data-visualization olympics paris skateboarding tokyo

Last synced: 10 Mar 2026

https://github.com/sundarsharma332/codeliner

code snippet highlighter with custom line selection

code data-visualization program snapshot visualizer

Last synced: 10 Mar 2026

https://github.com/adamouization/python-machine-learning-data-science-notes

:orange_book: Jupyter notebooks containing useful Python code and notes for general Machine Learning and Data Science projects.

data data-science data-visualization guide jupyter jupyter-notebook machine-learning matplotlib notes numpy pandas pandas-dataframe python seaborn

Last synced: 11 Apr 2026

https://github.com/sehaj003/telco-churn-analysis

This repository contains files (dataset and Jupyter codebooks) for a project aimed to build machine learning models to predict customer churn based on given parameters.

data-science data-visualization exploratory-data-analysis machine-learning machine-learning-models predictive-modeling principal-component-analysis python

Last synced: 20 May 2026

https://github.com/jdfoster11/northwest_territories_collision_factors

Using Python & Tableau to perform a statistical and regression analysis on a NorthWest Territories Vehicle Collision Dataset

clustering-algorithm co-lab data-science data-visualization heatmap-visualization html python3 tableau

Last synced: 15 Mar 2025

https://github.com/Lightning-Chart/lcjs-example-0305-racingbars

A demo application showcasing using LightningChart JS to visualize tracking of COVID-19.

chart data-visualization lcjs lightningchart-js

Last synced: 27 Dec 2025

https://github.com/pratanup/exploratory-data-analysis-eda-

Objective is to make this data ready for modeling by transforming the given data into clean data by doing EDA

data-analytics data-science data-visualization exploratory-data-analysis python

Last synced: 19 Apr 2026

https://github.com/gustapinto/fatec_dsm_pi_terceiro_semestre

Um visualizador para tendências de animes

anime chartjs data-visualization django etl

Last synced: 05 Apr 2025

https://github.com/pythoncoderunicorn/gi-joe

Dataset for GI Joe action figures from 1980s & 1990s. My dataset for TidyTuesday

data-science data-visualization gi-joe r toy-project

Last synced: 27 May 2026

https://github.com/corndogit/dataspaceart

A generative art project which generates stylized patterns from weather data

data-visualization python weather

Last synced: 06 Oct 2025

https://github.com/hfzdzakii/dicoding-solvinghrproblem

This repo is a master submission for my Dicoding Final Project. Employee Attrition & Performance Dataset was being used to fulfill the submission. Feel free to explore and I hope my work give you some insight!

data-analysis data-visualization

Last synced: 16 May 2025

https://github.com/camsai/notebooks

CAMSAI Notebooks provides interactive Jupyter notebooks for AI-driven materials science research. These notebooks demonstrate the use of CAMSAI tools, schemas, and workflows, offering hands-on examples for data validation, materials design, and AI integration to accelerate scientific discovery.

artificial-intelligence chemistry data-science data-standards data-structures data-visualization density-functional-theory machine-learning materials materials-design materials-informatics materials-science modeling-and-simulation molecular-dynamics

Last synced: 27 Oct 2025

https://github.com/filip-kustura/statistics-olympics-analysis

A group seminar analyzing the relationship between citizens' average height and a country's Olympic success. The project involved data collection, descriptive statistics and statistical testing. Created and presented as part of the mandatory undergraduate Statistics course in spring 2021.

correlation-analysis data-analysis data-visualization descriptive-statistics group-project hypothesis-testing olympic-games r-programming research sports-analytics statistical-testing statistics university-project

Last synced: 05 Jan 2026

https://github.com/lytello/data-visualizations

Assortment of data visualizations I have created

data-visualization r

Last synced: 28 May 2026

https://github.com/riju18/data-analysis-and-visualizaton

Most complex data analyzing for clustering, preparing, complex calculation, joining, cross-over & more for Data science.

data-analysis data-mining data-science data-visualization powerbi tableau

Last synced: 04 Jan 2026

https://github.com/ledsouza/curso_de_estatistica_parte_3

Projeto de curso de estatística sobre distribuições e teste de hipósteses

data-science data-visualization pandas scipy seaborn statsmodels vitrinedev

Last synced: 29 Apr 2026