An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/geo-y20/loan-approval-automation-using-mongodb-and-pymongo

This project demonstrates the implementation of a loan approval system that utilizes MongoDB for distributed data storage and management, and PyMongo for database operations. The project aims to automate the assessment of loan eligibility using customer details from online applications.

crud-application data data-analysis data-science data-visualization deployment jupyter-notebook loan-default-prediction loan-prediction-analysis machine-learning machine-learning-algorithms matplotlib mongodb pymongo streamlit web

Last synced: 08 May 2026

https://github.com/sunnybibyan/random_data_generation

A project that generates a dataset using various statistical distributions (Normal, Uniform, Exponential, Random Integers, and Binomial) and performs data analysis. Includes visualizations and an option to export the data as a CSV file.

data-analysis data-visualization python random-data-generation statistics streamlit-webapp

Last synced: 13 Jun 2026

https://github.com/mxagar/statistics_with_python_coursera

My personal notes done while following the Coursera Specialization "Statistics with Python", from the University of Michingan, hosted by Dr. Brenda Gunderson.

data-modeling data-science data-visualization hypothesis-testing machine-learning pandas python statistics

Last synced: 06 May 2026

https://github.com/douglasvolcato/fiis-analysis-brasilian-market

Brazilian investment fund analysis focused in dividend yield and minimun price variation

data-science data-visualization pandas python scraping scraping-websites

Last synced: 09 May 2026

https://github.com/vjo/d3-punchcard

D3 Punchcard chart 📊●•●

chart d3js data-visualization library punchcard visualization

Last synced: 13 Jun 2026

https://github.com/douglasvolcato/brazilian-stock-market-analysis

Brazilian stocks analysis focused in dividend yield, diversification and minimun price variation

data-science data-visualization pandas python scraping scraping-websites

Last synced: 09 May 2026

https://github.com/kirkalyn13/opensignal_autogenerate_report

Script used to generate results/summary, including the trends of flagged provinces, from the raw excel data file,

data-analysis data-science data-visualization matplotlib numpy pandas python

Last synced: 06 May 2026

https://github.com/mohini1403/road_accident_data_analytics

This project aims to analyze road accident data to gain insights into the factors contributing to accidents, identify patterns, and propose data-driven recommendations for improving road safety. The dataset used in this project contains information about various aspects of road accidents, such as location, time, weather conditions, and severity.

analytics data-visualization pandas powerbi

Last synced: 14 Jun 2026

https://github.com/soufianboukir/ecom-analytics-platform

End-to-end data science project on an Amazon sales dataset, including data preprocessing, analysis, modeling, and a Streamlit dashboard for insights and decision-making.

data-analysis data-science data-visualization data-visualization-dashboard forecasting-models timeseries

Last synced: 14 Jun 2026

https://github.com/manhtdxxx/batch-and-stream-pipeline-via-lakehouse

This project demonstrates a modern Lakehouse architecture supporting both streaming and batch data pipelines, built on Apache Iceberg tables.

airflow batch-processing data-engineering data-visualization docker elt-pipeline hive-metastore iceberg kafka lakehouse medallion-architecture spark stream-processing superset trino

Last synced: 08 May 2026

https://github.com/amirhosseinhonardoust/customer-sentiment-intelligence-platform

An enterprise-grade NLP + Streamlit + SQL platform for analyzing customer feedback. Performs automated sentiment detection, stores labeled reviews in SQLite, and delivers real-time dashboards with probability insights to support business, marketing, and product optimization decisions.

community-project cost-of-living dashboard data-analysis data-visualization economic-analysis inflation-tracking local-data open-data pandas price-tracker public-insight python sqlite streamlit

Last synced: 06 May 2026

https://github.com/darkmenacer/google-geolocation-bias

Tool to scrape web links for different queries from different cities across the world, all from your home

data-visualization postgresql python3 selenium webscraping

Last synced: 08 May 2026

https://github.com/mariam-badr-mb/gtc-ml-project2-diabetes-prediction

This project is part of the GTC Machine Learning Program. It demonstrates the end-to-end ML workflow by building a predictive model for diabetes detection

classification-algorithm data-analysis data-visualization diabetes-prediction gridsearchcv hyperparameter-tuning machine-learning python

Last synced: 09 May 2026

https://github.com/shridhar1504/loan-clustering-datascience-project

This project uses Machine Learning to Cluster loan together based on their similarities. The project uses a dataset of loan application which includes information about the Loan amount and Balance. The project then use the clustering algorithm to group the loan together based on the similarities.

clustering-algorithm data-analysis data-science data-visualization datanalysis eda kmeans-clustering machine-learning python sql sql-server unsupervised-learning

Last synced: 08 May 2026

https://github.com/hyoaru/philippine-poverty-area-estimates-choropleth

A web application providing a visual representation using a choropleth map of the estimated magnitude of poor families in the Philippines from the years 2006, 2009, 2012, and 2015.

data-visualization plotly python streamlit web-application

Last synced: 08 May 2026

https://github.com/kaushik0911/jubilant-guide

A Streamlit application for advanced route planning and accessibility analysis using OpenRouteService (ORS). Explore optimal routes while avoiding roadblocks and discover points of interest (POIs) within travel time ranges.

data-analysis data-visualization geospatial-analysis python streamlit

Last synced: 16 Jun 2026

https://github.com/sathyasris27/data-analysis-on-adult-smoking-patterns-in-the-uk

The aim of this analysis is to understand the smoking patterns among adults in the UK.

data data-analysis data-visualization python3

Last synced: 09 May 2026

https://github.com/topfunky/learning-r-stats

Scripts and data while learning to use the R statistics and charting software program

data-visualization r statistics

Last synced: 16 Jun 2026

https://github.com/akshadk7/exploratory-data-analysis

Implementing EDA and Machine Learning Algorithms on Kaggle Car Dataset

data-visualization exploratory-data-analysis machine-learning-algorithms predictive-modeling

Last synced: 17 Jun 2026

https://github.com/barrarrr/fly-in

A dynamic, terminal-based drone network simulation application.

42 42school a-star algorithms breadth-first-search data-visualization drone fly-in

Last synced: 10 Jun 2026

https://github.com/vrostbyte/budget-app

Web app to manage personal finances: track expenses, income, bills, and visualize budgets with charts.

bills-management budget css data-visualization expense- finance html income-tracker javascript json personal-finance web-app

Last synced: 10 May 2026

https://github.com/texasbe2trill/exploring-world-population-with-r

In this project, I explored and analyzed a world population dataset in the R programming language. Utilizing tidyverse, I created two choropleth maps. One map shows the growth rate by continent, and the other shows the growth rate of African countries.

africa choropleth-map continents data-visualization datascience dataset demographics exploratory-data-analysis growthrate people population r rprogramming social-sciences world-population

Last synced: 17 Jun 2026

https://github.com/deva-246/business-insights-on-realtime-swiggy-data-using-python

Data analysis for business decision-making and insights of a real time segment of Swiggy data.

data-visualization jupyter pandas python seaborn

Last synced: 10 May 2026

https://github.com/msohaill/wrappedify

Full stack implementation of Wrappedify

data-visualization express music socket-io spotify sveltekit

Last synced: 05 May 2026

https://github.com/backdoorali/insider-threat-detection-project

Personal data analysis project combining insider threat detection, cybersecurity, and exploratory data analytics. Built for portfolio showcase and practical skills demonstration.

cybersecurity data-analysis data-analysis-excel data-analysis-project data-analyst data-analytics data-visualization eda excel insider-threat jupyter-lab jupyter-notebook matplotlib numbers pandas portfolio-project python python3 threat-detection threat-intelligence

Last synced: 07 May 2026

https://github.com/dimitryzub/walmart-stores-coffee-analysis

Walmart Coffee Exploratory Data Analysis. Data Extracted with SerpApi 🧡

analysis analytics data data-visualization matplotlib pandas python pythonanalysis seaborn

Last synced: 10 May 2026

https://github.com/shazeus/vizflow-cli

Data visualization pipeline tool for schema inspection, charts, dashboards, and export

charts cli dashboard data-visualization flask pandas plotly python

Last synced: 09 Jun 2026

https://github.com/gabrielmpinho/cs50-sql

Solutions and notes from CS50’s Introduction to Databases with SQL. Covers CRUD operations, data modeling, normalization, joins, views, indexes, and connecting SQL with Python and Java. Begins with SQLite for portability and introduces PostgreSQL and MySQL for scalability.

data-analysis data-structures data-visualization database databases javascript python sql

Last synced: 10 May 2026

https://github.com/vineet416/chronic-kidney-disease-prediction

This repository contain code of Chronic Kidney Disease Detection Prediction Project. The goal of this project is predict the chronic kidney disease using parameters like Diabetes Mellitus, Blood Urea, Sugar, Hypertension etc.. I used multiple machine learning algorithms with hyperparameter tuning which is having highest accuracy score of 97.5

data-visualization data-wrangling exploratory-data-analysis feature-engineering feature-selection hyperparameter-tuning machine-learning matplotlib numpy pandas plotly pre-processing python seaborn sklearn-library statsmodels

Last synced: 14 Apr 2026

https://github.com/emilhvitfeldt/ggtetris

Create Tetris Chart Visualizations in R

data-visualization datavisualization dataviz ggplot2 r rstats

Last synced: 05 Apr 2025

https://github.com/kinshuk-code-1729/data-visualisation-using-python

This Repository consists of several python snippets for creating Two-Dimensional (2D) Graphics

data-analysis data-science data-visualization matplotlib visualization

Last synced: 02 Jun 2026

https://github.com/shuklayash02/data_analysis_using_r

Covid19 analysis and cleaning of data where the death age and deaths of specific gender is cleaned and analysed

analysis cleaning-data data-analysis data-visualization rprogramming

Last synced: 09 Oct 2025

https://github.com/snehankekre/streamlit-vehicle-collisions-nyc

This application is a Streamlit dashboard to analyze motor vehicle collisions in NYC 🗽💥🚗

data-science data-visualization nyc-opendata python streamlit streamlit-dashboard

Last synced: 31 Jan 2026

https://github.com/mayankyadav23/air-bnb-data-analysis

Data analysis and insights from NYC Airbnb listings, focusing on key metrics such as host performance, neighborhood trends, pricing, and customer reviews. Comprehensive documentation of ETL processes and analytical methodologies is provided. Perfect for understanding Airbnb dynamics and decision-making in the NYC market.

advanced-excel business-intelligence data-analysis data-analytics data-visualization power-bi ppt

Last synced: 19 Mar 2026

https://github.com/allanotieno254/data-analytics-with-tableau

Repository showcasing projects and insights generated through Tableau. Contains visualizations, dashboards, and analytical reports on various datasets,

analytics-intelligence business-intelligence dashboards-tableau data-analytics data-storytelling data-visualization tableau

Last synced: 19 Mar 2026

https://github.com/mehulcode12/atliq-bank_creditcard_transaction_analysis

The credit card project at Atliq Bank comprises two key phases: market identification and trial. This initiative aims to leverage mathematical and statistical concepts to analyze data related to demographics, income, credit scores, and spending patterns in order to identify the target audience for the credit card.

codebasics data-analysis data-science data-visualization mathematics python python3 statistics

Last synced: 30 Apr 2026

https://github.com/xuri/excelize-cs

Excelize is a C# port of Go Excelize library that allow you to write to and read from XLAM / XLSM / XLSX / XLTM / XLTX files.

agent ai chart csharp data-analysis data-science data-visualization excel excelize formula microsft office ooxml parser spreadsheet xlsm xlsx

Last synced: 03 Mar 2026

https://github.com/thevinh-ha-1710/rstudio-statistics

This project deeply studies 2 datasets using applied statistics techniques.

applied-statistics data-analysis data-science data-visualization rmarkdown rstudio

Last synced: 31 Jan 2026

https://github.com/trigeminal/hospital-respiratory-forensics

(DS) A comprehensive repository dedicated to the analysis of weekly hospital respiratory data and metrics reported to the Centers for Disease Control and Prevention’s (CDC) National Health Safety Network (NHSN) from August 2020 through October 2024.

data-visualization jupyter-notebooks python3

Last synced: 30 Apr 2026

https://github.com/robertopatino1/oscars2023_data_analysis

A deep data science analysis involving tweets regarding the upcoming Academy Awards

data data-analysis-python data-science data-visualization html jupyter-notebook lda-model machine-learning python trends tweepy twitter

Last synced: 24 Apr 2026

https://github.com/tejashah88/Visualflow-Unity

Visualizing Vitech insurance data with Unity and the Leap Motion (for HTN 2017).

data-visualization leap-motion unity unity3d virtual-reality vitech vr

Last synced: 11 Mar 2025

https://github.com/divyanshu-rawat/d3-data-viz

A D3.js based data visualization project.

d3js-bars d3js-graph data-visualization reactjs

Last synced: 15 Apr 2026

https://github.com/mohamedhany99/human-voice-identifier-counter

the application developed in (KIVY) it can identify the users imported into the dataset based on the support vector machine training model it has two features ( Importing new voice - Detection to detect the human voices and count them)

android android-app android-application automation automation-framework data data-analysis data-mining data-science data-visualization datascience kivy kivy-framework machine-learning python

Last synced: 27 Mar 2026

https://github.com/robinmillford/analytics_for_fashion_supply_management

This Streamlit dashboard provides a comprehensive analysis of supply chain data, focusing on key metrics such as production volumes, stock levels, order quantities, revenue, manufacturing costs, lead times, shipping costs, transportation routes, risk factors, and sustainability factors

dashboard data-analysis data-visualization streamlit supply-chain-management

Last synced: 07 Sep 2025

https://github.com/alcestide/scianalytics

Playground for Data Analysis and Visualization for Research and Scientifical Purposes with Pandas and Plotly.

csv data-analysis data-science data-visualization pandas plotly python science-research statistics

Last synced: 30 Apr 2026

https://github.com/epfromer/x2-angular

Angular app for data visualization and searching of Enron email via x2 server.

angular auth0 data-visualization graphql

Last synced: 13 Apr 2026

https://github.com/gutyoh/narrative_visualization

Visualizations of the evolution of Netflix's content over the years using the D3 JavaScript library.

d3 data-visualization javascript

Last synced: 30 Apr 2026

https://github.com/alrza2003/alrza2003.github.io

This repository contains the source files for my personal portfolio website. It highlights my background as a data analyst and radiology student, and showcases real-world projects, tools I use, and ways to connect with me. The site is based on a pre-built template that I customized to reflect my profile and experience.

data data-analysis data-visualization portfolio portfolio-website python

Last synced: 30 Apr 2026

https://github.com/14richa/forecastflow

ForecastFlow is a web application built with Streamlit that visualizes solar and wind power forecasts from Elexon.

analysis data-visualization python streamlit

Last synced: 30 Apr 2026

https://github.com/marcelohfonseca/moneychart

Projeto aberto para que interessadas em investimentos e análise de dados possam testar e contribuir com melhorias no código fonte e nas visualizações de dados.

business-intelligence data-analytics data-visualization investing power-bi python stocks stocks-analysis

Last synced: 17 May 2026

https://github.com/mkk-1817/hr-attrition

This project, conducted during my internship at MeriSKILL, focuses on HR Attrition Prediction using advanced Machine Learning models. The initiative includes the development of a dynamic Dashboard and in-depth Analysis to offer actionable insights for proactive human resource strategies.

data-analysis data-science data-visualization jupyter-notebook machine-learning-algorithms powerbi python

Last synced: 03 May 2026

https://github.com/affec-ds/dashboard-ventas-vinilos

Dashboard interactivo de ventas para tienda de vinilos. Análisis visual, KPIs clave y filtros dinámicos para decisiones comerciales.

business-intelligence data-analysis data-visualization ipywidgets jupyter-notebook kpis matplotlib music-industry notebook-project python retail-analytics sales-analysis seaborn vinyl-records

Last synced: 30 Apr 2026

https://github.com/garcane/global-shipping-analytics-dashboard

This Tableau project provides a comprehensive visual analysis of global sales, shipping costs, and quality metrics across different regions and countries.

data data-analysis data-analyst data-visualization metrics tableau

Last synced: 01 Mar 2026

https://github.com/albertomorini/policesviolence

Repository for the project of the course Data Science (Fondamenti di Scienza dei Dati) at UniUD.

data-analysis data-science data-visualization r

Last synced: 31 May 2026

https://github.com/mansurpro/digitrecognizer

DigitRecognizer is a simple, handcrafted neural network implemented with NumPy to classify handwritten digits from the MNIST dataset. This project demonstrates the foundational principles of neural networks, including forward propagation, backpropagation, and gradient descent, all built from scratch without relying on deep learning frameworks.

backpropagation custom-dataset data-visualization deep-learning from-scratch gradient-descent handwritten-digit-recognition machine-learning minimal-dependencies mnist neural-network numpy python

Last synced: 16 May 2026

https://github.com/arda-guler/spaceguard

Data and programs related to minor planets (especially NEOs and potentially hazardous objects).

asteroid asteroids astronomy data-science data-visualization minor-planet-center minor-planets neo neos planetary-defense space spaceguard

Last synced: 07 Sep 2025

https://github.com/tufstraka/irizs

Data Visualization projects. 📈📉📊

data-visualization

Last synced: 01 Feb 2026

https://github.com/clever-boy/productclassification

Comprehensive product analysis and recommendation system with JSON data processing, visual analytics, and machine learning.

data-visualization json-processing machine-learning product-analysis python recommendation-system

Last synced: 14 Apr 2026

https://github.com/sunnybibyan/marketing_campaign_analysis_power_bi_dashboard

Campaign Performance Analysis This project analyzes the performance of Spring, Summer, and Fall marketing campaigns, revealing key insights and actionable recommendations.

data-analysis data-visualization dax marketing-campaign powerbi

Last synced: 19 Mar 2026

https://github.com/prernarohra/mental-health-prediction

This project focuses on predicting mental health outcomes using machine learning algorithms. By analyzing various psychological, social, and lifestyle factors, the model aims to identify individuals at risk, enabling early intervention and support.

data-analysis data-science data-visualization machine-learning mental-health python

Last synced: 20 May 2026

https://github.com/nafisalawalidris/springforth-university-foodbank

Springforth University Food Bank: A collaborative initiative with UNESCO to address student food insecurity. Contains code and resources for the web application, data analysis, and insights into the prevalence and impact of food insecurity on academic performance.

academic-performance collaborative-initiative data-analysis data-visualization excel pivot-tables powerbi springforth-university-food-bank student-food-insecurity unesco

Last synced: 17 Feb 2026

https://github.com/scarblase/salary-comparison

Submission for the DataCamp Salary Competition(1 level). 🏆

data data-analysis data-science data-visualization engineering python sql structured-data

Last synced: 01 May 2026

https://github.com/rakumar99/jp-morgan-chase-virtual-internship

This repository contains the various tasks assigned by JPMorgan Chase & Co. Virtual Internship on Microsoft Excel

conditional-formatting dashboard data-analysis data-visualization hlookup pivot-tables presentation vba-macros vlookup

Last synced: 02 Mar 2026

https://github.com/anarya22/heart-disease-classification

Predicting heart disease using machine learning. This notebook looks into various python base ML and DS libraries in an attempt to build a machine learning model capable of predicting whether or not someone has heart disease based on their medical attributes.

data-cleaning data-visualization machine-learning matplotlib numpy pandas scikit-learn

Last synced: 01 May 2026

https://github.com/smusab9152/pokemon_data_analysis

This repo that explores and analyzes a dataset of Pokémon attributes. The analysis includes data cleaning, exploratory data analysis (EDA), and visualizations .

analytics data-analysis data-visualization exploratory-data-analysis jupyter-notebook matplotlib numpy pandas pokemon python seaborn statistical-analysis

Last synced: 02 May 2026

https://github.com/pradipece/interactive_analytics_data_automation

This project allows users to control complex dataset processing with Python programming by merging Streamlit with Plotly for visualizations.

automation data-science data-visualization pandas plotly streamlit

Last synced: 01 May 2026

https://github.com/pfund-software-ltd/pfund-plot

A plug-and-play plotting library designed for traders. No Learning curve. Supports real-time plots. Display in Jupyter/Marimo notebook, browser or desktop

data-visualization financial-charts financial-dashboard plotting real-time-graph

Last synced: 10 Oct 2025

https://github.com/prashver/sales-insights-tableau

SQL analysis followed by a Tableau dashboard creation was used to uncover the reasons behind declining sales.

dashboard data-visualization sql tableau

Last synced: 01 Feb 2026

https://github.com/riddhis2226/titanic-survival-data-analysis

Titanic-Survival-Data-Analysis : Analyze passenger data from the Titanic to predict survival based on features like age, gender, class, and fare.

data-analysis data-mining data-science data-visualization database jupyter-notebook machine-learning-models machinelearning-python plotlyjs python3

Last synced: 01 May 2026

https://github.com/as16082023/hotel-booking-analysis-eda-

Exploratory Data Analysis on hotel booking data using Python

data-analysis data-visualization exploratory-data-analysis jupyter-notebook python

Last synced: 29 Apr 2026

https://github.com/madeiradata/microsoft-data-analysts-club

Open-source Repository of Useful Scripts and Solutions for Microsoft Data Analysts

data-analysis data-visualization microsoft-data-analysis powerbi powerbi-report

Last synced: 19 Mar 2026

https://github.com/ryantusi/ai_ml_fundamentals

Welcome to my GitHub repository showcasing projects completed during the AI/ML Foundation course from Codecademy. This repository contains a collection of projects that demonstrate my skills and knowledge in the field of artificial intelligence and machine learning.

artificial-intelligence data-science data-visualization exploratory-data-analysis foundation machine-learning numpy pandas python

Last synced: 14 Apr 2026

https://github.com/alejo1630/ibm_capstone_project

This project aims to leverage predictive analytics to forecast the outcomes of rocket launches for Space Y, a new player in the commercial space industry.

data-collection data-science data-visualization data-wrangling exploratory-data-analysis machine-learning predictive-modeling python spacex

Last synced: 01 May 2026

https://github.com/as16082023/space-missions-dashboard

Created a Tableau dashboard to visualize the history of space missions.

dashboard data-visualization space-mission tableau

Last synced: 20 Mar 2026