An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/mehnaz2004/data-cleaning-casestudy

This repository demonstrates data cleaning with a layoffs dataset. It covers handling missing values, detecting outliers, and encoding categorical data, using visualizations like boxplots and distplots to enhance data quality. Check out the code to see these techniques in action.

categorical-data-encoding data-cleaning data-integrity data-visualization missing-value-handling outlier-detection-and-removal pandas seaborn sklearn

Last synced: 01 Mar 2026

https://github.com/nagar2nd/airbnb-property-management-optimization

This project aims to analyze Airbnb’s dataset to optimize rental strategies, enhance customer satisfaction, and maximize revenue for property owners. Using Tableau, the insights generated will help improve decision-making for both Airbnb and its hosts.

data-analysis data-visualization tableau

Last synced: 01 Mar 2026

https://github.com/larsgw/nederlands-kruidkundig-archief

Biodiversity data of the Nederlands Kruidkundig Archief

botany data-visualization vegetation

Last synced: 19 Mar 2026

https://github.com/virajbhutada/excel-reports

A curated showcase of interactive dashboards created using MS Excel, designed to transform raw data into clear, actionable insights. This collection demonstrates advanced Excel visualization techniques tailored for data-driven storytelling.

analytics business-intelligence dashboards data-cleaning data-science data-visualization ecommerce msexcel retail

Last synced: 01 Mar 2026

https://github.com/living-with-machines/accidents-interactive

This is the “accidents interactive” for the Living with Machines exhibit at Leeds City Museum 2022–23.

accidents-analysis data-visualization industrial-revolution museum museum-experience museum-installation

Last synced: 20 Mar 2026

https://github.com/5hraddha/sda-megaline-the-best-plan

Megaline is a telecom operator and it offers its clients two prepaid plans, Surf and Ultimate. The commercial department wants to know which of the plans brings in more revenue in order to adjust the advertising budget.

data-visualization hypothesis-testing matplotlib numpy pandas scipy seaborn statistical-data-analysis

Last synced: 16 Apr 2026

https://github.com/qc20/emojiadventure

EmojiAdventure: An interactive digital art experience that generates a dynamic, ever-changing landscape of emojis using Perlin noise algorithms. Explore a vibrant world where emojis come to life through mouse interaction and creative coding techniques.

algorithmic-art creative-coding data-visualization digital-art emoji font-art generative-art interaction-design interactive-art interactive-graphics interactive-visualizations javascript mouse-interaction p5js perlin-noise ux-design web-animation web-art web-art-is-great-art web-fonts

Last synced: 21 Mar 2026

https://github.com/chdl17/ipl_analysis_r

This GitHub repository contains R code for analyzing Indian Premier League (IPL) data. The repository also includes a detailed report explaining the results and insights gained from the analysis. It is a great resource for anyone interested in understanding the performance of teams and players in the IPL using data science tools.

analysis data-analysis-r data-visualization flexdashboard

Last synced: 02 Mar 2026

https://github.com/lisabensoussan/sampling-data-wrangling-and-visualization

This project focuses on simulating rollup profit strategies and analyzing data on notable female scientists using R. It includes tasks like simulation, data scraping from Wikipedia, and generating various visualizations.

data-visualization data-wrangling probability simulation statistical-analysis

Last synced: 19 Mar 2026

https://github.com/controldata23/foresight-institution

This analysis is an EDA done on a certain Educational Institution Dataset

data-visualization exploratory-data-analysis spreadsheets sql tableau

Last synced: 02 Mar 2026

https://github.com/bhaskarbharati/ibm-datascience-hands-on-lab

This is the basic hands-on exercise using Jupyter Notebook. This lab is done in the process of learning course Tools For Data Science | IBM

data-analysis data-science data-visualization datawrangling eda machine-learning

Last synced: 23 Apr 2025

https://github.com/madhuresh2011/sales-analysis-dashboard-using-power-bi

This project demonstrates the use of Power BI’s advanced analytics capabilities to transform raw data into actionable insights, helping organizations make data-driven decisions to optimize their sales strategies.

analysis dashboards data-insights data-visualization excel-data-import power-bi power-query presentation

Last synced: 02 Mar 2026

https://github.com/ladaegorova18/data_analysis

Learning the basics of data analysis in Python

analytics data-analysis data-visualization steam-games

Last synced: 24 Jun 2026

https://github.com/matheusafonseca/deploy-ml-models-with-streamlit-udemy

This repository is dedicated to storing the code developed during the "Machine Learning Model Deployment with Streamlit" course on Udemy. The course covers basic to advanced techniques for deploying machine learning models using Streamlit.

data data-science data-visualization interface joblib layout machine-learning optimization-algorithms python python3 sklearn sklearn-datasets sklearn-library sklearn-pipeline streamlit

Last synced: 19 Apr 2026

https://github.com/muthukumar0908/imdb_movie_analysis_with_powerbi

The project aim is to analyze the dataset using Power Bi, The dataset is related to IMDB Movies.

data-analysis data-visualization powerbi

Last synced: 12 Jun 2025

https://github.com/danielrosehill/eco-ninja-3

Configuration for an LLM assistant that performs analysis on sustainability data

data-visualization prompt-engineering prompting sustainability

Last synced: 22 Feb 2026

https://github.com/mituskillologies/ds-ref-cdac-aug24

Program of refresher course on Data Science conducted for CDAC officials at CDAC Headquarters, Pune in August 2024.

data-science data-visualization machine-learning mysql python-programming r-programming sql

Last synced: 10 May 2026

https://github.com/borjamome/soho_cholera

Cholera deaths in the Soho District (London)

data-analysis data-visualization london r

Last synced: 04 Sep 2025

https://github.com/quantumudit/groceries-basket-analysis

This project performs market basket analysis using Power BI and Python to reveal associations between grocery items. It involves transforming raw transaction data into a processed dataset, creating interactive Power BI reports, and generating key insights through Python, enabling data-driven decision-making.

data-analysis data-visualization pandas powerbi python

Last synced: 12 Apr 2026

https://github.com/xre22zax/biodiversity---national-parks

National Parks Service about endangered species

data-analysis-python data-visualization ipynb python python3

Last synced: 04 Mar 2026

https://github.com/samuelson777/titanic-dataset-analysis

Exploratory data analysis of the Titanic dataset, uncovering insights on passenger survival rates based on gender, age, and class. Includes data cleaning, visualization, and findings.

data-analysis data-visualization exploratory-data-analysis kaggle machine-learning matplotlib pandas python seaborn titanic-dataset

Last synced: 16 Apr 2026

https://github.com/pdiegel/nc-parcel-search

A web application enhancing access to North Carolina's parcel data with advanced search capabilities and an interactive map interface.

arcgis data-visualization geospatial gis map north-carolina parcel-data react typescript vercel

Last synced: 16 Apr 2026

https://github.com/tejaswirupa/impact-of-workplace-stress-on-mental-health-conditions-of-employees

Studied how remote, hybrid, and onsite work affects employee stress and wellness. Engineered metrics to quantify fatigue and work-life balance, uncovering mental health trends across industries and roles.

data-visualization datascience exploratory-data-analysis feature-engineering

Last synced: 24 Jan 2026

https://github.com/ddeepanshu-997/datascience_marketing_campaign

In this repository i am going to perform data preprocessing techniques and try to findout some useful insights using the various datascience libraries along with data visualisation library to get the precise outputs on the dataset

data-insights data-science data-visualization data-visualization-project datacleaning insights libraries matplotlib numpy-arrays output pandas-dataframe prepr techniques visualization visualization-library

Last synced: 09 Sep 2025

https://github.com/albertofaraujo/pbi_dashboard_anp

Análise do preço médio do combustível automobilístico no Brasil ao longo do ano de 2022

data-visualization dax-studio power-query powerbi

Last synced: 06 Jan 2026

https://github.com/anshajk/covid-vaccinations

A repository to track the rate of covid vaccinations in India

covid-19 data-visualization streamlit

Last synced: 17 May 2026

https://github.com/ashwin331133/gorkha_earthquake_damage_prediction

The main objective is to predict the level of damage to buildings caused by the 2015 Gorkha earthquake in Nepal.

data-analysis data-visualization machine-learning python

Last synced: 29 Apr 2026

https://github.com/johannaschmidle/netflix-subscription-analysis

Examined Netflix subscription data to understand market behaviour, predict future trends, and identify consumer preferences. [SQL, Tableau]

data-analysis data-cleaning data-trend data-visualization netflix

Last synced: 05 Mar 2026

https://github.com/farmradiohangar/living-dashboards

Modern visualization tools for real-time data.

campaign data-visualization ict4d inclusive-innovation survey

Last synced: 05 Mar 2026

https://github.com/vasugi2003/customer_churn_analysis_using_tableau

Customer Churn Analysis - To identify various reasons for customer to discontinue a company services.

business-analytics business-intelligence charts csv data-science data-visualization dataanalytics predictive-modeling preprocessing tableau

Last synced: 05 Mar 2026

https://github.com/drkbluescience/ibm-datascience-spacex

In this project, we predict whether the Falcon 9 first stage will land successfully by following the data science methodology.

data-visualization data-wrangling machine-learning-algorithms sql-query sqlite webscraping-data

Last synced: 10 May 2026

https://github.com/amafjarkasi/nwjs-dashboard-template

🚀 Professional NW.js Desktop Dashboard Template - React 18 + TypeScript + Vite with Ant Design UI, dark/light themes, data visualization, CRUD operations, and cross-platform builds. Production-ready foundation for modern desktop applications.

ant-design cross-platform crud dark-mode dashboard data-visualization desktop-app dexie electron-alternative indexeddb modern-ui nwjs production-ready react recharts tailwindcss template typescript vite zustand

Last synced: 02 Apr 2026

https://github.com/minervarose/exoplanet-discovery-observatory

Interactive Tableau exploration of exoplanet discoveries, planetary systems, and discovery trends using NASA Exoplanet Archive data.

analytics astronomy business-intelligence dashboard data-visualization exoplanets nasa space tableau tableau-public

Last synced: 24 Jun 2026

https://github.com/raphael-ufrj/analise_algodao

Análise histórica de plantio de algodão, analise do plantio com base no clima e nos dados históricos.

analysis data-science data-visualization dataset docker pandas provenance python python3 scikit-learn seaborn streamlit

Last synced: 02 Apr 2026

https://github.com/sayamalt/employee-attrition-prediction

Successfully established a machine learning model which can accurately predict whether an employee of a given company will leave it in the impending future or not, based on several employee details and employment metrics.

binary-classification continuous-deployment continuous-integration cross-validation data-exploration-and-preprocessing data-visualization exploratory-data-analysis feature-engineering hyperparameter-optimization machine-learning model-deployment model-training-and-evaluation

Last synced: 08 Oct 2025

https://github.com/theo-liang/python-project-analysis-for-instacart

This project involved analyzing Instacart's sales data to understand customer purchasing behaviors and optimize marketing strategies.

aggregation data-visualization datatypes deriving-new-variables merging-data pandas-dataframe python subsetting wrangling-data

Last synced: 17 Apr 2026

https://github.com/shridhar1504/tableau-visualization-viz.-project-

This repository contains Visualization Projects which is visualized through Tableau Software, by using the visualization we can gain multiple insights and strategies which helps to develop the business for gaining high profit margins and also it provides social values in some cases to calculate damages and intensity of calamities.

dashboards data-analysis data-science data-visualization exploratory-data-analysis tableau tableau-dashboards tableau-public tableau-workbooks visualization

Last synced: 04 Feb 2026

https://github.com/alex-martineau/realisation_dashboard_veille_technique

Dashboard Streamlit de scoring crédit explicable + veille NLP comparative BERT vs MiniLM pour la classification de produits e-commerce.

api bert credit-scoring dashboard data-visualization ecommerce explainability flask heroku machine-learning minilm nlp sentence-transformers shap streamlit

Last synced: 17 Apr 2026

https://github.com/Sparsh7082/Data-Analysis-Portfolio

This repository is dedicated to showcasing my skills, sharing projects, and tracking my progress in Data Analytics and Data Science.

canva data-analytics data-manipulation data-visualization database-querying google-slides power-bi power-point presentation-tools programming-language python r spreadsheets sql tableau

Last synced: 10 Mar 2025

https://github.com/tolumie/exploratory-data-analytics-projects

Exploratory Data Analytics – A collection of projects covering data exploration, feature engineering, hypothesis testing, and predictive modeling across diverse datasets, including insurance, real estate, laptops, cars, COVID-19, and the Olympics.

data-analysis data-visualization data-wrangling exploratory-data-analysis-eda feature-engineering hypothesis-testing machine-learning matplotlib numpy pandas predictive-modeling python seaborn statistical-analysis

Last synced: 11 Apr 2026

https://github.com/vikpires/ds_tips-dataset

Projeto individual do bootcamp de ciência de dados avanti 2024.2, com o objetivo de analisar e observar padrões no conjunto de dados "Tips".

data-analysis data-science data-visualization exploratory-data-analysis jupyter-notebook matplotlib numpy pandas python seaborn tips

Last synced: 17 Sep 2025

https://github.com/subho004/housepricepredictor

Bangalore House Price Predictor: A web app using Flask and scikit-learn to predict house prices in Bangalore based on location, area, bedrooms, and bathrooms.

bootstrap css data-science data-visualization flask github html lasso-regression linear-regression machine-learning model-deployment pickle predictive-modeling real-estate-housing-prices regression-analysis ridge-regression web-development

Last synced: 04 Apr 2026

https://github.com/prcharan592/social-media-sentiment-analysis

Social media sentiment analysis using tweets involves analyzing tweet data to determine public sentiment (positive, negative, or neutral) using natural language processing (NLP) and machine learning techniques.

data-visualization machine-learning matplotlib nlp nltk numpy pandas python3 sentiment-analysis spacy tweets

Last synced: 04 Apr 2026

https://github.com/yuvrajsaraogi/sales-prediction-using-python

Sales prediction involves estimating future product sales based on factors like advertising spend, target audience, and platform. Businesses rely on data scientists to forecast sales and optimize advertising costs. Machine learning in Python can be used for this task.

data data-analysis data-science data-visualization machine-learning matplotlib natural-language-processing numpy pandas prediction python sales-prediction-using-python sql

Last synced: 19 Apr 2026

https://github.com/loosenthedark/ci_data-visualisation-dashboard-mini-project

Code Institute IFD Module demo project using D3.js, Crossfilter, dc.js & queue.js to leverage sample data relating to salary levels & participation in academia parsed by gender. Bootstrap-based theme.

bootstrap4 code-institute crossfilter css3 d3js data-visualisation data-visualization dcjs frontend html5 javascript queue svg

Last synced: 11 May 2026

https://github.com/albertofaraujo/pbi_data_travels

Melhorar a compreensão dos dados de vendas da empresa Data Travels para identificar oportunidades de crescimento e otimizar suas estratégias de marketing.

data-visualization dax-studio power-query powerbi

Last synced: 26 Jan 2026

https://github.com/awanraskall/retail-demand-analysis

Data analysis of retail meal orders, fulfillment centers, and product demand using Python

data-analysis data-visualization jupyter-notebook numpy pandas python

Last synced: 18 Apr 2026

https://github.com/11noel11/chaos_nonchaos_predictor_nn

AI-powered chaos detection using Simple Harmonic Motion (SHM) & Double Pendulum examples! Compare a Neural Network (NN) with the Lyapunov exponent method to classify chaotic vs. non-chaotic systems. Features Deep Learning, SHAP explainability, F1-score, precision, recall, and stunning visualizations!

chaos-theory classification data-visualization deep-learning double-pendulum explainable-ai feature-importance keras lyapunov-exponent machine-learning neural-networks nonlinear-dynamics python scientific-computing shap simple-harmonic-motion tensorflow time-series-analysis

Last synced: 18 Apr 2026

https://github.com/leandrocollares/urbanization-versus-income

A responsive scatter plot that shows urban population percentages and GDP per capita in Americas.

d3 data-visualization svelte

Last synced: 18 Apr 2026

https://github.com/nishanthmuruganantham/football-player-wages-eda

This repository uses Python for analyzing football player data, focusing on various aspects such as player positions, league distributions, wages, and the relationship between player age and appearances. It includes visualizations generated using Plotly to provide insights into the dynamics of football player demographics and performance.

data-analysis data-science data-visualization eda football football-analytics football-data kaggle kaggle-dataset pandas plotly python

Last synced: 18 Apr 2026

https://github.com/hannahgsimon/halmodeling2024graphs

Created code to develop and analyze statistical graphs for the spatial radiotherapy model, which can be found at https://github.com/hannahgsimon/HALModeling2024. This project was in association with the Cleveland Clinic Lerner Research Institute, Jacob Scott Lab.

agent-based-model bifurcation-analysis cancer-models computational-biology data-visualization hybrid-automata immune-response mathematical-modelling ordinary-differential-equations radiation-therapy spatial-model statistics systems-biology

Last synced: 23 Nov 2025

https://github.com/yashrajgithub/crop-recommendation

KrishiGyaan is a web app designed to help farmers make informed decisions on crop selection. By analyzing soil and environmental factors, the app provides personalized crop recommendations, enhancing agricultural productivity and promoting sustainable farming practices.

api artificial-intelligence crop-recommendation-system data-preprocessing data-visualization json machine-learning-algorithms pickle python random-forest-classifier scikit-learn streamlit supervised-learning train-test-split user-interface

Last synced: 05 Apr 2026

https://github.com/timjjting/escaping-flatland

A demo of optimization techniques for plotting large datasets in a 3D space using Three.js + Svelte integration

big-data data-visualization glsl-shaders lod octree svelte sveltekit three-js

Last synced: 11 May 2026

https://github.com/nowon1/insurance-claim-prediction_version

This project aims to predict the insurance claim amounts based on various customer attributes using machine learning techniques. The project involves data preprocessing, exploratory data analysis, feature engineering, and model training and evaluation.

data-preprocessing data-science data-visualization exploratory-data-analysis feature-engineering insurance jupyter-notebook machine-learning numpy pandas predictive-modeling python random-forest regression-analysis scikit-learn

Last synced: 05 Apr 2026

https://github.com/satti-hari-krishna-reddy/data-whisperer

Data Whisperer is an AI-driven tool that automates exploratory data analysis (EDA), generates actionable insights, and enables natural language querying of datasets. it combines the power of AI (Google Gemini) with interactive visualizations and professional reporting.

ai data-analysis data-visualization llm python3 streamlit

Last synced: 18 Apr 2026

https://github.com/kwokhing/visualizing-datasets-with-facets

Demo on using Facets: An Open Source Visualization Tool for Machine Learning Training Data developed by Google's PAIR Initiative

anaconda data-analysis data-visualization facets jupyter-notebook missing-data open-source python skewness unbalanced-data visualisation visualization

Last synced: 18 Apr 2026

https://github.com/rahuls-1106/dataspark

DataSpark is a powerful analytics project transforming raw retail data into actionable insights for Global Electronics. By leveraging Python, SQL, and interactive visualizations, it uncovers trends in customer behavior, sales performance, and product popularity, driving smarter business decisions and boosting growth.

data-visualization jupyter-notebook matplotlib numpy pandas-dataframe powerbi python seaborn sql

Last synced: 18 Apr 2026

https://github.com/szuzick/hr-analytics-pipeline

End-to-end HR analytics solution using PostgreSQL, dbt, and Power BI

data-analysis data-visualization database-maintenance dbt hr-analytics insights postgresql powerbi sql

Last synced: 10 Jun 2026

https://github.com/prakashjha1/loan-eligibility-prediction

This repository contains the codebase and resources for a machine learning-based project aimed at predicting loan eligibility for individuals. The project utilizes various algorithms and data preprocessing techniques to build predictive models that assess the likelihood of an applicant being eligible for a loan based on historical data.

data data-visualization exploratory-data-analysis loan-prediction-analysis machine-learning-algorithms naive-bayes-classification parameter-tuning python random-forest

Last synced: 19 Apr 2026

https://github.com/shyamkumarnagilla/ai-powered-forecasting-for-agricultural-productivity

AI Powered Forecasting for Agricultural Productivity is a project that utilizes machine learning to predict crop yields and optimize farming practices. By harnessing historical and real-time data, this model empowers farmers with data-driven insights to enhance productivity and sustainability in agriculture.

data-analysis data-visualization deep-learning flask neural-network

Last synced: 19 Apr 2026

https://github.com/patricoferris/data_visualisations

Playing around with Data and Web Graphics

data-visualization javascript

Last synced: 10 Jun 2026

https://github.com/vyjayanthipolapragada/data_analytics_medical_appointments

Analyzing the data set which consists of medical appointments to draw insights about patient's no-show scenarios

data-analysis data-analytics data-cleaning data-visualization data-wrangling jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 19 Apr 2026

https://github.com/mlucifer27/bilateral-visualization

Streamlit app visualizes bilateral relationship scores between 100 countries from 1945 to 2024. It supports interactive heatmaps, network graphs, pairwise comparisons, and more.

d3blocks data-analysis data-visualization plotly-python python streamlit

Last synced: 04 Jun 2026

https://github.com/itskshitija/tesla-stock-price-prediction

Welcome to the Tesla Stock Price Forecasting project, where we delve into time-series analysis to predict stock price trends for one of the world's most innovative companies—Tesla Inc.

data-visualization eda python time-series-analysis

Last synced: 29 Jun 2026

https://github.com/nikolaos-mavromatis/etf-data-analysis-dashboard

Insights into SPY ETF performance with an interactive Streamlit dashboard powered by Alpha Vantage data.

api data-analysis data-visualization financial-analysis pandas plotly python streamlit

Last synced: 20 Apr 2026

https://github.com/namratha2301/carprice_analysisandprediction

This project analyzes factors influencing vehicle prices using a dataset of various attributes, including Engine capacity, Power, Mileage, and Seating capacity.

data-analysis data-visualization exploratory-data-analysis machine-learning pandas predictive-modeling random-forest-classifier regression scikit-learn seaborn

Last synced: 20 Apr 2026

https://github.com/anjaliwork20/moodify

Mood-based music recommendation system that considers a user's emotional state to recommend songs, genres, artists and playlists using Machine learning

artificial-intelligence cnn-keras cnn-model convolutional-neural-networks data data-analysis data-science data-structures data-visualization database deep-learning machine-learning machine-learning-algorithms python recommended song songs

Last synced: 20 Apr 2026

https://github.com/tryomar/data-miner

DataMiner is an interactive web application for data mining and machine learning. It helps users upload, clean, transform, and analyze datasets while building predictive models — all through a simple and powerful Streamlit interface.

data-cleaning data-mining data-preprocessing data-science data-visualization interactive-dashboards pandas python scikit-learn streamlit

Last synced: 20 Apr 2026

https://github.com/hrosicka/czechpopulationestimation

This GitHub repository contains Python code for data analysis and population prediction in the Czech Republic up to the year 2050. The code is written in Python and utilizes the Pandas and Matplotlib libraries.

data-analysis data-visualization matplotlib matplotlib-figures matplotlib-pyplot pandas pandas-dataframe pandas-library pandas-python python python3

Last synced: 11 May 2026

https://github.com/chris-santiago/ferrum

Ferrum is a statistical visualization library for Python: a grammar-first charting system that unifies exploratory plots, statistical graphics, interactive views, and model diagnostics, backed by a Rust engine. Built in 10 days by one human and an agentic Claude framework.

altair data-visualization ggplot2 grammar-of-graphics machine-learning matplotlib plotly plotnine plotting python rust seaborn statistical-graphics

Last synced: 05 Jun 2026

https://github.com/fatihilhan42/hollywood-theatrical-market-synopsis-1995-to-2021

In this project, the data of hollywood film production companies from 1995 to 2021 were examined. Significant tables and graphs were created using data visualization algorithms, with the tickets sold divided into categories.

data data-analysis data-science data-visualization

Last synced: 23 Mar 2025

https://github.com/heremaps/data-lens-javascript-examples

Self-contained JavaScript examples for HERE Data Lens.

data-lens data-visualization examples interactive js

Last synced: 21 Apr 2026