An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/itrauco/data-dirtying-tool

a simple command line tool to generate dirty data and do common data things in google cloud

data data-analysis data-engineering data-ops data-pipeline data-science data-visualization data-wrangling dirty-data google-cloud machine-learning

Last synced: 24 Feb 2025

https://github.com/shubham200137/cyclistic-case-study

This repository contains a case study for Google's Data Analytics Professional Certificate, focusing on Cyclistic, a fictional bike sharing company in Chicago. The case study aims to drive growth by converting casual riders into members through a marketing strategy.

data-analysis data-visualization numpy-python pandas-python presentation-slides sql tableau

Last synced: 11 Jun 2026

https://github.com/mr-chang95/webpage_abtest_analysis_udacity

A/B Testing Project for Udacity's Data Analyst Nanodegree Program. Using Python in Jupyter Notebook.

abtesting data-science data-visualization matplotlib pandas python webpage

Last synced: 11 Apr 2026

https://github.com/miteshgupta07/streamlit-machine-learning-app

A Streamlit application for interactive exploratory data analysis (EDA) and data visualization, offering dynamic tools to analyze and visualize machine learning datasets.

data-visualization python streamlit

Last synced: 27 Apr 2026

https://github.com/azaz9026/myntra_review_project

Myntra Scraper Project Project Overview: The Myntra Scraper Project is designed to extract product data from the Myntra website. This tool enables users to gather information such as product names, prices, descriptions, ratings, and images for analysis, comparison, or personal use.

data-science data-structures data-visualization filesystem github mogodb mogoose python3 strreamlit web-scraping

Last synced: 10 Apr 2026

https://github.com/allanreda/telco-customer-churn-predictor-app

A web-based machine learning application that predicts customer churn using a logistic regression model. Built with Scikit-Learn for model training, Gradio for the user interface, and deployed on Google Cloud App Engine. The app allows users to input customer data and receive predictions on churn risk to support business decision-making.

app-engine data-visualization deployment google-cloud gradio hyperparameter-tuning logistic-regression machine-learning numpy pandas scikit-learn

Last synced: 16 Apr 2026

https://github.com/andersoncrs/analisis_exploratorio_de_datos-eda-_rendimiento_estudiantil

Este análisis exploratorio de datos (EDA) realizado sobre el conjunto de datos de rendimiento estudiantil tiene como objetivo identificar y comprender los factores que influyen en el desempeño académico de los estudiantes. A través de la limpieza, transformación y visualización de datos, se busca descubrir patrones y relaciones significatvas.

data-analysis data-exploration data-exploration-and-preprocessing data-visualization seaborn

Last synced: 30 Mar 2025

https://github.com/albertofaraujo/pbi_dashboard_prouni

Analisar os dados referentes ao detalhamento quantitativo das bolsas PROUNI concedidas no ano de 2021.

data-visualization dax-studio power-query powerbi

Last synced: 03 Feb 2026

https://github.com/jatin-s16/hr_mysql_powerbi

This repository contains raw HR data along with key business questions. I performed data cleaning using MySQL queries and wrote analytical queries to extract meaningful insights. The results were then visualised using Power BI to enhance business understanding.

data-analysis data-science data-visualization mysql powerbi

Last synced: 29 May 2026

https://github.com/haonamnguyen/costumer-shopping-trends-analysis

This project analyzes a synthetic dataset of customer shopping behavior to see key trends and insights. Using SQL and Tableau, the analysis focuses on customer demographics, purchase patterns, and preferences, including age distribution, payment methods, shipping types, and top product categories.

data-analysis data-visualization sql tableau

Last synced: 05 Jan 2026

https://github.com/khushi-sabarad/adinsights_dashboard

AdInsights Dashboard: An interactive web dashboard built with Python (Flask, Pandas, Plotly) to visualize and analyze digital advertising performance. Allows filtering by gender, ad type, and location for detailed insights

ad-performance advertising dashboard data-analysis data-visualization flask pandas plotly python web-application

Last synced: 01 May 2026

https://github.com/samjoesilvano/adventureworks_sales_performance_dashboard

Createed an interactive 4-page dashboard for AdventureWorks that visualizes key sales metrics—including revenue, profit, orders, and return rates—across 2020 to 2022. Featuring dynamic geographic analysis and detailed customer insights, this dashboard empowers data-driven decision-making and enhances business performance.

business-intelligence data-analysis-python data-analytics data-driven-decisions data-modeling data-visualization geographic-analysis interactive-dashboards kpi-metrics powerbi sales-performance-analysis

Last synced: 05 Jan 2026

https://github.com/jleung51/visualizations

Javascript & D3.js visualizations of data.

d3js data-visualization javascript

Last synced: 27 Mar 2025

https://github.com/pekiiipy/credit-card-fraud-detection

🔍 Detect credit card fraud efficiently using advanced machine learning techniques, achieving high accuracy rates on a large dataset of transactions.

adasyn anomaly-detection class-imbalance credit-card-fraud data-visualization fraud fraud-detection frauddetection kaggle keras logistic-regression plotly-python postgresql random-forest scikit-learn tensorflow tree-model xgboost

Last synced: 11 Apr 2026

https://github.com/karo23361/toy-store-kpi-power-bi

PowerBI Portfolio Project

csv data data-visualization powerbi

Last synced: 03 Feb 2026

https://github.com/franloza/contratosdemadrid

This project is an interactive web application for exploring and analyzing public contracts in the Community of Madrid. It allows users to search for companies and view their contract details, aiming to promote transparency and facilitate access to public information.

data-visualization duckdb evidence open-data

Last synced: 23 Jun 2026

https://github.com/pat8901/diskanalyzer-cli

Processes a pdf file holding storage utilization data to automatically create graph visualizations revealing the true demographics hidden in large data.

data-visualization graphs-generation matplotlib

Last synced: 27 Dec 2025

https://github.com/sayamalt/taxi-trip-fare-prediction

Successfully created a machine learning model which can accurately predict the fare of a taxi trip based on several features such as trip duration, tip amount, etc.

cross-validation data-exploration-and-preprocessing data-visualization exploratory-data-analysis feature-engineering hyperparameter-optimization machine-learning model-deployment model-selection model-training-and-evaluation regression-modelling

Last synced: 09 Nov 2025

https://github.com/sayamalt/customer-churn-prediction

Successfully established a machine learning model which can predict whether any given customer currently utilizing the products and services offered by a company will churn at anytime in the future or not, depending upon a set of unique features/characteristics pertaining to that specific individual, to a great level of accuracy.

classification data-visualization exploratory-data-analysis feature-engineering hyperparameter-tuning model-deployment model-evaluation model-optimization model-training supervised-machine-learning

Last synced: 09 Nov 2025

https://github.com/tolumie/exploratory-data-analytics-projects

Exploratory Data Analytics – A collection of projects covering data exploration, feature engineering, hypothesis testing, and predictive modeling across diverse datasets, including insurance, real estate, laptops, cars, COVID-19, and the Olympics.

data-analysis data-visualization data-wrangling exploratory-data-analysis-eda feature-engineering hypothesis-testing machine-learning matplotlib numpy pandas predictive-modeling python seaborn statistical-analysis

Last synced: 11 Apr 2026

https://github.com/debjyotisaha/power-bi-projects-phase-1

Portfolio projects related to data visualisation in Power BI

data-analysis data-visualization dax-expression powerbi powerquery

Last synced: 18 Jan 2026

https://github.com/sehaj003/telco-churn-analysis

This repository contains files (dataset and Jupyter codebooks) for a project aimed to build machine learning models to predict customer churn based on given parameters.

data-science data-visualization exploratory-data-analysis machine-learning machine-learning-models predictive-modeling principal-component-analysis python

Last synced: 20 May 2026

https://github.com/ledsouza/dataviz_vendas_lojas

Projeto de visualização de dados para realizar comparações e distribuições

data-science data-visualization matplotlib pandas pyplot python seaborn vitrinedev

Last synced: 11 Apr 2026

https://github.com/corndogit/dataspaceart

A generative art project which generates stylized patterns from weather data

data-visualization python weather

Last synced: 06 Oct 2025

https://github.com/37743/ml-starterkit

This project is designed to streamline the data science workflow by allowing users to input a file path and receive a comprehensive analysis tailored to their needs. The tool automates key stages in the data science pipeline, including Exploratory Data Analysis (EDA), data preprocessing, and model training.

data-preprocessing data-visualization exploratory-data-analysis machine-learning python

Last synced: 07 Apr 2025

https://github.com/fatihilhan42/turkey_earthquake_analysis_1915-2021_python

In this project, earthquakes in Turkey from 1915 to 2021 were analyzed. The data taken from the data set, which you can find in the repo, was first organized using data cleaning algorithms. Afterwards, these cleaned data were printed out as graphics and animation using data visualization algorithms.

data-analysis data-cleaning data-visualization jupyter-notebook

Last synced: 23 Mar 2025

https://github.com/BAMresearch/SOFA

SOftware for Force Analysis - A graphical user interface to analyze Atomic Force Microscopy Force Spectroscopy data

atomic-force-microscopy data-science data-visualization

Last synced: 16 Jan 2026

https://github.com/steviecurran/dashboards

Compilation of Links to the dashboards in the other repositories

dashboard data-analysis data-science data-visualization pandas powerbi python-dash tableau

Last synced: 21 Feb 2026

https://github.com/walid0912/rfm_analysis

RFM Analysis is employed to comprehend and categorize customers according to their purchasing patterns. RFM, an acronym for recency, frequency, and monetary value, comprises three essential metrics that offer insights into customer involvement, allegiance, and significance to a business.

data-analysis data-visualization python rfm-analysis

Last synced: 02 Sep 2025

https://github.com/ascender1729/sentitweet

SentiTweet: Advanced sentiment analysis tool using AWS Comprehend and TextBlob. Analyze text sentiment via CLI or web interface with visualizations.

aws-comprehend cli-tool data-visualization machine-learning natural-language-processing python sentiment-analysis text-analysis textblob web-application

Last synced: 31 Mar 2025

https://github.com/shubhamdeepkeshav/visualization-on-tips

📊 Data visualization project analyzing tipping behavior in restaurants using Python. 🍽️ Explores insights based on ⏰ time, 👥 party size, 🧑‍🤝‍🧑 gender, and 🚬 smoker status with Matplotlib and Seaborn.

data-visualization dataanalysis eda matplotlib python seaborn

Last synced: 08 Apr 2025

https://github.com/SebastianUrdaneguiBisalaya/datathon-expresate-peru-con-datos

Contiene la información de los 2 proyectos de análisis y ciencia de datos presentados en la Datatón de la Secretaría de Gobierno y Transformación Digital del Perú.

data-visualization jupyter-notebook machine-learning npl-data python

Last synced: 18 Jan 2026

https://github.com/ginalamp/covid_dashboard_twitternews

Corona Dashboard & report based on Twitter media outlet news.

dashboard data-analysis data-visualization twitter

Last synced: 28 Jan 2026

https://github.com/garcane/solana-ml-forecast

This project uses machine learning, specifically an XGBoost regressor, to predict the price of Solana (SOL) based on historical data and engineered features.

cryptocurrency data-visualization machine-learning solana xgboost

Last synced: 10 May 2026

https://github.com/mikeesto/ausvotes19

:bird: A collection of 67,284 public tweets published on the night of the 2019 Australian election

australia data-analysis data-visualization elections open-data twitter

Last synced: 06 Apr 2025

https://github.com/manel15279/datamining-project

A university project that aims to explore various data mining techniques like Data Exploration, Association Rule Mining, Supervised and Unsupervised Learning, applied to real-world datasets, focusing on soil fertility analysis and COVID-19 cases evolution over time.

covid-19 data-analysis data-mining data-visualization datascience gradio machine-learning python soil-properties

Last synced: 10 Jun 2025

https://github.com/srking501/futurelearn_mooc

A summative coursework for CSC8631 Data Management and Exploratory Data Analysis

crisp-dm data-mining data-preprocessing data-science data-visualization deployment eda exploratory-data-analysis

Last synced: 23 Mar 2025

https://github.com/thenazar9/user-behavior-email-campaign-analysis-sql

Analysis of user behavior and email campaign performance using BigQuery and Looker Studio, focusing on account creation trends, email engagement, and user segmentation.

analytics bigquery data-analysis data-visualization etl looker-studio sql structured-query-language

Last synced: 16 Oct 2025

https://github.com/wai-l/dataviz

Data visualisation with Excel, R and other tools.

data-visualisation data-visualization data-viz datavisualisation datavisualization excel ggplot2

Last synced: 05 Jan 2026

https://github.com/katiesaund/tidy_tuesday

A weekly data project in R from the R4DS online learning community

data-analysis data-visualization datascience plot r rstats tidytuesday

Last synced: 24 Mar 2025

https://github.com/mzprog/datatables

create a better tables with few lines of code.

data-visualization datatables laravel-package livewire

Last synced: 27 Apr 2026

https://github.com/binjewarkunal/top-10-worlds-largest-economy-analysis

Top 10 Largest Economies by GDP and GDP Per Capita. Data Collection: Forbes India, Data Visualization: Created a Horizontal Bar Chart and Documention.

data-visualization datacollection microsoft-excel notion

Last synced: 27 Mar 2026

https://github.com/jigyasag18/aircraft-data-management

This repository offers a comprehensive simulation of global military air deployments involving 10 countries, aircraft models, mission types, and strategic zones. It analyzes air power distribution, mission intent (offensive, defensive, support), and geopolitical positioning. The project provides structured insights into regional & zone level threat

aircraft-data aircraft-performance data data-analysis data-visualization database database-management dataset datavisualisation mysql powerbi powerbi-report powerbi-visuals sql

Last synced: 04 Feb 2026

https://github.com/balajimohan18/loan-classification-datascience-project

This project uses machine learning algorithms to predict the classification of loan status. The dataset is loaded and some transformation is done using SQL for getting a proper dataset with some valid informations.

classification data-analysis data-cleaning data-science data-visualization loan-prediction loan-status machine-learning sql supervised-learning

Last synced: 03 Sep 2025

https://github.com/misaghmomenib/shop-revenue-analysis

A Data Analysis Project Aimed at Analyzing and Forecasting Shop Revenue Based on Sales and Other Business Metrics. It Helps to Identify Trends, Patterns, and Key Factors Influencing Revenue to Make Data-driven Decisions for Business Growth.

data-analysis data-visualization python

Last synced: 24 Mar 2025

https://github.com/analysisbyvivek/road-accident

Analyzes road accident patterns, exploring factors like lighting, weather, speed limits, time of day, and road conditions to uncover trends in severity and frequency.

data-analysis data-visualization eda jupyter-notebook kaggle tableau-public

Last synced: 19 Jun 2026

https://github.com/samaalharbi2/project-data-science-blog-post

A data science project from Udacity’s Nanodegree — exploring what drives developer success

crisp-dm data-analysis data-science data-visualization nanodegree udacity

Last synced: 26 Jan 2026

https://github.com/codesaadumair/exploratory-data-analysis

A centralized repository showcasing various Exploratory Data Analysis (EDA) projects using Jupyter notebooks, visualizations, and accompanying documentation.

data-analysis data-science data-visualization eda jupyter-notebook jupyterlab python

Last synced: 24 Mar 2025

https://github.com/fatihilhan42/hollywood-theatrical-market-synopsis-1995-to-2021

In this project, the data of hollywood film production companies from 1995 to 2021 were examined. Significant tables and graphs were created using data visualization algorithms, with the tickets sold divided into categories.

data data-analysis data-science data-visualization

Last synced: 23 Mar 2025

https://github.com/hannahgsimon/halmodeling2024graphs

Created code to develop and analyze statistical graphs for the spatial radiotherapy model, which can be found at https://github.com/hannahgsimon/HALModeling2024. This project was in association with the Cleveland Clinic Lerner Research Institute, Jacob Scott Lab.

agent-based-model bifurcation-analysis cancer-models computational-biology data-visualization hybrid-automata immune-response mathematical-modelling ordinary-differential-equations radiation-therapy spatial-model statistics systems-biology

Last synced: 23 Nov 2025

https://github.com/mindlessmuse666/train-test-splitter

Анализ данных о пассажирах Титаника и разбиение на обучающую и тестовую выборки. Практическое задание по дисциплине "Основы применения методов искусственного интеллекта в программировании".

data-analysis data-preprocessing data-visualization machine-learning pandas python scikit-learn seaborn titanic train-test-split

Last synced: 12 Apr 2026

https://github.com/arthurdanjou/studies

💼 This is the repository containing all my projects done during my studies in Python and R.

ai data data-science data-visualization jupyter jupyter-notebook ml python r

Last synced: 08 Apr 2025

https://github.com/prgermux/image-scrapper

"Image Scrapper" is a Python application that recursively scrapes images from directories and displays them on an interactive, zoomable, and scrollable canvas. Ideal for organizing and navigating large image datasets.

data-visualization desktop-application file-explorer graphics-view gui-tool image-organization image-processing image-scraper image-viewer interactive-visualization pyqt5 python recursive-directory zoom-and-pan

Last synced: 24 Mar 2025

https://github.com/eubrunoo/beer-consumption-predictor

An R project analyzing the impact of environmental factors on beer consumption in São Paulo, with a predictive linear regression model.

data-analysis data-science data-visualization machine-learning r statistical-analysis statistics

Last synced: 02 Apr 2025

https://github.com/vbhatsaccnt/softdrinktrendsanalysis

A Tableau dashboard project providing comprehensive insights into soft drink sales trends, allowing for detailed analysis and informed decision-making within the beverage industry.

dashboard data-visualization food-products marketing tableau trend-analysis

Last synced: 01 Mar 2026

https://github.com/living-with-machines/machines-interactive

This is the “machines interactive” for the Living with Machines exhibit at Leeds City Museum 2022–23.

data-visualization history-of-technology industrial-revolution machines museum museum-experience museum-installation

Last synced: 20 Jan 2026

https://github.com/manikantasanjay/-spatial-analysis-of-temperature-trends

"Global Climate Insights" offers a deep dive into global temperature trends, using spatial analysis and interactive visualizations to highlight the impacts of climate change.

data-visualization folium geospatial matplotlib plotly spatial-analysis temperature-trends

Last synced: 04 Jul 2025

https://github.com/leandrocollares/infant-mortality-in-africa

An interactive choropleth map that shows infant mortality rates in Africa between 1960 and 2018

d3 data-visualization react

Last synced: 07 May 2026

https://github.com/davidchocholaty/bithack_hackathon_2024

This repository contains my personal code tasks for the BIT_Hack hackathon, created in 2024.

data-mining data-science data-visualization exploratory-data-analysis hackaton hackaton-project machine-learning

Last synced: 06 May 2026

https://github.com/mahmoudnamnam/superstore-analysis

This project explores the SuperStore dataset to uncover insights into sales, profit, and customer behavior. It identifies key trends, regional variations, and product performance, using data analysis and machine learning techniques to guide business strategy and optimize performance.

clustering data-analysis data-science data-visualization geopandas jupyter-notebook machine-learning numpy pandas plotly regression seaborn sklearn

Last synced: 12 Apr 2026

https://github.com/mostafa-ghorab/global-happiness-analysis

An analysis of global happiness rankings based on various factors like GDP, family support, health, and freedom from the World Happiness Report (2015-2017). This project provides data visualizations and statistical insights into how these factors influence happiness scores in different regions.

business-analysis data-analysis data-visualization matplotlib numpy pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/filip-kustura/python-covid-19-behaviors-analysis

Using Jupyter Notebook, this university project analyzes attitudes and behaviors related to the COVID-19 pandemic using a two-year survey from Imperial College London and YouGov research company. Utilizing Pandas, NumPy and Matplotlib, the data analysis focuses on three countries, exploring trends and insights throughout the pandemic.

covid-19 data-analysis data-visualization jupyter-notebook matplotlib numpy pandas python university-project

Last synced: 12 Apr 2026

https://github.com/quocduyenanhnguyen/roi-modeling-and-analysis-of-sports-dataset

In this project, you will find my ROI model for retirement savings and PowerPoint presentation of my ROI model, as well as my data analysis/visualization of Sports Ticket Sales dataset that I concluded with a PDF group written report

data-analysis data-visualization microsoft-excel rate-of-return-modeling sports-ticket-sales-dataset

Last synced: 08 Feb 2026

https://github.com/ddeepanshu-997/datascience_marketing_campaign

In this repository i am going to perform data preprocessing techniques and try to findout some useful insights using the various datascience libraries along with data visualisation library to get the precise outputs on the dataset

data-insights data-science data-visualization data-visualization-project datacleaning insights libraries matplotlib numpy-arrays output pandas-dataframe prepr techniques visualization visualization-library

Last synced: 09 Sep 2025

https://github.com/tanaybhadula/twitter-trends-dashboard

An interactive dashboard to visualizes data on current Twitter trends by country and globally. Collects data of over 60 countries using the python Tweepy library, processed it,and visualized it in the form of bar chart and pie chart using the Plotly Dash framework.

dash dashboard data-analysis data-visualization plotly python trends twitter

Last synced: 31 May 2026

https://github.com/aninditaws/investly

Investly: A personal finance platform for young investors, offering tailored portfolio recommendations by integrating user risk profiles, real-time market data, and optimization algorithms.

api-integration data-visualization goal-based-allocation react-frontend supabase-backend

Last synced: 01 Apr 2025

https://github.com/borjamome/soho_cholera

Cholera deaths in the Soho District (London)

data-analysis data-visualization london r

Last synced: 04 Sep 2025

https://github.com/devanshsahu47/hr-dashboard-mysql-powerbi

A comprehensive HR dashboard that visualizes key workforce metrics such as employee demographics, attrition rates, and performance trends. Built using Power BI/Excel, it enables data-driven HR decision-making with interactive charts and KPIs.

data-analytics data-visualization excel power-bi

Last synced: 04 Feb 2026

https://github.com/smpotts/sp500_index_analysis

Uses the Plotly Dash framework to visualize publicly available data for companies listed on the S&P 500 index

dash-plotly data-visualization financial-analysis pandas-dataframe python

Last synced: 01 Apr 2025

https://github.com/shrutiijoshi/e-commerce

The dataset contains various attributes related to orders, customers, and products, providing a comprehensive view of the sales process.

analysis data-visualization tableau-public visualization

Last synced: 07 Jan 2026

https://github.com/zeroxjackson/trendviz

A data visualization tool for Twitter trends in the United States.

data-visualization twitter

Last synced: 01 Apr 2025

https://github.com/saba-gul/google_data_analystics_belabeat_fitness_capstone_project

This project focuses on leveraging Fitbit user data to derive valuable insights and facilitate data-driven decision-making for Bellabeat, a leading wellness company. The objective is to harness the wealth of information captured by Fitbit devices to enhance the wellness offerings provided by Bellabeat.

bellabeat-case-study bellabeat-eda data-analytics data-visualization fitbit google-casestudy

Last synced: 08 Jun 2026

https://github.com/kirby-b/assorted-r-files

Mainly files from learning to use datasets and do data analysis with R

barchart data-visualization r-language r-programming

Last synced: 25 Mar 2025

https://github.com/erabossid/d3js-treemap

Data visualization with D3js Treemap

d3js data-science data-visualisation data-visualization reactjs

Last synced: 10 Mar 2025

https://github.com/ricardo-melo-martins/docker

⚡ RMM ⚡:: 🐳 docker with database for fun development

data-visualization database datascience docker mysql postgres sakila sakila-database sqlite

Last synced: 12 Apr 2026

https://github.com/bhaskarbharati/ibm-datascience-hands-on-lab

This is the basic hands-on exercise using Jupyter Notebook. This lab is done in the process of learning course Tools For Data Science | IBM

data-analysis data-science data-visualization datawrangling eda machine-learning

Last synced: 23 Apr 2025

https://github.com/kathyreid/geelong-council-elections-2017

Chord diagram of distributed preferences based on Victorian Electoral Commission data

chord-diagram d3js data-visualization

Last synced: 13 Mar 2025

https://github.com/teja-1403/game-of-thrones-analysis

Demonstrate Exploratory Data Analysis on GOT Dataset using plots and graphs and using the information extracted from text.

analysis data-visualization datascience machine-learning python

Last synced: 12 Apr 2026