An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/gvatsal60/ds-on-kaggle

A collection of data science projects, experiments, and insights from Kaggle competitions and datasets

data data-science data-visualization numpy pandas python3

Last synced: 29 Apr 2026

https://github.com/machinelearningzuu/data-engineering-projects

This repository is a curated collection of projects and tools that exemplify best practices in data engineering. It serves as a resource for data professionals seeking to enhance their data infrastructure, optimize data pipelines, and implement cutting-edge data processing techniques.

airflow bigquery data-engineering data-science data-visualization data-warehouse

Last synced: 30 Apr 2026

https://github.com/tashi-2004/global-ecommerce-retail-trends-analysis

The Global E-commerce & Retail Analysis project involves data preprocessing, dimensionality reduction with PCA, CLV calculation and What-If analysis . Key insights include effective PCA for data reduction, detailed CLV analysis across segments , and the impact of pricing strategies on sales.

boxplot clv-analysis data-science data-visualization dataintegration deep-learning dimensionality-reduction ecommerce heatmap machine-learning normalization outlier-detection outlier-removal pca-analysis preprocessing python scatter-plot whatif-analysis

Last synced: 30 Apr 2026

https://github.com/shridhar1504/rafik-s-kitchen-data-analysis

The Project is about the Analysis of the Sales and Expenses Data of a Famous Fast-food Restaurant. This mainly focuses on gaining Insights that will boost the Future Sales and also Business Strategies it Improve the Profit Margins. Handled Tools are SQL, Python, Power BI, MS Office Tools.

business-analytics business-intelligence data-analysis data-analytics data-visualization eda ms-office powerbi-report powerpoint-presentations python sql-server

Last synced: 10 May 2026

https://github.com/mattsebastianh/Make-the-Other-Charts.-Silly-s-Ice-Cream-Shop-Project

Data Visualization with Matplotlib | Matplotlib Fundamentals | Silly's Ice Cream Shop Project

data-visualization matplotlib python

Last synced: 18 Jun 2026

https://github.com/tolumie/aviva-insurance-statistics-hypothesis-abtesting-modelling

This project explores the impact of demographic and lifestyle factors on insurance charges. Using statistical hypothesis testing (ANOVA, Chi-Square, T-tests) and predictive modeling (Elastic Net, Random Forest, Gradient Boosting). The analysis is deployed using Streamlit.

anova chi-square-test data-visualization eda gradient-boosting hypothesis-testing insurance-dataset machine-learning predictive-modeling python random-forest statistical-analysis streamlit

Last synced: 30 Apr 2026

https://github.com/knutsynstad/tic-tac-toe-poster

Visualizing the 765 unique gameboards of tic-tac-toe as a matrix using react and create-react-app for a solution space poster.

data-visualization matrix react svg visualization

Last synced: 10 May 2026

https://github.com/kashirin-alex/thither.direct-onamove

an android skeleton-example application for using data from Thither.Direct platform on mobile applications

android-application data data-analysis data-structures data-visualization mobile-development mobility query research-data-management

Last synced: 27 Apr 2026

https://github.com/01110011011101010110010001101111/tigercosmosbootstrapdash

Sample Repository of Visualisaing TigerGraph Data with Cosmos in a Bootstrap Dashboard

bootstrap cosmos data-visualization graph-visualization tigergraph

Last synced: 30 Apr 2026

https://github.com/abhi227070/ipl-2024-sold-player-data-analysis

This project analyzes IPL 2024 auctioned players' data, including name, team, cricket type, nationality, and price. Users input a player's name to access team, style, nationality, and auction price, aiding research and fantasy leagues. It offers insights into player dynamics, serving cricket enthusiasts with comprehensive data exploration.

data-analysis data-visualization dataanalytics machine-learning machine-learning-algorithms python3

Last synced: 30 Apr 2026

https://github.com/amitkaps/vaccines

India COVID Vaccines Status Visualisation

data-visualization

Last synced: 25 Jan 2026

https://github.com/caesaredia/la-cafe-market-analysis

A data-driven feasibility study exploring the potential of launching a robot-staffed café in Los Angeles, based on real F&B business data.

business-intelligence cafe data-analysis data-visualization food-industry franchise los-angeles market-research pandas python

Last synced: 01 May 2026

https://github.com/tashi-2004/data-visualization-tableau-traffic-collision-insights

Analysis of traffic collision data using Tableau, featuring interactive visualizations that highlight trends in injuries and fatalities, contributing factors, and geographic distributions. It includes various sheets and dashboards, with recommendations for enhancing road safety. The dataset is available for further exploration.

data-analysis data-visualization eda geospatial-analysis machine-learning predictive-modeling statistics tableau traffic-analysis

Last synced: 19 Mar 2026

https://github.com/gui-sitton/timeseries-taxi

To attract more drivers during peak hours, we need to predict the amount of cab requests for the next hour. Build a model for this prediction.

data-science data-visualization machine-learning ml python time-series time-series-analysis time-series-prediction

Last synced: 20 May 2026

https://github.com/fbarffmann/project1

Analyzed factors influencing movie profitability using Python. Cleaned and visualized film industry data to uncover trends in budgets, sales, genres, and ratings.

box-office-analysis data-analysis data-visualization matplotlib movie-industry pandas python regression seaborn

Last synced: 01 May 2026

https://github.com/abdoomohamedd/python-data-analysis-projects

A collection of data analysis projects that I have worked on. Each project involves cleaning data, performing exploratory data analysis (EDA), and creating visualizations to extract insights. The projects utilize various Python libraries, including pandas, numpy, matp

data-analysis data-analysis-python data-cleaning data-visualization matplotlib matplotlib-pyplot numpy pandas python

Last synced: 01 May 2026

https://github.com/treyhamilton/baccarat-betting-simulator

Streamlit app simulating a baccarat betting strategy with full results visualization and CSV output.

baccarat betting data-visualization gambling machine-learning simulator streamlit

Last synced: 01 May 2026

https://github.com/shubhamdeepkeshav/visualization-on-tips

📊 Data visualization project analyzing tipping behavior in restaurants using Python. 🍽️ Explores insights based on ⏰ time, 👥 party size, 🧑‍🤝‍🧑 gender, and 🚬 smoker status with Matplotlib and Seaborn.

data-visualization dataanalysis eda matplotlib python seaborn

Last synced: 08 Apr 2025

https://github.com/ezrahsieh/academicdatabasedashboard

The Academic Faculty and Research Insight Dashboard utilizes SQL and NoSQL databases and is designed to support academic institutions, research departments, and individual students by providing comprehensive insights into faculty members and their research activities.

dashboard data-visualization database-management mongodb mysql neo4j sql

Last synced: 21 Feb 2026

https://github.com/sudarsann27/basic_machine_learning_algorithms

Basic Machine learning algorithms using scikit-learn and other fundamental libraries

data-science data-visualization ensemble-model kaggle numpy pandas scikit-learn supervised-machine-learning

Last synced: 20 Jan 2026

https://github.com/rajatdiptabiswas/iris-flower-dataset

:cherry_blossom: Trying out data visualization and data science on the iris flower dataset

data-science data-visualization iris iris-dataset

Last synced: 15 Mar 2025

https://github.com/rios0rios0/investmate

Go-based application designed to scrape and analyze ETF (Exchange-Traded Fund) data, focusing on dividend cash amounts, average closing prices, and dividend yields over a specified number of years. The application uses the colly library for web scraping and the tablewriter library for displaying the data in a formatted table.

crawling data-visualization etf-investments financial-analysis golang

Last synced: 23 May 2026

https://github.com/guptakushal03/whatsapp-chat-analyser

The WhatsApp Chat Analyzer is a Python-based tool built with Streamlit for analyzing WhatsApp chat data. It provides insights such as total messages, word count, media shared, links shared, monthly activity timeline, most active users, activity maps, and word clouds.

chat-analysis data-analysis data-visualization python streamlit text-processing whatsapp word-cloud

Last synced: 01 May 2026

https://github.com/ishmal793/dashboard-cms-

streamlit_dashboard (content management system ) both for compliance and violation option

data-visualization streamlit streamlit-dashboard

Last synced: 02 Sep 2025

https://github.com/cluzier/crypto-price-dashboard

Shows current crypto prices and trade history

charts cryptocurrency data-visualization

Last synced: 13 Oct 2025

https://github.com/manisharora96/instagram-reach-analysis

This project provides a detailed approach to analyzing Instagram reach and engagement metrics. By leveraging the code and tools shared here, you can gain valuable insights into your Instagram content's performance and optimize your strategy to grow your audience effectively

data-analysis data-visualization instagram-reach python-tools

Last synced: 23 Mar 2025

https://github.com/fatihilhan42/eda-spacex-launches-falcon9-and-falcon-heavy

In this project, we analyze the space flight data of Spacex space research company Falcon 9 rocket.

data-analysis data-science data-visualization eda elonmusk spacex

Last synced: 23 Mar 2025

https://github.com/ginalamp/covid_dashboard_twitternews

Corona Dashboard & report based on Twitter media outlet news.

dashboard data-analysis data-visualization twitter

Last synced: 28 Jan 2026

https://github.com/riju18/data-analysis-and-visualizaton

Most complex data analyzing for clustering, preparing, complex calculation, joining, cross-over & more for Data science.

data-analysis data-mining data-science data-visualization powerbi tableau

Last synced: 04 Jan 2026

https://github.com/trim0500/fe-stats-classifier

An experiment to create a machine learning model via PyTorch to classify select Fire Emblem unit base stat distributions.

creational-patterns data-analysis data-science data-visualization design-patterns excel jupyter jupyter-notebook matplotlib-pyplot numpy pandas python python-modules python3 pytorch singleton

Last synced: 11 Apr 2026

https://github.com/kingrg-buff/portfolio

🛠️ About Me I am a Computer Information Systems student at WTAMU, building a foundation in networking, security, databases, analytics, and programming. My passion lies in data science, particularly in using machine learning and data wrangling to uncover insights. This portfolio highlights projects that showcase my technical work.

asp-net-core azuremachinelearning bootstrap csharp data-analytics data-science data-visualization database-management efcore html-css-javascript machine-learning ooad powerbi predective-modeling python r sql system-integration technical-documentation web-development

Last synced: 13 Apr 2026

https://github.com/garcane/solana-ml-forecast

This project uses machine learning, specifically an XGBoost regressor, to predict the price of Solana (SOL) based on historical data and engineered features.

cryptocurrency data-visualization machine-learning solana xgboost

Last synced: 10 May 2026

https://github.com/fbarffmann/tornado-damage-dashboard

Built a Flask dashboard visualizing 1,000+ US tornadoes from 2023 using Leaflet.js and MongoDB. Interactive maps show tornado magnitude, damage, and frequency.

api data-visualization flask geospatial leaflet mongodb pandas python tornado-dashboard

Last synced: 11 Apr 2026

https://github.com/mikeesto/ausvotes19

:bird: A collection of 67,284 public tweets published on the night of the 2019 Australian election

australia data-analysis data-visualization elections open-data twitter

Last synced: 06 Apr 2025

https://github.com/hfzdzakii/dicoding-solvinghrproblem

This repo is a master submission for my Dicoding Final Project. Employee Attrition & Performance Dataset was being used to fulfill the submission. Feel free to explore and I hope my work give you some insight!

data-analysis data-visualization

Last synced: 16 May 2025

https://github.com/dheyhasan/echo-trends

EchoTrends is a data visualization app that analyzes your Spotify playlists and reveals insightful patterns—such as track duration, popularity, and statistical correlations—using interactive charts and statistical tests. Built with React (frontend) and FastAPI (backend), it offers both functional analysis and a demo landing

correlation-analysis data-visualization fastapi javascript music-analysis python react recharts spotify-api tailwindcss

Last synced: 11 Apr 2026

https://github.com/salma-mamdoh/investigating-netflix-movies-and-guest-stars-in-the-office

My Project to learn the Basics of Analysis & Visualization on DataCamp

data-analysis data-visualization datacamp matplotlib pandas python

Last synced: 11 Apr 2026

https://github.com/jdfoster11/northwest_territories_collision_factors

Using Python & Tableau to perform a statistical and regression analysis on a NorthWest Territories Vehicle Collision Dataset

clustering-algorithm co-lab data-science data-visualization heatmap-visualization html python3 tableau

Last synced: 15 Mar 2025

https://github.com/hyoaru/rph-retraction-relationship-visualization

A task output in GEC 3 RPH, retraction relationship visualization

data-visualization graph

Last synced: 31 Mar 2025

https://github.com/ladaegorova18/data_analysis

Learning the basics of data analysis in Python

analytics data-analysis data-visualization steam-games

Last synced: 16 May 2025

https://github.com/solygambas/d3-firebase

5 small projects to understand D3.js basics using Firebase and Materialize.

d3 d3js data-visualization firebase firestore javascript materialize materializecss

Last synced: 11 Apr 2026

https://github.com/mmzong/gee_lifestyleeffectsonhypertension

Generalized Estimating Equations (GEE), Quasi-likelihood under the Independence Model Criterion (QIC), Longitudinal data, Embedded box plots within violin plots with hypertension risk categories, spaghetti plots, aggregate line plots, histograms, faceted-area plots, box and jitter plots. Investigating the impact of lifestyle on health.

aggregate-line-plot area-faceted-plots box-plots data-analysis data-manipulation data-science data-visualization generalized-estimating-equations histograms jitter-plots longitudinal-data qic quasi-likelihoods r spaghetti-plots violin-plots

Last synced: 29 Jul 2025

https://github.com/sayamalt/steel-energy-consumption-prediction-using-pyspark

Successfully established a machine learning model using PySpark which can precisely predict the energy consumption of the steel industry, up to an r2 score of approximately 99.5%.

apache-spark big-data-analytics big-data-processing cross-validation data-visualization exploratory-data-analysis hyperparameter-tuning machine-learning model-training-and-evaluation python regression spark sql

Last synced: 10 Mar 2026

https://github.com/eslamdyab21/data-visualization-using-matplotlib-and-seaborn

This is the last project in the nanodegree udacity program. it's about data visualization.

data data-analysis data-visualization matplotlib pandas python seaborn udacity udacity-data-analyst-nanodegree

Last synced: 09 May 2026

https://github.com/incubated-geek-cc/gis-data-viewer

A web-based GIS data viewer built in JavaScript with Leaflet JS plugin.

data-visualization geospatial geospatial-analysis geospatial-data gis leafletjs

Last synced: 11 Apr 2026

https://github.com/thenazar9/user-behavior-email-campaign-analysis-sql

Analysis of user behavior and email campaign performance using BigQuery and Looker Studio, focusing on account creation trends, email engagement, and user segmentation.

analytics bigquery data-analysis data-visualization etl looker-studio sql structured-query-language

Last synced: 16 Oct 2025

https://github.com/vinitgurjar/r_lang_exp

This is a collection of my collage Data Analytics lab work and assignment, the files here contains program of R language

data-analysis data-visualization r

Last synced: 02 Jul 2025

https://github.com/darkdk123/simple-heart-disease-classification

This Experiment provides a comprehensive approach to forecast heart disease risks by performing a detailed data analysis, predictive modeling & hyperparameter tuning. This leads to a `LinearSVC` model with 90% Accuracy

classification-algorithm data-science data-visualization exploratory-data-analysis heart-disease-prediction machine-learning

Last synced: 17 Nov 2025

https://github.com/katiesaund/tidy_tuesday

A weekly data project in R from the R4DS online learning community

data-analysis data-visualization datascience plot r rstats tidytuesday

Last synced: 24 Mar 2025

https://github.com/eea/eea.reveal

Reveal hidden knowledge by visualizing network structure in your data.

data-analysis data-visualization graphviz network-visualization

Last synced: 18 Mar 2025

https://github.com/mzprog/datatables

create a better tables with few lines of code.

data-visualization datatables laravel-package livewire

Last synced: 27 Apr 2026

https://github.com/binjewarkunal/top-10-worlds-largest-economy-analysis

Top 10 Largest Economies by GDP and GDP Per Capita. Data Collection: Forbes India, Data Visualization: Created a Horizontal Bar Chart and Documention.

data-visualization datacollection microsoft-excel notion

Last synced: 27 Mar 2026

https://github.com/abhipatel35/diabetes_ml_classification

Predict diabetes using machine learning models. Experiment with logistic regression, decision trees, and random forests to achieve accurate predictions based on health indicators. Complete lifecycle of ML project included.

classification data-analysis data-science data-visualization descision-tree diabetes-prediction jupiter-notebook logistic-regression machine-learning model-evaluation open-source pandas pycharm-ide python random-forest scikit-learn

Last synced: 20 Jan 2026

https://github.com/rohan3122k/social-media-sentiment-analysis-of-finance-defence-and-healthcare-in-the-usa

This project provides a comprehensive, data-driven analysis of three critical sectors - Finance, Defense, and Healthcare , under the administrations of Donald Trump and Joe Biden.

api aws data-visualization datamining financial-analysis healthcare-application nytimes-api python reddit-api sentiment-analysis wordcloud-visualization

Last synced: 11 May 2026

https://github.com/sayamalt/house-price-prediction

Successfully created a regression model for predicting the price of any house, excluding enormous real estates and mansions, to a significant level of accuracy.

data-visualization exploratory-data-analysis feature-engineering feature-selection machine-learning regression-analysis regression-testing

Last synced: 09 Nov 2025

https://github.com/jigyasag18/aircraft-data-management

This repository offers a comprehensive simulation of global military air deployments involving 10 countries, aircraft models, mission types, and strategic zones. It analyzes air power distribution, mission intent (offensive, defensive, support), and geopolitical positioning. The project provides structured insights into regional & zone level threat

aircraft-data aircraft-performance data data-analysis data-visualization database database-management dataset datavisualisation mysql powerbi powerbi-report powerbi-visuals sql

Last synced: 04 Feb 2026

https://github.com/balajimohan18/loan-classification-datascience-project

This project uses machine learning algorithms to predict the classification of loan status. The dataset is loaded and some transformation is done using SQL for getting a proper dataset with some valid informations.

classification data-analysis data-cleaning data-science data-visualization loan-prediction loan-status machine-learning sql supervised-learning

Last synced: 03 Sep 2025

https://github.com/leosimoes/datascienceacademy-powerbi-clinicadebi

Atividades do curso Análise de Dados com Microsoft Power BI e Clínica de BI da Data Science Academy.

dashboards data-analysis data-visualization microsoft-power-bi power-bi

Last synced: 05 Jan 2026

https://github.com/sayamalt/employee-attrition-prediction

Successfully established a machine learning model which can accurately predict whether an employee of a given company will leave it in the impending future or not, based on several employee details and employment metrics.

binary-classification continuous-deployment continuous-integration cross-validation data-exploration-and-preprocessing data-visualization exploratory-data-analysis feature-engineering hyperparameter-optimization machine-learning model-deployment model-training-and-evaluation

Last synced: 08 Oct 2025

https://github.com/harshmule1/product-sales-anlysis

Product Sales Anlaysis Using Power Bi

analysis data-visualization powerbi

Last synced: 04 Feb 2026

https://github.com/jfaccioli/citi-bike-tableau

A data analysis of Citi Bike users in Jersey City using Tableau

data-analysis data-visualization tableau tableau-public

Last synced: 26 Jan 2026

https://github.com/noturlee/sales-dataanalysis

This project aims to predict product sales based on advertising expenditures, focusing on 'TV advertising'. Machine learning techniques are employed to analyze and interpret data, enabling businesses to optimize advertising strategies and maximize sales potential.

data-modeling data-science data-structures-and-algorithms data-visualization linear-regression

Last synced: 08 Apr 2025

https://github.com/codyguru/energy-monitoring-dashboard

Energy Monitoring Dashboard to help the client to find the issued sensor quickly

data-visualization mock-server react tailwindcss typescript

Last synced: 10 Apr 2026

https://github.com/apelullo/cobalt_health_wellness_platform_ops

Cobalt is a mental health and wellness platform created for Penn Medicine employees that serves as a hub for support services such as therapy, wellness coaching, topic- and population-specific group sessions, and a variety of self-help resources.

academic-research data-cleaning-pipeline data-validation data-visualization decision-support feature-development healthcare-data hipaa key-performance-metrics mental-health-services operations-research product-analytics reporting-pipeline

Last synced: 23 Mar 2025

https://github.com/franloza/contratosdemadrid

This project is an interactive web application for exploring and analyzing public contracts in the Community of Madrid. It allows users to search for companies and view their contract details, aiming to promote transparency and facilitate access to public information.

data-visualization duckdb evidence open-data

Last synced: 02 Sep 2025

https://github.com/chokzb/covid19_vaccination_analysis

An EDA project examining global COVID-19 vaccination progress. The notebook investigates vaccination trends by country, daily vaccination rates, timeline patterns, and dose distribution. The project includes visualisations created with Matplotlib, Seaborn, and Plotly.

covid-19 data-analysis data-visualization jupyter-notebook matplotlib numpy pandas plotly python seaborn vaccination

Last synced: 07 May 2026

https://github.com/pekiiipy/credit-card-fraud-detection

🔍 Detect credit card fraud efficiently using advanced machine learning techniques, achieving high accuracy rates on a large dataset of transactions.

adasyn anomaly-detection class-imbalance credit-card-fraud data-visualization fraud fraud-detection frauddetection kaggle keras logistic-regression plotly-python postgresql random-forest scikit-learn tensorflow tree-model xgboost

Last synced: 11 Apr 2026

https://github.com/fatihilhan42/hollywood-theatrical-market-synopsis-1995-to-2021

In this project, the data of hollywood film production companies from 1995 to 2021 were examined. Significant tables and graphs were created using data visualization algorithms, with the tickets sold divided into categories.

data data-analysis data-science data-visualization

Last synced: 23 Mar 2025

https://github.com/spear97/montecarlo-python

This was a project for my Programming Language Concepts Class were we were assigned to create a Monty Carlo Simulation using Python.

data-science data-visualization matplotlib-figures matplotlib-python montecarlo pandas-library pandas-python python python-3

Last synced: 23 Mar 2025

https://github.com/dmarks84/ind_project_obesity-multi-class-classification--kaggle

Independent Project - Kaggle Competition -- I worked on the obesity classification data set as part of a Kaggle Competition of the same name, scoring (for accuracy) above 0.9

classification correlation-analysis cross-validation data-modeling data-visualization dataframes eda gridsearchcv matplotlib multiclass-classification numpy pandas python seaborn sklearn statistics supervised-ml

Last synced: 11 Apr 2026

https://github.com/gaurav0502/router-traffic-analysis

Exploratory Analysis of the different kinds of traffics being experienced by a router.

data-analytics data-visualization network-analysis python

Last synced: 06 Apr 2025

https://github.com/jaewonson37/data_visualization2

Topic : Revealing and analyzing the distribution, frequency, and impact of significant earthquakes that happened across various regions and periods of time with several visualization techniques.

bar-plot binning-plot data-visualization ggplot2 mosaic-plots scatter-plot world-map

Last synced: 11 Jun 2025

https://github.com/mumtaz4118/nlp-course

Programming Assignments and Lectures for Stanford's CS 224: Natural Language Processing with Deep Learning

course data data-analysis data-analytics data-science data-visualization deep-learning education machine-learning natural-language-processing neural-network transfer-learning

Last synced: 24 Nov 2025

https://github.com/mindlessmuse666/train-test-splitter

Анализ данных о пассажирах Титаника и разбиение на обучающую и тестовую выборки. Практическое задание по дисциплине "Основы применения методов искусственного интеллекта в программировании".

data-analysis data-preprocessing data-visualization machine-learning pandas python scikit-learn seaborn titanic train-test-split

Last synced: 12 Apr 2026

https://github.com/samjoesilvano/adventureworks_sales_performance_dashboard

Createed an interactive 4-page dashboard for AdventureWorks that visualizes key sales metrics—including revenue, profit, orders, and return rates—across 2020 to 2022. Featuring dynamic geographic analysis and detailed customer insights, this dashboard empowers data-driven decision-making and enhances business performance.

business-intelligence data-analysis-python data-analytics data-driven-decisions data-modeling data-visualization geographic-analysis interactive-dashboards kpi-metrics powerbi sales-performance-analysis

Last synced: 05 Jan 2026

https://github.com/mcommer/emtools

A toolbox for geophysical EM-simulation data- and model-file processing, analysis, plotting, and other gimmicks

data-visualization electromagnetics geophysics plotting-scripts shell-scripts

Last synced: 30 Jun 2025

https://github.com/arthurdanjou/studies

💼 This is the repository containing all my projects done during my studies in Python and R.

ai data data-science data-visualization jupyter jupyter-notebook ml python r

Last synced: 08 Apr 2025