An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/hemangsharma/streamingcontentanalyzer

This Streamlit application provides an interactive dashboard for analyzing streaming content data. It allows users to explore movie and TV show ratings, distributions, temporal trends, and genre breakdowns through various visualizations and filters.

dashboard data-analysis data-science data-visualization python streamlit-dashboard streamlit-webapp

Last synced: 02 Apr 2025

https://github.com/syedzaheerabbas/aerofit-descriptive_analysis

Analyzed customer profiles for Aerofit treadmills to enhance product recommendations. The project includes visualizations and probability calculations to understand how customer demographics impact treadmill purchases.

data-visualization descriptive-statistics eda insights probability-analysis python

Last synced: 28 Apr 2026

https://github.com/melih0132/all-my-projects

This repository showcases projects from my computer science journey, covering technologies like web development and interactive applications.

csharp data-visualization database game-development html-css ia javascript kotlin-android python software-development swift unity web-development

Last synced: 05 Apr 2026

https://github.com/nazir20/scraping-tweets-using-python-and-preprocessing-tweets-for-sentiment-analysis

This is repo is about how to scrape tweets from Twitter using Python and also proprocessing tweets for sentiment analysis

data-cleaning data-visualization jupyter-notebook python twitter-sentiment-analysis

Last synced: 13 Apr 2026

https://github.com/zeptosec/bpscrapper

Shows history of oil prices

data data-visualization database nodejs scraper

Last synced: 13 Apr 2026

https://github.com/virajbhutada/titanic-survival-prediction

ML project focused on predicting Titanic passenger survival using various algorithms and extensive data analysis techniques. This project includes detailed data visualization and interpretation to uncover key factors affecting survival. By leveraging various ML models the analysis aims to achieve high predictive accuracy.

ada-boost-classifier data-exploration data-science data-visualization decision-tree-classifier hyperparameter-tuning knn-classification logistic-regression machine-learning model-interpretation random-forest-classifier roc-curve titanic-classification

Last synced: 14 Jun 2026

https://github.com/deliprofesor/income-analytics-interpretable-machine-learning-model

This project predicts whether an individual earns more than 50K using the Adult Income dataset. A Random Forest model is trained and evaluated, with explanations provided through DALEX and LIME for feature importance and model transparency.

classification dalex data-preprocessing data-science data-visualization feature-engineering income-prediction lime machine-learning model-explainability predictive-modeling r-programming random-forest

Last synced: 10 Apr 2025

https://github.com/akansharajput280799/covid19-impact-analysis-usa

Data Analysis and Predictive Modeling to study COVID-19 impact across age groups, regions, and seasons in the USA.

classification-algorithm clustering-algorithm data-preprocessing data-visualization descriptive-statistics exploratory-data-analysis matplotlib numpy pandas seaborn

Last synced: 13 Apr 2026

https://github.com/leandrocollares/nyc-film-permits

NYC film permits: an exploratory data analysis

data-analysis data-visualization pandas plotly

Last synced: 05 Jul 2025

https://github.com/deliprofesor/virtual-reality-in-education-impact-analysis-and-insights

This project examines the impact of Virtual Reality (VR) on education, focusing on its effects on student engagement, learning outcomes, and creativity. It uses data analysis techniques like descriptive statistics, correlation analysis, and clustering to assess VR's effectiveness in enhancing learning.

clustering data data-analysis data-science data-visualization exploratory-data-analysis hypothesis-testing machine-learning python regression-analysis virtual-reality

Last synced: 14 Jun 2025

https://github.com/umutonder97/project-network-ids

Network-Based Intrusion Detection System - dev/deploy-ment of a Hybrid Intrusion Detection System (HIDS) that integrates Signature-based Network Intrusion Detection Systems (SNIDS)

artificial-neural-networks convolutional-neural-networks covid-19 covid-19-russia covid19-data data-visualization genetic-algorithm ids keras-tensorflow knn microservices-architecture network-behavioral-analysis python time-series-forecasting

Last synced: 05 Jul 2025

https://github.com/kivanc57/feature_comparison

This project explores the relationship between features and diagnosis in cancer data. Using methods like boxplots, scatterplots, PCA, k-means clustering, and logistic regression, we analyze and visualize data to understand health indicators.

boxplot clustering correlation data-science data-visualization descriptive-statistics explanatory-data-analysis pearson-correlation r scatter-plot spearman

Last synced: 06 Jun 2026

https://github.com/fazzaan/gitbook-sciencing

GitBook sync for Sciencing publishing & training projects

data-presentation data-visualization ebook gitbook science science-communication science-research

Last synced: 08 Jan 2026

https://github.com/treyhamilton/ds-project-1

A compilation of various programming concepts written in Python/R covering the topics listed below

covid19-data data-science data-visualization exploratory-data-analysis

Last synced: 06 Jul 2025

https://github.com/brazer27/iris-classification

A Python implementation of Naive Bayes algorithm for Iris flower classification. Features include cross-validation, data preprocessing, and prediction capabilities. Built from scratch without ML libraries, achieving ~95% accuracy on the classic Iris dataset.

cross-validation data-science data-visualization flower-classification iris-dataset machine-learning naive-bayes python

Last synced: 06 Sep 2025

https://github.com/mituskillologies/data-science-mar25

Programs of Data Science batch @ MITU Skillologies, March 2025

data-analytics data-science data-visualization machine-learning python

Last synced: 16 Mar 2025

https://github.com/master-helix/music-queries

This is a beginner Data Analyst Portfolio Project aimed at providing data insights based on a music store dataset

data-analytics data-visualization ms-excel postgresql sql

Last synced: 06 Sep 2025

https://github.com/abhisek-13/whatsapp-chat-analyzer

The WhatsApp Chat Analyzer is a data analysis project that provides insights into WhatsApp chats. It analyzes chat data to show metrics like the number of lines, most used letter, chatting duration, media files shared, most used emojis, and group member activity. The results are displayed on a user-friendly dashboard built with Streamlit.

data-analysis data-mining data-visualization eda machine-learning machine-learning-algorithms matplotlib numpy pandas python seaborn sklearn

Last synced: 13 Apr 2026

https://github.com/as16082023/heart-health-dashboard

A Tableau project analyzing heart disease risks by age, gender, and key health metrics to identify critical trends and insights

data-visualization tableau

Last synced: 21 Jan 2026

https://github.com/shuyib/mouse_gut_otu

Vectorization and Unsupervised Learning of Mouse Operation Taxonomic Units to determine which species of bacteria form distinct groups in a dataset.

16s-rrna anaconda analysis data-visualization dataset gut-microbiome matplotlib-figures mothur numpy-arrays pandas-dataframe pca-analysis python3 scikitlearn-machine-learning sops t-sne unsupervised-learning

Last synced: 13 Apr 2026

https://github.com/grandechowhiskey/fcc-data_analysis-projects

A collection of projects completed as part of the FreeCodeCamp "Data Analysis with Python" certification. These projects cover statistical calculations, data visualization, and trend analysis using real-world datasets.

data-analysis data-visualization matplotlib pandas python3 scikit-learn seaborn

Last synced: 01 May 2026

https://github.com/k8hertweck/tidytuesdaydataviz

data viz for TidyTuesday lunch meetup at the Hutch

data-visualization tidytuesday

Last synced: 30 May 2026

https://github.com/srinibas-masanta/olympics-data-analysis

The Olympics Analysis project explores Olympic data to uncover trends in athlete performance, medal distribution, and participation across countries and demographics. By leveraging detailed datasets, it provides insights into the evolution of the Games, highlighting key patterns and disparities over time.

data-analysis data-science data-visualization olympics olympics-visualization

Last synced: 02 Apr 2025

https://github.com/domingosdeeulariadumba/ablisk

A Python module for design, analysis and decision-making of A/B tests.

ab-testing data-visualization statistics

Last synced: 14 Jan 2026

https://github.com/bretsw/eme6356-fs22-module5

Slide deck for EME6356, Module 5: Data Visualization (Fall 2022)

analytics data-analytics data-visualization slides visualization

Last synced: 08 Jan 2026

https://github.com/izadoraluz/uber-twitter-feedback-analysis

Uma pesquisa exploratória ccom análise de feedback positivo sobre a empresa Uber no Twitter (X) usando visualização de dados, com o objetivo de criar um projeto prático usando PLN e um dashboard intuitivo

dashboard data-visualization pln

Last synced: 05 Feb 2026

https://github.com/diogocarrola/freecodecamp-projects

A collection of projects completed as part of the freeCodeCamp curriculum. This repository showcases my progress and skills in web development, including HTML, CSS, JavaScript and more.

apis data-visualization front-end javascript responsive-design

Last synced: 26 Mar 2025

https://github.com/sssshefer/covid-map

Interactive map showing covid data implemented on R language

big-data data-visualization r r-studio

Last synced: 01 Mar 2025

https://github.com/samruddhi3012/public-health-data-analysis

Hi! This repo involves analyzing the Healthcare analytics using Advanced Microsoft Excel.

dashboard data-analysis data-visualization healthcare microsoft-excel pivot-chart pivot-tables vlookup

Last synced: 05 Feb 2026

https://github.com/joaopalmeiro/vtils

A Python package providing utility functions for Data Visualization.

data-visualization python visualization

Last synced: 26 Mar 2025

https://github.com/joaopalmeiro/data-vis-assets

A collection of various assets for Data Visualization assembled from inspiring sources.

catalog data-visualization documentation markdown

Last synced: 26 Mar 2025

https://github.com/samwhaaa/da_portfolio

Showcasing some of my Data Analytics projects

data-analysis data-analytics data-visualization jupyter jupyter-notebook python

Last synced: 01 Mar 2025

https://github.com/robinmillford/cardiac-care-performance-dashboard

This project presents a comprehensive data analysis and interactive dashboard focused on Cardiac Surgery and Percutaneous Coronary Interventions (PCI) performance by hospital, spanning from 2008 onwards.

cardiac data-analysis data-visualization plotly-express streamlit-dashboard tableau tableau-public

Last synced: 07 Sep 2025

https://github.com/fbarffmann/mycitibike

Built an interactive Leaflet.js map visualizing over 750 Citi Bike station locations in NYC. Analyzed usage patterns, station density, and user navigation across the network.

citibike data-analysis data-visualization geojson geospatial interactive-map javascript leaflet nyc web-mapping

Last synced: 07 Jul 2025

https://github.com/beatussum/pmsexp

A software for recovering the position of an object in a video

cpp cpp17 data-visualization physics qt qt5 science utility video

Last synced: 17 May 2026

https://github.com/nimomach/cafe-sales

This analysis focuses on evaluating the sales performance of a cafe by examining key metrics such as total revenue, sales by product category, peak sales times, and many more.

cafe data-analysis data-visualization sales

Last synced: 12 Mar 2026

https://github.com/deliprofesor/health-status-and-heart-attack-risk-eda-regression-and-hypothesis-testing-analysis

This project analyzes health and lifestyle factors influencing heart attack risk using statistical methods and machine learning, with Ridge Regression identified as the best predictive model.

data-cleaning data-visualization exploratory-data-analysis healthcare-insights hypothesis-testing machine-learning ridge-regression

Last synced: 10 Apr 2025

https://github.com/lvsvendsen/shime-monitor-r

R script for visualizing pH and pump activity in SHIME gut microbiome experiments.

data-visualization microbiome r research-tool shime

Last synced: 13 Sep 2025

https://github.com/mrkirushko/oxywitleaf2csv

Witleaf pulse oxymeter BIN format data decoder

d data-visualization oximeter oxymeter pulseoximeter pulseoxymeter r witleaf

Last synced: 02 Apr 2025

https://github.com/sehgal-vishal/blinkit-sales-project

This dashboard is designed to provide in-depth insights into key performance indicators (KPIs) critical to the operations of Blinkit.

data-visualization dataanalysis powerbi sales

Last synced: 19 Mar 2026

https://github.com/albanecoiffe/uber_data_visu_streamlit

Tableau de bord interactif avec Streamlit permettant d'explorer les données des trajets Uber de janvier 2015 à New York.

data-visualization streamlit

Last synced: 02 May 2026

https://github.com/01110011011101010110010001101111/tigergraph_cosmos_template

Template for TigerGraph and Cosmograph Projects with pyTigerGraph, Fast API, and Cosmos

cosmograph data-visualization tigergraph

Last synced: 26 Mar 2025

https://github.com/mohsinraza2999/new-york-taxi-fare-analysis

This project analyzes and predicts taxi fares estimate fares in advance using Regression Analysis. Conducted EDA, hypothesis testing, to identify key variables. Developed ML models (Random Forest, XGBoost) with GridSearchCV for hyperparameter tuning to predict generous tip giver accurately.

ab-testing data-un data-visualization exploratory-data-analysis fea random-forest regression-analysis sklearn xgboost

Last synced: 17 May 2026

https://github.com/jianxi-erin/bigdata-machinelearning-lab

本项目是一个综合性的大数据与机器学习实验平台,包含两个主要任务,每个任务涵盖三个关键技术模块:大数据处理、数据分析和机器学习。项目基于真实的竞赛设计,提供完整的数据处理模拟和建模实践。

data-analysis data-visualization hadoop machine-learning python spark sql

Last synced: 03 May 2026

https://github.com/tsbarr/belly-button-challenge

Using front-end development tools (javascript, html and css) I built an interactive dashboard to explore the Belly Button Biodiversity dataset, which catalogs the microbes that colonize human navels.

data data-visualization javascript

Last synced: 04 Mar 2026

https://github.com/deliprofesor/kizbasina_odev4_trafik

Bu proje, 2005-2014 yılları arasında İngiltere’de gerçekleşen trafik kazalarına ait kapsamlı veri setlerini kullanarak trafik kazalarının sebeplerini, şiddetini ve zaman içindeki değişimini analiz etmektedir.

data-science data-visualization istatistik matplotlib pandas python statistics

Last synced: 14 Apr 2026

https://github.com/bdice/signac-micde-cnsccs-2018

Slides and demos for the MICDE CNSCCS Symposium, October 15, 2018

data-management data-visualization demo signac workflow-automation

Last synced: 07 Oct 2025

https://github.com/18mahi/tweet-sentiment-analysis

Classify tweets into happy, sad, angry, excited, and neutral with this interactive Python model. Combining TF-IDF text features with engineered numeric features like emoji sentiment, polarity, subjectivity, and punctuation counts, it demonstrates intermediate-level NLP, feature engineering, and visualization skills.

data-science data-visualization jupyter-notebook machine-learning matplotlib nlp numpy pandas sklearn string textblob

Last synced: 30 Apr 2026

https://github.com/omarsolieman/socialgiveawaydataanalysis

This project involved cleaning, analyzing, and processing data from an Instagram giveaway to ensure a fair and data-driven winner selection process. The primary goal was to automate the process of identifying valid entries, weighting them based on engagement (likes and multiple entries), and performing a post-giveaway analysis

data-analysis data-science data-visualization instagram scraping threejs

Last synced: 14 May 2026

https://github.com/aymanmomin/excel-coffee-data-analytics-exploring-coffee-orders-dataset

This project utilizes a coffee orders dataset to perform comprehensive data analytics and gain insights into customer preferences, popular items, and sales trends. The analysis aims to provide valuable information for coffee shop owners and enthusiasts, facilitating data-driven decision-making and improved customer satisfaction.

data-analysis data-visualization excel project

Last synced: 18 Jan 2026

https://github.com/pranavsp108/market_basket_analysis-instacart

Customer segmentation and market basket analysis using the Instacart dataset with Python, Pandas, and K-Means clustering.

customer-segmentation-and-buying-behavior data-analysis data-visualization instacart jupyter-notebook kmeans-clustering market-basket-analysis pandas python scikit-learn

Last synced: 05 May 2026

https://github.com/shivani0126/resturant_rating_analysis

Restaurant ratings Analysis is a project where real consumers from 2012, including additional information about each restaurant and their cuisines, and each consumer and their preferences are visualised through Power BI dashboard.

dashboard data-visualization dataanalysis datamodeling dataprep dax-functions powerbi

Last synced: 27 Jan 2026

https://github.com/alexquilis1/spanish-fuel-stations-analysis

Real-time analysis of Spanish fuel prices using government API data with interactive maps and regional comparisons

data-analysis data-visualization fuel-prices geospatial-analysis ggplot2 government-data leaflet open-data r shiny spain tidyverse

Last synced: 08 Oct 2025

https://github.com/tyriek-cloud/power-bi-nyc-housing-financial-report

This report was conducted to provide a comprehensive analysis of various NYC housing and financial data.

dashboard data-analysis data-visualization financial-analysis powerbi statistics

Last synced: 21 Jan 2026

https://github.com/jlee9503/telecommunication-churn

Analyze key factors influencing customer churn using Python data analytics technique. Explore key factors through data preprocessing, exploratory data analysis (EDA), and predictive modeling.

data-analysis data-visualization matplotlib pandas python scikit-learn

Last synced: 18 Jan 2026

https://github.com/amish5ingh/cricket-data-analytics-ipl

Data analysis and visualization of IPL 2022 matches using Python, Pandas, Matplotlib, and Seaborn. Includes insights on match outcomes, player performances, toss trends, and venue stats with 12+ charts.

data-analysis data-visualization ipl-data-analysis ipl-data-visualization jupiter-notebook matplotlib-pyplot numpy pandas python seaborn

Last synced: 09 May 2026

https://github.com/lixx21/tableau_netflix_movies_tvshows_2021

Visualize netflix movies and tv shows in 2021

data-visualization dataset netflix tableau

Last synced: 19 Jan 2026

https://github.com/juanes0023/dashboard-mtp

🚗 Track user activity and revenue in real-time with the Mileage Tracker Pro Dashboard for clear insights and growth trends.

analytics business-intelligence dashboard data-visualization plotly python real-time-analytics saas streamlit supabase

Last synced: 20 Apr 2026

https://github.com/adithya2369/safa_public

AI-powered customer feedback analyzer that uses generative AI to transform customer reviews into actionable business insights. Upload review data, get instant summaries, satisfaction scores, detailed reports, and improvement suggestions—all in an easy-to-deploy Docker container.

data-analysis data-visualization docker-containerization full-stack-development generative-ai langchain langchain-groq web-development

Last synced: 10 Oct 2025

https://github.com/ianhaggerty/final-capstone

This represents the final capstone project in my HyperionDev Data Science (fundamentals) course. A dataset of Amazon customer reviews is analysed using natural language processing.

amazon data-analytics data-science data-visualization dataset matplotlib nlp nlp-machine-learning numpy pandas plotly reviews seaborn spacy tabulate textblob wordcloud

Last synced: 19 Jan 2026

https://github.com/hirkojoba/fintrack

Full-stack financial tracking app with ML forecasting and AI insights. Built with Rails, PostgreSQL, Python/scikit-learn, and OpenAI API.

artificial-intelligence data-visualization fintech full-stack machine-learning openai postgresql python ruby-on-rails scikit-learn

Last synced: 14 Apr 2026

https://github.com/nidhi-kekatpure/churn-prediction-for-subscription-customers

End-to-end customer churn prediction using AWS SageMaker, XGBoost, and Python - complete ML pipeline from data generation to real-time inference with 79% AUC score.

aws business-intelligence data-science data-visualization jupyter-notebook predictive-analytics

Last synced: 16 May 2026

https://github.com/rohanchawla972/business-analytics-and-visual-storytelling

Business analytics repository integrating multi-source datasets and applying advanced methods — EDA, clustering, panel regression, sentiment analysis, and data storytelling — to extract actionable insights across hospitality, innovation, and financial services.

business-analytics clustering data-visualization jupyter-notebook predictive-modeling python storytelling

Last synced: 18 May 2026

https://github.com/anandu-jpg/coffee-shop-sales-analysis

This project analyzes coffee shop sales data to identify trends, patterns, and insights that can help improve operations, boost revenue, and enhance the customer experience.

business-intelligence data-analysis data-visualization exploratory-data-analysis jupyter-notebook pandas phyton

Last synced: 18 May 2026

https://github.com/ratna-babu/generating-graphs

Generate, color, and visualize random graphs using Python's NetworkX and Matplotlib. Includes compression and storage of graph data with .gz and pickle. Ideal for exploring graph coloring and greedy algorithms in graph theory.

data-visualization erdos-renyi graph-coloring graph-theory greedy-algorithm matplotlib networkx python random-graph random-graph-generation

Last synced: 10 Oct 2025

https://github.com/badranalyst/data-professional-survey-breakdown-power-bi-dashboard

This project presents an interactive Power BI dashboard analyzing data professionals' insights. Key focus areas include job satisfaction, challenges in entering the data field, career priorities, demographics, and more. The visualization helps uncover trends and factors impacting data professionals globally.

charts dashboard dashboards data data-cleaning data-visualization dataset dax power-bi powerbi

Last synced: 23 Feb 2026

https://github.com/mouradhamzaoui/End-To-End-MLOPS-Airline-Project

This project aims to predict the number of passengers, freight quantity, and mail quantity for American airlines operating between Canadian and U.S. airports using an MLOps approach. It involves automating the data pipeline, from data extraction and preparation to model training and evaluation, leveraging tools like DVC, MLflow, and Docker for vers

data-visualization docker dvc github-actions machine machine-learning-algorithms mlflow

Last synced: 14 Apr 2026

https://github.com/ahsankhizar5/retail-sales-analysis-python-powerbi

A complete retail sales analytics project using Python for data cleaning and EDA, and Power BI for dashboard visualization. Built as a capstone for the Business Analytics Bootcamp by CourseMea.

business-analytics capstone-project coursemea dashboard data-visualization eda exploratory-data-analysis powerbi python python3 retail-data

Last synced: 31 May 2026

https://github.com/alexondata/daan_eda-exploratory-data-analysis_ecommerce

This project presents an Exploratory Data Analysis (EDA) pipeline for an eCommerce dataset, integrating Python, SQL Server, and Power BI to transform raw transactional data into meaningful business insights. The project was developed as part of an academic assignment at Transilvania University of Brașov, Faculty of Mathematics and Computer Science.

data-analysis data-visualization ecommerce microsoft-sql-server powerbi python

Last synced: 18 May 2026

https://github.com/jackiboi307/simpleplot

Simple plotting tool made with pygame

data-visualization pygame python

Last synced: 13 Oct 2025

https://github.com/achronus/data-graphing-tool

A tool for finding the perfect graph that fits your CSV data.

data-visualization matplotlib numpy pandas python3

Last synced: 13 Oct 2025

https://github.com/anushkundu/london-housing-market-analysis

London Housing Market Analysis: An Insightful Power BI Dashboard"

data-analysis data-visualization powerbi transformation

Last synced: 27 Jan 2026

https://github.com/ayorick23/python-data-science-cheat-sheet

Guía rápida y práctica de sintaxis, comandos y funciones esenciales de Python para Ciencia de Datos. Perfecta para recordar cómo usar las librerías más comunes como NumPy, Pandas, Matplotlib y Scikit-learn en tus análisis diarios.

cheat-sheet data-analysis data-science data-visualization deep-learning jupyter-notebook machine-learning matplotlib ml numpy pandas python scikit-learn scipy seaborn statistics sympy tensorflow

Last synced: 07 Apr 2026

https://github.com/markusbegerow/powerbi-navigation-menu

Interactive navigation menu visual for Power BI with slide-out filtering and hierarchical data support

business-intelligence d3js data-visualization filter hamburger-menu navigation powerbi powerbi-custom-visuals powerbi-visuals typescript

Last synced: 14 Oct 2025

https://github.com/pngo1997/chicago-airbnb-listings

Interactive Chicago Airbnb listings geospatial map.

data-visualization geospatial html python visualization

Last synced: 31 May 2026

https://github.com/coderjolly/utilisation-analysis

This provides a small glimpse of the IISc's, Supercomputer Education Research Centre (SERC) resource data, and how it was ingested, extracted to produced relevant results for data analysis between actual resource utilisation and simulated resource utilisation.

csv-parser-python data-transformation data-visualization flow plotly-dash plotly-python

Last synced: 14 Oct 2025