An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/azmainadel/twitter-data-neo4j

Playing with graph database on a large dataset of twitter data.

data-analysis data-visualization neo4j-database snap

Last synced: 06 Apr 2025

https://github.com/moindalvs/assignment_multi_linear_regression_2

Consider only the below columns and prepare a prediction model for predicting Price. Corolla<-Corolla[c("Price","Age_08_04","KM","HP","cc","Doors","Gears","Quarterly_Tax","Weight")]

cooks-distance data-science data-structures data-visualization exploratory-data-analysis feature-engineering feature-selection influencers multi-collinearity-issue outlier-removal outliers-detection predictive-modeling

Last synced: 18 Apr 2026

https://github.com/qtle3/decision-tree-regression

This project implements **Decision Tree Regression** to predict the salary of an employee based on their position level. By using a dataset containing position levels and their corresponding salaries, the project highlights how decision trees can capture complex relationships in data through non-linear regression.

data-visualization decision-tree-regression prediction-model

Last synced: 08 Apr 2025

https://github.com/redayzarra/ml-skincancer-project

This project utilizes a convolutional network to identify 9 different kinds of skin cancers including melanoma, nevus, and more. The model is trained on over 2,200 pictures of various skin cancers based off of this dataset. This model implements fundamental computer vision and classification techniques and includes a step-by-step implementation.

computer-vision convolutional-neural-networks data-visualization deep-learning lenet lenet-5 lenet-architecture machine-learning machine-learning-algorithms neural-network

Last synced: 17 Apr 2026

https://github.com/dadosdelaplace/hilostwitter

Códigos R de los hilos de Twitter (R codes of Twitter activity) https://twitter.com/dadosdelaplace

data-visualization dataviz divulgacion ggplot2 maths plotly r rstats statistical-analysis threads twitter

Last synced: 11 Mar 2026

https://github.com/samuelbarbosadev/roof_imoveis_data_analysis

The company hired you because they want to know what would be the 5 properties they should invest in and why, and which 5 you would not recommend investing in at all.

data-preparation data-understanding data-visualization pandas python

Last synced: 12 Apr 2026

https://github.com/gappeah/global-shipping-analytics-dashboard

This Tableau project provides a comprehensive visual analysis of global sales, shipping costs, and quality metrics across different regions and countries.

data data-analysis data-analyst data-visualization metrics tableau

Last synced: 25 Feb 2025

https://github.com/repnot/gauge-demo

A Streamlit demo application for the streamviz Python package publicly available and distributed via the Python Package Index.

data-visualization plotly python streamlit

Last synced: 19 Apr 2026

https://github.com/xilinjia/csv-analyser

Visualize and analyze data in csv files. A multiplatform project runnable on Linux, Android, Windows, MacOS

android compose csv data-science data-visualization kotlin kotlin-multiplatform linux macos multiplatform windows

Last synced: 09 Apr 2026

https://github.com/dsnchz/solid-plotly

SolidJS wrapper for Plotly.js – reactive and performant charts powered by Plotly, built for Solid.

analysis analytics charting-library charts data-visualization solidjs visualization

Last synced: 02 Apr 2026

https://github.com/ipanalytics/tor-radar

The project collects public Tor relay data once per hour, stores compact snapshots in the repository, and renders an interactive dashboard without a database or backend.

asn data-visualization network-intelligence onionoo osint threat-intelligence tor tor-exit-nodes tor-relays

Last synced: 25 May 2026

https://github.com/neerajcodes888/whatsapp-chat-analyzer

A Python tool for effortless analysis of WhatsApp conversations. Gain insights with basic statistics, word cloud visualizations, and URL statistics. Powered by pandas, urlextract, wordcloud, seaborn, and Streamlit. 📊📱

analyzer chat data-analysis data-visualization pandas python3 seaborn urlextract whatsapp wordcloud

Last synced: 12 Apr 2026

https://github.com/michalredm/carsdatasetdashboard

Shiny dashboard with data visualizations for the 90,000+ Cars Dataset.

data-visualization shinydashboard

Last synced: 20 Apr 2026

https://github.com/ahmetzamanis/bayesianusedcars

Predicting used car prices using Bayesian Linear Regression, visualizing results and comparing with OLS regression.

bayesian car-prices data-science data-visualization linear modeling r regression rmarkdown

Last synced: 16 Dec 2025

https://github.com/ipanalytics/vpn-infrastructure-atlas

Interactive VPN infrastructure atlas: aggregate provider footprint by reported endpoint country, ASN, and hosting network. No raw configs or per-IP feed.

asn-analysis cybersecurity data-visualization fraud-detection github-pages ip-reputation network-intelligence proxy-detection threat-intelligence vpn-intelligence

Last synced: 25 May 2026

https://github.com/sleeplessglory/big-data

Projects regarding big data analysis, presented within Jupyter Notebook

big-data data-analysis data-visualization jupyter python

Last synced: 16 Apr 2026

https://github.com/rani-sikdar/python-data-structures

A repository for sharing Python implementations of common data structures and algorithms, including arrays, linked lists, stacks, queues, trees, graphs, and sorting/searching techniques. Perfect for learning, revisiting fundamentals, and collaborating on computer science concepts. Contributions welcome! 🚀

data-analysis data-structure data-visualization numpy-library pandas-dataframe python python-3

Last synced: 30 Mar 2025

https://github.com/gino-freud-hobayan/excel_project_dataset_gino

Simple Excel Project on Data Cleaning and Dashboard Creation

dashboard data-cleaning data-visualization excel

Last synced: 19 Mar 2026

https://github.com/rakumar99/power-bi-projects

This repository contains various power bi projects and dashboards of Humaan Resources , Financial Analysis using Power BI Desktop.

dashboards data-analysis data-visualization databases datacleaning datamodeling etl powerbi powerquery reports

Last synced: 04 Jun 2026

https://github.com/gholamrezadar/favourite-youtube-channels

this program goes through your youtube watch history and sorts channels based how many of their videos you have watched!

data-analysis data-visualization python

Last synced: 16 Jan 2026

https://github.com/mayankyadav23/air-bnb-data-analysis

Data analysis and insights from NYC Airbnb listings, focusing on key metrics such as host performance, neighborhood trends, pricing, and customer reviews. Comprehensive documentation of ETL processes and analytical methodologies is provided. Perfect for understanding Airbnb dynamics and decision-making in the NYC market.

advanced-excel business-intelligence data-analysis data-analytics data-visualization power-bi ppt

Last synced: 19 Mar 2026

https://github.com/prernarohra/mental-health-prediction

This project focuses on predicting mental health outcomes using machine learning algorithms. By analyzing various psychological, social, and lifestyle factors, the model aims to identify individuals at risk, enabling early intervention and support.

data-analysis data-science data-visualization machine-learning mental-health python

Last synced: 20 May 2026

https://github.com/evoluteur/madeleine-radars

Using D3 radar charts to compare the proportions of flour, sugar, butter, and eggs in 147 madeleine cakes recipes.

charts comparison d3 d3-visualization d3js data-visualization food madeleine radar radar-chart recipes spider-chart visualization

Last synced: 17 Apr 2026

https://github.com/zouari-oss/user-risk-detection

FastAPI-based microservice that predicts the risk level of a user session

data-generation data-visualization ml pandas xgboost xgboost-classifier

Last synced: 29 Jun 2026

https://github.com/akshadk7/exploratory-data-analysis

Implementing EDA and Machine Learning Algorithms on Kaggle Car Dataset

data-visualization exploratory-data-analysis machine-learning-algorithms predictive-modeling

Last synced: 17 Jun 2026

https://github.com/sunnybibyan/random_data_generation

A project that generates a dataset using various statistical distributions (Normal, Uniform, Exponential, Random Integers, and Binomial) and performs data analysis. Includes visualizations and an option to export the data as a CSV file.

data-analysis data-visualization python random-data-generation statistics streamlit-webapp

Last synced: 13 Jun 2026

https://github.com/alicankaya192/ai-jobs-market-2025-2026-salaries

🤖 Global AI & LLM jobs market analysis (2025–2026). Salary trends, remote work premiums, top paying skills, and LLM engineering vs traditional AI comparisons. 📈

ai-jobs data-analysis data-science data-visualization eda exploratory-data-analysis generative-ai jobs jupyter-notebook llm-learning market-analysis matplotlib pandas salary-analysis statistics

Last synced: 21 Jun 2026

https://github.com/vjo/d3-punchcard

D3 Punchcard chart 📊●•●

chart d3js data-visualization library punchcard visualization

Last synced: 13 Jun 2026

https://github.com/rudra-g-23/find-my-joint

A utility to find potential join keys (matching columns) across multiple DataFrames.

data-analysis data-visualization join network-graph pandas pandas-dataframe

Last synced: 24 Jun 2026

https://github.com/newking9088/marketing_campaign_ml_prediction_dashboard

Transform your marketing strategy with our intuitive ML Prediction Dashboard, providing real-time, data-driven insights to optimize campaign success.

data-visualization finance-application streamlit-webapp

Last synced: 29 Jun 2026

https://github.com/duskmoon314/gongbi

Gongbi (工笔) is a data visualization crate inspired by ggplot2

data-visualization grammar-of-graphics rust

Last synced: 20 Aug 2025

https://github.com/autodesk-platform-services/aps-dataviz-fader

Fader application using Data Visualization API

data-visualization iot sample viewer

Last synced: 08 Feb 2026

https://github.com/msthamizh/chicago-crime-analyzer

Developing a PowerBI dashboard that allows users to interactively analyze and predict crime patterns using machine learning. The supports crime trend analysis and enabling users to explore insights such as crime hotspots, peak hours, arrest rates, and incident locations while providing dynamic visualizations for enhanced decision-making.

data-preprocessing data-visualization exploratory-data-analysis machine-learning matplotlib powerbi python seaborn

Last synced: 28 Apr 2026

https://github.com/tanishq-ctrl/cyberattack-analysis-and-insights

This repository contains an in-depth analysis of a cybersecurity dataset. The primary goal is to identify patterns, vulnerabilities, and trends in cyberattacks by leveraging various visualizations and statistical insights. The project provides actionable insights for enhancing cybersecurity measures.

cyberattack cyberattacks data-science data-visualization dataanalysis dataanalysisusingpython dataanalytics

Last synced: 28 Jun 2025

https://github.com/sandravizz/global_inequality_story

Dataviz Project about Global Inequality

data data-visualization inequality

Last synced: 03 Jul 2025

https://github.com/edseldim/FirstRoundElectionsFr

A data visualization spreadsheet on Excel

data-analysis data-visualization excel pandas python

Last synced: 02 Aug 2025

https://github.com/sourabh-kumar04/numpy-basic

Numpy-Basic is a structured learning repo covering NumPy from basics to advanced. It includes arrays, indexing, reshaping, filtering, vector ops, angle functions, stats, and .npy file handling. Each concept is explained with code, examples, and Matplotlib visualizations in both light and dark modes. Ideal for students and data learners.

data-analysis data-science data-visualization learning learning-resources machine-learning matplotlib numerical-computing numpy python python-library python-programming

Last synced: 10 May 2026

https://github.com/mastersign/mastersign-datascience

High level helpers for data science in Python with Pandas.

data-science data-visualization database-access pandas python

Last synced: 05 May 2026

https://github.com/uba/cmap2png

Useful script to convert a color map definition to PNG image file.

colormap converter cpt data-science data-visualization matplotlib plot png python

Last synced: 17 Sep 2025

https://github.com/sanchariii/stock-price-predictor

This project utilizes Deep Learning models, Long-Short Term Memory (LSTM) Neural Network algorithm, to predict stock prices.

data-visualization keras-tensorflow neural-network python

Last synced: 26 Jul 2025

https://github.com/patelabhi574/hotel_reservation_analysis

Analyzing data collected by hotel to make future prediction for the owner of what are the segments they are making most profit & also which are the patterns & trends which have been seen over the past years in the booking in different times throughout the year and price setting on the website in peak time as per availability index.

data data-visualization datamodeling looker-studio powerbi reporting sql-query sql-server

Last synced: 19 Feb 2026

https://github.com/gui-sitton/churn-finalproject

predict its customers' churn. If it is discovered that a user is planning to switch operator, the company will offer them promotional codes and special plan options.

churn-prediction data-analysis-python data-science data-visualization

Last synced: 26 Jul 2025

https://github.com/bzbradford/wibee-dashboard

Data visualization app for WiBee pollinator surveys.

data-visualization pollinators shiny

Last synced: 30 Jul 2025

https://github.com/melihcanndemir/3d-fractal-explorer

Interactive 3D visualization tool for exploring the mesmerizing world of Mandelbrot and Julia fractals. Built with Python, OpenGL and PyQt5, offering real-time animation and intuitive controls.

3d-graphics complex-numbers computer-graphics data-visualization educational fractal graphics-programming interactive julia-set mandelbrot math-visualization mathematical-art mathematics numpy opengl pyqt pyqt5 python scientific-visualization visualization

Last synced: 30 Jul 2025

https://github.com/myself-aas/quantium_data_analytics_forage

This project analyzes retail customer chip purchasing behavior using Python, focusing on customer segmentation and key spending drivers to provide data-driven insights for strategic category management recommendations.

data-analysis data-engineering data-science data-visualization feature-engineering forage internship-project matplotlib-pyplot numpy-library pandas-dataframe pearson-correlation python quantium-virtual-experience scipy-stats seaborn

Last synced: 31 Jul 2025

https://github.com/soumyaco/hotel-price-data-analysis

Data analysis on Indian hotels price. A beginners guide to data analysis.

data-analysis-python data-science data-visualization matplotlib seaborn-plots

Last synced: 01 Aug 2025

https://github.com/shubhamgoyal575/diwali-sankranti-promotion-sales

This Power BI dashboard analyzes sales performance during Diwali and Sankranti festivals. It provides insights into revenue trends, top-selling products, regional sales distribution, and customer purchasing behavior to help optimize festive season sales strategies. 🚀

buisness-intelligence dashboard data-analysis data-visualization diwali-sankranti-sales-analysis excel fast-moving-consumers-goods fmcg microsoft-power-bi mysql power-query powerbi revenue-insights sales-dashboard sales-insights sql

Last synced: 02 Mar 2026

https://github.com/jahnavigupta06/zepto-delivery-customer-analytics

Real-time SQL + Power BI Analytics Project replicating Zepto's customer & delivery insights.

business-intelligence churn-analysis customer-segmentation data-analysis data-visualization powerbi sql-server

Last synced: 02 Aug 2025

https://github.com/shriram-vibhute/data-analysis

This repository offers a comprehensive collection of data analysis techniques using NumPy Pandas, Matplotlib and Seaborn.

data-aggregation data-analysis data-visualization data-wrangling matplotlib numpy pandas seaborn

Last synced: 02 Aug 2025

https://github.com/jcaperella29/methylation_shiny

An interactive R Shiny web app for DNA methylation analysis with support for DE, pathway enrichment, power analysis, and machine learning — deployable on HPC via Singularity/Apptainer.

apptainer bioconductor bioinformatics data-visualization differential-expression epigenetics genomics machine-learning methylation pathway-enrichment-analysis pca r random-forest shiny singularity umap

Last synced: 02 Aug 2025

https://github.com/t-mohamed-shafeek/data-analysis-on-tamil-nadu-road-accidents

The "Data Analysis on Tamil Nadu Road Accidents" is a project deals with analysis of data on Road Accidents encountered by Tamil Nadu ( one of the states of India ) in the year of 2020 and 2021. But the dataset is most recently created (created on February 15, 2023 with source form TN Police).

dashboard data-analysis data-science data-visualization jupyter-notebook tableau

Last synced: 03 Aug 2025

https://github.com/sirawin/ona-visualization

🔗 Interactive Organizational Network Analysis visualization with D3.js, community detection, and team analytics. Live demo: https://ona-lh77jtqk9-sirawins-projects.vercel.app

community-detection d3js data-visualization javascript network-graph organizational-network-analysis vercel vite

Last synced: 05 May 2026

https://github.com/simranjeet97/ipl-dataanalysis

Data Analysis performed on IPL Dataset with Data Profiling, Data Pre-Processing, Data Manipulation, and Data Visualization.

artificial-intelligence data-analysis data-manipulation data-mining data-preprocessing data-science data-visualization indian-premier-league-2008-2018 ipl ipl-dataset iplayer python

Last synced: 08 May 2026

https://github.com/leocornus/leocornus-visualdata

JavaScript libraries to make data visualization simpler and easier.

data-analysis data-mining data-visualization data-visualization-simpler javascript-library

Last synced: 10 Aug 2025

https://github.com/dianaow/singapore_stories

Exploring and visualizing datasets related to Singapore issues

d3 d3js d3v4 data-visualization maps singapore singapore-government

Last synced: 29 Jun 2026

https://github.com/nelsonkariuki/dataanalysis

This project involves data analysis of vido game sales from https://www.kaggle.com/gregorut/videogamesales/download

data-analysis data-visualization python

Last synced: 11 Jun 2026

https://github.com/floressek/data_analysis_and_visualization

This repository contains a collection of statistical data analysis laboratories using R. Each lab focuses on different aspects of data exploration, visualization, and analysis techniques.

data-analysis data-visualization

Last synced: 05 Oct 2025

https://github.com/yuvraj0412s/proactive-fraud-detection-using-machine-learning

An end-to-end machine learning project for detecting financial fraud using LightGBM, featuring in-depth EDA, advanced feature engineering, and a focus on actionable business insights.

class-imbalance classification-model data-analysis data-science data-visualization exploratory-data-analysis feature-engineering fintech fraud-detection jupyter-notebook lightgbm machine-learning pandas python scikit-learn smote

Last synced: 02 May 2026

https://github.com/udaykumar-dhokia/d8a

d8a is a modern data analytics platform that provides powerful visualization and analysis tools for your data.

data-analysis data-visualization fullstack-development

Last synced: 16 Sep 2025

https://github.com/viztruth/google-play-store-data-analysis

This repository contains all the materials of my final project 'Google Play store Data Analysis' for the 'Telling Stories with Data' course at PES University.

data-analysis data-visualization

Last synced: 21 Aug 2025

https://github.com/mohamed3nan/udacity

Udacity Data Analysis Nanodegree Program

data-analysis data-visualization numpy pandas python

Last synced: 10 Apr 2026

https://github.com/aymane-maghouti/mobile-data-hive-insights

This project demonstrates the process of extracting data from a MySQL database, transferring it using Apache Sqoop, storing it in Hive Data warehouse (the data actually is store in Hadoop Distributed File System (HDFS)), and performing analysis using Hive Query Language (Hive QL) (it is a language close to SQL). Then visualize the data in Power BI,

apache-sqoop data data-integration data-visualization hadoop-hdfs hivedb hiveql powerbi

Last synced: 09 Mar 2026

https://github.com/abbasi0abolfazl/commentanalyzer

machine learning project designed to analyze Instagram comments for sentiment detection, question identification, and topic modeling. Utilizing algorithms such as LDA, LSA, NMF, and BERT, CommentAnalyzer provides valuable insights into user interactions, helping brands and researchers understand audience sentiments and trends.

data-visualization instagram-data-analysis machine-learning question-detection sentiment-analysis social-media-analytics topic-modeling

Last synced: 30 Aug 2025

https://github.com/paul019/pappe

A CLI to draw your data on top of millimeter paper

automation data-visualization diagram diagram-generator python

Last synced: 05 Mar 2025

https://github.com/ipanalytics/vpn-infrastructure-intelligence-lab

Aggregate VPN infrastructure intelligence dataset and interactive dashboard for provider, country, ASN, relationship, archetype, and hosting dependency analysis without publishing raw endpoint data.

asn cybersecurity data-visualization geoip github-pages hosting-analysis infrastructure-intelligence internet-measurement network-analysis osint privacy-research static-dashboard threat-intelligence vpn vpn-research

Last synced: 25 May 2026

https://github.com/karlyndiary/amazon-sentiment-analysis-eda

Amazon Customer Reviews Sentiment Analysis utilizing Python for Data Extraction and Pre-Processing, with real-time data from the Amazon API and Tableau for dashboard visualizations.

amazon-api api-data-extraction dashboard data-cleaning data-pipeline data-visualization etl roberta-sentiment-analysis sentiment-analysis tableau-dashboards vader-sentiment-analysis

Last synced: 22 Mar 2025

https://github.com/virajbhutada/global-universities-success-analysis-powerbi-sql-excel

This capstone project conducts in-depth analysis using Power BI, SQL, and Excel to explore complex dynamics shaping global university success. Integrating data from diverse ranking systems and criteria, our aim is to unravel the factors influencing universities worldwide.

capstone capstoneproject data-analysis data-analytics data-insights data-science data-science-projects data-visualization excel exploratory-data-analysis mece mysql powerbi powerpoint sql

Last synced: 20 Jun 2025

https://github.com/s-araromi/smart-device-data-analysis-bellabeat_casestudy

This repository contains a comprehensive analysis of smart device data for Bellabeat, focusing on user activity, sleep, calorie burn, and weight management. Insights from Fitbit data are used to inform Bellabeat's marketing strategy and product recommendations.

bellabeat data-analysis-in-r data-visualization fitbit-smart-devices fitness-tracking r-programming sleep-analysis smart-devices

Last synced: 25 May 2026

https://github.com/ninadpatil09/bankcard-analytics---credit-debit-card-usage-monitoring

This project is a comprehensive data analysis initiative aimed at extracting valuable insights from bank card usage data. The tools and techniques includes Python, Excel, Tableau, web scraping, pandas. It centers around understanding and visualizing trends and patterns in credit and debit card usage across multiple banks.

data-cleaning data-visualization excel python tableau tableau-public web-scraping

Last synced: 18 Apr 2026

https://github.com/husna-poyraz/titanic-machine-learning

Use machine learning to create a model that predicts which passengers survived the Titanic shipwreck.

data data-analysis data-science data-visualization deep-learning machine-learning missing-data outlier-detection python titanic

Last synced: 10 May 2026

https://github.com/amirnaghibi/companion

find the safest way back home with a trusted companion

data-visualization django-framework pathfinding python3 webapp

Last synced: 10 Apr 2026

https://github.com/nouman6093/advanced-statistical-models

in this repository i will upload everything i have learned about data science advanced statistical models. there are over 42 statistical models. each of them work on algorithms. and there are over 32 algorithms. each library has its own way of writing such statistical models. after learning i will try to upload as much statistical models as possibl

data data-analysis data-science data-visualization

Last synced: 11 Jun 2026

https://github.com/njanakiev/iotserver

Data analysis and data visualization of mobile sensor data

d3js data-visualization mongodb nodejs sensor sensor-data udp-server

Last synced: 10 Apr 2025

https://github.com/nowke/cricviz

Cricket data analysis/visualizations

cricket-data data-visualization reactjs

Last synced: 15 Apr 2026

https://github.com/walkerdustin/vergleich-von-messmethoden-fuer-punktwolken

Bei der Vermessung eines physischen Raumes ist das Ergebnis eine Punktwolke. Diese Punktwolke beschreibt dann ausgewählte Punkte im Raum, zum Beispiel auf den Wänden und der Decke. Wenn diese Punkte in zwei seperaten Messungen gemessen werden, vielleicht sogar von unterschiedlichen Geräten, soll hinterher herausgefunden werden wie genau diese Punktwolken übereinstimmen. Dafür gibt es zwei grundsätzlich verschiedene Methoden. Diese sollen hier verglichen werden.

3d-models accuracy-metrics data-analysis data-visualization kaggle measure-distance numpy point-cloud pointcloudprocessing punkte python science-research simulation statistics

Last synced: 11 Apr 2026

https://github.com/prime-infinity/type-one

Software to visualize and analyze GitHub repos based on certain statistics such as stars, forks and issues

data-analysis data-visualization

Last synced: 03 Feb 2026

https://github.com/willie-conway/global-superstore-data-modeling-analysis

A comprehensive data modeling and analysis project for the 🌍Global Super Store, focusing on database design 🗃️, sales data analysis 📊, and interactive visualizations 📍 using MySQL 🖥️ and Tableau 📈.

business-analytics business-intelligence data-exploration data-modeling data-preprocessing data-restructuring data-visualization database-design er-diagram geographic-analysis interactive-dashboard mysql profit-analysis sales-analysis sales-performance sales-trends sql star-schema tableau time-series-analysis

Last synced: 21 Jul 2025