An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/shwetapardhi/assignment-04-simple-linear-regression-2

Q2) Salary_hike -> Build a prediction model for Salary_hike Build a simple linear regression model by performing EDA and do necessary transformations and select the best model using R or Python. EDA and Data Visualization. Correlation Analysis. Model Building. Model Testing. Model Predictions.

correlation-analysis data-visualization distplot eda feature-engineering model-building model-predictions model-template numpy ols-regression p-value pandas python r-square-values regression-plot seaborn simple-linear-regression smf statsmodels t-score

Last synced: 08 May 2026

https://github.com/vatshayan/pokemon-analysis

Visualization, Analysis & Predicting the accuracy of finding Pokemon power, attack & speed through Machine Learning

artificial-intelligence data data-analysis data-science data-visualization dataset machine-learning machine-learning-algorithms pokemon scikit-learn

Last synced: 30 May 2026

https://github.com/abhisek-13/whatsapp-chat-analyzer

The WhatsApp Chat Analyzer is a data analysis project that provides insights into WhatsApp chats. It analyzes chat data to show metrics like the number of lines, most used letter, chatting duration, media files shared, most used emojis, and group member activity. The results are displayed on a user-friendly dashboard built with Streamlit.

data-analysis data-mining data-visualization eda machine-learning machine-learning-algorithms matplotlib numpy pandas python seaborn sklearn

Last synced: 13 Apr 2026

https://github.com/nittygritty-zzy/quantlab

🚀 Professional quantitative trading research platform with ML-powered backtesting, multi-source options analysis, portfolio management, and interactive Plotly visualizations. Built on qlib with CLI interface.

algorithmic-trading backtesting cli data-visualization financial-analysis machine-learning options-trading plotly portfolio-management python qlib quantitative-finance

Last synced: 14 Jan 2026

https://github.com/shuyib/mouse_gut_otu

Vectorization and Unsupervised Learning of Mouse Operation Taxonomic Units to determine which species of bacteria form distinct groups in a dataset.

16s-rrna anaconda analysis data-visualization dataset gut-microbiome matplotlib-figures mothur numpy-arrays pandas-dataframe pca-analysis python3 scikitlearn-machine-learning sops t-sne unsupervised-learning

Last synced: 13 Apr 2026

https://github.com/superskyyy/stackoverseer

This is a StackOverflow monitor where you can easily access the most trending and up-to-date questions on a particular set of tags. This project can be modified to support wider range of tags and provide functionalities.

charts data-visualization stackoverflow-api stackoverflow-questions

Last synced: 08 Jan 2026

https://github.com/ak-abhilash/insightcat

📊 One-click open-source EDA tool for CSV, Excel, JSON

csv- data-analysis- data-visualization eda- fastapi- open-source- pandas- react-

Last synced: 14 Jun 2025

https://github.com/sudodevinci/devincicloud

Flask Website template used for devinci.cloud, The Cost effective weather prediction Project with Växjö Linnaeus Science Park

api api-gateway arduino data-visualization esp32-arduino flask flask-api lcl mysql-database weather-station

Last synced: 18 Apr 2026

https://github.com/alainamariajoe/netflix-data-visualizer

A simple data visualization project using Netflix movies and TV shows dataset to create basic insights and visual representations.

data-visualization matplotlib pandas python seaborn

Last synced: 17 May 2026

https://github.com/srinibas-masanta/olympics-data-analysis

The Olympics Analysis project explores Olympic data to uncover trends in athlete performance, medal distribution, and participation across countries and demographics. By leveraging detailed datasets, it provides insights into the evolution of the Games, highlighting key patterns and disparities over time.

data-analysis data-science data-visualization olympics olympics-visualization

Last synced: 02 Apr 2025

https://github.com/bretsw/eme6356-fs22-module5

Slide deck for EME6356, Module 5: Data Visualization (Fall 2022)

analytics data-analytics data-visualization slides visualization

Last synced: 08 Jan 2026

https://github.com/greatwoman23/hotel_reservation_analysis

In this project, we delve into the intricate world of hotel reservations, utilizing a multifaceted analytical approach to uncover valuable insights. Through a combination of SQL queries and Tableau visualizations, we meticulously dissect a rich dataset comprising booking details, customer demographics, and reservation statuses.

data-analysis data-science data-visualization hotel hotel-reservation publications sql sql-query sqlite3 tableau

Last synced: 15 May 2026

https://github.com/spacebakery/nba-trends-project

Data Science Foundations I | Exploratory Data Analysis in Python | Summarizing Relationship Between Two Features

categorical-variables data-analysis data-visualization matplotlib nba-dataset quantitative-variables scipy seaborn subset summary-statistics

Last synced: 11 Mar 2025

https://github.com/joaopalmeiro/vtils

A Python package providing utility functions for Data Visualization.

data-visualization python visualization

Last synced: 26 Mar 2025

https://github.com/nmelgar/healthy_child_dataviz

Data visualization project to analyze what a healthy child is.

analysis data data-analysis data-science data-visualization dataviz research tableau visualization

Last synced: 23 Feb 2026

https://github.com/nurulashraf/telco-customer-churn-prediction-model

This repository contains a Telco Customer Churn Prediction project using machine learning. It includes data preprocessing, exploratory data analysis, feature engineering, and model development to predict customer churn. Key tools used are Python, Pandas, NumPy, Matplotlib, Seaborn, and scikit-learn.

churn-prediction classification-model customer-churn data-visualization exploratory-data-analysis machine-learning predictive-analytics python scikit-learn

Last synced: 16 Mar 2025

https://github.com/aaronjmars/token-race

Data visualization tool for tokens, based on their FDV or MC.

coinmarketcap data-visualization dexscreener web3

Last synced: 14 Sep 2025

https://github.com/rohanag03/whatsapp-chat-analyzer

This project is a comprehensive tool for analyzing WhatsApp chat histories. Using Python and Streamlit, it visualizes chat data to provide insights into messaging patterns and media sharing habits.

chat-analysis data-visualization python streamlit

Last synced: 12 May 2026

https://github.com/esther-poniatowski/multitask-context-dependent-behavior

Data analysis of neuronal recordings in naive and trained animals performing multiple tasks in active and passive attentional states

cognitive-neuroscience computational-neuroscience data-analysis data-visualization information-processing

Last synced: 26 Mar 2025

https://github.com/satyam4229/prediction-of-different-diseases

Prediction of the different diseases with the help of different symptoms express the diseases in the real time. In the dataset, there are 132+ different symptoms on which the model is trained to give the best result of the disease.

data-analysis data-science data-visualization jupyter-notebook kaggle python

Last synced: 13 Apr 2026

https://github.com/samruddhi3012/rfm-sales-analysis

Hi there! In this project I have performed Sales Analysis (RFM Analysis) using SQL and Tableau.

data-analysis data-visualization mssqlserver rfm-analysis segmentation tableau

Last synced: 12 Mar 2025

https://github.com/nimomach/cafe-sales

This analysis focuses on evaluating the sales performance of a cafe by examining key metrics such as total revenue, sales by product category, peak sales times, and many more.

cafe data-analysis data-visualization sales

Last synced: 12 Mar 2026

https://github.com/aayushwankhade/z

z is a versatile programming language known for its simplicity and ease of use in developing web applications. With a strong focus on clean, readable code and efficient performance, z is ideal for both beginner and experienced developers looking to create high-quality software.

apache chatgpt data-engineering data-science data-visualization fish free game immutability machine-learning pattern-matching python statistics zwave

Last synced: 07 Sep 2025

https://github.com/deliprofesor/arrhythmia-classification-and-anomaly-detection

This project classifies arrhythmias and detects anomalies using machine learning and deep learning. It includes preprocessing the "INCART 2-lead Arrhythmia Database," feature engineering, KMeans clustering, Random Forest, IsolationForest, and an LSTM model for classification.

classification clustering data-science data-visualization deep-learning machine-learning

Last synced: 10 Apr 2025

https://github.com/deliprofesor/health-status-and-heart-attack-risk-eda-regression-and-hypothesis-testing-analysis

This project analyzes health and lifestyle factors influencing heart attack risk using statistical methods and machine learning, with Ridge Regression identified as the best predictive model.

data-cleaning data-visualization exploratory-data-analysis healthcare-insights hypothesis-testing machine-learning ridge-regression

Last synced: 10 Apr 2025

https://github.com/archanakokate/iris_flower_classification

Analyzing and modeling the Iris dataset with the aim of classifying the species of Iris flowers.

analytics data-visualization exploratory-data-analysis machine-learning-algorithms

Last synced: 17 Mar 2025

https://github.com/jansim/ridges

R package for downloading and visualizing topographical elevation data.

data-visualization geospatial r ridgeline

Last synced: 02 Mar 2025

https://github.com/dcostachar/bellabeat-case-study

An analysis of Fitbit Fitness Tracker data with R to examine user behaviour and conduct a competitor analysis to optimize Bellabeat's product marketing strategies.

consumer-behaviour-analysis data-visualization exploratory-data-analysis ggplot2 health-data marketing-analytics r statistical-analysis tidyverse

Last synced: 02 Apr 2025

https://github.com/albanecoiffe/uber_data_visu_streamlit

Tableau de bord interactif avec Streamlit permettant d'explorer les données des trajets Uber de janvier 2015 à New York.

data-visualization streamlit

Last synced: 02 May 2026

https://github.com/freya135/personal-finance-manager

This project is a web-based personal finance manager dashboard built using Next.js and Vercel PostgreSQL. The dashboard aggregates essential financial data to help users track metrics like profits, sales, and customer activity, and it provides easy-to-read visualizations to support data-driven decision-making.

data-visualization nextjs personal-finance-manager postgresql vercel webdashboard

Last synced: 13 Apr 2026

https://github.com/jianxi-erin/bigdata-machinelearning-lab

本项目是一个综合性的大数据与机器学习实验平台,包含两个主要任务,每个任务涵盖三个关键技术模块:大数据处理、数据分析和机器学习。项目基于真实的竞赛设计,提供完整的数据处理模拟和建模实践。

data-analysis data-visualization hadoop machine-learning python spark sql

Last synced: 03 May 2026

https://github.com/dibsthegreat/titantic-dataset-analytics

DASC4850 Final Project where I did EDA to determine the survivability of Titanic guests depending on Age, Gender, Wealth, etc.

data-science data-visualization matplotlib numpy pandas python random-forest-classifier

Last synced: 13 Apr 2026

https://github.com/christophertrimboli/trends-vs-stock

A tool to browse culture's impact on stocks. Made in React, Node, Bootstrap and Charts.js to compare data from Google Trends and Alpha Vantage APIs.

bootstrap chartjs data-visualization google-trends intrinio nodejs react stocks

Last synced: 28 Apr 2026

https://github.com/davifeliciano/modern_physics_experiments

Collection of data analysis and visualization scripts developed in Python around some modern physics experiments

data-analysis data-visualization modern-physics physics physics-experiments

Last synced: 18 Jan 2026

https://github.com/swatisinghit/treadmill-customer-profiling-for-aerofit

Create comprehensive customer profiles for each AeroFit treadmill product through descriptive analytics. Develop two-way contingency tables and analyze conditional and marginal probabilities to discern customer characteristics, facilitating improved product recommendations and informed business decisions.

analytics conditional-probability data-analysis data-science data-visualization eda numpy pandas probability statistics

Last synced: 08 May 2026

https://github.com/boss294/credisync

CrediSync is a comprehensive web application designed to manage and track your financial transactions, including credits, debits, creditors, and debtors. With modern UI/UX and advanced features, it provides a seamless experience for managing your financial records.

credit-tracker css data-structures data-visualization dsa html html3 js management-system money-management web-dev website websoftware

Last synced: 17 May 2026

https://github.com/theglobemc/theglobemc.github.io

An interactive HTTP visualization of Minecraft books on the web from GlobeMC.

books data-visualization datamining minecraft

Last synced: 07 Oct 2025

https://github.com/bdice/signac-micde-cnsccs-2018

Slides and demos for the MICDE CNSCCS Symposium, October 15, 2018

data-management data-visualization demo signac workflow-automation

Last synced: 07 Oct 2025

https://github.com/18mahi/tweet-sentiment-analysis

Classify tweets into happy, sad, angry, excited, and neutral with this interactive Python model. Combining TF-IDF text features with engineered numeric features like emoji sentiment, polarity, subjectivity, and punctuation counts, it demonstrates intermediate-level NLP, feature engineering, and visualization skills.

data-science data-visualization jupyter-notebook machine-learning matplotlib nlp numpy pandas sklearn string textblob

Last synced: 30 Apr 2026

https://github.com/iankitnegi/tableautales

"Discover my Tableau journey! Dive into data-driven stories, visualizations, and projects as I explore the power of data visualization."

data data-visualization tableau

Last synced: 21 Jan 2026

https://github.com/ljadhav25/world-population-analysis-1990-2023-

This repository contains data and analysis related to the world population from 1990 to 2023. The objective is to explore population trends, identify patterns, and visualize demographic changes across different countries and continents over the past few decades.

data-analysis-python data-visualization matplotlib numpy-library pandas-library seaborn

Last synced: 08 Oct 2025

https://github.com/sayamalt/mental-health-classification-using-fine-tuned-distilbert

Successfully established a multiclass text classification model by fine-tuning pretrained DistilBERT transformer model to classify several distinct types of mental health statuses such as anxiety, stress, personality disorder, etc. with an accuracy of 77%.

data-visualization deep-learning distilbert-fine-tuning distilbert-model model-evaluation model-inference model-training-and-evaluation multiclass-text-classification natural-language-processing text-classification text-preprocessing text-tokenization

Last synced: 08 Oct 2025

https://github.com/lgibson7/stat_651

Coursework for STAT 651 Data Visualization, Cal State East Bay Fall 2022

data-visualization leaflet shiny-apps tableau

Last synced: 05 Feb 2026

https://github.com/pranavsp108/market_basket_analysis-instacart

Customer segmentation and market basket analysis using the Instacart dataset with Python, Pandas, and K-Means clustering.

customer-segmentation-and-buying-behavior data-analysis data-visualization instacart jupyter-notebook kmeans-clustering market-basket-analysis pandas python scikit-learn

Last synced: 05 May 2026

https://github.com/syncfusionexamples/how-to-add-arrows-to-the-chart-axis-in-wpf-chart

Learn how to enhance WPF charts by adding arrows to the chart axes using annotations for improved visualization and clarity.

axis-with-arrows chart-annotations chart-axis chart-customization charting-library charts data-visualization line-annotation wpf-char wpf-sfcharts

Last synced: 08 Oct 2025

https://github.com/alexquilis1/spanish-fuel-stations-analysis

Real-time analysis of Spanish fuel prices using government API data with interactive maps and regional comparisons

data-analysis data-visualization fuel-prices geospatial-analysis ggplot2 government-data leaflet open-data r shiny spain tidyverse

Last synced: 08 Oct 2025

https://github.com/HarmoniCode/Filtra

Digital Filter Designer is a powerful application built using PyQt5 and Matplotlib. It allows users to design and visualize digital filters, including standard filters and all-pass filters, and generate corresponding C code. Ideal for students, researchers, and engineers in digital signal processing.

data-visualization digital-signal-processing filter-design pyqt5 real real-time-processing

Last synced: 09 Oct 2025

https://github.com/amish5ingh/cricket-data-analytics-ipl

Data analysis and visualization of IPL 2022 matches using Python, Pandas, Matplotlib, and Seaborn. Includes insights on match outcomes, player performances, toss trends, and venue stats with 12+ charts.

data-analysis data-visualization ipl-data-analysis ipl-data-visualization jupiter-notebook matplotlib-pyplot numpy pandas python seaborn

Last synced: 09 May 2026

https://github.com/gauravsy704/sct_ds_2

Performed data cleaning and exploratory data analysis (EDA) on the Titanic dataset from Kaggle. Investigated the relationships between variables and identified key patterns and trends in the data using Python, with a focus on survival rates, passenger demographics, and embarkation details.

data-science data-visualization jupyter-notebook pandas python seaborn

Last synced: 06 May 2026

https://github.com/armahdavi/analytics_statistics_ml_plotting_dust_extraction_hvac_filters_ph2

PhD Technical Paper 1 - Phase 2 - Mahdavi & Siegel (2020) (Aerosol Science & Technology; AS&T) - Sharing all the data pipelines, processing codes, descriptive statistics, statistical modellings, and plotting/visualizations - Project Miestone: 2017 - 2020 - Full-length article is available

data-pipelines data-science data-visualization machine-learning matplotlib-pyplot numpy pandas-dataframe python scipy-stats sklearn statistics

Last synced: 14 Apr 2026

https://github.com/mgckaled/rs_data-analytics

Repositório agregador do conteúdo da formação Data Analytics desenvolvido pelo Rocketseat.

data-analytics data-visualization python sql statistics

Last synced: 09 Oct 2025

https://github.com/do-me/excel-column-analyzer

A free online tool to analyze Excel column data. Instantly count unique values, calculate frequencies, and visualize results in charts.

chartjs data-science data-visualization tailwind

Last synced: 09 Oct 2025

https://github.com/damisparks/machine-learning

Machine learning is a method of data analysis that automates analytical model building. It is a branch of artificial intelligence based on the idea that systems can learn from data, identify patterns and make decisions with minimal human intervention

data-science data-visualization deeplearning machine-learning machine-learning-algorithms machinelearning-python

Last synced: 09 Oct 2025

https://github.com/adithya2369/safa_public

AI-powered customer feedback analyzer that uses generative AI to transform customer reviews into actionable business insights. Upload review data, get instant summaries, satisfaction scores, detailed reports, and improvement suggestions—all in an easy-to-deploy Docker container.

data-analysis data-visualization docker-containerization full-stack-development generative-ai langchain langchain-groq web-development

Last synced: 10 Oct 2025

https://github.com/ianhaggerty/final-capstone

This represents the final capstone project in my HyperionDev Data Science (fundamentals) course. A dataset of Amazon customer reviews is analysed using natural language processing.

amazon data-analytics data-science data-visualization dataset matplotlib nlp nlp-machine-learning numpy pandas plotly reviews seaborn spacy tabulate textblob wordcloud

Last synced: 19 Jan 2026

https://github.com/salma-mamdoh/a-visual-history-of-nobel-prize-winners-project

My project aims to practice Data Analysis and Data Visualization on DataCamp

data-analysis data-visualization datacamp matplotlib pandas python seaborn

Last synced: 04 May 2026

https://github.com/controldata23/automobiles-data-exploration

An Exploratory Data Analysis done on an Automobiles dataset from kaggle

data-exploration data-visualization eda jupyter-notebook matplotlib python-data-analysis

Last synced: 19 Jan 2026

https://github.com/gorlix/sp-grafana-bridge

A lightweight, real-time data bridge to export Super Productivity tasks to InfluxDB v2 for advanced analytics on Grafana. Transform your time-tracking into actionable insights.

data-visualization grafana influxdb-v2 plugin productivity-tool super-productivity superproductivity time-tracking

Last synced: 31 May 2026

https://github.com/itskshitija/hr-data-analysis

The HR Data Analytics Dashboard project uses Power BI to analyze employee data, visualizing key HR metrics and KPIs to support data-driven decisions for improving workforce management, employee satisfaction, and organizational growth.

analytics data-science data-visualization dataanalysis dataanalytics hrdataanalysis powerbi-desktop powerbidashboard

Last synced: 21 Jan 2026

https://github.com/priyanshubiswas-tech/deloitte-daikibo-telemetry-analysis-task-1

Tableau dashboard analyzing Daikibo telemetry data. Tracks downtime by factory/device with interactive filters. Deloitte task solution with JSON processing.

data-analysis data-visualization deloitte json tableau tableau-public

Last synced: 11 Oct 2025

https://github.com/saifalibaig/covid-19-death-rate-analysis-using-python

Analysis of Covid-19 data along with the world happiness report to identify if there is any relationship between death rate and happiness rate of countries all over the world.

data-analysis data-visualization numpy pandas python3 sns visualization

Last synced: 03 May 2026

https://github.com/azaz9026/email-spam-detection

Welcome to the Email Spam Detection project! This repository provides a machine learning model for detecting spam emails using a Naive Bayes classifier and a simple web interface built with Streamlit.

data-analysis data-cleaning data-structures data-visualization deep-learning machine-learning python sql streamlit

Last synced: 14 Apr 2026

https://github.com/benst099/circlesplot

Visualize proportions with circles in a plot

cran cran-r data-science data-visualization proportions r visualization

Last synced: 11 Oct 2025

https://github.com/mr-chang95/udacity-starbucks-challenge

Data Science Project for Udacity's Data Scientist Program. Using Python in Jupyter Notebook.

data data-science data-visualization numpy pandas sklearn

Last synced: 14 Apr 2026

https://github.com/ahsankhizar5/titanic-eda-visualization

Exploratory Data Analysis and Visualization on the Titanic Dataset using Python, Pandas, Matplotlib, and Seaborn to uncover survival patterns.

data-analysis data-science data-visualization eda kaggle machine-learning matplotlib pandas python seaborn titanic-dataset

Last synced: 31 May 2026

https://github.com/archanakokate/kkbox_music_recommendations

Predicting the chances of a user listening to a song repetitively after the first observable listening event.

data-visualization exploratory-data-analysis machine-learning statistical-analysis

Last synced: 11 Oct 2025

https://github.com/dzakwanalifi/stadata-x

Terminal UI untuk menjelajahi dan mengunduh data BPS Indonesia secara interaktif

bps-api cli-app data-analysis data-visualization indonesia-statistics indonesian-data open-data python statistics terminal-ui textual tui

Last synced: 20 Jan 2026

https://github.com/tzerk/esr

R package 'ESR' for plotting and analysing ESR spectra in dating applications

data-analysis data-visualization electron-spin-resonance geochronology r

Last synced: 13 Mar 2026

https://github.com/alexondata/daan_eda-exploratory-data-analysis_ecommerce

This project presents an Exploratory Data Analysis (EDA) pipeline for an eCommerce dataset, integrating Python, SQL Server, and Power BI to transform raw transactional data into meaningful business insights. The project was developed as part of an academic assignment at Transilvania University of Brașov, Faculty of Mathematics and Computer Science.

data-analysis data-visualization ecommerce microsoft-sql-server powerbi python

Last synced: 18 May 2026

https://github.com/louisfernando1204/websocket-benchmark

A comprehensive performance testing and analysis suite designed to evaluate and compare different WebSocket server implementations across various programming languages and libraries.

benchmarking broadcast-test coder-websocket csv data-analysis data-visualization echo-test golang gorilla-websocket nodejs python3 socket-io websocket-client websocket-server ws

Last synced: 09 Apr 2026

https://github.com/achronus/data-graphing-tool

A tool for finding the perfect graph that fits your CSV data.

data-visualization matplotlib numpy pandas python3

Last synced: 13 Oct 2025

https://github.com/meettomb/focusetrack

FocusTrack is a simple desktop app built with WPF and C# that helps you see how you spend time on your computer. It tracks the apps you use, records how long you use them, and shows the data in easy-to-read charts and lists. With options to filter by today, last week, or last month, FocusTrack helps you understand your digital habits.

app-usage-tracker csharp data-visualization desktop-app livecharts monitoring productivity time-management wpfh

Last synced: 13 Oct 2025

https://github.com/mishraanuraagx/finsight

FinSight is a machine learning-driven financial analytics tool designed to explore, cluster, and visualize different financial assets based on their risk and return behaviors.

clustering cryptocurrency data-science data-visualization deep-learning fastapi finance financial-analysis gru hierarchical-clustering investment-analysis lstm machine-learning portfolio-analysis predictive-modeling quantitative-finance risk-management stock-market time-series yfinance

Last synced: 24 Apr 2026

https://github.com/ashioyajotham/data_centers_are_eating_the_world

This idea is mostly how supercomputers (AI data centers) are coming up fast so it is an attempt to map them like Semi-Analysis does

data-center data-visualization supercomputers

Last synced: 14 Oct 2025

https://github.com/markusbegerow/powerbi-navigation-menu

Interactive navigation menu visual for Power BI with slide-out filtering and hierarchical data support

business-intelligence d3js data-visualization filter hamburger-menu navigation powerbi powerbi-custom-visuals powerbi-visuals typescript

Last synced: 14 Oct 2025

https://github.com/miserman/splot

An R package to ease data visualization

data-visualization r

Last synced: 22 Jan 2026

https://github.com/mahambilalandahaan/week8

K-Means Deep Dive: Clustering analysis with Elbow and Silhouette methods in Python

clustering data-visualization jupyter-notebook k-means machine-learning python scikit-learn unsupervised-learning

Last synced: 20 Apr 2026