An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/jahnavigupta06/zepto-delivery-customer-analytics

Real-time SQL + Power BI Analytics Project replicating Zepto's customer & delivery insights.

business-intelligence churn-analysis customer-segmentation data-analysis data-visualization powerbi sql-server

Last synced: 02 Aug 2025

https://github.com/lemniscate-world/stratai

This project analyzes financial assets using a Hidden Markov Model (HMM) to identify different market regimes and patterns. The analysis includes calculating daily returns, rolling volatility, and volume changes, and visualizing the hidden states identified by the HMM.

ai assets data data-science data-visualization finance financial-analysis fintech hmm-model hmmlearn machine-learning trading

Last synced: 23 Oct 2025

https://github.com/faizantkhan/regression-project-bangalore-property-price-prediction

🏠 Bangalore Property Price Prediction is a comprehensive project designed to accurately predict property prices in Bangalore. Leveraging advanced regression techniques and a dataset sourced from Kaggle, the model undergoes meticulous feature engineering, data cleaning, and parameter tuning to ensure high accuracy.

backend-api css data-cleaning data-science data-visualization eda flask html javascript machine-learning-algorithms numpy pandas project project-repository property python regression-models server

Last synced: 14 Apr 2026

https://github.com/fengxiaoxiao-001/data_preprocessing

提供处理缺失值,处理异常值,处理特征工程以及多种数据绘图功能;适合大型数据,以及配备处理超多不同数据类型分布的方法

data-science data-visualization processing

Last synced: 29 Apr 2026

https://github.com/mksingh431/python-project

Learn Pandas with exercises and sample projects

data-analysis data-science data-visualization project projects python

Last synced: 03 May 2026

https://github.com/tushard48/product-cluster-analysis

This project performs clustering analysis on a product dataset to identify and group similar products. The analysis includes data preprocessing, application of various clustering algorithms, and visualization of results to gain insights into product patterns. Key techniques used are K-Means, Mini Batch K-Means, evaluated using metr

data-visualization excel machine-learning powerbi streamlit unsupervised-learning

Last synced: 03 May 2026

https://github.com/pxaris/expenditure-analyzer

Application for analyzing expenditure data over time

data-analysis data-visualization docker python statistics

Last synced: 29 Apr 2026

https://github.com/patelabhi574/hotel_reservation_analysis

Analyzing data collected by hotel to make future prediction for the owner of what are the segments they are making most profit & also which are the patterns & trends which have been seen over the past years in the booking in different times throughout the year and price setting on the website in peak time as per availability index.

data data-visualization datamodeling looker-studio powerbi reporting sql-query sql-server

Last synced: 19 Feb 2026

https://github.com/shubhamgoyal575/diwali-sankranti-promotion-sales

This Power BI dashboard analyzes sales performance during Diwali and Sankranti festivals. It provides insights into revenue trends, top-selling products, regional sales distribution, and customer purchasing behavior to help optimize festive season sales strategies. 🚀

buisness-intelligence dashboard data-analysis data-visualization diwali-sankranti-sales-analysis excel fast-moving-consumers-goods fmcg microsoft-power-bi mysql power-query powerbi revenue-insights sales-dashboard sales-insights sql

Last synced: 02 Mar 2026

https://github.com/dsrodrigovieira/houserocketsales

Este repositório contém um projeto desenvolvido para praticar habilidades de análise de dados utilizando Python

data-analysis data-visualization heroku kaggle-dataset python

Last synced: 29 Apr 2026

https://github.com/nomadsdev/sys-moninsight

System Monitoring and Analysis Tool is a utility for real-time performance tracking. It logs CPU, memory, and disk usage, provides visual graphs, and offers performance recommendations. Perfect for optimizing system efficiency.

automation cpu-usage data-analysis data-visualization disk-usage matplotlib memory-usage performance-analysis performance-optimization psutil python real-time-monitoring resource-management sys-moninsight system-metrics

Last synced: 19 Jun 2026

https://github.com/pheithar/socialdata_madridcentral

Social data and visualization course at DTU - 2022. Effectiveness of Madrid Central

data-analysis data-visualization jupyer-notebook madrid python

Last synced: 28 Apr 2026

https://github.com/gracysapra/heart-disease-prediction-using-logistic-regression

This project uses Logistic Regression to predict the likelihood of heart disease based on medical attributes such as age, cholesterol levels, and blood pressure. It includes model training, evaluation, and an interactive Gradio interface for real-time heart disease risk prediction.

classification data-preprocessing data-science data-visualization gradio-interface heart-disease-prediction logistic-regression machine-learning

Last synced: 11 Jun 2026

https://github.com/karthikmprakash/911-call-dataanalysis

Data Analysis of Emergency (911) Calls: Fire, Traffic, EMS for Montgomery County, PA

911-call-analysis data-analysis data-visualization python3 united-states-data

Last synced: 10 May 2026

https://github.com/sathyasris27/environmental-classification-based-on-gaming-patterns

The aim of this project is to create a more nuanced understanding of the interactions between socio-demographic characteristics, in-game behaviours, and global-scale environmental consciousness.

data-engineering data-mining data-science data-visualization python-3

Last synced: 28 Apr 2026

https://github.com/mastersign/mastersign-datascience

High level helpers for data science in Python with Pandas.

data-science data-visualization database-access pandas python

Last synced: 05 May 2026

https://github.com/johnwalley/data-viz-resources

Data visualization resources

data-visualization

Last synced: 08 Jan 2026

https://github.com/anarya22/tata-data-visualization-empowering-business-with-effective-insights-job-simulation-on-forage

Completed a simulation involving creating data visualizations for Tata Consultancy Services. Created visuals for data analysis to help executives with effective decision making.

business-analysis data-cleaning-and-preprocessing data-visualization excel powerbi

Last synced: 07 Jan 2026

https://github.com/mulliru/estudo-data-visualization

Este repositório contém os códigos que desenvolvi e utilizei durante o curso de Visualização de Dados da Alura. Aqui, você encontrará diversos scripts e notebooks que abordam os principais conceitos e técnicas de Data Science, aplicados em diferentes contextos e projetos.

alura data-visualization

Last synced: 08 Jun 2026

https://github.com/rahul-404/full_stack_data_science_with_generative_ai

Welcome to the repository for the course "Full Stack Data Science with Generative AI". This repository is designed to accompany the course and provide resources, exercises, and projects related to the study of data science and generative AI techniques.

data-analysis data-science data-visualization database deep-learning exploratory-data-analysis feature-engineering generative-ai machine-learning nlp python statistics

Last synced: 12 Apr 2026

https://github.com/vetrivel07/data-visualization-portfolio

This repository showcases my Data Visualization projects using Power BI and Tableau, along with Python-based exploratory analysis. It includes dashboards, data storytelling, and business insights

dashboard data-cleaning data-schema data-visualization tableau

Last synced: 02 Mar 2026

https://github.com/getkey/stereotype-map

Map of national stereotypes

data-visualization google-suggestions vuejs vuex

Last synced: 12 Apr 2026

https://github.com/yusuf-abol/alumni-interaction-and-conversation-dynamics-nlp

This Natural Language Processing (NLP) project took a dive into chat engagement dynamics within the University of Ilorin’s Class of 2018 Statistics alumni group. By applying Latent Dirichlet Allocation (LDA) for topic modeling and network analysis, I uncovered communication patterns, topic distributions, and member interactions.

alumni-network anonymization conversation data-science data-visualization engagement machine-learning network-analysis nlp python-3 sentiment-analysis statistics whatsapp

Last synced: 05 May 2026

https://github.com/aryathel/twittersentimentanalysis

WORK IN PROGRESS. This project will allow users to enter in a keyword or hashtag to search for on Twitter, as well as the number of Tweets to include in their search, and the program will return an analysis of the general sentiment of that topic.

data-science data-visualization flask python twitter-api twitter-sentiment-analysis

Last synced: 27 Apr 2026

https://github.com/robertopatino1/oscars2023_data_analysis

A deep data science analysis involving tweets regarding the upcoming Academy Awards

data data-analysis-python data-science data-visualization html jupyter-notebook lda-model machine-learning python trends tweepy twitter

Last synced: 24 Apr 2026

https://github.com/linuxto5re/salesandguestmanagementmldotnet

welcome to our Sales and Guest Projection repository! Discover precise guest predictions via ML.NET, historical data, and advanced tech. This model also applies to sales forecasts, fueled by ML.NET's capabilities. In addition, we've added data visualization.

csharp data-visualization machine-learning mldotnet mvvm-architecture oxyplot sql-server

Last synced: 27 Apr 2026

https://github.com/albertomorini/policesviolence

Repository for the project of the course Data Science (Fondamenti di Scienza dei Dati) at UniUD.

data-analysis data-science data-visualization r

Last synced: 31 May 2026

https://github.com/webmobiledev/d3-visuals

There are some data visualizations like treechart, bubblechart, mbostock, mapchart using d3.js.

bootstrap d3js data-visualization highcharts mbostock-d3

Last synced: 27 Apr 2026

https://github.com/ascender1729/cds-iisc-p1-datasci-predoc

A data science project utilizing machine learning to predict movie release years and genres based on directors' previous works.

data-cleaning data-visualization film-industry machine-learning movie-metadata multi-label-classification predictive-modeling regression-analysis-feature-engineering

Last synced: 31 Mar 2025

https://github.com/avallecam/cdcper

Miscelanea de funciones customizadas a tareas de análisis en CDC Perú

data-manipulation data-mining data-visualization data-wrangling r tidyverse

Last synced: 07 Jun 2026

https://github.com/clubgamma/machine-learning-and-data-analysis-tasks

This repository is a part of Hacktoberfest 2022. I have created few basic task of machine learning and Exploratory data analysis. Have a look at it

contributions-welcome data-visualization exploratory-data-analysis hacktoberfest hacktoberfest2022 machine-learning-algorithms open-source python

Last synced: 25 Apr 2026

https://github.com/jdanielgoh/mortalidad-infantil

Este proyecto se realizó para el datatón sobre niñez y adolescencia en México 2023. Contiene visualizaciones sobre mortalidad infantil con d3 y vue

d3 d3js data-visualization

Last synced: 09 Jun 2026

https://github.com/faezeh-gholamrezaie/visual-google-scholar-search

A Python script that searches Google Scholar for specific keywords and visually presents the results in various chart formats, enabling researchers to analyze trends and insights in academic literature.

academic academic-research academic-trends ai ai-research bibliometrics data-analysis data-visualization google-scholar publication-analysis python research-trends scholarly scholarly-data word-cloud

Last synced: 25 Apr 2026

https://github.com/eduardobursa/d3-nuget

D3 package for aspnet applications.

data-visualization javascript

Last synced: 04 May 2026

https://github.com/zane/plot

A tiny Clojure library for plotting things at the REPL.

clojure data-science data-visualization repl statistics

Last synced: 01 Sep 2025

https://github.com/asifdotexe/quickvu

Quick VU: No-code, data cleaning analysis and visualization tool built on Streamlit. Quickly clean, visualize, explore, and understand data relationships and correlations with ease. Perfect for analysts, business users, and anyone looking to gain data insights—without writing a single line of code.

automation data-analysis data-cleaning data-visualization python3 streamlit-application toolkit

Last synced: 06 Jun 2026

https://github.com/kitestring/scidata

Extracts and cleanses flat data exported from a Chemical Analyzer instrument (Time of Flight Mass Spectrometer), then loads data into a SQL database using a Star schema. A basic CLI is implemented to query SQL database to create data visualizations which describe instrument performance in a simple and digestible manner. The following python libraries are utilized: numpy, pandas, matplotlib, sklearn, statistics, sqlite3, os, time, & datetime.

data-visualization extract matplotlib pandas python3 sql-database

Last synced: 25 Apr 2026

https://github.com/Rayyan9477/Calorie-Burnage-Exploratory-Data-Analysis

Calorie Burnage is the measure of calories burned during physical activity or exercise, crucial for weight management and fitness goals. This project focuses on analyzing a dataset that includes information on duration, pulse rates, and calories burned during exercise sessions.

data-analytics data-science data-visualization exploratory-data-analysis linear-regression r-language r-programming

Last synced: 29 Apr 2025

https://github.com/flazefy2/ds-50k_songs_dataset_generated_by_ai

https://www.kaggle.com/datasets/refiaozturk/spotify-songs-dataset

csv data-science data-visualization jupyter-notebook python statistics

Last synced: 25 Apr 2026

https://github.com/nicholas-miklaucic/rho_plus

The Python data viz nitro canister you didn't know you needed

aesthetics bokeh colormap data-visualization matplotlib plotly python

Last synced: 05 May 2026

https://github.com/flazefy2/ds-global_cybersecurity_threats

https://www.kaggle.com/datasets/atharvasoundankar/global-cybersecurity-threats-2015-2024

data-science data-visualization numpy python squarify statistics

Last synced: 05 May 2026

https://github.com/gholamrezadar/favourite-youtube-channels

this program goes through your youtube watch history and sorts channels based how many of their videos you have watched!

data-analysis data-visualization python

Last synced: 16 Jan 2026

https://github.com/leosimoes/uerj-tcc-analisador-dados

Trabalho de conclusão de curso (TCC) em Engenharia de Computação. Aplicativo Web para preparação e análise de dados, criação de gráficos e modelos de regressão linear e logistica.

computer-engineer data-analysis data-science data-visualization linear-logistic linear-regression python streamlit

Last synced: 24 Apr 2026

https://github.com/amr-yasser226/datagovernanceworkflow

Comprehensive data governance pipeline for SSH honeypot logs—covering data profiling, cleansing, quality assurance, encryption, classification, and GDPR/CCPA/HIPAA compliance. Built with Pandas, Pandera, YData Profiling, and cryptography, with simulated Caesar cipher attacks to demonstrate practical data-security techniques.

caesar-cipher ccpa cryptography cybersecurity data-cleaning data-encryption data-governance data-profiling data-quality data-validation data-visualization gdpr hipaa honeypot-analysis open-source pandas privacy-compliance python ssh-logs

Last synced: 05 Feb 2026

https://github.com/sandravizz/global_inequality_story

Dataviz Project about Global Inequality

data data-visualization inequality

Last synced: 03 Jul 2025

https://github.com/shuddha2021/stellar-candidate-selector

A sophisticated candidate selection algorithm leveraging multi-criteria analysis and machine learning to identify top software engineering candidates. This tool features flexible filtering, score adjustment, and detailed visualizations to streamline the recruitment process.

candidate-selection data-analysis data-visualization machine-learning pandas plotting-in-python python python-data-analysis recruitment scikit-learn

Last synced: 05 May 2026

https://github.com/hecatops/insightbench

Insight Bench is a web-based CSV analysis tool built with Streamlit, designed for quick, effortless exploration of your CSV files. Simply upload your file and get instant insights without needing any setup or coding.

data-visualization exploratory-data-analysis python shadcn-ui streamlit

Last synced: 24 Apr 2026

https://github.com/archanakokate/movielens-case-study-eda-prediction-

Exploratory Data Analysis on Movielens data files and Model building using Decision Tree Classifier , Random Forest Classifier and XG Boost.

data-visualization dataengineering exploratory-data-analysis machine-learning-algorithms

Last synced: 17 Mar 2025

https://github.com/ninadpatil09/bankcard-analytics---credit-debit-card-usage-monitoring

This project is a comprehensive data analysis initiative aimed at extracting valuable insights from bank card usage data. The tools and techniques includes Python, Excel, Tableau, web scraping, pandas. It centers around understanding and visualizing trends and patterns in credit and debit card usage across multiple banks.

data-cleaning data-visualization excel python tableau tableau-public web-scraping

Last synced: 18 Apr 2026

https://github.com/hakaneroztekin/web-scraping

☕ A web scraping project developed with Python. 📈 It scrapes the website, collects and visualizes data.

data-science data-visualization python web-scraping

Last synced: 15 Jun 2026

https://github.com/dianaow/star-wars-viz

A simple implementation of a scatter plot built with React + D3 (Typescript)

d3js d3v4 data-visualization reactjs star-wars star-wars-api typescript

Last synced: 21 Apr 2026

https://github.com/yashika-malhotra/micromobility-service-provider---hypothesis-testing

Examined factors influencing demand for micro-mobility shared electric cycles Performed exploratory analysis and hypothesis testing, revealing the distinct influence of weather-season association on hourly counts

colab-notebook data-visualization eda exploratory-data-analysis hypothesis-testing jupyter-notebook matplotlib matplotlib-pyplot numpy pandas python scipy-library scipy-stats seaborn skit-learn

Last synced: 12 Apr 2026

https://github.com/as16082023/coffee-bean-sales-analysis

Analyzing coffee bean sales data to optimize consumer targeting, product offerings, and strategic marketing in the coffee industry.

coffee-bean-sales dashboard data-analysis data-visualization ms-excel

Last synced: 22 Jan 2026

https://github.com/ricardolsmendes/sna-dw

Companion repository for my Data Science & Analytics MBA Term Paper: "Social Network Analysis applied to Data Warehouses: opportunities and constraints for Data Governance"

data-analytics data-governance data-science data-visualization network-science social-network-analysis

Last synced: 21 Apr 2026

https://github.com/hamdaniqhmqd/kelompok6-sistem-cerdas-bbri

Repository group6-system-smart-bbri is a group assignment project that uses Streamlit, scikit-learn, and related technologies to build a BBRI stock price prediction application based on day, week, and month input.

data-visualization numpy pandas python sklearn streamlit

Last synced: 03 Apr 2025

https://github.com/md-emon-hasan/ml-projects-telcom-customer-churn-prediction

📱 Customers are likely to leave a telecom service, enabling companies to take measures for retention and create accurate churn prediction models.

boostrap5 customer-churn customer-segmentation data-engineering data-science data-visualization logestic-regression machine-learning telco-customer-churn-prediction telcom-churn

Last synced: 05 May 2026

https://github.com/rakumar99/power-bi-projects

This repository contains various power bi projects and dashboards of Humaan Resources , Financial Analysis using Power BI Desktop.

dashboards data-analysis data-visualization databases datacleaning datamodeling etl powerbi powerquery reports

Last synced: 04 Jun 2026

https://github.com/msohaill/wrappedify

Full stack implementation of Wrappedify

data-visualization express music socket-io spotify sveltekit

Last synced: 05 May 2026

https://github.com/neerajcodes888/diwali-sales-analysis

An open-source repository for sales data analysis. Dive into insightful trends, metrics, and visualizations to empower data-driven decision-making. Ideal for data analysts, business professionals, and enthusiasts seeking comprehensive sales insights. Clone, customize, and contribute to enhance your sales analytics journey.

data-science-projects data-visualization numpy pandas-dataframe python3 sales-analysis seaborn-plots

Last synced: 26 Mar 2025

https://github.com/kristinbaumann/data-art-pi

Data Vis Coding - Data Art with Pi

d3 data-visualization pi random

Last synced: 20 Apr 2026

https://github.com/prangonghose/analysis_of_bangladesh_economic_complexity

In this project a brief analysis has been done by our team in the export economy of Bangldesh for the past three decades.

data-analysis data-science data-visualization inequalipy matplotlib pandas plotly

Last synced: 22 May 2026

https://github.com/gappeah/london-housing-price-dashboard

This Excel-based Housing Visual Dashboard provides a comprehensive view of average house prices across various boroughs in London from 1996 to 2013. The dashboard is designed to offer insights into housing market trends and price variations across different areas of London over time.

data data-analysis data-visualization excel visual

Last synced: 31 Jul 2025

https://github.com/gappeah/global-shipping-analytics-dashboard

This Tableau project provides a comprehensive visual analysis of global sales, shipping costs, and quality metrics across different regions and countries.

data data-analysis data-analyst data-visualization metrics tableau

Last synced: 25 Feb 2025

https://github.com/kuroko1t/geoview

A lightweight, browser-based GIS data viewer built with Streamlit and Geopandas. Visualize Shape files, GeoJSON, and more instantly

data-visualization folium geojson geopandas gis shapfile streamlit

Last synced: 29 May 2026

https://github.com/karlyndiary/amazon-sentiment-analysis-eda

Amazon Customer Reviews Sentiment Analysis utilizing Python for Data Extraction and Pre-Processing, with real-time data from the Amazon API and Tableau for dashboard visualizations.

amazon-api api-data-extraction dashboard data-cleaning data-pipeline data-visualization etl roberta-sentiment-analysis sentiment-analysis tableau-dashboards vader-sentiment-analysis

Last synced: 22 Mar 2025

https://github.com/romerorodriguezd/housing-market-tracker

Jupyter Notebook to store and evaluate long-run evolution of house sales, based on Idealista website.

data-visualization matplotlib pandas python scraping scraping-python sqlite3

Last synced: 06 May 2026

https://github.com/sayamalt/fraudulent-transactions-prediction

Successfully trained a machine learning model which can predict whether a given transaction is fraud or not.

data-visualization exploratory-data-analysis imblearn machine-learning model-based-testing model-building predictive-analytics sklearn

Last synced: 29 Apr 2026

https://github.com/drcbeatz/machine-learning-tool

Machine Learning Tool - Train and test supervised ML algorithms (incl. binary classification and regression) on custom data sets and visualize your results without knowing how to code.

data-science data-visualization django machine-learning python scikit-learn

Last synced: 06 May 2026

https://github.com/azmainadel/twitter-data-neo4j

Playing with graph database on a large dataset of twitter data.

data-analysis data-visualization neo4j-database snap

Last synced: 06 Apr 2025

https://github.com/aubainmbk/optimisation-ventes-supermarche

Notre objectif est de ressortir des informations grâce aux données de ventes d'un Supermarché.

data-visualization dataanalysis excel powerbi

Last synced: 04 Feb 2026

https://github.com/dan-alvares/climaterbot

Repositório do projeto do bot CLIMATER, que fornece visualização de dados agroclimáticos, suas médias e séries temporais para usuários do Telegram, democratizando o acesso desses dados para pequenos produtores.

data-visualization inmet telegram-bot time-series

Last synced: 04 Apr 2026

https://github.com/jolars/qualpal-py

A Python package for automatic generation of qualitative color palettes

colors data-visualization palette-generation

Last synced: 04 Apr 2026

https://github.com/kirkalyn13/opensignal_autogenerate_report

Script used to generate results/summary, including the trends of flagged provinces, from the raw excel data file,

data-analysis data-science data-visualization matplotlib numpy pandas python

Last synced: 06 May 2026

https://github.com/amirhosseinhonardoust/customer-sentiment-intelligence-platform

An enterprise-grade NLP + Streamlit + SQL platform for analyzing customer feedback. Performs automated sentiment detection, stores labeled reviews in SQLite, and delivers real-time dashboards with probability insights to support business, marketing, and product optimization decisions.

community-project cost-of-living dashboard data-analysis data-visualization economic-analysis inflation-tracking local-data open-data pandas price-tracker public-insight python sqlite streamlit

Last synced: 06 May 2026

https://github.com/dsnchz/solid-plotly

SolidJS wrapper for Plotly.js – reactive and performant charts powered by Plotly, built for Solid.

analysis analytics charting-library charts data-visualization solidjs visualization

Last synced: 02 Apr 2026

https://github.com/namratha2301/best-selling-books

Comprehensive examination of best-selling books, focusing on understanding sales patterns, genre distributions, and the impact of various features on book performance.This project aims to predict book sales and classify genres, providing valuable insights for authors, publishers, and readers.

data-analysis data-visualization matplotlib pandas sckiit-learn seaborn

Last synced: 06 May 2026

https://github.com/soufianboukir/ecom-analytics-platform

End-to-end data science project on an Amazon sales dataset, including data preprocessing, analysis, modeling, and a Streamlit dashboard for insights and decision-making.

data-analysis data-science data-visualization data-visualization-dashboard forecasting-models timeseries

Last synced: 14 Jun 2026