An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/tejas-130704/whatsapp-analyser

ChatMate is a web app that analyzes WhatsApp chats, providing insightful visualizations like word clouds, heatmaps, and activity timelines. It calculates total messages, words, media, links, and more, helping you understand chat patterns for groups or individuals with ease. Simply upload your chat file and get detailed reports instantly!

data-analysis data-visualization python streamlit web-application whatsapp-analysis

Last synced: 26 Apr 2026

https://github.com/sankalp130/call-center_data-analysis

This repository contains Tableau dashboard, visualization, and data analysis projects I have created.

data-cleaning data-schema data-visualization tableau

Last synced: 05 Jan 2026

https://github.com/czheluo/fst-manhattan

Fst manhattan Plot

data-visualization fst

Last synced: 14 Apr 2026

https://github.com/bydmiller/textwizards

This repository houses a collection of my natural language processing (NLP) projects, showcasing a variety of applications and experiments in the field. From sentiment analysis to language modeling, explore the power of NLP through my code. Feel free to use, modify, and contribute!

char-rnn chat-bot-nltk classification data-visualization encoder-decoder-attention gpt gradient-descent math-question-solver n-gram-language-models naive-bayes-classifier nlp patient-survival-analysis regression-algorithms sentiment-analysis smallgpt spell-checker titanic-dataset translation trax word-embeddings

Last synced: 30 Jul 2025

https://github.com/mastercruelty/gokart-data-hub

It manages data about gokart races and plot graphs about your times!

data-analysis-python data-science data-visualization gokart matplotlib pandas race

Last synced: 15 Apr 2025

https://github.com/jorgeatgu/apaga-luz

💡 ¿Cuánto cuesta la luz? 💶

data data-visualization flat-data

Last synced: 04 Feb 2026

https://github.com/linuxto5re/salesandguestmanagementmldotnet

welcome to our Sales and Guest Projection repository! Discover precise guest predictions via ML.NET, historical data, and advanced tech. This model also applies to sales forecasts, fueled by ML.NET's capabilities. In addition, we've added data visualization.

csharp data-visualization machine-learning mldotnet mvvm-architecture oxyplot sql-server

Last synced: 27 Apr 2026

https://github.com/guruakaashjn/te_project_microsoft_ai

AI based statistical analysis of land-use plastic pollution in India using AI/ML techniques.

artificial-intelligence data-analysis data-analytics data-science data-visualization machine-learning powerbi

Last synced: 27 Feb 2025

https://github.com/jfoster17/save

Search Analysis and Visualization Environment

analysis data-science data-visualization environment search visualization

Last synced: 07 Apr 2025

https://github.com/prankshaw/election-analytica

Analyzing previous election results for Haryana Vidhan Sabha and other factors and to compare them with various parameter to conclude results.

anaconda collection data-analysis data-science data-visualization elections jupyter-notebook python python-3 wrangling

Last synced: 16 May 2026

https://github.com/floressek/data_analysis_and_visualization

This repository contains a collection of statistical data analysis laboratories using R. Each lab focuses on different aspects of data exploration, visualization, and analysis techniques.

data-analysis data-visualization

Last synced: 05 Oct 2025

https://github.com/pavankethavath/dataspark-illuminating-insights-for-global-electronics

DataSpark is a retail analytics project for Global Electronics leveraging Python, SQL, and Power BI. It uncovers customer insights, sales trends, and store performance to optimize marketing, inventory, and operations. Features include clean datasets, SQL-driven analysis, and interactive dashboards, driving data-driven growth and decision-making.

data-engineering data-visualization dataanalytics powerbi python retail-data sql

Last synced: 27 Apr 2026

https://github.com/lilivalgo/coal-production-colombia

Data analysis that includes information on annual coal production, royalties generated, and climate variables. Descriptive analysis and visual analysis techniques were used

analysis data-visualization dataframes insights manipulation matplotlib python seaborn transformation

Last synced: 27 Apr 2026

https://github.com/noturlee/titanic-datamodel

This project demonstrates the process of data preprocessing, model training, evaluation, and tuning in building a predictive model for a classic dataset. The Random Forest model, with its ability to handle complex relationships and interactions between features, proved to be the most effective in this case.

data-modeling data-science data-visualization python

Last synced: 08 Apr 2025

https://github.com/valeriiaanisimova/fractals-project

Interactive fractal-based contact map visualizer with significance filtering and relationship display.

contact-maps css data-visualization fractals html interactive javascript tree-layout ui visualization

Last synced: 06 Oct 2025

https://github.com/strickvl/knitting-llms

Generating Knitting Patterns from Language Models

art crafts data-visualization knitting llm llms

Last synced: 06 Oct 2025

https://github.com/ahmetcansolak/developer-insights

New project of ClubRockers from Sarıyer Hills

bitbucket data data-science data-visualization github python3

Last synced: 28 Apr 2026

https://github.com/zane/plot

A tiny Clojure library for plotting things at the REPL.

clojure data-science data-visualization repl statistics

Last synced: 01 Sep 2025

https://github.com/ondrejhruby/airbnb-analysis-machine-learning

A comprehensive end-to-end machine learning project analyzing Airbnb listings data. This project includes exploratory data analysis, model training, optimization, and model interpretability, using a randomly generated dataset for demonstration purposes.

airbnb-data data-science data-visualization exploratory-data-analysis hyperparameter-tuning machine-learning model-interpretability python regression-analysis

Last synced: 20 Jul 2025

https://github.com/nfaltir/word_visualizer

🧩 Visualize speeches for fun.

data-visualization python streamlit wordcloud

Last synced: 28 Apr 2026

https://github.com/shridhar1504/power-bi-visualization-project

This repository contains Visualization Projects which is visualized through Power BI Software, by using the visualization we can gain multiple insights and strategies which helps to develop the business for gaining high profit margins and by the insights we can reduce the damages by accidents & calamities.

dashboard data-analysis data-cleaning data-science data-visualization exploratory-data-analysis microsoft-excel microsoft-power-bi microsoft-powerpoint powerbi powerbi-report powerbi-visuals powerpoint-slides

Last synced: 21 Jan 2026

https://github.com/qtle3/support-vector-regression

This project implements Support Vector Regression (SVR) to predict the salary of an employee based on their position level. The script uses a dataset that contains position levels and corresponding salaries, applying feature scaling to improve the performance of the SVR model. The results are visualized to show how well the model fits the data.

data-visualization feature-scaling gaussian-kernel inverse-transform-method support-vector-regression

Last synced: 25 Oct 2025

https://github.com/fatihilhan42/eda-on-data-science-salary-with-python

You can access the files of this project, which analyzes people working in the field of data science according to countries and working wages.

analysis data-science data-scientists data-visualization jupyter-notebook pyhton salary

Last synced: 23 Mar 2025

https://github.com/mulliru/estudo-data-visualization

Este repositório contém os códigos que desenvolvi e utilizei durante o curso de Visualização de Dados da Alura. Aqui, você encontrará diversos scripts e notebooks que abordam os principais conceitos e técnicas de Data Science, aplicados em diferentes contextos e projetos.

alura data-visualization

Last synced: 08 Jun 2026

https://github.com/mitevpi/vue-d3-bar-chart

Reusable, reactive, animated bar chart using D3 + Vue.js. Written in idiomatic Vue, rather than D3 syntax.

d3 data data-visualization frontend interactive svg vue web

Last synced: 18 May 2026

https://github.com/cwendorf/polyplot

Visualizing Distributional Statistics [R Package]

data-visualization distribution-shape polyplot r r-package statistics

Last synced: 15 Mar 2025

https://github.com/paul019/pappe

A CLI to draw your data on top of millimeter paper

automation data-visualization diagram diagram-generator python

Last synced: 05 Mar 2025

https://github.com/lamp-lucas/foton_contacts

Plugin para Redmine que gerencia pessoas, empresas e vínculos com foco em projetos da indústria AEC. Totalmente integrado, responsivo e parte do ecossistema Livre e OpenSource Mundo AEC.

aec-industry construction-tech contacts-management crm csv-import data-visualization foton issue-linking mundo-aec open-source project-management redmine-plugin ruby-on-rails team-collaboration user-profile vcard

Last synced: 09 May 2026

https://github.com/gutyoh/netflix-dashboard

Dashboard created with Python 🐍 and the Dash 📊 framework to visualize Netflix’s content distribution and classification.

dash dashboard data-visualization plotly python

Last synced: 08 Apr 2025

https://github.com/sathyasris27/environmental-classification-based-on-gaming-patterns

The aim of this project is to create a more nuanced understanding of the interactions between socio-demographic characteristics, in-game behaviours, and global-scale environmental consciousness.

data-engineering data-mining data-science data-visualization python-3

Last synced: 28 Apr 2026

https://github.com/sourabh-kumar04/numpy-basic

Numpy-Basic is a structured learning repo covering NumPy from basics to advanced. It includes arrays, indexing, reshaping, filtering, vector ops, angle functions, stats, and .npy file handling. Each concept is explained with code, examples, and Matplotlib visualizations in both light and dark modes. Ideal for students and data learners.

data-analysis data-science data-visualization learning learning-resources machine-learning matplotlib numerical-computing numpy python python-library python-programming

Last synced: 10 May 2026

https://github.com/sevdanurgenc/python-for-data-science-lecture-notes

In this repo, I have the course contents of Python for Data Science training, which will be given to Siemens by the cooperation of Academy Peak Information Technologies Training and Consultancy between 28 June - 1 July 2022.

data-analysis data-mining data-modeling data-science data-structure data-visualization matplotlib-tutorial numpy-tutorial pandas-tutorial

Last synced: 23 Mar 2025

https://github.com/pxaris/expenditure-analyzer

Application for analyzing expenditure data over time

data-analysis data-visualization docker python statistics

Last synced: 29 Apr 2026

https://github.com/tiesen243/data-analyst

A simple data visualization with python

data-visualization jupyter-notebook

Last synced: 18 Aug 2025

https://github.com/neerajcodes888/diwali-sales-analysis

An open-source repository for sales data analysis. Dive into insightful trends, metrics, and visualizations to empower data-driven decision-making. Ideal for data analysts, business professionals, and enthusiasts seeking comprehensive sales insights. Clone, customize, and contribute to enhance your sales analytics journey.

data-science-projects data-visualization numpy pandas-dataframe python3 sales-analysis seaborn-plots

Last synced: 26 Mar 2025

https://github.com/mansurpro/digitrecognizer

DigitRecognizer is a simple, handcrafted neural network implemented with NumPy to classify handwritten digits from the MNIST dataset. This project demonstrates the foundational principles of neural networks, including forward propagation, backpropagation, and gradient descent, all built from scratch without relying on deep learning frameworks.

backpropagation custom-dataset data-visualization deep-learning from-scratch gradient-descent handwritten-digit-recognition machine-learning minimal-dependencies mnist neural-network numpy python

Last synced: 16 May 2026

https://github.com/archanakokate/movielens-case-study-eda-prediction-

Exploratory Data Analysis on Movielens data files and Model building using Decision Tree Classifier , Random Forest Classifier and XG Boost.

data-visualization dataengineering exploratory-data-analysis machine-learning-algorithms

Last synced: 17 Mar 2025

https://github.com/rizerkrof/dataviz-smokingimpactoninsurancecharges

Data visualization about smoking impact on insurance annual charges

data-visualization insurance-pricing plotly-python

Last synced: 30 Apr 2026

https://github.com/fengxiaoxiao-001/data_preprocessing

提供处理缺失值,处理异常值,处理特征工程以及多种数据绘图功能;适合大型数据,以及配备处理超多不同数据类型分布的方法

data-science data-visualization processing

Last synced: 29 Apr 2026

https://github.com/gui-sitton/churn-finalproject

predict its customers' churn. If it is discovered that a user is planning to switch operator, the company will offer them promotional codes and special plan options.

churn-prediction data-analysis-python data-science data-visualization

Last synced: 26 Jul 2025

https://github.com/yuvraj0412s/proactive-fraud-detection-using-machine-learning

An end-to-end machine learning project for detecting financial fraud using LightGBM, featuring in-depth EDA, advanced feature engineering, and a focus on actionable business insights.

class-imbalance classification-model data-analysis data-science data-visualization exploratory-data-analysis feature-engineering fintech fraud-detection jupyter-notebook lightgbm machine-learning pandas python scikit-learn smote

Last synced: 02 May 2026

https://github.com/uba/cmap2png

Useful script to convert a color map definition to PNG image file.

colormap converter cpt data-science data-visualization matplotlib plot png python

Last synced: 17 Sep 2025

https://github.com/treyhamilton/stat-project-1

A compilation of various programming concepts written in R covering the topics listed below

data-visualization exploratory-data-analysis regression-models

Last synced: 28 Jun 2025

https://github.com/priyanshubiswas-tech/aws-mwaa-elt-airflow-sql-dbt-superset-project

This project was created as part of an assessment for DigitalXC AI. It demonstrates a cloud-based ELT pipeline using AWS MWAA, Airflow, dbt, PostgreSQL, and Superset. The pipeline automates data ingestion from S3, transformation with dbt, and visualization through Superset, following modern data engineering practices on a scalable AWS architecture.

apache-airflow apache-superset aws-s3 dag data-analysis data-engineering-pipeline data-visualization dbt elt-pipeline python rds-postgres

Last synced: 03 Jul 2025

https://github.com/luabagg/worldwide-trends

Worldwide Google Trends visualization and classification

data-analysis data-visualization google-trends trends

Last synced: 03 Feb 2026

https://github.com/reubenmathew/dataopsexperiments

Machine Learning, NLP and various data operations

data-science data-visualization python3

Last synced: 22 Apr 2025

https://github.com/datalopes1/ds_salaries2024_eda

Neste projeto será realizado o processo de EDA (Exploratory Data Analysis) a partir do dataset Data Science Salaries 2024, que pode ser encontrado no Kaggle, com licensa Database: Open Database e enviado por Sazidul Islam.

data-analysis data-visualization eda exploratory-data-analysis jupyter-notebook python

Last synced: 29 Apr 2026

https://github.com/403errors/offset-to-hydrostatic-curve

Group project for making hydrostatic curves from the given ship offset table using python libraries

data-visualization matplotlib-pyplot numpy python3 sklearn

Last synced: 10 May 2026

https://github.com/arif-miad/global-plastic-waste-analysis

Global plastic waste is a pressing environmental issue, with massive production, limited recycling, and high risks to ecosystems and human health

catboost-classifier data-science data-visualization geopandas machine-learning matplotlib python random-forest-classifier seaborn

Last synced: 29 Apr 2026

https://github.com/patelabhi574/hotel_reservation_analysis

Analyzing data collected by hotel to make future prediction for the owner of what are the segments they are making most profit & also which are the patterns & trends which have been seen over the past years in the booking in different times throughout the year and price setting on the website in peak time as per availability index.

data data-visualization datamodeling looker-studio powerbi reporting sql-query sql-server

Last synced: 19 Feb 2026

https://github.com/prajakta1321/kaggle-ai-report-2023

A Report describing the trends in emergence of AI over the years !

data-analysis data-visualization python3

Last synced: 28 Jun 2025

https://github.com/chrnthnkmutt/theartofstatistic_python

This repository is implemented from David Spiegelhalter's The Art of Statistics Book, for making Python Visualization

data data-science data-visualization machine-learning statistics

Last synced: 08 Jun 2026

https://github.com/dhrupad17/ibm-data-analyst-professional-certificate

Prepare for a career as a data analyst. Build job-ready skills – and must-have AI skills – for an in-demand career. Earn a credential from IBM. No prior experience required.

assignment-solutions coursera data-analytics data-science data-visualization excel ibm pandas professional-certificate professional-certificates python quiz updated-2024

Last synced: 13 Apr 2026

https://github.com/p0u4a/leet-track

Web app for tracking and visualising LeetCode progress.

data-visualization leetcode leetcode-stats web-app

Last synced: 01 Apr 2025

https://github.com/albertomorini/policesviolence

Repository for the project of the course Data Science (Fondamenti di Scienza dei Dati) at UniUD.

data-analysis data-science data-visualization r

Last synced: 31 May 2026

https://github.com/robinmillford/analytics_for_fashion_supply_management

This Streamlit dashboard provides a comprehensive analysis of supply chain data, focusing on key metrics such as production volumes, stock levels, order quantities, revenue, manufacturing costs, lead times, shipping costs, transportation routes, risk factors, and sustainability factors

dashboard data-analysis data-visualization streamlit supply-chain-management

Last synced: 07 Sep 2025

https://github.com/pkx8326/simple_math_curves_sql_python_plotly

This project demonstrates the use of mysql.connect package with Python and Plotly to create an SQL database, tables, and visualize data from those tables.

data-visualization database datavisualization mysql mysql-connector plotly python sql

Last synced: 10 Apr 2026

https://github.com/gappeah/london-housing-price-dashboard

This Excel-based Housing Visual Dashboard provides a comprehensive view of average house prices across various boroughs in London from 1996 to 2013. The dashboard is designed to offer insights into housing market trends and price variations across different areas of London over time.

data data-analysis data-visualization excel visual

Last synced: 31 Jul 2025

https://github.com/natanast/immunovisual

A collection of charts made with the R programming language, focusing on immunogenetics analyses. Different charts types are being organized into multiple sections, each accompanied by its reproducible code. The gallery spotlights the utilization of prominent R packages such as tidyverse, data.table, and ggplot2.

data-visualization ggplot2 quarto r-programming

Last synced: 11 Mar 2026

https://github.com/robertopatino1/oscars2023_data_analysis

A deep data science analysis involving tweets regarding the upcoming Academy Awards

data data-analysis-python data-science data-visualization html jupyter-notebook lda-model machine-learning python trends tweepy twitter

Last synced: 24 Apr 2026

https://github.com/uudigitalhumanitieslab/digital-atlas

Interface to visualise connections between postcolonial intellectuals

data-visualization postcolonialism

Last synced: 23 Mar 2025

https://github.com/gusenov/radial-tree-js

Реализация радиального дерева на JavaScript.

canvas data-visualisation data-visualization javascript svg tree visualization

Last synced: 20 Feb 2026

https://github.com/jubinjacob03/heartdiseaseclassify-ml

Heart Disease Dataset Analysis & Classification using ML models such as linear, support vector machine, k-means, k-nearest neighbors and logistic regression.

data-analysis data-science data-visualization ipython-notebook kaggle-dataset kmeans knn linear-regression logistic-regression machine-learning matplotlib python seaborn support-vector-machine

Last synced: 18 Jan 2026

https://github.com/vetrivel07/data-visualization-portfolio

This repository showcases my Data Visualization projects using Power BI and Tableau, along with Python-based exploratory analysis. It includes dashboards, data storytelling, and business insights

dashboard data-cleaning data-schema data-visualization tableau

Last synced: 02 Mar 2026

https://github.com/VektorTech/air-pollution-map

Air pollution data visualization on an interactive globe made with three js.

api-integration chartjs data-visualization geojson geolocation-api glsl gsap threejs typescript webgl

Last synced: 08 Apr 2025

https://github.com/vatshayan/hospital-discharge-analysis

Analysis of Hospitalization Discharge Rates in Lake County, Illinois of various attributes like Anxiety, Alcohol, mood, Diabetes, Asthma, etc

data-analysis data-visualization jupyter-notebook machine machine-learning machine-learning-algorithms scikit-learn

Last synced: 04 Mar 2025

https://github.com/mehulcode12/atliq-bank_creditcard_transaction_analysis

The credit card project at Atliq Bank comprises two key phases: market identification and trial. This initiative aims to leverage mathematical and statistical concepts to analyze data related to demographics, income, credit scores, and spending patterns in order to identify the target audience for the credit card.

codebasics data-analysis data-science data-visualization mathematics python python3 statistics

Last synced: 30 Apr 2026

https://github.com/viseshrp/community_health_indicator

Android app to fetch,organize and represent NYC health data

android data-analysis data-visualization health

Last synced: 03 Mar 2025

https://github.com/nafisalawalidris/building-a-clustering-model-for-customer-segmentation

Customer Segmentation Using Clustering: This repo applies clustering algorithms to a customer transaction dataset, grouping similar customers together based on their purchasing behavior. Targeted marketing strategies can be developed by analyzing distinct customer segments.

clustering customer-segmentation data-analysis data-visualization k-means machine-learning marketing-analytics unsupervised-learning

Last synced: 16 Mar 2025

https://github.com/nrobledosagredo/lda-topic-analysis-news-chile

Topic analysis on news from Chile using LDA for extracting and visualizing relevant patterns.

data-visualization latent-dirichlet-allocation

Last synced: 16 Mar 2025

https://github.com/alcestide/scianalytics

Playground for Data Analysis and Visualization for Research and Scientifical Purposes with Pandas and Plotly.

csv data-analysis data-science data-visualization pandas plotly python science-research statistics

Last synced: 30 Apr 2026

https://github.com/gutyoh/narrative_visualization

Visualizations of the evolution of Netflix's content over the years using the D3 JavaScript library.

d3 data-visualization javascript

Last synced: 30 Apr 2026

https://github.com/edseldim/FirstRoundElectionsFr

A data visualization spreadsheet on Excel

data-analysis data-visualization excel pandas python

Last synced: 02 Aug 2025

https://github.com/alrza2003/alrza2003.github.io

This repository contains the source files for my personal portfolio website. It highlights my background as a data analyst and radiology student, and showcases real-world projects, tools I use, and ways to connect with me. The site is based on a pre-built template that I customized to reflect my profile and experience.

data data-analysis data-visualization portfolio portfolio-website python

Last synced: 30 Apr 2026

https://github.com/mkk-1817/hr-attrition

This project, conducted during my internship at MeriSKILL, focuses on HR Attrition Prediction using advanced Machine Learning models. The initiative includes the development of a dynamic Dashboard and in-depth Analysis to offer actionable insights for proactive human resource strategies.

data-analysis data-science data-visualization jupyter-notebook machine-learning-algorithms powerbi python

Last synced: 03 May 2026

https://github.com/gracysapra/heart-disease-prediction-using-logistic-regression

This project uses Logistic Regression to predict the likelihood of heart disease based on medical attributes such as age, cholesterol levels, and blood pressure. It includes model training, evaluation, and an interactive Gradio interface for real-time heart disease risk prediction.

classification data-preprocessing data-science data-visualization gradio-interface heart-disease-prediction logistic-regression machine-learning

Last synced: 11 Jun 2026

https://github.com/affec-ds/dashboard-ventas-vinilos

Dashboard interactivo de ventas para tienda de vinilos. Análisis visual, KPIs clave y filtros dinámicos para decisiones comerciales.

business-intelligence data-analysis data-visualization ipywidgets jupyter-notebook kpis matplotlib music-industry notebook-project python retail-analytics sales-analysis seaborn vinyl-records

Last synced: 30 Apr 2026

https://github.com/ahmednasef3/udemy-courses-full-eda

Exploratory Data Analysis on the factors that can affect the promotions and earnings in Udemy Courses and the perfect way to make a good saled course in Udemy.

data-analysis data-science data-visualization eda exploratory-data-analysis matplotlib pandas seaborn udemy-course-project

Last synced: 01 May 2026

https://github.com/jahnavigupta06/zepto-delivery-customer-analytics

Real-time SQL + Power BI Analytics Project replicating Zepto's customer & delivery insights.

business-intelligence churn-analysis customer-segmentation data-analysis data-visualization powerbi sql-server

Last synced: 02 Aug 2025

https://github.com/sirawin/ona-visualization

🔗 Interactive Organizational Network Analysis visualization with D3.js, community detection, and team analytics. Live demo: https://ona-lh77jtqk9-sirawins-projects.vercel.app

community-detection d3js data-visualization javascript network-graph organizational-network-analysis vercel vite

Last synced: 05 May 2026