An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/ascender1729/cds-iisc-p1-datasci-predoc

A data science project utilizing machine learning to predict movie release years and genres based on directors' previous works.

data-cleaning data-visualization film-industry machine-learning movie-metadata multi-label-classification predictive-modeling regression-analysis-feature-engineering

Last synced: 31 Mar 2025

https://github.com/muhammadhilmyputrarisma/ab-test

Python code for A/B testing on Cookie Cats game data. This project analyzes the impact of moving the first gate from level 30 to level 40 on player retention and game rounds, helping to evaluate if delaying the gate improves player engagement and gameplay experience.

ab-testing cookie-cats data-analysis data-visualization game-analytics python statistics

Last synced: 18 May 2026

https://github.com/johnwalley/data-viz-resources

Data visualization resources

data-visualization

Last synced: 08 Jan 2026

https://github.com/singh-dhruv/data-visualization-with-accuracy-metrics

Welcome to this repository! Here we loaded the iris_dataset , upon training, testing and perform operations on it resulted in beautiful data visualizations along with accuracy metrics. Data Visualization and accuracy metrics are provided for some of the models that are used.

accuracy-metrics data-visualization hyperparameter-optimization knn-model logistic-regression svm-model

Last synced: 02 Jul 2025

https://github.com/emredurukn/data-analysis

Example notebooks for analyzing data

data-analysis data-visualization python

Last synced: 12 May 2026

https://github.com/MinaRavi/first-practice-data-cleaning-visualization

Basic Practise of Data-cleaning & Data-visualization

data-analytics data-cleaning data-visualization git

Last synced: 31 Mar 2025

https://github.com/chandkund/spam-email-detection

This project focuses on detecting spam emails using a fine-tuned DistilBERT model, a lighter version of the BERT model. The model is trained to classify email text into two categories: spam (1) and not spam (0). The dataset consists of email texts labeled as either spam or non-spam.

data-visualization datapreprocessing matplotlib pandas python pytorch sklearn transformer

Last synced: 20 Jan 2026

https://github.com/javitocor/d3-data-visualization

Different projects to visualize data from APIs using D3 library

d3-visualization d3js data-visualization freecodecamp javascript practice-project

Last synced: 30 Apr 2026

https://github.com/katiesaund/market_size_app

An interactive dashboard to plot a company's estimated market size.

data-visualization finance r rshiny shiny shinyapp

Last synced: 24 Mar 2025

https://github.com/czoido/conan-graph3d

Generate a Conan dependency graph visualization in 3d

conan cpp data-visualization

Last synced: 20 Jan 2026

https://github.com/jorgeatgu/apaga-luz

💡 ¿Cuánto cuesta la luz? 💶

data data-visualization flat-data

Last synced: 04 Feb 2026

https://github.com/guruakaashjn/te_project_microsoft_ai

AI based statistical analysis of land-use plastic pollution in India using AI/ML techniques.

artificial-intelligence data-analysis data-analytics data-science data-visualization machine-learning powerbi

Last synced: 27 Feb 2025

https://github.com/sreejabethu/sales-data-analysis-forecasting

Welcome to the Sales Data Analysis & Forecasting project! 🚀 This repository showcases my data analysis skills through exploratory data analysis (EDA), data cleaning, and visualization of sales and customer feedback data. The goal is to extract actionable insights to drive business decisions.

analysis barchart data-visualization datacleaning exploratory-data-analysis forecasting histogram matplotlib-pyplot numpy-library pandas-library pycharm-ide sales-analysis salesdata salesdataanalysis seaborn-plots transformation

Last synced: 19 May 2026

https://github.com/prem-ium/tax-merge

Consolidate multiple 1099 brokerage tax forms, visualize realized gains/losses, and track data for Fidelity, Chase, Vanguard, Schwab, and others!

1099 accounting chase data-science data-visualization data-wrangling desktop-application fidelity financial-analysis firstrade graphical-user-interface schwab tastytrade tax tax-forms taxes vanguard

Last synced: 02 Feb 2026

https://github.com/noturlee/titanic-datamodel

This project demonstrates the process of data preprocessing, model training, evaluation, and tuning in building a predictive model for a classic dataset. The Random Forest model, with its ability to handle complex relationships and interactions between features, proved to be the most effective in this case.

data-modeling data-science data-visualization python

Last synced: 08 Apr 2025

https://github.com/noturlee/imdb-dataanalysis

A data model that predicts the IMDb rating of a movie based on features like genre, director, and actors. Using regression techniques to tackle this problem.

data-analysis data-cleaning data-modeling data-science data-visualization

Last synced: 08 Apr 2025

https://github.com/GZ430/global-christianity-dataviz-jp

A web app built in R Shiny for users to explore global Christianity data from Joshua Project, World Watch List, and others.

christianity data-visualization r shiny

Last synced: 10 Mar 2025

https://github.com/kusmn/tableau-googletrends-canada-analytics

Visualizes Canada’s top Google search terms (2020–2024) using BigQuery and Tableau to explore regional and temporal trends.

bigquery data-visualization tableau

Last synced: 26 Aug 2025

https://github.com/gutyoh/netflix-dashboard

Dashboard created with Python 🐍 and the Dash 📊 framework to visualize Netflix’s content distribution and classification.

dash dashboard data-visualization plotly python

Last synced: 08 Apr 2025

https://github.com/sevdanurgenc/python-for-data-science-lecture-notes

In this repo, I have the course contents of Python for Data Science training, which will be given to Siemens by the cooperation of Academy Peak Information Technologies Training and Consultancy between 28 June - 1 July 2022.

data-analysis data-mining data-modeling data-science data-structure data-visualization matplotlib-tutorial numpy-tutorial pandas-tutorial

Last synced: 23 Mar 2025

https://github.com/leosimoes/uerj-tcc-analisador-dados-texto

Texto do trabalho de conclusão de curso (TCC) em engenharia de computação. Aplicativo Web para análise de dados.

data-analysis data-science data-visualization python streamlit

Last synced: 24 Mar 2025

https://github.com/priyanshubiswas-tech/aws-mwaa-elt-airflow-sql-dbt-superset-project

This project was created as part of an assessment for DigitalXC AI. It demonstrates a cloud-based ELT pipeline using AWS MWAA, Airflow, dbt, PostgreSQL, and Superset. The pipeline automates data ingestion from S3, transformation with dbt, and visualization through Superset, following modern data engineering practices on a scalable AWS architecture.

apache-airflow apache-superset aws-s3 dag data-analysis data-engineering-pipeline data-visualization dbt elt-pipeline python rds-postgres

Last synced: 03 Jul 2025

https://github.com/reubenmathew/dataopsexperiments

Machine Learning, NLP and various data operations

data-science data-visualization python3

Last synced: 22 Apr 2025

https://github.com/403errors/offset-to-hydrostatic-curve

Group project for making hydrostatic curves from the given ship offset table using python libraries

data-visualization matplotlib-pyplot numpy python3 sklearn

Last synced: 10 May 2026

https://github.com/nemeslaszlo/lending-club-analysis

Lending Club analysis, with a model that can predict whether or nor a borrower will pay back their loan.

binary-classification classification data-visualization keras matplotlib numpy pandas seaborn tensorflow

Last synced: 12 Apr 2026

https://github.com/hendersontrent/hotter

R package for webscraping, statistical analysis, and data visualisation of the Triple J Hottest 100 Countdown and related data.

data-visualisation data-visualization r triple-j webscraping

Last synced: 08 Apr 2025

https://github.com/VektorTech/air-pollution-map

Air pollution data visualization on an interactive globe made with three js.

api-integration chartjs data-visualization geojson geolocation-api glsl gsap threejs typescript webgl

Last synced: 08 Apr 2025

https://github.com/GiveMePseudonyms/PiVisualisations

A way to visualise millions of digits of Pi. Written in Python using Pygame and Tkinter.

data data-visualization pi pygame python self-organising-criticality tkinter

Last synced: 08 Apr 2025

https://github.com/rani-sikdar/python-data-structures

A repository for sharing Python implementations of common data structures and algorithms, including arrays, linked lists, stacks, queues, trees, graphs, and sorting/searching techniques. Perfect for learning, revisiting fundamentals, and collaborating on computer science concepts. Contributions welcome! 🚀

data-analysis data-structure data-visualization numpy-library pandas-dataframe python python-3

Last synced: 30 Mar 2025

https://github.com/nrobledosagredo/lda-topic-analysis-news-chile

Topic analysis on news from Chile using LDA for extracting and visualizing relevant patterns.

data-visualization latent-dirichlet-allocation

Last synced: 16 Mar 2025

https://github.com/neerajcodes888/whatsapp-chat-analyzer

A Python tool for effortless analysis of WhatsApp conversations. Gain insights with basic statistics, word cloud visualizations, and URL statistics. Powered by pandas, urlextract, wordcloud, seaborn, and Streamlit. 📊📱

analyzer chat data-analysis data-visualization pandas python3 seaborn urlextract whatsapp wordcloud

Last synced: 12 Apr 2026

https://github.com/aubainmbk/optimisation-ventes-supermarche

Notre objectif est de ressortir des informations grâce aux données de ventes d'un Supermarché.

data-visualization dataanalysis excel powerbi

Last synced: 04 Feb 2026

https://github.com/qtle3/decision-tree-regression

This project implements **Decision Tree Regression** to predict the salary of an employee based on their position level. By using a dataset containing position levels and their corresponding salaries, the project highlights how decision trees can capture complex relationships in data through non-linear regression.

data-visualization decision-tree-regression prediction-model

Last synced: 08 Apr 2025

https://github.com/azmainadel/twitter-data-neo4j

Playing with graph database on a large dataset of twitter data.

data-analysis data-visualization neo4j-database snap

Last synced: 06 Apr 2025

https://github.com/kuroko1t/geoview

A lightweight, browser-based GIS data viewer built with Streamlit and Geopandas. Visualize Shape files, GeoJSON, and more instantly

data-visualization folium geojson geopandas gis shapfile streamlit

Last synced: 29 May 2026

https://github.com/ksmooi/mscs_data_visualization

This series of reports demonstrates the process of creating, refining, and evaluating interactive data visualizations using Altair, focusing on Seattle weather data, with an emphasis on accessibility, usability, and effectively communicating insights.

altair data-visualization

Last synced: 01 Apr 2025

https://github.com/ineelhere/shinyDwight

A dashboard depicting the great battle of Dwight Schrute and the Computer | The Office

bslib data-visualization imola r rshiny rshiny-application sass server shiny-apps ui-components

Last synced: 01 Apr 2025

https://github.com/dadosdelaplace/hilostwitter

Códigos R de los hilos de Twitter (R codes of Twitter activity) https://twitter.com/dadosdelaplace

data-visualization dataviz divulgacion ggplot2 maths plotly r rstats statistical-analysis threads twitter

Last synced: 11 Mar 2026

https://github.com/samuelbarbosadev/roof_imoveis_data_analysis

The company hired you because they want to know what would be the 5 properties they should invest in and why, and which 5 you would not recommend investing in at all.

data-preparation data-understanding data-visualization pandas python

Last synced: 12 Apr 2026

https://github.com/mohagungnursalim/da-covid-data-cleansing

Identifying and correcting errors, inconsistencies, and inaccuracies within datasets related to the COVID-19 pandemic. This is essential for obtaining reliable and actionable insights from the data.

data-analytics data-visualization pandas-dataframe

Last synced: 25 Mar 2025

https://github.com/ax-va/interactive-data-visualization-with-d3-dale-2023

These examples on Interactive Data Visualization with D3.js in the web browser are compiled with some modifications from the book "Data Visualization with Python and JavaScript: Scrape, Clean, Explore, and Transform Your Data", Second Edition, written by Kyran Dale and published by O'Reilly Media in 2023.

ax-va d3 data-science data-visualization dataviz frontend javascript web

Last synced: 12 Jun 2025

https://github.com/robthepcguy/ahk-mouse-heatmap

An AutoHotkey script that records left, right, and middle mouse clicks, logging the date, time, and x and y coordinates. It features automatic GUI updates and generates a visual heatmap via a Python script, accessible from the system tray. This tool is ideal for analyzing user interaction and creating detailed mouse activity maps.

autohotkey-script click-counter click-log click-map data-analysis data-visualization heatmap heatmap-visualization mouse-events mouse-tracking python

Last synced: 01 Apr 2025

https://github.com/yusuf-abol/alumni-interaction-and-conversation-dynamics-nlp

This Natural Language Processing (NLP) project took a dive into chat engagement dynamics within the University of Ilorin’s Class of 2018 Statistics alumni group. By applying Latent Dirichlet Allocation (LDA) for topic modeling and network analysis, I uncovered communication patterns, topic distributions, and member interactions.

alumni-network anonymization conversation data-science data-visualization engagement machine-learning network-analysis nlp python-3 sentiment-analysis statistics whatsapp

Last synced: 05 May 2026

https://github.com/anarya22/tata-data-visualization-empowering-business-with-effective-insights-job-simulation-on-forage

Completed a simulation involving creating data visualizations for Tata Consultancy Services. Created visuals for data analysis to help executives with effective decision making.

business-analysis data-cleaning-and-preprocessing data-visualization excel powerbi

Last synced: 07 Jan 2026

https://github.com/jabhij/predictionmodels_heartdiseases

Comparison of various Machine Learning algorithms for Heart Diseases (Heart Attack) prediction.

big-data-analytics bigdata data-visualization datamodeling decision-tree hive knn logistic-regression machine-learning mapreduce naviebayes random-forest svm

Last synced: 04 Jun 2026

https://github.com/kurosawaxyz/covid4eu-sorbonne

Economy: “Analysis of Labor Market decisions of men and women during the COVID-19 pandemic in the 4EU+ countries”.

covid-19 data-analysis data-science data-visualization pandas

Last synced: 04 Jul 2025

https://github.com/alexeyraspopov/vepr

🚧 Visualization protocol for complex and sizable data visualizations

data-visualization zero-dependency

Last synced: 04 Jul 2025

https://github.com/alejo1630/chicago_crimes

A Jupyter Notebook with the data analysis and data visualization of crimes in Chicago from 2017 to 2023 using libraries such as seaborn and folium

data-analysis data-visualization folium pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/rani-sikdar/pwc-virtual-internship-powerbi

Comprehensive Power BI dashboards showcasing insights on Call Centre Trends, Customer Retention, and Diversity & Inclusion to drive business impact.

business-analytics business-intelligence data-analysis data-cleaning data-visualization interactive interactive-visualizations powerbi

Last synced: 07 Jan 2026

https://github.com/gracysapra/pandas-numpy-data-visualisation

This repository contains essential Python scripts and notebooks for data analysis and visualization. It includes: pandas: Data manipulation and analysis, including operations on series and dataframes. NumPy: Efficient numerical computations and array processing. Data Visualization: Creating insightful visualizations using Matplotlib and Seaborn.

data-science data-visualization matplotlib numpy numpy-arrays pandas pandas-dataframe pandas-series seaborn

Last synced: 07 May 2026

https://github.com/cpscript/github-profile-monitor

A system for tracking GitHub followers with automated data collection, change detection, and a sleek dashboard interface. Using GitHub Actions for automation and GitHub Pages for hosting.

data-visualization github-actions github-analytics github-pages json-api responsive-design web-dashboard

Last synced: 04 Sep 2025

https://github.com/whitehathackerpr/data-visualization-tool

This is a Python-based web application that allows users to upload datasets, analyze data, and create visualizations interactively. The tool is designed for ease of use and provides a simple interface to perform basic data analysis and generate visualizations

data data-analysis data-visualization python python3

Last synced: 05 Sep 2025

https://github.com/md-emon-hasan/3-eda-basketball-ml-app

A ML application focused on EDA and basketball analytics, showcasing data visualization and insights using Python and relevant libraries.

basketball-analysis csv data-visualization eda exploratory-data-analysis exploratory-data-analysis-eda ml-app

Last synced: 12 Apr 2026

https://github.com/isatyamks/chatsense

This is my first machine learning model, designed to predict the mood and behavior of users by analyzing their WhatsApp chat archives.

analysis artificial-intelligence behavior data-visualization machine-learning matplotlib pandas prediction seaborn vercel-deployment whatsapp-chat wordcloud

Last synced: 07 Jan 2026

https://github.com/anurag-kumar-molankala/blinkit-grocery-sales-dashboard

The BlinkIT Grocery Sales Dashboard is an interactive Power BI dashboard that provides insights into grocery sales performance. It includes key KPIs, sales trends, and outlet performance analysis.

business-intelligence dashboards data-analysis data-visualization dax excel kpi-dashboard power-bi powerquery slicers-kpi-card-multirow-card sql-server ssis ssms ssrs-reports

Last synced: 09 Apr 2025

https://github.com/archanakokate/diabetes_prediction_capstoneproject

Analyzing and Modelling National Institute of Diabetes(NIDDK) dataset for accurate prediction of Diabetes in Patients, with Tableau dashboard visualization.

data-engineering data-visualization exploratory-data-analysis machine-learning-algorithms tableau-dashboards

Last synced: 17 Mar 2025

https://github.com/cwendorf/devise

Data Exploration, Visualization, and Inference with Statistical Estimation [R Package]

confidence-intervals data-visualization devise effect-size estimation r r-package statistics

Last synced: 29 May 2026

https://github.com/geo-y20/uber-rides-data-analysis

This project aims to analyze Uber ride data to understand various aspects of ride usage, such as the distribution of rides across different categories, purposes, months, days, and times.

dashboard dashboard-templates data data-analysis data-analysis-python data-analytics data-visualization pandas powerbi python recommendation-system rides uber

Last synced: 13 Apr 2026

https://github.com/ddihora1604/iitk_esg

Researching and Analyzing key ESG (Environmental, Social, Governance indicators) metrics and their impact on stock performance and market behavior. Leveraging AI techniques (like Machine Learning and NLP) in finance to extract insights from ESG disclosures, enhancing financial predictions and sustainable investment strategies.

data-analysis data-visualization esg python yahoo-finance

Last synced: 24 Apr 2025

https://github.com/saidsef/ff18

A complete catalog of all the players in Fifa 2018 and their complete statistics

data-analysis data-visualization fifa18 machine-learning machine-learning-algorithms world-cup-2018 world-cup-ranking

Last synced: 29 May 2026

https://github.com/vincent-tran-94/dataviz_tweets_chatgpt

Une application Streamlit pour analyser et visualiser les données et les tweets sur la sortie de ChatGPT. Ce projet comprend la gestion des données, l'analyse des sentiments, les tendances émergentes et les applications potentielles de ChatGPT.

data-management data-visualization python streamlit text-mining twitter

Last synced: 08 May 2026

https://github.com/webuccinoco/mysql-pivot-tables

Build complex MySQL pivot tables without touching a single line of code. This free PHP tool lets you visually connect your database and map out your data sources with a few simple clicks.

business-analytics business-intelligence crosstab data-analysis data-analytics data-visualization mysql mysql-database mysql-pivot-table mysql-reports mysql-virtualization php php-pivot-table php-reports pivot-tables reporting-tools

Last synced: 04 Feb 2026

https://github.com/radhikareddy-chintareddy/exploratory-data-analysis-and-regression-for-property-pricing

This project analyzes property listing data to uncover trends and predict property prices using regression modeling. Key steps include exploratory data analysis (EDA), data cleaning, multicollinearity checks, and model evaluation using metrics like RMSE.

data-visualization exploratory-data-analysis python regression-analysis rmse-score statistical-analysis

Last synced: 24 Apr 2026

https://github.com/sanam2405/chatinfo

Analysing the WhatsApp Chat with my crush over a 6M period

data-analysis data-visualization python

Last synced: 27 Apr 2026

https://github.com/zrkhadija/analyzing-parisian-associations-a-data-exploration-project

This project focuses on analyzing data about associations registered in Paris. Using an API, the data was fetched in JSON format and transformed into a pandas dataframe for exploration and analysis. The dataset includes details such as the statutory name, postal code, city of registration, sectors of activity, domains of activity..

analysis data-visualization json-api paris

Last synced: 03 Apr 2025

https://github.com/open-data-plan/ava-react

React Component wrapper for AVA

auto-chart ava chart data-visualization g2plot react

Last synced: 10 Jun 2025

https://github.com/gauranshgoel123/predictive-demand-analysis

Demand Forecasting Project A web application for predicting future demand for part numbers based on historical data. Built with React for the frontend and FastAPI with Python for the backend, this application visualizes demand trends and allows users to input additional data for improved accuracy. In render analyzer is frontend analysis is backend

chartjs data-analysis data-science data-visualization dataset deployment full-stack machine-learning numpy pandas predictive-analysis prophet-model python reactjs render

Last synced: 13 Apr 2026

https://github.com/aravindnathan02/sales-and-customer-analytics

This is a repository for sales and customer performance Tableau dashboard.

customer-dashboard dashboard data-analysis data-visualization sales-analysis sales-dashboard tableau

Last synced: 08 Jan 2026

https://github.com/imageomics/emb-explorer

An interactive tool for classifying images with a pretrained model and exploring clustering results in 2D space.

clustering data-visualization dimensionality-reduction eda embedding exploratory-data-analysis exploratory-data-visualizations image-exploration

Last synced: 03 Mar 2026

https://github.com/lebrancconvas/streaming-service-price

Visualize the price from Streaming Service that available in Thailand.

data-visualization price-comparison streaming-video

Last synced: 13 Jun 2025

https://github.com/shantoroy/data-visualization-streamlit-app

Simple csv data analytics platform built using Python Streamlit

data-analytics data-visualization machine-learning python streamlit

Last synced: 13 Apr 2026

https://github.com/guidodipietro/fmcranks

Raw text input to nicely formatted XLSX file + PNG image of the range

data-analysis-python data-visualization python

Last synced: 26 Jan 2026

https://github.com/prernarohra/mental-health-prediction

This project focuses on predicting mental health outcomes using machine learning algorithms. By analyzing various psychological, social, and lifestyle factors, the model aims to identify individuals at risk, enabling early intervention and support.

data-analysis data-science data-visualization machine-learning mental-health python

Last synced: 20 May 2026

https://github.com/soumya-thoutam/revenue-and-demand-forecasting-analysis

End-to-end data analysis project with SQL and Power BI for revenue and demand forecasting in bike-sharing.

data-analysis data-visualization powerbi sql sql-server

Last synced: 16 Mar 2025

https://github.com/tejashah88/Visualflow-Unity

Visualizing Vitech insurance data with Unity and the Leap Motion (for HTN 2017).

data-visualization leap-motion unity unity3d virtual-reality vitech vr

Last synced: 11 Mar 2025

https://github.com/garcane/global-shipping-analytics-dashboard

This Tableau project provides a comprehensive visual analysis of global sales, shipping costs, and quality metrics across different regions and countries.

data data-analysis data-analyst data-visualization metrics tableau

Last synced: 01 Mar 2026

https://github.com/sushantdhumak/lstm_for_household_power_consumption

This project explores the application of Long Short-Term Memory (LSTM) networks in predicting household power consumption. Using data collected at one-minute intervals, we demonstrate how LSTM can be leveraged for accurate forecasting.

correlation-analysis data-visualization dropout evaluation-metrics feature-engineering lstm lstm-model resampling reshaping-datasets time-series-analysis time-series-forecasting

Last synced: 26 Mar 2025

https://github.com/nafisalawalidris/building-a-clustering-model-for-customer-segmentation

Customer Segmentation Using Clustering: This repo applies clustering algorithms to a customer transaction dataset, grouping similar customers together based on their purchasing behavior. Targeted marketing strategies can be developed by analyzing distinct customer segments.

clustering customer-segmentation data-analysis data-visualization k-means machine-learning marketing-analytics unsupervised-learning

Last synced: 16 Mar 2025

https://github.com/jubinjacob03/heartdiseaseclassify-ml

Heart Disease Dataset Analysis & Classification using ML models such as linear, support vector machine, k-means, k-nearest neighbors and logistic regression.

data-analysis data-science data-visualization ipython-notebook kaggle-dataset kmeans knn linear-regression logistic-regression machine-learning matplotlib python seaborn support-vector-machine

Last synced: 18 Jan 2026

https://github.com/albertomorini/policesviolence

Repository for the project of the course Data Science (Fondamenti di Scienza dei Dati) at UniUD.

data-analysis data-science data-visualization r

Last synced: 31 May 2026