An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/sayamalt/customer-churn-prediction

Successfully established a machine learning model which can predict whether any given customer currently utilizing the products and services offered by a company will churn at anytime in the future or not, depending upon a set of unique features/characteristics pertaining to that specific individual, to a great level of accuracy.

classification data-visualization exploratory-data-analysis feature-engineering hyperparameter-tuning model-deployment model-evaluation model-optimization model-training supervised-machine-learning

Last synced: 09 Nov 2025

https://github.com/pratyusha-r/air-quality-index-visualization

This project visualizes the Air Quality Index (AQI) of various cities across India using Power BI. The aim is to provide insights into the air quality trends and help raise awareness about environmental issues affecting urban areas.

air-quality-data dashboard data-visualization powerbi

Last synced: 03 Jan 2026

https://github.com/sayamalt/twitter-sentiment-analysis

Successfully established a machine learning model which can accurately classify the sentiment of any particular tweet into either positive, negative or neutral category.

data-visualization exploratory-data-analysis nlp sentiment-analysis supervised-learning text-processing

Last synced: 09 Nov 2025

https://github.com/sayamalt/credit-card-approval-prediction

Successfully developed a machine learning model which can accurately predict up to 100% accuracy whether a credit card application of a given applicant would be approved or not, based on several demographic features such as applicant age, total income, marital status, total years of work experience, etc.

binary-classification cicd-deployment cross-validation data-exploration-and-preprocessing data-visualization exploratory-data-analysis feature-engineering hyperparameter-optimization machine-learning model-deployment model-retraining model-selection model-testing model-training-and-evaluation

Last synced: 09 Nov 2025

https://github.com/xre22zax/airline-analysis

Travel agency and need to know the ins and outs of airline prices for your clients

data-analysis data-visualization python python3 visualization

Last synced: 13 Apr 2026

https://github.com/mehmetkahya0/gallstone_dataset_analysis_project

Safra Taşı Hastalığı (Gallstone-1) Veri Seti Analizi (https://archive.ics.uci.edu/dataset/1150/gallstone-1)

analysis analytics data data-analysis data-science data-visualization database graph matplotlib python

Last synced: 25 Apr 2026

https://github.com/bowenfu/cylindervtk

Generate deformed flexible cylinder based on given displacements

data-visualization python vtk

Last synced: 07 Jul 2025

https://github.com/sayamalt/taxi-trip-fare-prediction

Successfully created a machine learning model which can accurately predict the fare of a taxi trip based on several features such as trip duration, tip amount, etc.

cross-validation data-exploration-and-preprocessing data-visualization exploratory-data-analysis feature-engineering hyperparameter-optimization machine-learning model-deployment model-selection model-training-and-evaluation regression-modelling

Last synced: 09 Nov 2025

https://github.com/ibttf/isolation-index

Independent research project tracking risk factors for social isolation in different US counties.

data-science data-visualization gis mapbox psychology react

Last synced: 13 Apr 2026

https://github.com/gunjanmimo/d3-visualization

D3.js is a JavaScript library for producing dynamic, interactive data visualizations in web browsers. It makes use of Scalable Vector Graphics, HTML5, and Cascading Style Sheets standards. It is the successor to the earlier Protovis framework

d3js data data-science data-visualization reactjs

Last synced: 29 Apr 2026

https://github.com/alfiyafatima09/heuristic_algorithms

This project compares pathfinding algorithms (A*, Greedy Best-First, and Hill Climbing) by visualizing their paths and comparing performance metrics (nodes explored, memory, execution time) on a grid with obstacles.

algorithms data-visualization

Last synced: 20 Jan 2026

https://github.com/fbarffmann/mycitibike

Built an interactive Leaflet.js map visualizing over 750 Citi Bike station locations in NYC. Analyzed usage patterns, station density, and user navigation across the network.

citibike data-analysis data-visualization geojson geospatial interactive-map javascript leaflet nyc web-mapping

Last synced: 07 Jul 2025

https://github.com/apelullo/cobalt_health_wellness_platform_ops

Cobalt is a mental health and wellness platform created for Penn Medicine employees that serves as a hub for support services such as therapy, wellness coaching, topic- and population-specific group sessions, and a variety of self-help resources.

academic-research data-cleaning-pipeline data-validation data-visualization decision-support feature-development healthcare-data hipaa key-performance-metrics mental-health-services operations-research product-analytics reporting-pipeline

Last synced: 23 Mar 2025

https://github.com/beatussum/pmsexp

A software for recovering the position of an object in a video

cpp cpp17 data-visualization physics qt qt5 science utility video

Last synced: 17 May 2026

https://github.com/stefagnone/airbnb-data-analysis

Data analysis and visualization of Airbnb listings using text mining frameworks, Tableau dashboards, and MongoDB to uncover business insights for optimizing strategies.

airbnb-data-analysis business-insights data-visualization mongodb r-programming sentiment-analysis tableau-dashboards text-mining

Last synced: 25 Apr 2026

https://github.com/subratamondal1/heart-attack-prediction

Heart Attack Prediction of patients based on the required data. Data Ingestion - Data Preparation - Exploratory Data Analysis (EDA) - Modelling - Evaluation.

data-analysis data-science data-visualization kaggle-dataset machine-learning matplotlib-pyplot numpy pandas python3 scikit-learn seaborn

Last synced: 09 Apr 2026

https://github.com/navp7/roadaccident_powerbi

An interactive Power BI dashboard designed to analyze road accident data

dashboards data-analysis data-visualization powerbi

Last synced: 19 Mar 2026

https://github.com/hanzopgp/lolanalysis

League Of Legends game data engineering, analysis, visualization and machine learning. Business intelligence project.

data-analysis data-cleaning data-engineering data-visualization dataiku deep-learning etl machine-learning scraping university

Last synced: 27 May 2026

https://github.com/fbarffmann/belly-button-challenge

Built an interactive JavaScript dashboard to visualize bacterial biodiversity from belly button samples. Analyzed data from 153 participants and identified OTU 1167 as the most common bacteria.

biodiversity dashboard data-analysis data-visualization interactive-charts javascript json plotly

Last synced: 25 Apr 2026

https://github.com/deliprofesor/arrhythmia-classification-and-anomaly-detection

This project classifies arrhythmias and detects anomalies using machine learning and deep learning. It includes preprocessing the "INCART 2-lead Arrhythmia Database," feature engineering, KMeans clustering, Random Forest, IsolationForest, and an LSTM model for classification.

classification clustering data-science data-visualization deep-learning machine-learning

Last synced: 10 Apr 2025

https://github.com/carmendev/covid-19-tracker

Data visualization React.js project deployed with Firebase. Daily statistics about current, recovered and closed cases coming from an API.

data-visualization firebase numeral reactjs

Last synced: 11 Apr 2026

https://github.com/karo23361/toy-store-kpi-power-bi

PowerBI Portfolio Project

csv data data-visualization powerbi

Last synced: 03 Feb 2026

https://github.com/lvsvendsen/shime-monitor-r

R script for visualizing pH and pump activity in SHIME gut microbiome experiments.

data-visualization microbiome r research-tool shime

Last synced: 13 Sep 2025

https://github.com/archanakokate/ml_mercedes_benz_greener_manufacturing_project

This project involves reducing testing time for car configurations. The tasks include removing columns with zero variance, checking for null values, applying label encoding, performing dimensionality reduction, and using XGBoost to predict testing time.

data-visualization dimentionality-reduction encoding exploratory-data-analysis machine-learning-algorithms

Last synced: 17 Mar 2025

https://github.com/tmoulik/bikeshare-python

Analysis of Bikeshare data from three major cities

data-analysis data-visualization python udacity-nanodegree

Last synced: 25 Apr 2026

https://github.com/dcostachar/bellabeat-case-study

An analysis of Fitbit Fitness Tracker data with R to examine user behaviour and conduct a competitor analysis to optimize Bellabeat's product marketing strategies.

consumer-behaviour-analysis data-visualization exploratory-data-analysis ggplot2 health-data marketing-analytics r statistical-analysis tidyverse

Last synced: 02 Apr 2025

https://github.com/albanecoiffe/uber_data_visu_streamlit

Tableau de bord interactif avec Streamlit permettant d'explorer les données des trajets Uber de janvier 2015 à New York.

data-visualization streamlit

Last synced: 02 May 2026

https://github.com/vidyadnina/cyclistic-sql-tableau-project

Trip data analysis for a bike-sharing service company using SQL and Tableau.

bigquery dashboard data-analysis data-analytics-sql data-cleaning data-visualization sql

Last synced: 02 Jan 2026

https://github.com/sukitsubaki/image-color-scheme

Extract dominant colors from images and create beautiful color palettes with minimal dependencies. Supports various palette types: monochromatic, analogous, complementary, triadic, and tetradic.

color-extraction color-palette data-visualization design-tools image-analysis minimal python python-library

Last synced: 29 Apr 2026

https://github.com/yaser-123/energy-consumption-dashboard

A Power BI dashboard to analyze energy consumption for water, gas, and electricity across cities and buildings. Features include interactive charts, drill-down insights, and dynamic filters for easy monitoring and optimization.

dashboard data-analysis data-analytics data-visualization energy-consumption energy-efficiency powerbi

Last synced: 05 Jan 2026

https://github.com/tanyakuznetsova/world-happiness-report-2023-in-europe

Happiness Insight '23: Navigating global joy. Exploring trust's role in life satisfaction with my World Happiness Report analysis.

citizen-science data-storytelling data-visualization global-indicators life-satisfaction social-trust world-happiness-report

Last synced: 25 Apr 2026

https://github.com/01110011011101010110010001101111/tigergraph_cosmos_template

Template for TigerGraph and Cosmograph Projects with pyTigerGraph, Fast API, and Cosmos

cosmograph data-visualization tigergraph

Last synced: 26 Mar 2025

https://github.com/nero103/airbnb-destination

This is and end-to-end project to uncover the ideal destination based on listings and hosts. Strategy included: Data workflow-SQL analysis-Data modeling-Data Visualization-Findings

data-analysis data-modeling data-visualization etl etl-pipeline excel microsoft-sql-server powerpoint sql tableau

Last synced: 27 Mar 2026

https://github.com/ilke-kas/multivariate-data-analysis

A curated collection of R-based data analysis projects applying regression modeling, clustering, dimensionality reduction, multivariate statistics, and classification. Each project showcases practical data science techniques, interpretability, and domain insights using real-world and academic datasets.

classification data-analysis data-visualization dimensionality-reduction machine-learning multivariate-analysis r regression statistics

Last synced: 05 Oct 2025

https://github.com/spear97/montecarlo-python

This was a project for my Programming Language Concepts Class were we were assigned to create a Monty Carlo Simulation using Python.

data-science data-visualization matplotlib-figures matplotlib-python montecarlo pandas-library pandas-python python python-3

Last synced: 23 Mar 2025

https://github.com/mohsinraza2999/new-york-taxi-fare-analysis

This project analyzes and predicts taxi fares estimate fares in advance using Regression Analysis. Conducted EDA, hypothesis testing, to identify key variables. Developed ML models (Random Forest, XGBoost) with GridSearchCV for hyperparameter tuning to predict generous tip giver accurately.

ab-testing data-un data-visualization exploratory-data-analysis fea random-forest regression-analysis sklearn xgboost

Last synced: 17 May 2026

https://github.com/dmarks84/ind_project_obesity-multi-class-classification--kaggle

Independent Project - Kaggle Competition -- I worked on the obesity classification data set as part of a Kaggle Competition of the same name, scoring (for accuracy) above 0.9

classification correlation-analysis cross-validation data-modeling data-visualization dataframes eda gridsearchcv matplotlib multiclass-classification numpy pandas python seaborn sklearn statistics supervised-ml

Last synced: 11 Apr 2026

https://github.com/lukakerr/us-surnames

US Surname data visualisation using R. Displays top 25 US surnames and race/ethnic percentage per name.

data data-visualization r

Last synced: 05 Oct 2025

https://github.com/tsopermon/comparison-ml-algorithms

This repository compares the performance of Adaline, Logistic Regression, and Perceptron models on binary classification tasks using linearly, non-linearly, and marginally separable datasets from the Iris dataset. It includes MATLAB implementations, 10-fold cross-validation, and visualizations of decision boundaries and MSE histories.

adaline binary-classification classification-accuracy cross-validation data-visualization decision-boundaries iris-dataset logistic-regression machine-learning matlab mse neural-networks perceptron

Last synced: 15 Mar 2025

https://github.com/samjoesilvano/adventureworks_sales_performance_dashboard

Createed an interactive 4-page dashboard for AdventureWorks that visualizes key sales metrics—including revenue, profit, orders, and return rates—across 2020 to 2022. Featuring dynamic geographic analysis and detailed customer insights, this dashboard empowers data-driven decision-making and enhances business performance.

business-intelligence data-analysis-python data-analytics data-driven-decisions data-modeling data-visualization geographic-analysis interactive-dashboards kpi-metrics powerbi sales-performance-analysis

Last synced: 05 Jan 2026

https://github.com/Gregoritsch3/Exercise_Pandas_1

A Pandas exercise demonstrating the loading, cleaning and reorganization of data in .xlsx or .csv files, descriptive statistics, data visualization, statistical approximation of data with the normal distribution, etc.). Libraries include Pandas, NumPy, Scipy, SymPy, MatplotLib,

data-cleaning data-visualization descriptive-statistics matplotlib numpy pandas scipy sympy

Last synced: 01 May 2025

https://github.com/trismald/eurosoccer1023

Data Analyst - European Soccer 2010 2023

data-analysis data-visualization jupyter-notebook pandas powerbi python

Last synced: 06 May 2026

https://github.com/bala-1409/milk-production-time-series-forecasting-datascience-project

This project uses time series forecasting to predict future milk production. The data used in this project is monthly milk production data from January 1962 to December 1975. The ARIMA (autoregressive integrated moving average) model is used to forecast the milk production. The model is evaluated using various metric.

acf adf arima-model data-analysis data-science data-visualization datapreprocessing eda exploratory-data-analysis forecasting machine-learning-algorithms pacf python python3 sarimax-model seasonality seasonality-analysis time-series time-series-forecasting trends

Last synced: 27 Apr 2026

https://github.com/kruthiktr/crop-recommendation-system-using-machine-learning

A machine learning-based system recommending crops based on soil, climate, and environmental conditions to optimize agricultural yields.

ai-in-agriculture crop-recommendation data-visualization machine-learning prediction python python3 recommendation-system

Last synced: 25 Apr 2026

https://github.com/dibsthegreat/titantic-dataset-analytics

DASC4850 Final Project where I did EDA to determine the survivability of Titanic guests depending on Age, Gender, Wealth, etc.

data-science data-visualization matplotlib numpy pandas python random-forest-classifier

Last synced: 13 Apr 2026

https://github.com/christophertrimboli/trends-vs-stock

A tool to browse culture's impact on stocks. Made in React, Node, Bootstrap and Charts.js to compare data from Google Trends and Alpha Vantage APIs.

bootstrap chartjs data-visualization google-trends intrinio nodejs react stocks

Last synced: 28 Apr 2026

https://github.com/mcommer/emtools

A toolbox for geophysical EM-simulation data- and model-file processing, analysis, plotting, and other gimmicks

data-visualization electromagnetics geophysics plotting-scripts shell-scripts

Last synced: 30 Jun 2025

https://github.com/subhamghimire/dataanavis

Learning Data analysis and visualization

data-analysis data-science data-visualization dataset

Last synced: 06 Oct 2025

https://github.com/davifeliciano/modern_physics_experiments

Collection of data analysis and visualization scripts developed in Python around some modern physics experiments

data-analysis data-visualization modern-physics physics physics-experiments

Last synced: 18 Jan 2026

https://github.com/gerhynes/d3-character-frequencies

A character frequency analyzer built using D3.js. Built for The Advanced Web Developer Bootcamp.

d3 data-visualization javascript

Last synced: 25 Apr 2026

https://github.com/swatisinghit/treadmill-customer-profiling-for-aerofit

Create comprehensive customer profiles for each AeroFit treadmill product through descriptive analytics. Develop two-way contingency tables and analyze conditional and marginal probabilities to discern customer characteristics, facilitating improved product recommendations and informed business decisions.

analytics conditional-probability data-analysis data-science data-visualization eda numpy pandas probability statistics

Last synced: 08 May 2026

https://github.com/deliprofesor/kizbasina_odev4_trafik

Bu proje, 2005-2014 yılları arasında İngiltere’de gerçekleşen trafik kazalarına ait kapsamlı veri setlerini kullanarak trafik kazalarının sebeplerini, şiddetini ve zaman içindeki değişimini analiz etmektedir.

data-science data-visualization istatistik matplotlib pandas python statistics

Last synced: 14 Apr 2026

https://github.com/theglobemc/theglobemc.github.io

An interactive HTTP visualization of Minecraft books on the web from GlobeMC.

books data-visualization datamining minecraft

Last synced: 07 Oct 2025

https://github.com/bdice/signac-micde-cnsccs-2018

Slides and demos for the MICDE CNSCCS Symposium, October 15, 2018

data-management data-visualization demo signac workflow-automation

Last synced: 07 Oct 2025

https://github.com/csoren66/customer-personality-analysis

Predict how different customer segments will respond for a particular product or service.

data-analysis data-visualization python

Last synced: 03 Mar 2025

https://github.com/sanad343/complete-data-analyst

Data analysis is the process of turning raw data into useful information for decision-making.

data data-visualization datamanipulation eda excel exploratory-data-analysis powerbi python-3 sql tableau

Last synced: 30 Jun 2025

https://github.com/mitgar14/etl-workshop-1

Workshop #1 (Data Engineer) for the ETL course using Pandas, Matplotlib, SQLAlchemy and Power BI for the creation of the dashboard.

data-engineer data-visualization etl pandas postgresql powerbi python sqlalchemy

Last synced: 11 Apr 2026

https://github.com/ianjure/martial-law-in-data

A data visualization of how martial law shaped the Philippine economy.

data-visualization

Last synced: 05 Jan 2026

https://github.com/trimoyee-g/phishing-site-predictor

A phishing site prediction model using scikit-learn's Random Forest Classifier, achieving high accuracy and gaining insights into website characteristics.

data-visualization machine-learning python random-forest-classifier scikit-learn

Last synced: 11 Apr 2026

https://github.com/gabboraron/biostatisztika_es_alkalmazasai

"A statisztika a matematika azon ága, melynek feladata, hogy eszközt adjon a politikusok kezébe, mellyel tetszőleges állítás és annak ellentéte is tudományos alapon igazolható"

biostatistics data-analysis data-visualization r statistics statistics-course

Last synced: 24 Oct 2025

https://github.com/ayaankhan98/covid-19-analysis

Covid-19 Analysis. This repository is a part of AMURoboHack 1.0, Here we tried to visulize the world data of Covid-19. Data Visulization gives an easy way to understand bunch of data. We tried plotting the data over a world map so that users can eaisly get the stats for a conuntry by just hovering the mouse pointer over the country in the world map, we also provided the zooming over the world map to bring a sense of attractiveness and user friendly interface.

covid-19 d3js data-visualization topojson

Last synced: 30 Mar 2025

https://github.com/sayamalt/mental-health-classification-using-fine-tuned-distilbert

Successfully established a multiclass text classification model by fine-tuning pretrained DistilBERT transformer model to classify several distinct types of mental health statuses such as anxiety, stress, personality disorder, etc. with an accuracy of 77%.

data-visualization deep-learning distilbert-fine-tuning distilbert-model model-evaluation model-inference model-training-and-evaluation multiclass-text-classification natural-language-processing text-classification text-preprocessing text-tokenization

Last synced: 08 Oct 2025

https://github.com/shaolans/projet_algav_trie

Implementation of the Patricia Trie and the Hybrid Trie in Java

algorithms data-structures data-visualization graphviz-dot hybrid-training java patricia-tree tree trie

Last synced: 11 Jun 2026

https://github.com/aymanmomin/excel-coffee-data-analytics-exploring-coffee-orders-dataset

This project utilizes a coffee orders dataset to perform comprehensive data analytics and gain insights into customer preferences, popular items, and sales trends. The analysis aims to provide valuable information for coffee shop owners and enthusiasts, facilitating data-driven decision-making and improved customer satisfaction.

data-analysis data-visualization excel project

Last synced: 18 Jan 2026

https://github.com/mardavsj/matplotlib-in-python

The fundamentals of Python Matplotlib Library.

data-visualization matplotlib python

Last synced: 15 May 2026

https://github.com/dikshadamahe/fossee-project

A hybrid web and desktop application for visualizing chemical equipment parameters, built with Django, React, and PyQt5.

chartjs chemical-engineering dashboard data-visualization django fossee fossee-2026 fossee-internship hybrid-application pyqt5 python react rest-api

Last synced: 25 Apr 2026

https://github.com/ddihora1604/iit_patna

A multifaceted project involving applying ML models like Ridge Classifier, RNN, RIDOR, Rotation Forest and RUSBoost, integrating SMOTE for class balancing, and handling diverse datasets including those for seating arrangement tasks.

data-analysis data-visualization datamodelling machine-learning-algorithms python

Last synced: 25 Apr 2026

https://github.com/andersoncrs/analisis_exploratorio_de_datos-eda-_rendimiento_estudiantil

Este análisis exploratorio de datos (EDA) realizado sobre el conjunto de datos de rendimiento estudiantil tiene como objetivo identificar y comprender los factores que influyen en el desempeño académico de los estudiantes. A través de la limpieza, transformación y visualización de datos, se busca descubrir patrones y relaciones significatvas.

data-analysis data-exploration data-exploration-and-preprocessing data-visualization seaborn

Last synced: 30 Mar 2025

https://github.com/syncfusionexamples/how-to-add-arrows-to-the-chart-axis-in-wpf-chart

Learn how to enhance WPF charts by adding arrows to the chart axes using annotations for improved visualization and clarity.

axis-with-arrows chart-annotations chart-axis chart-customization charting-library charts data-visualization line-annotation wpf-char wpf-sfcharts

Last synced: 08 Oct 2025

https://github.com/alexquilis1/spanish-fuel-stations-analysis

Real-time analysis of Spanish fuel prices using government API data with interactive maps and regional comparisons

data-analysis data-visualization fuel-prices geospatial-analysis ggplot2 government-data leaflet open-data r shiny spain tidyverse

Last synced: 08 Oct 2025

https://github.com/allanreda/telco-customer-churn-predictor-app

A web-based machine learning application that predicts customer churn using a logistic regression model. Built with Scikit-Learn for model training, Gradio for the user interface, and deployed on Google Cloud App Engine. The app allows users to input customer data and receive predictions on churn risk to support business decision-making.

app-engine data-visualization deployment google-cloud gradio hyperparameter-tuning logistic-regression machine-learning numpy pandas scikit-learn

Last synced: 16 Apr 2026

https://github.com/marielachirinosr/bellabeat-wellness-data-trends

Analyzing smart device data for insights on user activity patterns to optimize interventions for better health outcomes.

data data-analysis data-visualization pandas python python3 tableau tableau-public

Last synced: 25 Apr 2026