An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/satyam4229/omnify-dataanalysis

Our assessment of Omnify focused on data-driven strategies to maximize profitability. We identified "Product X" as the most profitable product and recommended leveraging the "Wellness Solutions" keyword category for optimal keyword strategy.

data-analysis data-science data-visualization excel omnify

Last synced: 04 Jan 2026

https://github.com/aplgr/grovegrid

Interactive growth maps for rows of trees. CSV in; single HTML out.

alpinejs cli csv data-visualization echarts forestry go heatmap

Last synced: 25 May 2026

https://github.com/archanakokate/adidas_us_sales_analysis_powerbi

Analysis of Adidas Sales in US for year 2020-2021 using PowerBI

analysis data-visualization modelling powerbi

Last synced: 04 Jan 2026

https://github.com/swethajoseph/urological-cancer-referral-forecast

Analysing and forecasting urological cancer referral patterns for NHS Scotland, aiming to improve management and operational efficiency.

data-visualization datacleaning excel forcasting statistical-analysis tableau time-series-analysis

Last synced: 04 Jan 2026

https://github.com/0xarchit/covid-data-dashboard

This repo consists files related to Data Visualization Covid Data Dashboard Assignment

covid-19 covid19-data dashboard data-visualization streamlit

Last synced: 10 Apr 2026

https://github.com/jaymax01/dvd-rental-data-analysis

Data analysis of a DVD rental database

data-visualization postgresql sql

Last synced: 22 Jul 2025

https://github.com/brianyu28/old-sheets-flying

Data analysis and graphics tool for The Harvard Crimson's Data and Design Teams

data-visualization harvard-university journalism

Last synced: 15 May 2025

https://github.com/metalbolicx/tipviz

Show data only by hovering an element in your D3.js project.

d3 d3-js d3js data-visualization tooltip

Last synced: 20 Jan 2026

https://github.com/mjanez/spain-cultural-pulse

Interactive web app to explore contemporary Spanish culture, values, politics & social norms with beautiful data visualizations (Next.js + Leaflet + Recharts + D3). Based on 2024 nationwide survey (3k respondents).

csic culture d3js data-visualization i18n nextjs norpol open-data politics social-norms sociology spain spain-culture spain-politics survey-data tailwindcss

Last synced: 13 Jan 2026

https://github.com/shubham200137/icc-women-s-t20-world-cup-data-analytics

Created a Power BI report to identify top 11 players for a T20 cricket team by scraping data from espncricinfo with Python, cleaning and transforming the data with pandas, and evaluating various player performance metrics.

beautifulsoup4 data-analysis data-visualization numpy-python pandas-python powerbi web-scraping

Last synced: 25 Feb 2025

https://github.com/lotfiferaga/google-play-store-sentiment-analysis

Perform sentiment analysis on Google Play Store reviews using Python. Analyze user feedback to determine the overall sentiment (positive, negative, or neutral) towards various apps. Gain insights to aid developers and businesses in understanding user satisfaction levels and improving their products.

data-analysis data-visualization googleplayservices python reviewsanalysis-nlp

Last synced: 26 Feb 2025

https://github.com/akhdandann/squadevaluationdashboard-powerbi

A Power BI dashboard that visualizes squad evaluation metrics including happiness, contribution, commitment, delivery, and agile behavior across tribes at PT. XL Axiata Tbk. (with dummy data)

business-intelligence dashboard data-visualization power-bi reporting

Last synced: 26 Jan 2026

https://github.com/msikorski93/meteorite-landings

Basic data analysis focused mainly on visualizing geospatial data worldwide with cartopy.

cartopy data-visualization geopandas gis mapping meteorite-landing-sites shapefile

Last synced: 16 May 2026

https://github.com/ultra-bugs/pyside6-datatable-widget

A PySide6 DataTable widget with jQuery DataTable-like functionality

data-visualization desktop-app desktop-application gui pyside6 qt qt6 table

Last synced: 30 Jun 2025

https://github.com/usk2003/vnrvjiet-lab-work

This repository contains my lab work for the B.Tech CSE-AIML program (2022-2026) under the R22 regulation at VNR Vignana Jyothi Institute of Engineering and Technology. It includes various subjects like Machine Learning, OS, Data Structures, C Programming, and more, showcasing my practical learning and implementations.

c-programming compiler-design computer-networks data-engineering data-structures data-visualization dbms engineering-drawing java machine-learning operating-system python software-engineering

Last synced: 11 Apr 2026

https://github.com/tashi-2004/apache-hadoop-spark-hive-cyberanalytics

This project utilizes Apache Hadoop, Hive, and PySpark to process and analyze the UNSW-NB15 dataset, enabling advanced query analysis, machine learning modeling, and visualization. The project demonstrates efficient data ingestion, processing, and predictive analytics for network security insights.

ai apache-hadoop apache-hive big-data-analytics big-data-processing data-analysis data-engineering data-science data-security data-visualization hdfs machine-learning network-analysis network-security pyspark python3 threat-detection unsw-nb15-dataset

Last synced: 02 May 2026

https://github.com/aphp/jupyter-eds-notebooks

jupyter-eds-notebooks provides Docker images with preconfigured Jupyter environments for clinical and health data analysis, tailored for AP‑HP Datalabs and the HELIX platform.

data-analysis data-science data-visualization healthcare lab

Last synced: 13 Jan 2026

https://github.com/samanhur/data_visualization_pcc

First experiences in data visualization with python

data-analysis data-science data-visualization python3

Last synced: 23 Mar 2025

https://github.com/mr-chang95/webpage_abtest_analysis_udacity

A/B Testing Project for Udacity's Data Analyst Nanodegree Program. Using Python in Jupyter Notebook.

abtesting data-science data-visualization matplotlib pandas python webpage

Last synced: 11 Apr 2026

https://github.com/emcramer/clockplot

Plotting utility for a "clockplot" that puts groups into a time-ordered heterogeneity visualization

biology data-analysis data-visualization heterogeneity pseudotemporal-ordering

Last synced: 10 Mar 2026

https://github.com/badranalyst/student-tests-data-analysis-application

Python-based analysis of student test scores in math, reading, and writing, examining correlations with parental education, lunch type, and test preparation. Includes data cleaning, visualization, and statistical insights into factors influencing academic performance.

data-analysis data-visualization dataset matplotlib numpy pandas python sklearn

Last synced: 05 May 2026

https://github.com/allanreda/telco-customer-churn-predictor-app

A web-based machine learning application that predicts customer churn using a logistic regression model. Built with Scikit-Learn for model training, Gradio for the user interface, and deployed on Google Cloud App Engine. The app allows users to input customer data and receive predictions on churn risk to support business decision-making.

app-engine data-visualization deployment google-cloud gradio hyperparameter-tuning logistic-regression machine-learning numpy pandas scikit-learn

Last synced: 16 Apr 2026

https://github.com/andersoncrs/analisis_exploratorio_de_datos-eda-_rendimiento_estudiantil

Este análisis exploratorio de datos (EDA) realizado sobre el conjunto de datos de rendimiento estudiantil tiene como objetivo identificar y comprender los factores que influyen en el desempeño académico de los estudiantes. A través de la limpieza, transformación y visualización de datos, se busca descubrir patrones y relaciones significatvas.

data-analysis data-exploration data-exploration-and-preprocessing data-visualization seaborn

Last synced: 30 Mar 2025

https://github.com/shaolans/projet_algav_trie

Implementation of the Patricia Trie and the Hybrid Trie in Java

algorithms data-structures data-visualization graphviz-dot hybrid-training java patricia-tree tree trie

Last synced: 11 Jun 2026

https://github.com/jeffbla/image-and-porosity-analysis-by-dash-plotly

This app uses Dash to explore core data. Compare images in the first row and interact with a line chart in the second row. Hovering reveals details, and clicking updates the images to match the selected data point.

dashboard data-visualization dicom-images interactive-visualizations plotly-dash xct

Last synced: 21 Jul 2025

https://github.com/sanad343/complete-data-analyst

Data analysis is the process of turning raw data into useful information for decision-making.

data data-visualization datamanipulation eda excel exploratory-data-analysis powerbi python-3 sql tableau

Last synced: 30 Jun 2025

https://github.com/akhdandann/itutilizationdashboard-powerbi

Interactive Power BI dashboard for monitoring IT utilization, application uptime, and infrastructure performance at PT PLN (2014-2018).

business-intelligence dashboard data-visualization power-bi reporting

Last synced: 26 Jan 2026

https://github.com/tsopermon/comparison-ml-algorithms

This repository compares the performance of Adaline, Logistic Regression, and Perceptron models on binary classification tasks using linearly, non-linearly, and marginally separable datasets from the Iris dataset. It includes MATLAB implementations, 10-fold cross-validation, and visualizations of decision boundaries and MSE histories.

adaline binary-classification classification-accuracy cross-validation data-visualization decision-boundaries iris-dataset logistic-regression machine-learning matlab mse neural-networks perceptron

Last synced: 15 Mar 2025

https://github.com/jleung51/visualizations

Javascript & D3.js visualizations of data.

d3js data-visualization javascript

Last synced: 27 Mar 2025

https://github.com/spear97/montecarlo-python

This was a project for my Programming Language Concepts Class were we were assigned to create a Monty Carlo Simulation using Python.

data-science data-visualization matplotlib-figures matplotlib-python montecarlo pandas-library pandas-python python python-3

Last synced: 23 Mar 2025

https://github.com/pekiiipy/credit-card-fraud-detection

🔍 Detect credit card fraud efficiently using advanced machine learning techniques, achieving high accuracy rates on a large dataset of transactions.

adasyn anomaly-detection class-imbalance credit-card-fraud data-visualization fraud fraud-detection frauddetection kaggle keras logistic-regression plotly-python postgresql random-forest scikit-learn tensorflow tree-model xgboost

Last synced: 11 Apr 2026

https://github.com/karo23361/toy-store-kpi-power-bi

PowerBI Portfolio Project

csv data data-visualization powerbi

Last synced: 03 Feb 2026

https://github.com/carmendev/covid-19-tracker

Data visualization React.js project deployed with Firebase. Daily statistics about current, recovered and closed cases coming from an API.

data-visualization firebase numeral reactjs

Last synced: 11 Apr 2026

https://github.com/hanzopgp/lolanalysis

League Of Legends game data engineering, analysis, visualization and machine learning. Business intelligence project.

data-analysis data-cleaning data-engineering data-visualization dataiku deep-learning etl machine-learning scraping university

Last synced: 27 May 2026

https://github.com/navp7/roadaccident_powerbi

An interactive Power BI dashboard designed to analyze road accident data

dashboards data-analysis data-visualization powerbi

Last synced: 19 Mar 2026

https://github.com/gautam25raj/data-sync

A powerful platform designed to revolutionize the way teams collaborate and visualize data.

chat collaboration data-visualization express material-tailwind mongodb mongoose nextjs nodejs reactjs redux redux-toolkit tableau tableau-dashboard tailwindcss

Last synced: 11 Apr 2026

https://github.com/sayamalt/credit-card-approval-prediction

Successfully developed a machine learning model which can accurately predict up to 100% accuracy whether a credit card application of a given applicant would be approved or not, based on several demographic features such as applicant age, total income, marital status, total years of work experience, etc.

binary-classification cicd-deployment cross-validation data-exploration-and-preprocessing data-visualization exploratory-data-analysis feature-engineering hyperparameter-optimization machine-learning model-deployment model-retraining model-selection model-testing model-training-and-evaluation

Last synced: 09 Nov 2025

https://github.com/sayamalt/house-price-prediction

Successfully created a regression model for predicting the price of any house, excluding enormous real estates and mansions, to a significant level of accuracy.

data-visualization exploratory-data-analysis feature-engineering feature-selection machine-learning regression-analysis regression-testing

Last synced: 09 Nov 2025

https://github.com/farhannirzhor/vrinda_store_excel_project

This project is about excel analysis and visualization. In this project, I analyzed Vrinda Store's sales and made an annual sales report

data-analysis data-cleaning data-preprocessing data-visualization microsoft-excel reporting

Last synced: 05 Jan 2026

https://github.com/sayamalt/steel-energy-consumption-prediction-using-pyspark

Successfully established a machine learning model using PySpark which can precisely predict the energy consumption of the steel industry, up to an r2 score of approximately 99.5%.

apache-spark big-data-analytics big-data-processing cross-validation data-visualization exploratory-data-analysis hyperparameter-tuning machine-learning model-training-and-evaluation python regression spark sql

Last synced: 10 Mar 2026

https://github.com/nel-zi/nuga_bank

Developed an automated data exploration and cleaning pipeline for Nuga Bank to streamline data preparation, ensure consistent data quality, and normalize datasets into structured databases for efficient analysis and reporting.

data data-automation data-visualization datacleaning datatransformation etl-automation etl-pipeline

Last synced: 16 May 2025

https://github.com/debjyotisaha/power-bi-projects-phase-1

Portfolio projects related to data visualisation in Power BI

data-analysis data-visualization dax-expression powerbi powerquery

Last synced: 18 Jan 2026

https://github.com/adamouization/python-machine-learning-data-science-notes

:orange_book: Jupyter notebooks containing useful Python code and notes for general Machine Learning and Data Science projects.

data data-science data-visualization guide jupyter jupyter-notebook machine-learning matplotlib notes numpy pandas pandas-dataframe python seaborn

Last synced: 11 Apr 2026

https://github.com/mithoon278/us-visa-approval-prediction-mlops-project

This project presents a ML based solution using ML Algorithm to predict which visa applications will be approved and thus recommend a suitable profile for applicants whose visa have a high chance of approval.

aws classification data-visualization ec2-instance exploratory-data-analysis feature-engineering hyperparameter-tuning machine-learning random-forest-classifier s3-bucket

Last synced: 11 Apr 2026

https://github.com/salma-mamdoh/investigating-netflix-movies-and-guest-stars-in-the-office

My Project to learn the Basics of Analysis & Visualization on DataCamp

data-analysis data-visualization datacamp matplotlib pandas python

Last synced: 11 Apr 2026

https://github.com/gustapinto/fatec_dsm_pi_terceiro_semestre

Um visualizador para tendências de animes

anime chartjs data-visualization django etl

Last synced: 05 Apr 2025

https://github.com/pythoncoderunicorn/gi-joe

Dataset for GI Joe action figures from 1980s & 1990s. My dataset for TidyTuesday

data-science data-visualization gi-joe r toy-project

Last synced: 27 May 2026

https://github.com/shafaq-aslam/data-analytics-dairy

A comprehensive repository for Data Analytics learning and projects. It includes MySQL, Python, Power BI, Tableau, and Excel. The goal is to analyze data, generate insights, and create compelling visualizations for real-world datasets.

data-analysis data-visualization excel excel-based-data-analysis powerbi python-scripts sql sql-queries sql-queries-for-data-manipulation sql-query-for-data-visualization tableau

Last synced: 20 Jan 2026

https://github.com/jimohola/streamlit_ml

How to build a Web App using Streamlit for Machine Learning Algorithms

data-visualization exploratory-data-analysis machine-learning streamlit webapp

Last synced: 09 May 2026

https://github.com/filip-kustura/statistics-olympics-analysis

A group seminar analyzing the relationship between citizens' average height and a country's Olympic success. The project involved data collection, descriptive statistics and statistical testing. Created and presented as part of the mandatory undergraduate Statistics course in spring 2021.

correlation-analysis data-analysis data-visualization descriptive-statistics group-project hypothesis-testing olympic-games r-programming research sports-analytics statistical-testing statistics university-project

Last synced: 05 Jan 2026

https://github.com/hemangsharma/dataanalysis

This repo contains analysis like a dashboard and time series forecast on NASDAQ data

analysis data data-analysis data-visualization python

Last synced: 10 Mar 2026

https://github.com/riju18/data-analysis-and-visualizaton

Most complex data analyzing for clustering, preparing, complex calculation, joining, cross-over & more for Data science.

data-analysis data-mining data-science data-visualization powerbi tableau

Last synced: 04 Jan 2026

https://github.com/ledsouza/curso_de_estatistica_parte_3

Projeto de curso de estatística sobre distribuições e teste de hipósteses

data-science data-visualization pandas scipy seaborn statsmodels vitrinedev

Last synced: 29 Apr 2026

https://github.com/82luli02/sakila_dvd_rental_database_analysis

Analysis of the Sakila DVD Rental database using SQL

data data-analysis data-science data-visualization sql

Last synced: 10 Mar 2026

https://github.com/walid0912/rfm_analysis

RFM Analysis is employed to comprehend and categorize customers according to their purchasing patterns. RFM, an acronym for recency, frequency, and monetary value, comprises three essential metrics that offer insights into customer involvement, allegiance, and significance to a business.

data-analysis data-visualization python rfm-analysis

Last synced: 02 Sep 2025

https://github.com/akhdandann/productdashboard-powerbi

A Power BI dashboard to monitor engineering effectiveness at PT. XL Axiata Tbk. It tracks release frequency, production defects, cycle time, developer activity, and team happiness. Note: All names have been changed and the data has been modified - this is dummy data for demonstration purposes only.

business-intelligence dashboard data-visualization power-bi reporting

Last synced: 26 Jan 2026

https://github.com/sudarsann27/basic_machine_learning_algorithms

Basic Machine learning algorithms using scikit-learn and other fundamental libraries

data-science data-visualization ensemble-model kaggle numpy pandas scikit-learn supervised-machine-learning

Last synced: 20 Jan 2026

https://github.com/ascender1729/sentitweet

SentiTweet: Advanced sentiment analysis tool using AWS Comprehend and TextBlob. Analyze text sentiment via CLI or web interface with visualizations.

aws-comprehend cli-tool data-visualization machine-learning natural-language-processing python sentiment-analysis text-analysis textblob web-application

Last synced: 31 Mar 2025

https://github.com/shivabajelan/usgs_earthquake_visualisation

USGS Earthquake Visualisation is an open-source project that provides an interactive map to visualise earthquake data collected by the USGS, highlighting the relationship between tectonic plates and seismic activity. Built with JavaScript, Leaflet.js, D3.js, HTML, and CSS, the project is available on GitHub under the MIT License.

api css d3 data-visualization html javascript leafletjs

Last synced: 31 Mar 2025

https://github.com/rios0rios0/investmate

Go-based application designed to scrape and analyze ETF (Exchange-Traded Fund) data, focusing on dividend cash amounts, average closing prices, and dividend yields over a specified number of years. The application uses the colly library for web scraping and the tablewriter library for displaying the data in a formatted table.

crawling data-visualization etf-investments financial-analysis golang

Last synced: 23 May 2026

https://github.com/vassilevsky/4sq73

Демонстрация неточности координат заведений Foursquare на примере Ульяновска

data-visualization foursquare-api geo yandex-maps

Last synced: 15 Mar 2025

https://github.com/garcane/solana-ml-forecast

This project uses machine learning, specifically an XGBoost regressor, to predict the price of Solana (SOL) based on historical data and engineered features.

cryptocurrency data-visualization machine-learning solana xgboost

Last synced: 10 May 2026

https://github.com/jmjuanes/vizjar

:chart: Micro visualization toolkit for building interactive graphics for data exploration

data-visualization visualization

Last synced: 31 Mar 2025

https://github.com/harshmule1/crypto-market-trend

Crypto Market Trend Using Power Bi

analysis data-visualization excel powerbi

Last synced: 29 Jan 2026

https://github.com/shefreenkaur/comp_430_project

A comprehensive, open-source business intelligence visualization tool designed for algorithmic trading systems. This application transforms complex trading data into intuitive visualizations, enabling traders and analysts to make data-driven decisions.

algorithmic-trading api-development business-intelligence data-analytics data-visualization etl-pipeline fastapi finance financial-analysis interactive-dashboard plotly streamlit

Last synced: 13 Apr 2026

https://github.com/hyoaru/rph-retraction-relationship-visualization

A task output in GEC 3 RPH, retraction relationship visualization

data-visualization graph

Last synced: 31 Mar 2025

https://github.com/anilyigitsel/tourist-attraction-data-analysis

This project analyzes tourism trends from 2017 to 2021, focusing on visitor numbers, ratings, and attraction popularity during these years.

data-analysis data-visualization excel sql tourism

Last synced: 26 Jan 2026

https://github.com/vinitgurjar/r_lang_exp

This is a collection of my collage Data Analytics lab work and assignment, the files here contains program of R language

data-analysis data-visualization r

Last synced: 02 Jul 2025

https://github.com/darkdk123/simple-heart-disease-classification

This Experiment provides a comprehensive approach to forecast heart disease risks by performing a detailed data analysis, predictive modeling & hyperparameter tuning. This leads to a `LinearSVC` model with 90% Accuracy

classification-algorithm data-science data-visualization exploratory-data-analysis heart-disease-prediction machine-learning

Last synced: 17 Nov 2025

https://github.com/katiesaund/tidy_tuesday

A weekly data project in R from the R4DS online learning community

data-analysis data-visualization datascience plot r rstats tidytuesday

Last synced: 24 Mar 2025

https://github.com/rahult18/atmo-flow

AtmoFlow is a robust data engineering pipeline built on Google Cloud Platform (GCP) that processes and analyzes weather and air quality data in both batch and streaming modes

airflow data data-modeling data-science data-visualization dataengineering gcp-bigquery gcp-cloud-composer gcp-cloud-functions pyspark

Last synced: 23 Jun 2026

https://github.com/relostar-devil/analyzing-naming-trends-using-python

Analyzes naming trends by processing baby names data from the Social Security Administration (SSA)

data-visualization python

Last synced: 30 Apr 2026

https://github.com/abhipatel35/diabetes_ml_classification

Predict diabetes using machine learning models. Experiment with logistic regression, decision trees, and random forests to achieve accurate predictions based on health indicators. Complete lifecycle of ML project included.

classification data-analysis data-science data-visualization descision-tree diabetes-prediction jupiter-notebook logistic-regression machine-learning model-evaluation open-source pandas pycharm-ide python random-forest scikit-learn

Last synced: 20 Jan 2026

https://github.com/sumanadithan/react-admin-dashboard

A modern React Admin Dashboard built with React, TypeScript, Vite, Tailwind CSS, Framer Motion, TanStack Table, Zustand, and Recharts. It features fast performance, responsive design, dynamic data handling, and testing with Vitest.

admin-dashboard data-visualization framer-motion frontend react recharts responsive-design state-management tailwind-css tanstack-table testing typescript vite vitest web-development zustand

Last synced: 12 Apr 2026

https://github.com/balajimohan18/loan-classification-datascience-project

This project uses machine learning algorithms to predict the classification of loan status. The dataset is loaded and some transformation is done using SQL for getting a proper dataset with some valid informations.

classification data-analysis data-cleaning data-science data-visualization loan-prediction loan-status machine-learning sql supervised-learning

Last synced: 03 Sep 2025

https://github.com/berkeley-gif/caladapt-website-2021

Redesign and rewrite of the website for Cal-Adapt.org

cal-adapt california climate-change climate-models data-visualization svelte

Last synced: 26 Jan 2026

https://github.com/prekshivyas/datastreamingetl

Utilizing my background and love for Apache Airflow and Data to build a real-time data streaming pipeline

apache-airflow apache-kafka apache-spark apache-zookeeper cassandra data-engineering data-ingestion data-pipeline data-processing data-visualization docker docker-compose

Last synced: 20 Jan 2026