An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/wa-lead/ml485_blind_preprocessing_prediction_comp

This project aims to achieve the best prediction results by applying various preprocessing techniques and blind data engineering.

data-engineering data-visualization machine-learning python

Last synced: 19 May 2026

https://github.com/samir-atra/share-lm_dataset_analysis

Analysis, studies and optimizations on the ShareLM extension dataset

data-analysis data-visualization gemma3n huggingface huggingface-transformers pandas

Last synced: 19 May 2026

https://github.com/sukhitashvili/pca_tutorial

PCA algorithm from scrach, using only matrix-vector multiplications

data-analysis data-science data-visualization machine-learning-algorithms pca

Last synced: 29 Mar 2025

https://github.com/ezeparziale/analisis-uso-bicicletas-caba

:biking_man: Análisis de como afecto la pandemia el uso de las bicicletas en CABA.

data data-science data-visualization

Last synced: 14 Mar 2025

https://github.com/analyticalnahid/seaborn-tutorial

A complete Notebook on Seaborn for Data Science

data-visualization seaborn seaborn-tutorial

Last synced: 23 Aug 2025

https://github.com/saketr3/voting-policy-impact-visualizer

Data visualization web app where users can compare voter turnout of different demographics with states’ voting policy fairness scores

data-visualization voting

Last synced: 14 Mar 2025

https://github.com/furkankarakuz/turkey_earthquake

This project focuses on analyzing and visualizing earthquake data specific to Turkey. It aims to provide insightful visualizations on topics such as earthquake frequency, location, and magnitude using data obtained from Boğaziçi University Kandilli Observatory and Earthquake Research Institute.

api data data-visualization earthquake python python3 request streamlit turkey turkey-earthquake

Last synced: 20 May 2026

https://github.com/the-ethan-hunt/dekh-data

Playground for data visualization notebooks

data-visualization jupyter-notebook python

Last synced: 28 Mar 2025

https://github.com/otsaloma/pollen-chart

Helsinki pollen count visualization

data-visualization javascript lambda pollen python

Last synced: 17 Apr 2026

https://github.com/cudavailable/gdp-data-visualization

A data visualization project for GDP data

data-visualization gdp vue

Last synced: 20 May 2026

https://github.com/prakhar-code/house_sales_analysis

House Sales Analysis Of King County, Washington, USA and Clean Visualization.

data-cleaning data-visualization excel tableau tableau-dashboards tableau-public

Last synced: 12 Jan 2026

https://github.com/archanakokate/bank_term_deposit_prediction

Build a Decision Tree classifier to predict if the client will subscribe to a Term Deposit based on their demographic and behavioral data.

data-analysis data-visualization exploratory-data-analysis machine-learning

Last synced: 14 Sep 2025

https://github.com/alra-code/data-analytics-com-power-bi

Desafio de projetos do Boocamp Data Analytics realizado pela Dio Me em 2024

analytics data-visualization desafios-resolvidos dio-bootcamp powerbi pt-br

Last synced: 25 Jan 2026

https://github.com/shuddha2021/interactive-data-visualization-app

An interactive web application for visualizing data using Chart.js. Users can explore and analyze data through dynamic charts and customize their view

chart data-visualization event-handling interactive-ui javascript real-time-updates responsive-design web-development

Last synced: 01 Nov 2025

https://github.com/mohamed-walied/customer-behavior-analysis-using-r

Customer Behavior Analysis project utilizing the "Groceries Market Basket Dataset" from Kaggle. The project employs a data-driven approach to uncover customer purchasing patterns and relationships within the grocery market using K-means Clustering and Association Rules using Apriori-Algorithm. In collaboration with some friends.

apriori-algorithm association-rule-learning dashboard data-cleaning data-visualization k-means-clustering r-programming-language

Last synced: 26 Jul 2025

https://github.com/williamd1k0/metacritic-games

Distribution of Metacritic scores for console games.

data-scraping data-visualization metacritic web-scraping

Last synced: 26 Jun 2025

https://github.com/c2r0b/2q

Manage data and relationships with AI

data-visualization graphql relationships rust tauri

Last synced: 09 Apr 2026

https://github.com/irishmorales/ph-poverty-statistics

An exploratory data analysis of Philippine poverty data. Data includes given 1991-2015 data, appended FIES 2018 & 2021 data, and 2024 & 2027 poverty estimates calculated using ARIMA.

data-visualization exploratory-data-analysis philippines poverty-alleviation

Last synced: 22 Mar 2025

https://github.com/samruddhi3012/rfm-analysis

Hi there! In this project I have performed Sales Analysis (RFM Analysis) using SQL and Tableau.

data-analysis data-visualization mssqlserver rfm-analysis segmentation tableau

Last synced: 27 Jun 2025

https://github.com/catalina2820/inteligencia-de-negocios

This repository contains materials and resources for the Business Intelligence course. It includes notes, workshops, and practical exercises that cover essential concepts and applications in data science, data visualization, machine learning, and big data.

bigdata data-cleaning data-science data-visualization web-scraping

Last synced: 04 Apr 2025

https://github.com/anergictcell/esbmeplots

An extension of the D3.js library for fast and flexible generation of basic plot types

d3js data-visualization javascript plotting

Last synced: 13 Jun 2026

https://github.com/gappeah/solana-ml-forecast

This project uses machine learning, specifically an XGBoost regressor, to predict the price of Solana (SOL) based on historical data and engineered features.

cryptocurrency data-visualization machine-learning solana xgboost

Last synced: 25 Feb 2025

https://github.com/vlad1343/data-visualisation

Python project showcasing interactive and static visualizations using Plotly and Matplotlib. It includes analysis of CSV, JSON, and API data, turning complex datasets into clear, insightful charts.

anova api csv-files data-analysis data-visualization json matplotlib matplotlib-pyplot pandas pandas-python plotly python3 seaborn seaborn-python

Last synced: 08 Apr 2026

https://github.com/faizantkhan/python_matplotlib

Matplotlib is a powerful Python library for creating visualizations and plots. It’s widely used for data representation, making complex information more accessible and interpretable. It offers various types of plots, including line graphs, scatter plots, bar charts, histograms, and more

data-analysis data-analytics data-engineering data-science data-visualization deep-learning graphs line machine-learning machine-learning-algorithms matplotlib matplotlib-pyplot matplotlib-python python

Last synced: 20 May 2026

https://github.com/traccyyyyy/employeehrwebapp

Modern web application built with Lit, featuring Web Components, real-time data visualization, responsive UI, and RESTful API integration.

api-rest data-visualization developer-tools frontend interactive-dashboard javascript lit real-time state-management ui-ux webapp webcomponents

Last synced: 20 May 2026

https://github.com/fvdavid/d3-in-action

Angular 19 Data Visualization D3js

angular d3-visualization d3js data-visualization typescript

Last synced: 08 May 2026

https://github.com/xmen3em/kaggle-competitions

This collection contains various projects and notebooks developed to tackle a range of Kaggle competitions, showcasing different machine learning techniques, data preprocessing methods, and model optimizations.

data data-science data-visualization deep-learning deployment ensemble-learning machine-learning-algorithms python streamlit

Last synced: 09 Apr 2026

https://github.com/alinababer/data-science-and-insight-agent-rag-llama3-lava-llm

Data-Science-and-Insight-Agent-RAG-LLama3-Lava-LLM-Django-WebApplication is an advanced AI-driven chatbot designed to assist in data science, document analysis, and image interpretation. This repository contain the Datascience Agent of this project.

artificial-neural-networks classifcation data-analysis data-engineering data-visualization datascience large-language-models llama2 lstm machine-learning python random-forest regression

Last synced: 01 Jan 2026

https://github.com/gui-sitton/timeseries-taxi

To attract more drivers during peak hours, we need to predict the amount of cab requests for the next hour. Build a model for this prediction.

data-science data-visualization machine-learning ml python time-series time-series-analysis time-series-prediction

Last synced: 20 May 2026

https://github.com/tanishpoddar/logitrack

LogiTrack is a Python & Streamlit-powered inventory management system for real-time warehouse optimization. It offers multi-warehouse planning, interactive maps, and supply chain analytics, supporting global coordinates, CSV/SQL data, and customizable parameters.

data-visualization database inventory-management logistics optimization python streamlit supply-chain supply-chain-analytics warehouse-optimization

Last synced: 02 Nov 2025

https://github.com/andersoncrs/regularizacion_lasso_en_modelos_de_regresion_lineal

Este repositorio contiene un análisis detallado sobre la implementación de la regularización Lasso en modelos de regresión lineal para predecir el precio de vehículos. Se parte de un conjunto de datos limpio y se aplican diversas transformaciones y modelados para mejorar la precisión de las predicciones.

data-analysis data-science data-visualization jupyter-notebook linear-regression regularization-methods seaborn sklearn

Last synced: 16 May 2026

https://github.com/mvharsh/blinkit-sales-dashboard

An interactive Power BI dashboard visualizing Blinkit's sales performance across outlets, item types, and customer ratings for strategic insights.

blinkitdashboard data-analysis data-visualization powerbi

Last synced: 25 Jan 2026

https://github.com/anonymo2239/big-data-churn-analyzer

Scalable customer churn prediction using PySpark. Includes EDA, feature engineering, modeling, and real-time inference on new data.

big-data churn-analysis churn-prediction classification-algorithm data-analysis data-science data-visualization modeling pyspark

Last synced: 21 May 2026

https://github.com/arekflo2002/analiza_danych-rstudio-_dyskryminacja_kobiet

Wykorzystując rstudio oraz zestawy dane ze strony https://www.gapminder.org/data/ badam tematykę dyskrminacjii kobiet na poszczególnych kontynentach i wyciągam odpowiednie wnioski

data data-preparation-and-analysis data-visualization rstudio statistics

Last synced: 14 Apr 2025

https://github.com/driversti/formula1

Formula 1 companion dashboard built on the public live-timing archive — starting with pre-race tyre inventory, more race-weekend insights to follow.

data-visualization f1 formula-1 github-pages python react tailwindcss typescript vite

Last synced: 21 May 2026

https://github.com/gabboraron/plague-inc

Plague Inc: An Epidemic Forecast Concept and Data Visualization Tool. Previously accessible at http://20.234.177.167/. You are welcome to host it on your own server.

big-data data-mining data-science data-visualization epidemic-simulations hackathon

Last synced: 06 Apr 2025

https://github.com/mmzong/gee_lifestyleeffectsonhypertension

Generalized Estimating Equations (GEE), Quasi-likelihood under the Independence Model Criterion (QIC), Longitudinal data, Embedded box plots within violin plots with hypertension risk categories, spaghetti plots, aggregate line plots, histograms, faceted-area plots, box and jitter plots. Investigating the impact of lifestyle on health.

aggregate-line-plot area-faceted-plots box-plots data-analysis data-manipulation data-science data-visualization generalized-estimating-equations histograms jitter-plots longitudinal-data qic quasi-likelihoods r spaghetti-plots violin-plots

Last synced: 29 Jul 2025

https://github.com/abhishekyadav915/diwali_sales_analysis

This project aims to analyze sales data during the Diwali festival using Python. The analysis focuses on identifying key trends, customer purchasing behavior, and sales performance across different segments. By leveraging data visualization and statistical analysis, we uncover insights.

data-analysis data-visualization matplotlib-pyplot numpy-library pandas-dataframe seaborn-python

Last synced: 05 Apr 2025

https://github.com/thesfinox/mltools

A collection of simple tools for data science and machine learning projects.

ai data-analysis data-science data-visualization logging machine-learning matplotlib neural-network python toolbox

Last synced: 14 May 2025

https://github.com/vishal-038/real_estate_price_prediction

The Real Estate Price Prediction project aims to develop a machine learning model to predict house prices based on various features

data-analysis data-science data-visualization machine-learning python

Last synced: 21 May 2026

https://github.com/hfagerlund/machine-learning-iris-analysis

No longer maintained. Moved to https://github.com/hfagerlund/machine-learning-classifier-iris/.

data-visualization jupyter-notebook machine-learning python37

Last synced: 22 Jul 2025

https://github.com/nishumehta/british-airways-reviews-analysis

This project analyzes British Airways reviews using Tableau to create an interactive dashboard. The dashboard visualizes average ratings across multiple metrics and trends over time.

dashboard data-analysis data-visualization tableau tableau-public

Last synced: 12 Jan 2026

https://github.com/divyashah0510/sales-insights-for-retail-company

This project is a data analysis project for a retail company. The company has dataset: sales_data.csv that contains the sales data for the company. The objective of this project is to analyze the sales data and provide insights to the company to improve their sales.

dash data-visualization pandas plotly sales sales-insights streamlit visualization

Last synced: 02 Jan 2026

https://github.com/maazie-khan/austin-housing-insights-powerbi

Worked with a real estate dataset, we will build a tool to evaluate trends and drivers of house prices around Austin, Texas.

dashboard data-analysis data-science data-visualization database powerbi

Last synced: 02 Jan 2026

https://github.com/saniyaacharya04/youtube-trending-video-analyzer

Modular Streamlit dashboard for analyzing trending YouTube videos by views, engagement, and category—powered by the YouTube Data API.

api-analysis clustering-engagement-metrics dashboard data-visualization modular-architecture streamlit trending youtube

Last synced: 21 May 2026

https://github.com/rudra-g-23/power-bi-custom-visual

A custom Power BI visual that displays a customizable, interactive charts with advanced capabilities.

custom-visuals data-analysis data-visualization dax powerbi powerbi-custom-visuals svg visualization

Last synced: 02 Jan 2026

https://github.com/tushar2704/employee-distribution

This repository contains valuable insights and visualizations derived from an extensive HR dataset spanning from 2000 to 2020, with over 22,000 rows.

data-analysis data-visualization excel postgresql powerbi sql tushar2704

Last synced: 04 Nov 2025

https://github.com/lut-ful/e-commerce-sales-report

This dashboard provides a visual analysis of e-commerce sales data

data data-analytics data-science data-visualization power-bi statics

Last synced: 28 Jun 2025

https://github.com/anuuragg/human-microbiome---eda

Fundamentals of Data Science - End Semester Project 1

data-science data-visualization eda fds microbiome

Last synced: 14 Mar 2025

https://github.com/foufou-exe/occitanie-report-rental-yields

This project aims to develop a datavisualization and reporting tool to analyze rental yields in the Occitanie region, for use by real estate investors.

data-visualization jasper java opendata python reporting

Last synced: 22 May 2026

https://github.com/tamanna2005/streamlit-crime-dashboard

A Streamlit-based personal project that visualizes crime data in Pittsburgh through an interactive dashboard, focusing on data storytelling and insightful exploration.

crime-data data-analysis-project data-visualization eda interactive-dashboard python streamlit

Last synced: 28 Jun 2025

https://github.com/vedikasnehil/sql-50

This project focuses on solving 50 SQL problems every weekend from LeetCode to strengthen SQL skills, master advanced techniques, and build consistency. Each solution is documented with clear explanations, creating a valuable resource for learning and application.

data-visualization database-management sql

Last synced: 06 Jan 2026

https://github.com/namratagulati/fraud_detection

This fulfills all the requirements of a fraud detection model developed on linear regression using feature scaling, engineering and testing model with the help of auc-roc curve and others.

data-analysis data-visualization machine-learning machine-learning-algorithms machinelearning-python

Last synced: 04 Jun 2026

https://github.com/zmyzheng/stack_overflow_qa_assistant

Big Data Analysis project with recommendation, cluster analysis and graph database

big-data-analytics cluster-analysis data-visualization graph-database hadoop mahout recommendation-system

Last synced: 30 Mar 2025

https://github.com/aliasgarsogiawala/dashboards

Power BI dashboards , each folder contains a pbix file and a pdf file with explanation of the dashboard

analysis dashboards data data-visualization powerbi

Last synced: 12 Feb 2026

https://github.com/bala-1409/milk-production-time-series-forecasting-datascience-project

This project uses time series forecasting to predict future milk production. The data used in this project is monthly milk production data from January 1962 to December 1975. The ARIMA (autoregressive integrated moving average) model is used to forecast the milk production. The model is evaluated using various metric.

acf adf arima-model data-analysis data-science data-visualization datapreprocessing eda exploratory-data-analysis forecasting machine-learning-algorithms pacf python python3 sarimax-model seasonality seasonality-analysis time-series time-series-forecasting trends

Last synced: 27 Apr 2026

https://github.com/bala-1409/loan-clustering-datascience-projects

This project uses Machine Learning to Cluster loan together based on their similarities. The project uses a dataeset of loan application which includes information about the Loan amount and Balance. The project then use the clustering algorithm to group the loan together based on the similarities.

clustering clustering-algorithm data-analysis data-science data-visualization kmeans-clustering machine-learning machine-learning-algorithms sql unsupervised-learning unsupervised-machine-learning

Last synced: 22 Mar 2025

https://github.com/rfonod/tableau-dashboard

A Tableau dashboard visualizing the change in the Summary Innovation Index (SII) from 2012 to 2019, relative to selected academic research trends in Europe. It includes a bar chart for country comparisons, a scatter plot for trend analysis, and a map to show geographic patterns, with interactive features for enhanced insights.

dashboard data-visualization innovation-index tableau visualization

Last synced: 25 Jan 2026

https://github.com/dimits-ts/visualization-assignments

Visualizing and analyzing results from the PISA-2018 competitions with regards to Greek performance and gender gap.

data-analysis data-visualization interactive-graphs presentation-slides r-language tableau

Last synced: 06 Nov 2025

https://github.com/noor188/preswald-data-app

A data app to visualize and manipulate the graduate admission dataset

data-analysis data-visualization open-source

Last synced: 04 Jul 2025

https://github.com/oelin/textgram

A simple text-based data visualisation library.

ascii-art data-visualization diagram python

Last synced: 23 May 2026

https://github.com/nmatthews2203-del/rent-affordability-explorer

Interactive housing analytics dashboard using Zillow rent data and Census income data to analyze affordability, rent trends, and geographic housing differences across U.S. counties.

altair data-analytics data-visualization housing-data interactive-dashboard pandas plotly python real-estate sql sqlite streamlit

Last synced: 03 May 2026

https://github.com/samuelbarbosadev/walrmart_data_analysis

You have been hired by Walmart to survey the revenue of their stores in the USA and point out which store would be best to expand its size. It is necessary to analyze the weekly sales of each store, calculate some important information that will be asked, and at the end of it all, indicate which store should be invested in.

data-preparation data-understanding data-visualization pandas python

Last synced: 08 May 2026

https://github.com/muthukumar0908/phonepe-pulse-data-visualization-and-exploration

Creating a dashboard by using streamlit application. in this app visualizing the data taken from Phonepe pulse Github repository.

data-visualization github-config mysql-connector-python mysql-database pandas plotly python streamlit-webapp

Last synced: 29 Apr 2026

https://github.com/sivkri/shiny-scatter-plot-app

This repository contains a Shiny app that allows users to create interactive scatter plots by selecting the X and Y axes and customizing the point color. The app utilizes the shiny package in R to provide a user-friendly interface and the ggplot2 package for creating visually appealing plots.

data-analysis data-visualization ggplot2 interactive-web-application r rprogramming scatter-plot shiny

Last synced: 22 Mar 2025

https://github.com/nimomach/amazon-sales-data

This is a small dataset containing Amazon sales data analysis for few regions.

dashboards data data-analysis data-visualization

Last synced: 08 Mar 2026