An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/netesf13d/expt-sequence-analysis

Data processing, analysis and visualization package for atomic physics experiments in the single-atom regime.

cold-atoms data-analysis data-visualization optical-tweezers

Last synced: 24 Jul 2025

https://github.com/istinnew/eniac_ab_insight

Dive into a comprehensive analysis aimed at boosting iPhone 13 sales by optimizing the Click-Through Rate (CTR) of the “SHOP NOW” button, compare different button designs and determine the most effective strategy for increasing engagement.

ab-testing data data-analysis data-engineering data-science data-visualization google googlecolab libraries python testing testing-tools visual-studio-code

Last synced: 29 Apr 2026

https://github.com/ryanga09/digitalent_fundamentaldatascience-selfpractice

A repository of hands-on projects from DigiTalent’s Fundamental Data Science training, covering web scraping, data exploration, data cleaning, and data annotation. Includes Jupyter notebooks and example code for practical learning.

data data-analysis data-science data-visualization dataset digitalent komdigi notebook-jupyter notebooks

Last synced: 02 Aug 2025

https://github.com/shariqayan/diwali_sales_analysis_python

The Diwali Sales Analysis project focuses on analyzing sales data during the Diwali festival to gain insights into customer behavior, improve customer experience, and optimize sales strategies.

data-visualization matplotlib numpy pandas python seaborn

Last synced: 29 Apr 2026

https://github.com/jigyasag18/airline-performance-and-passenger-satisfaction-project-using-big-data-analytics

This project analyzes 10 years of U.S. domestic airline data (~3GB) using Hadoop (Cloudera) and Hive for data processing. Power BI dashboards visualize key metrics like delays, on-time rates, air time, and diversions. The solution includes Hive queries, DAX measures, HDFS ingestion scripts, and year-wise insights with recommendations.

big-data big-data-analytics bigdata cloudera cloudera-hadoop cloudera-hadoop-framework data data-analysis data-visualization database hadoop hive power-bi powerbi powerbi-dashboard powerbi-dashboards powerbi-report powerbi-visuals powerbi-visuals-tools powerbidashboard

Last synced: 01 Aug 2025

https://github.com/stanleynguyen/so.cube

World map visualisation of World's Cube Association data 🌏

cas cube data-visualization leaftlet map

Last synced: 24 Jul 2025

https://github.com/farhad-here/student_performance_analyzer

Student Performance Analyzer with python, it is on of my data analysis course project. I teach you about filter(),lambda,map() in python

data-analysis data-visualization filter kaggle kaggle-dataset lambda map pandas python python-tutorial streamlit

Last synced: 29 Apr 2026

https://github.com/vishal-bhandary/sql-data-analytics

This repository contains a collection of SQL scripts demonstrating various analytical techniques, such as changes over time, cumulative, performance, data segmentation, part-to-whole analysis.

analytics business-intelligence customer-segmentation dashboarding data-analysis data-reporting data-visualization data-warehouse etl kpi product-analysis sql sql-server star-schema t-sql

Last synced: 30 Jun 2026

https://github.com/wonderwoman2020/atomic-sensors-website

Repository for .NET academic project realised 2023

academic-project asp-net data-visualization website

Last synced: 08 Jun 2026

https://github.com/angchekar28/air-quality-index-analysis

This project analyzes Air Quality Index (AQI) data to identify pollution trends, seasonal variations, and the impact of different pollutants. It includes data visualization, correlation analysis, and insights into air quality variations over time.

data-analysis data-science data-visualization exploratory-data-analysis jupyter-notebook machine-learning python

Last synced: 30 Apr 2026

https://github.com/devprnvk/realestateml

This Python program analyzes a dataset (HousePricePrediction.xlsx) containing information about house prices. It utilizes pandas for data manipulation, matplotlib for plotting, and seaborn for visualizing correlations and distributions.

data-science data-visualization datasets houses npm plotting prediction-model seaborn

Last synced: 30 Apr 2026

https://github.com/samiksha29-patil/hr-employee-data-analysis-visualization-in-python

This project focuses on analyzing an HR Employee Dataset that contains details about employees such as demographics, job status, salaries, performance reviews, satisfaction levels, and attrition reasons.

csv-files data data-visualization dataanalysis matplotlib numpy pandas python seaborn

Last synced: 30 Apr 2026

https://github.com/sakshithbillava/expense-manager

A web-based expense tracking app built with Python and Streamlit, featuring real-time updates, data visualization, user authentication, and MongoDB integration.

authentication data-visualization expense-manager matplotlib mongodb numpy pandas personal-finance python streamlit webapp

Last synced: 09 Apr 2026

https://github.com/diogojorgebasso/dataanalysis_r_minesnancy

Les codes et les matériaux des cours d'analyse de données en R à Mines de Nancy. Vous y trouverez également des scripts R, des notebooks et d'autres ressources pour chaque leçon.

analyse-data data-analysis data-science data-visualization estatistics r statistiques statistiques-descriptives

Last synced: 30 Apr 2026

https://github.com/alinababer/covid19-timeseries-cases-and-deaths-forecasting-

This study is based on confirmed cases and deaths collected from Pakistan. Results demonstrate the promising potential of TIME SERIES model in forecasting COVID-19 cases and highlight the superior performance of the time series compared to the LSTM.we apply AI-based forecasting models such time series ARIMA, LSTM, prophet and VAR.

arima covid-19 data-analysis data-science data-visualization fbprophet forecasting lstm rnn time-series var vectorautoregression

Last synced: 19 Jun 2026

https://github.com/dbolotov/ts_smoothing_visualizer

Streamlit app for visualizing and comparing time series smoothing methods on real and synthetic datasets.

data-science data-visualization streamlit time-series

Last synced: 24 Jul 2025

https://github.com/rijul007/smartwatch-data-analysis-using-python

Smartwatch Data Analysis to uncover insights into health and activity patterns using Python for data cleaning, exploratory analysis, and interactive visualizations.

data-analysis data-science data-visualization exploratory-data-analysis jupyter-notebook matplotlib numpy pandas plotly python

Last synced: 30 Apr 2026

https://github.com/samuelson777/iris-flower-classification

Iris Flower Classification: A machine learning project that classifies iris flowers into three species based on sepal and petal dimensions. Includes data exploration, visualization, and model evaluation using Python and scikit-learn.

classification data-science data-visualization iris-dataset jupyter-notebook machine-learning python scikit-learn

Last synced: 09 May 2026

https://github.com/shyamkumarnagilla/big-sales-prediction

The "Big Sales Prediction" model is a machine learning project that aims to accurately forecast sales for a given period. The model utilizes the Random Forest Regressor algorithm, a powerful ensemble learning technique, to analyze historical sales data and make predictions. It can be valuable for businesses looking to optimize sales forecasting.

data-analytics data-preprocessing data-science data-visualization machine-learning model-evaluation model-training

Last synced: 09 May 2026

https://github.com/mmartin46/county-health-findings-project

Analyze the data set given by United Health Group(UHG) to determine the impact on race, social and demographic factors on health, survival, and mortality.

analysis data-science data-visualization linear-regression machine-learning pandas

Last synced: 30 Apr 2026

https://github.com/gerhynes/d3-births-pie-chart

A D3 pie chart showing UN birth data grouped by month and quarter. Built for The Advanced Web Developer Bootcamp.

d3 data-visualization javascript

Last synced: 30 Apr 2026

https://github.com/farrelfaricaf/exploratorydataanalyst---titanic

This project analyzes the Titanic dataset using exploratory data analysis (EDA) and visualization techniques to identify survival patterns. The goal is to understand how demographic factors like gender and age influenced survival rates during the 1912 disaster.

data data-analysis data-science data-visualization eda python titanic-dataset

Last synced: 31 Jul 2025

https://github.com/the-clone-xyz/stats-lapas-pakam

Visualisasi data narapidana berdasarkan jenis kelamin di Lapas Lubuk Pakam menggunakan data BPS Deli Serdang secara otomatis via GitHub Actions.

bps-api data-visualization github-actions lubuk-pakam statistics

Last synced: 30 Apr 2026

https://github.com/ddeepanshu-997/datascience-e-commerce-shopping-details-

in this project i am going to apply data preprocessing technique on the dataset in order to clean the data using libraries, etc. make some insights/analyses to findout the hotpicks of the shopping along with some data visualsation libraries to get the trends and many more aspects in order to make a small contribution to the field of data science

cleaning-data data data-science data-visualization dataframe datapreprocessing dataset libraries matplotlib-pyplot numpy pandas plots python visualization

Last synced: 30 Apr 2026

https://github.com/miguelmedinacastro/trabalho-dados-r

Trabalho final da disciplina Análise Exploratória de Dados

data data-science data-science-projects data-visualization database r rstudio

Last synced: 01 May 2026

https://github.com/gitchaell/computer-scrapping

Tool that extracts data from the pages of companies that sell computers in the city of Trujillo - Peru, exports them in an XLSX file according to a relational data model, and displays them on a Power BI dashboard.

data-analysis data-structures data-visualization database dbdiagram export-excel powerbi scrapper-script scrapping xlsx

Last synced: 01 May 2026

https://github.com/falakrana/data-analysis-visualization

This repository showcases data analysis and visualization projects using Python and Tableau. It includes exploratory data analysis, interactive dashboards, and insightful visual stories derived from real-world datasets.

data-analysis data-visualization python tableau-public

Last synced: 01 May 2026

https://github.com/alrza2003/google-data-analysis-case-study-cyclistic

This project analyzes Cyclistic’s trip data to identify patterns in bike usage between casual riders and annual members. The findings help optimize marketing strategies and membership conversions.

business-task cyclistic-bike-share-analysis-case-study data-analysis data-science data-visualization google-data-analytics google-data-analytics-capstone-project google-data-analytics-professional jupyter-notebook python rmarkdown tableau

Last synced: 09 May 2026

https://github.com/gerhynes/d3-movie-quotes

A simple page built to practice binding data to elements using D3. Built for The Advanced Web Developer Bootcamp.

d3 data-visualization javascript

Last synced: 01 May 2026

https://github.com/jayavarshini-jayakumaran/nba-exploratory-data-analysis

A data analytics project that explores NBA game and player data using Python and Power BI. Features data preprocessing, EDA, feature engineering, and an interactive dashboard for visualizing team and player performance trends.

data-analysis data-visualization exploratory-data-analysis powerbi python3

Last synced: 20 Jun 2026

https://github.com/teamtigers/echartify

A web application built with .net core 2.2 that has come with the idea of reading the National Election's Data-set of Bangladesh in a fastest possible time and then representing the data-set with different statistical charts.

bangladesh chartjs code-first-migration cross-platform data-analysis data-structures data-visualization dotnet-core election-analysis election-data entity-framework-core materializecss mvc npoi razor-pages

Last synced: 16 Apr 2026

https://github.com/hooopo/ossinsight-pick

Handpicks, features, or highlights a selection of open-source repositories each week. We cherry-pick the best, trending, or otherwise interesting repositories, providing an in-depth analysis you won't find elsewhere, thus enabling developers to discover, learn from, and contribute to these noteworthy projects.

analytics data-visualization github open-source trending-repositories visualization

Last synced: 30 Jul 2025

https://github.com/ishmal793/bi-dummy-

An interactive and beginner-friendly data dashboard built using Streamlit. Upload your own CSV or Excel file, apply filters, view key statistics, and generate beautiful visualizations with no coding required.

data-analytics data-visualization eda pandas plotly python-dashboard streamlit

Last synced: 01 May 2026

https://github.com/corezen/graph-voyager

Interactive React app for visualizing and exploring graph data structures. Built with Vite and Tailwind CSS.

data-visualization github-pages graph-visualization react tailwindcss vite

Last synced: 01 May 2026

https://github.com/abdoomohamedd/python-data-analysis-projects

A collection of data analysis projects that I have worked on. Each project involves cleaning data, performing exploratory data analysis (EDA), and creating visualizations to extract insights. The projects utilize various Python libraries, including pandas, numpy, matp

data-analysis data-analysis-python data-cleaning data-visualization matplotlib matplotlib-pyplot numpy pandas python

Last synced: 01 May 2026

https://github.com/jfaccioli/leaflet-earthquake

Geo mapping earthquakes with Leaflet / Javascript / GeoJSON

data-visualization geojson javascript json leaflet

Last synced: 01 May 2026

https://github.com/rafath0ssain/predihome

Data analysis using economic factors affecting living conditions across Canadian provinces.

data-analysis data-visualization dplyr ggplot2 graph kaggle linear-regression prediction-model r shiny tidyr

Last synced: 01 May 2026

https://github.com/hamza-ali-shahjahan/thousandworlds-explorer

An explorable map of every world humanity has discovered — built on the NASA Exoplanet Archive.

astronomy data-visualization exoplanets nasa react space typescript vite

Last synced: 29 Jun 2026

https://github.com/leandrocollares/beyond-the-3-point-arc

A responsive scatter plot that shows the percentage of points scored by NBA teams via 3-point and mid-range field goals

d3 data-visualization react

Last synced: 01 May 2026

https://github.com/ujjwalll/get-that-flair

It is a repository for project detecting the flair of reddit post through their links. You can find the working model of it at - https://get-that-flair.herokuapp.com/

data-analysis data-visualization django-application herokuapp machine-learning naive-bayes-classifier praw-reddit python3 random-forest reddit-api sentiment-analysis topic-modeling

Last synced: 01 May 2026

https://github.com/mrwilbroad/react-data-visualizaion

Data visualization with reactjs , react-chartjs-2

chartjs data-visualization graphs react-chartjs-2 reactjs

Last synced: 02 May 2026

https://github.com/shaheerazam-dev/cyclistic-case-study-google-data-analytics-certificate

This case study simulates the real-world experience of a junior data analyst at Cyclistic, a fictional company. We will leverage the data analysis process framework (Ask, Prepare, Process, Analyze, Share, Act) to address critical business questions and provide data-driven insights to guide strategic decision-making.

bigquery data-science data-visualization spreadsheet sql tableau

Last synced: 06 Feb 2026

https://github.com/harshindcoder/salifort_motors_project

This people analytics project analyzes factors influencing employee turnover and predicts whether an employee is likely to leave. It aims to uncover patterns behind departures, helping Salifort improve retention, workplace culture, and professional growth strategies.

data-analysis data-science data-visualization hr-analytics machine-learning tree-models

Last synced: 02 May 2026

https://github.com/shridhar1504/rafik-s-kitchen-data-analysis

The Project is about the Analysis of the Sales and Expenses Data of a Famous Fast-food Restaurant. This mainly focuses on gaining Insights that will boost the Future Sales and also Business Strategies it Improve the Profit Margins. Handled Tools are SQL, Python, Power BI, MS Office Tools.

business-analytics business-intelligence data-analysis data-analytics data-visualization eda ms-office powerbi-report powerpoint-presentations python sql-server

Last synced: 10 May 2026

https://github.com/shridhar1504/milk-production-time-series-forecasting-datascience-project

This project uses time series forecasting to predict future milk production. The data used in this project is monthly milk production data from January 1962 to December 1975. The ARIMA (autoregressive integrated moving average) model is used to forecast the milk production. The model is evaluated using various metric.

adf arima-model augmented-dickey-fuller-test data-analysis data-analytics data-science data-visualization eda exploratory-data-analysis machine-learning machine-learning-algorithms python python3 residuals sarimax seasonality time-series time-series-forecasting trends

Last synced: 02 May 2026

https://github.com/hhrh/real-time-fraud-detection

A production-style real-time fraud detection pipeline using Kafka, FastAPI, XGBoost/CatBoost/LightGBM, Prometheus, and Grafana.

apache-kafka catboost data-visualization fastapi grafana lightgbm mlops prometheus python xgboost

Last synced: 10 May 2026

https://github.com/gerhynes/d3-mobile-subscription-literacy-scatterplot

A D3 scatterplot showing mobile phone subscriptions against literacy rates. Built for The Advanced Web Developer Bootcamp.

d3 data-visualization javascript

Last synced: 02 May 2026

https://github.com/hafs96/prediction_consommation-de-carburant

Dans ce projet, l'objectif est de développer un modèle permettant de prédire si une voiture a une consommation de carburant élevée ou faible en fonction de ses caractéristiques techniques.

analysis data data-visualization machine-learning testing training

Last synced: 09 Jun 2026

https://github.com/bhawnagoyal18/ai-doctor-a-symptom-checker-disease-predictor

AI Doctor is an intelligent healthcare application that utilizes machine learning (ML) and Python to predict potential diseases based on user-input symptoms. The project integrates data from multiple medical datasets and provides an interactive web-based UI for an intuitive user experience.

data-analysis data-engineering data-visualization dataset flask html5 machine-learning python sql stacking statistics

Last synced: 02 May 2026

https://github.com/rorrell/employmentdata

A Jupyter Notebook where I use group by to analyze the average unemployment rate by year

data-analysis data-visualization jupyter-notebook python3

Last synced: 02 May 2026

https://github.com/fatihilhan42/spotify-songs-recommendations-system_with_python

We developed a song recommendation system for the user with the data we received from our Spotify song dataset. Data set and other applications are given in the description. Have a nice day.

data-analysis data-science data-visualization jupyter-notebook python recommendation-engine recommendation-system

Last synced: 02 May 2026

https://github.com/datasqlsantosh/global-energy-consumption-renewable-generation-python-data-analysis-portfolio

This project focuses on analyzing global energy consumption patterns and trends in renewable energy generation using Python data analysis libraries such as Seaborn and NumPy. The analysis aims to explore energy consumption data from various regions worldwide and examine the contribution of renewable energy sources over time

data data-analysis data-visualization pandas seaborn

Last synced: 10 May 2026

https://github.com/dissorial/prx21_erikz

Analysis of self-tracked data: interactive visualizations & predictive algorithms

analytics data-analysis data-science data-visualization machine-learning matplotlib pandas python python3 visualization

Last synced: 02 May 2026

https://github.com/femincan/d3-treemap-diagram

My solution for the Visualize Data with a Treemap Diagram project on FCC.

css3 d3js data-visualization html5 javascript

Last synced: 02 May 2026

https://github.com/inevolin/multivariate-data-analysis

Showcases of modern multivariate & multidimensional data analysis in industrial and high-tech settings.

analytics data-analysis data-science data-visualization javascript

Last synced: 09 Jun 2026

https://github.com/ronaldkanyepi/southern-africa-health-indicators-analysis

Visualizing Health Trends in Southern Africa: Analyzing World Bank data (2013–2020) to uncover patterns in life expectancy, infant mortality, HIV prevalence, and maternal mortality.

botswana data-visualization dplyr ggplot2 ggtext health healthcare mozambique patchwork r south-africa tidyverse viridis zimbabwe

Last synced: 09 Jun 2026

https://github.com/miteshgupta07/covid-19-report-dashboard-using-streamlit

A Streamlit dashboard for COVID-19 reporting that provides real-time updates, visualizations, and analysis of global and local COVID-19 data to track the pandemic's progress and impact.

data-visualization python streamlit

Last synced: 03 May 2026

https://github.com/nix7amcm/fcc-data-viz-cert-projects

These are my projects for the freeCodeCamp Data Visualization certification.

d3 d3-visualization d3js data-visualization data-viz freecodecamp freecodecamp-project html-css-javascript

Last synced: 03 May 2026

https://github.com/rohitinu6/tesla-price-prediction

A machine learning project that predicts future stock price movements using Logistic Regression, SVC, and XGBoost with engineered financial features.

data-analysis data-visualization feature-engineering financial-analysis logistic-regression machine-learning matplotlib python scikit-learn seaborn stock-market stock-price-prediction support-vector-machine time-series xgboost

Last synced: 03 May 2026

https://github.com/drkbluescience/ibm-datascience-spacex

In this project, we predict whether the Falcon 9 first stage will land successfully by following the data science methodology.

data-visualization data-wrangling machine-learning-algorithms sql-query sqlite webscraping-data

Last synced: 10 May 2026

https://github.com/swanie21/svg-graphs

Data visualization with svg graphs

data-visualization react svg-graphs

Last synced: 03 May 2026

https://github.com/codeslash21/tmdb_data_analysis

We analysed TMDB dataset which contains around 11000 movies details. We analyzed to find some interesting facts about the dataset.

data-analysis data-visualization matplotlib nanodegree-project numpy pandas python tmdb-movie

Last synced: 03 May 2026

https://github.com/loosenthedark/ci_data-visualisation-dashboard-mini-project

Code Institute IFD Module demo project using D3.js, Crossfilter, dc.js & queue.js to leverage sample data relating to salary levels & participation in academia parsed by gender. Bootstrap-based theme.

bootstrap4 code-institute crossfilter css3 d3js data-visualisation data-visualization dcjs frontend html5 javascript queue svg

Last synced: 11 May 2026

https://github.com/hauntedhost/modern-drive

ModernDive: An Introduction to Statistical and Data Sciences via R at http://www.moderndive.com

data-science data-visualization r statistics

Last synced: 29 Jul 2025

https://github.com/leandrocollares/foreign-born-population-in-canada

Responsive bar chart that shows the percentage of foreign-born population in Canada between 1871 and 2011

d3 data-visualization svelte

Last synced: 03 May 2026

https://github.com/ljadhav25/swiggy-restaurant-analysis

This repository contains data and analysis related to restaurants listed on Swiggy, one of India's largest online food ordering and delivery platforms. The objective is to explore restaurant trends, customer reviews, pricing strategies, and delivery metrics to gain insights into the food delivery industry.

data-analysis data-visualization matplotlib-pyplot numpy-library pandas-library python seaborn-plots

Last synced: 03 May 2026

https://github.com/ibrahim1993-2/iraq-primary-enrollment

Data analysis of Iraq’s primary school enrollment (1960–2024). Includes SQL-style exploration, 14 insights with CSV exports, visualizations, and a polished Jupyter Notebook (PDF + HTML).

data-visualization eda education-data iraq jupyter-notebook matplotlib pandas portfolio-project python3 world-bank

Last synced: 03 May 2026

https://github.com/malakasupun/crime-data-analysis-of-lapd

This project aims to explore and analyse crime patterns in Los Angeles using a dataset spanning from 2020 to the present. The primary focus is to extract meaningful insights by integrating structured data analysis and advanced techniques in SQL and Natural Language Processing (NLP).

data-analysis data-visualization llm nlp sql

Last synced: 29 Jul 2025

https://github.com/akashparley/ipl-data-analysis

The IPL Data Analysis project focuses on extracting valuable insights from IPL match data using various data analytics techniques. By analyzing historical match outcomes, player performances, team comparisons, and venue statistics, the project visualizes trends and patterns through graphs like bar charts, line graphs, and scatter plots.

analytics data-visualization excel metplotlib numpy pandas python seaborn tableau-dashboards

Last synced: 04 May 2026

https://github.com/pejpero/multi-class_classifier

This project demonstrates multiclass classification using Perceptron and Logistic Regression, implemented from scratch without using built-in libraries. It includes techniques like One-Versus-The-Rest and One-Versus-One for Perceptron, and Softmax for Logistic Regression, with a focus on understanding core ML concepts.

data-visualization logistic-regression machine-learning multiclass-classification one-vs-one one-vs-rest perceptron python scratch-implementation softmax

Last synced: 09 Jun 2026

https://github.com/samruddhi3012/screen-time-analysis

Hi! This repo demonstrates a python project on Screen Time Analysis.

data-analysis data-visualization python

Last synced: 04 May 2026

https://github.com/mindlessmuse666/titanic-data-visualization

Проект по визуализации данных о пассажирах Титаника с использованием библиотек Python Matplotlib, Seaborn и Plotly.

data-analysis data-visualization matplotlib pandas plotly python seaborn titanic

Last synced: 04 May 2026