An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/riyanshibariyaa/Vehicle-Emission-Analysis_MACHINE_LEARNING_

Vehicle Emissions Analysis This project focuses on analyzing vehicle emissions data using various machine learning techniques. The dataset used for analysis contains information about vehicle emissions, including engine size, CO2 emissions, transmission type, smog level, and fuel consumption.

artificial-intelligence data-visualization exploratory-data-analysis feature-engineering linear-regression machine

Last synced: 12 Aug 2025

https://github.com/ashish-kr-srivastava/olympic-games-eda---python

About Exploratory Data Analysis of a Historical Olympic Games Dataset, including all the games from Athens 1896 to Rio 2016.

data-visualization datacleaning eda matpotlib numpy pandas python seaborn seaborn-python

Last synced: 09 Apr 2026

https://github.com/mindlessmuse666/eda-pandas

Проект по разведочному анализу данных (EDA) о пассажирах Титаника с использованием библиотеки Pandas. Включает в себя загрузку данных, предобработку, статистический анализ, визуализацию и создание сводных таблиц. Цель проекта - демонстрация основных методов и инструментов EDA для анализа и понимания данных.

data-analysis data-processing data-science data-visualization eda exploratory-data-analysis matplotlib pandas python titanic

Last synced: 18 Apr 2026

https://github.com/r12habh/canada-imigration-data-analysis

Dataset: Immigration to Canada from 1980 to 2013 - International migration flows to and from selected countries - The 2015 revision from United Nation's website. (Cognitive Class Data Analysis with Python)

canada data-analysis data-science data-visualization datascience python python3

Last synced: 23 May 2026

https://github.com/tsbarr/belly-button-challenge

Using front-end development tools (javascript, html and css) I built an interactive dashboard to explore the Belly Button Biodiversity dataset, which catalogs the microbes that colonize human navels.

data data-visualization javascript

Last synced: 04 Mar 2026

https://github.com/nathanaelmutua/british-airways-data-science-challenge

My solutions for the Forage program: web scraping, data cleaning, analysis, and visualization to extract business insights. Demonstrating practical data science skills for real-world problem-solving.

british-airways british-airways-virtual-program data-science data-visualization dataanalysis forage internship-project internship-task jupyter-notebook python sentiment-analysis webscraping

Last synced: 12 Aug 2025

https://github.com/jain1shh/solar-flare-prediction

This repository contains code and data for predicting solar flare energy ranges using machine learning, based on NASA's RHESSI mission data. It includes preprocessing of FITS files into a unified CSV dataset and implements models like Gradient Boosting, Random Forest, and Decision Tree classifiers, achieving accuracies up to 87%.

data-visualization machine-learning numpy pandas python scikit-learn solar-flare-prediction

Last synced: 09 Apr 2026

https://github.com/joselvillaronga/wifi-scan-measure-raspberry-pi-4

Herramienta web basada en Flask que escanea redes WiFi en 2.4 GHz y 5 GHz, estima distancias según RSSI y ofrece visualizaciones interactivas de canales, niveles de señal e historial de escaneos. Soporta almacenamiento en JSON o MongoDB y se ejecuta como servicio systemd para monitorización continua.

data-visualization debian flask graphs iot json mongodb network-monitoring network-tools python rssi-distance-estimation systemd wifi-scanner

Last synced: 09 Apr 2026

https://github.com/the-ethan-hunt/dekh-data

Playground for data visualization notebooks

data-visualization jupyter-notebook python

Last synced: 28 Mar 2025

https://github.com/furkankarakuz/turkey_earthquake

This project focuses on analyzing and visualizing earthquake data specific to Turkey. It aims to provide insightful visualizations on topics such as earthquake frequency, location, and magnitude using data obtained from Boğaziçi University Kandilli Observatory and Earthquake Research Institute.

api data data-visualization earthquake python python3 request streamlit turkey turkey-earthquake

Last synced: 20 May 2026

https://github.com/smpotts/dash-live-updates

Figuring out how to do live updates in Dash Plotly.

dash-plotly data-visualization python

Last synced: 27 Jul 2025

https://github.com/jakebrehm/geophotos

🗺 📍 A Python package to pull, analyze, and plot coordinates from various sources.

data-visualization gdal geopandas heatmap osgeo photos plot plotting python python-3

Last synced: 09 Jun 2026

https://github.com/rb-thompson/machine-learning-basics

Implement a machine learning pipeline on the Iris flower dataset.

data-preprocessing data-visualization model-training python scikit-learn

Last synced: 18 Jun 2026

https://github.com/grascya/sleep-health_-lifestyle-dataset

Classifier to predict the presence of a sleep disorder based on the other columns in the dataset.

data-visualization exploratory-data-analysis joblib machine-learning-algorithms pickle python statistical-analysis

Last synced: 20 May 2026

https://github.com/alan-oliveir/previsao_cartao_fidelidade

Projeto de ciência de dados para previsão de plano de fidelidade para clientes de uma companhia aérea.

data-science data-visualization database gradio python sql

Last synced: 04 May 2026

https://github.com/easonlai/covid19_hk_analysis

This is code sample of data analysis (with visualization) for COVID-19 cases in Hong Kong. Data is obtained from official data.gov.hk.

covid-19 data-analytics data-science data-visualization matplotlib pandas python seaborn seaborn-plots

Last synced: 12 Apr 2026

https://github.com/lucas-mazzolim/superstore-bi

Project where I prepared two data sources for querying and created a BI visualization in Data Studio. Used tools as Mysql, Looker Studio, Google Spreadsheet and Python.

business-intelligence data-analysis data-visualization google-looker-studio mysql spreadsheet

Last synced: 27 Jul 2025

https://github.com/sitek94/react-interactive-scatter-plot

An interactive scatter plot made with React and D3

d3 data-visualization react

Last synced: 07 May 2026

https://github.com/rajireddy15/employee-attrition-prediction-hr-analytics-

Employee Attrition Prediction (HR Analytics) helps organizations analyze employee data, identify factors driving turnover, and predict attrition using machine learning and visual dashboards, enabling data-driven HR decisions and retention strategies.

data-cleaning data-collection data-manipulation data-preprocessing data-science data-visualization eda feature-engineering imbalanced-data machine-learning mysql-database numpy pandas

Last synced: 04 May 2026

https://github.com/shellynagar27/marketing-content-performance-analysis

Analyzed 2024 social media campaign data from TikTok, Instagram, LinkedIn, and X.com using Power BI to uncover performance trends across platforms, content types, and regions. Built an interactive dashboard to drive insights on engagement, optimal posting times, and content strategy.

data-analysis data-modelling data-visualization excel figma marketing-analytics powerbi powerquery wireframing

Last synced: 26 Jun 2025

https://github.com/erictleung/tidytuesdays

:chart_with_upwards_trend: My attempts at #tidytuesday

data data-science data-visualization r rstats tables tidytuesday tidyverse

Last synced: 19 Sep 2025

https://github.com/saketr3/voting-policy-impact-visualizer

Data visualization web app where users can compare voter turnout of different demographics with states’ voting policy fairness scores

data-visualization voting

Last synced: 14 Mar 2025

https://github.com/warazkhan/airplane-crashes-and-fatalities-since-1908-

This project analyzes airplane crash data (1908 - 2008)✈️📊 to uncover trends in aviation accidents, fatalities, and safety improvements. Using exploratory data analysis (EDA) and data visualization, we examine key factors influencing crashes, identify high-risk regions, and explore advancements in aviation safety.

data-analysis data-visualization exploratory-data-analysis

Last synced: 10 Jun 2026

https://github.com/analyticalnahid/plotly-tutorial

A intro of Plolty for Data Science

data-science data-visualization ploty python3

Last synced: 28 Mar 2025

https://github.com/analyticalnahid/seaborn-tutorial

A complete Notebook on Seaborn for Data Science

data-visualization seaborn seaborn-tutorial

Last synced: 23 Aug 2025

https://github.com/akhi07rx/petals-using-r

This R code generates a plot of a flower. It uses polar coordinates and the sine function to create the petal shapes and then plots them.

data-visualization graphics opensource plot r trignometry

Last synced: 23 May 2026

https://github.com/leandrocollares/population-in-dutch-provinces

A responsive bar chart showing the population of Dutch provinces

d3 data-visualization svelte

Last synced: 16 Apr 2026

https://github.com/nivasharmaa/friskwatch

A Java program for analyzing stop-and-frisk data from the NYPD. Features data import, organization, and statistical analysis to compare occurrences during and after policy implementation.

data-analysis data-visualization dataprocessing datascience file-io java java-oop nypd-data

Last synced: 19 May 2026

https://github.com/dineshram0212/youtube-analysis

This YouTube Analysis Package provides tools for analyzing YouTube video data, including metrics on views, likes, comments, and engagement trends. Ideal for gaining insights into video performance and audience interaction patterns.

data data-visualization pandas python webscraping youtube-api-v3

Last synced: 19 Jun 2026

https://github.com/nicovandenhooff/wids-datathon-2022

This repository contains solution for the 2022 Women in Data Science Kaggle competition that I participated in, which obtained a top 10% leaderboard standing.

catboost data-visualization datascience energy-consumption ensemble-learning exploratory-data-analysis kaggle lightgbm machine-learning scikit-learn women-in-data-science xgboost

Last synced: 07 May 2026

https://github.com/antrikshy/personalmovieanalysis

Finds interesting patterns in an IMDb ratings export; written as a Jupyter notebook, viz using Seaborn

data-visualization imdb jupyter-notebook movie-ratings pandas python seaborn

Last synced: 07 May 2026

https://github.com/macorisd/instagram-fake-account-analysis

A project in R focused on detecting fake Instagram accounts. It includes exploratory data analysis, data visualization, and analysis using three techniques: association rules, formal concept analysis, and regression. The results are presented in an interactive Quarto book.

data-analysis data-science data-visualization r

Last synced: 10 Jun 2025

https://github.com/christos-pelekis/harsourcerer

An inclusive MERN stack-based platform for comprehensive analysis and exploration of HTTP traffic data extracted from HAR (HTTP Archive) files.

data-visualization har-files http-traffic mern-stack

Last synced: 29 Jul 2025

https://github.com/danielrosehill/value-factors-data-vis

Streamlit app containing visualisations of the Global Value Factors Database (GVFD) released by the IFVI in 2024

data data-visualization sustainability sustainability-data

Last synced: 29 Jul 2025

https://github.com/shreedata/covid-da-dasboard-using-powerbi

This repository showcases a PowerBI dashboard focused on visually representing COVID-19 data for Indian states and Union Territories in an easily understandable way. The dataset is sourced from Kaggle.

data-cleaning data-visualization datanalaysis microsoft microsoft-powerbi powerbi-report powerbi-visuals powerbidashboard

Last synced: 19 Feb 2026

https://github.com/archanakokate/eda_amazon_products_and_discounts_2023

Exploratory Data Analysis (EDA) on Amazon's 2023 Products and Discounts data

data-analysis data-mining data-visualization exploratory-data-analysis

Last synced: 03 Jan 2026

https://github.com/swethajoseph/statistical-stock-performance-analysis

Conducted a statistical analysis of Microsoft, Tesla, and Apple stock performance compared to the S&P 500, examining price trends, volatility, and correlations to derive investment insights.

advancedexcel comparative-analysis data-analysis data-visualization datapreparation descriptive-statistics moving-average msexcel performance-analysis performance-metrics regression-analysis statistical-analysis

Last synced: 03 Jan 2026

https://github.com/fawadeqbal/data-science

A comprehensive repository covering essential Data Science concepts using PyTorch, including anomaly detection, classification, clustering, regression, and more. Includes hands-on implementations and tutorials for each concept.

calculus calculus-2 data-science data-visualization dataset machine-learning python pytorch statistics

Last synced: 29 Jul 2025

https://github.com/hasinii12/-chocolate-analysis-dashboard

This Power BI report provides a comprehensive analysis of chocolate ratings and related attributes.

data-analysis data-visualization powerbi

Last synced: 09 Feb 2026

https://github.com/chahelgupta/interactive-data-visualization-tool-java

The JavaFX project aims to build an interactive data visualization tool offering Pie Charts, Bar Graphs, and Line Graphs. Users can input data for each chart type, customize visual aspects like colors and labels, and interact with zooming and tooltips.

data-visualization data-visualization-javafx data-visualization-project java java-application javafx javafx-application javafx-application-framework javafx-project

Last synced: 02 Jan 2026

https://github.com/malakasupun/crime-data-analysis-of-lapd

This project aims to explore and analyse crime patterns in Los Angeles using a dataset spanning from 2020 to the present. The primary focus is to extract meaningful insights by integrating structured data analysis and advanced techniques in SQL and Natural Language Processing (NLP).

data-analysis data-visualization llm nlp sql

Last synced: 29 Jul 2025

https://github.com/yash22222/literacy-exploration-analysis

Delve into India's literacy landscape through data analysis. Uncover regional disparities, high/low literacy states & gender imbalances.

csv data-analysis data-visualization government-data india literacy literacy-analysis states

Last synced: 29 Jul 2025

https://github.com/saroshfarhan/irish_hospital_data_anaysis

Irish hospital's patient discharge data for four counties analysis

data-analysis data-science data-visualization healthcare irish-data r-programming-language

Last synced: 18 Feb 2026

https://github.com/ezeparziale/analisis-uso-bicicletas-caba

:biking_man: Análisis de como afecto la pandemia el uso de las bicicletas en CABA.

data data-science data-visualization

Last synced: 14 Mar 2025

https://github.com/hauntedhost/modern-drive

ModernDive: An Introduction to Statistical and Data Sciences via R at http://www.moderndive.com

data-science data-visualization r statistics

Last synced: 29 Jul 2025

https://github.com/cyprianfusi/data-scientist-technical-exercise-10ds

With recommendations to UK Department for Education of 10 Local Authorities where National Tutoring Programme (NTP) should be intensified and a response to UK Secretary of Health regarding a 76% Accident and Emergency (A&E) performance target which seems far-fetched.

data-analysis data-cleaning data-visualization hypothesis-testing pandas-python policy statistics

Last synced: 21 Sep 2025

https://github.com/sejalkoli/powerbi-dashboard

Exploring insights and boosting business success with my Superstore Sales Dashboard project using Power BI.

dashboard data-analytics data-visualization powerbi

Last synced: 07 Nov 2025

https://github.com/prady2309/stock-analysis

Analysis on the stock prices of Apple, Google, Microsoft and Amazon

data-analysis data-science data-visualization python stock-market

Last synced: 19 May 2026

https://github.com/jabulente/tukey-s-hsd-for-pairwise-group-comparisons

This repository contains a Python project dedicated to performing Tukey’s Honest Significant Difference (HSD) test for pairwise group comparisons.

ai anova-analysis anova-test data-science data-visualization machine-learning math matplotlib-pyplot post-hoc post-hoc-analysis re real-world-problem-solving scipy-stats seaborn-plots statistics statsmodels string turkey-hsd

Last synced: 29 Jul 2025

https://github.com/athenyx04/arion

Smart animal weighing module for Demeter

data-visualization firebase livestock nextjs

Last synced: 07 May 2026

https://github.com/samukiszhsd/alteryx-analytics

Você está trabalhando com dados de transações bancárias do Itaú e precisa fazer algumas análises para ajudar o time de auditoria a detectar padrões incomuns e possíveis transações suspeitas.

alteryx data-analysis data-structures data-visualization etl workflow

Last synced: 18 Feb 2026

https://github.com/no-country-simulation/c21-55-n-data-bi

Trabajo de análisis estadístico en Power Bi, sobre la deserción de alumnos en carreras culturales universitarias de argentina.

data-visualization

Last synced: 18 Feb 2026

https://github.com/stat-by-tish/house-insurance-data-analysis

Fraud detection in house insurance using MATLAB – EDA, classification (trees, KNN, SVM, RF), and clustering. Built for a student project.

classification classification-trees clustering data-visualization exploratory-data-analysis house-data insurance-claims kmeans-clustering knn-classification matlab

Last synced: 26 Jun 2025

https://github.com/sukhitashvili/pca_tutorial

PCA algorithm from scrach, using only matrix-vector multiplications

data-analysis data-science data-visualization machine-learning-algorithms pca

Last synced: 29 Mar 2025

https://github.com/amarlearning/exploring-67-years-of-lego

In this project, I have explored database of every LEGO set ever built.

data-manipulation data-visualization importing-and-cleaning-data jupyter-notebook pandas python

Last synced: 07 May 2026

https://github.com/borjamome/radiografia-madrid

Análisis de Población, Economía y Sociedad de Madrid con R.

data-analysis data-visualization madrid r

Last synced: 17 Jun 2025

https://github.com/alinababer/covid19-timeseries-cases-and-deaths-forecasting-

This study is based on confirmed cases and deaths collected from Pakistan. Results demonstrate the promising potential of TIME SERIES model in forecasting COVID-19 cases and highlight the superior performance of the time series compared to the LSTM.we apply AI-based forecasting models such time series ARIMA, LSTM, prophet and VAR.

arima covid-19 data-analysis data-science data-visualization fbprophet forecasting lstm rnn time-series var vectorautoregression

Last synced: 19 Jun 2026

https://github.com/shuyib/london_weather_prediction

The London Weather Project aims to predict the mean temperature in London using historical weather data, involving data cleaning, feature engineering, and modeling with techniques like imputation, transformation, scaling, and the use of Mlflow for tracking model performance and hyperparameters.

data-cleaning data-lab data-science data-visualization datacamp-projects environmental-science feature-engineering forecasting jupyter-notebook machine-learning mlflow open-data python random-forest regression-analysis time-series weather-prediction

Last synced: 29 Mar 2025

https://github.com/danasilver/twacker

Track your Twitter friends (following) and followers.

data-visualization heroku-app twitter

Last synced: 12 Jul 2025

https://github.com/samir-atra/share-lm_dataset_analysis

Analysis, studies and optimizations on the ShareLM extension dataset

data-analysis data-visualization gemma3n huggingface huggingface-transformers pandas

Last synced: 19 May 2026

https://github.com/imnotamr/datasets-used

A comprehensive collection of datasets for machine learning and data science projects, covering topics from advertising and sales to health and sports analytics

ai classification data-analysis data-science data-visualization deep-learning jupyter-notebook machine-learning models python regression-models

Last synced: 19 May 2026

https://github.com/an4pdm/relatorio-de-vendas

O presente projeto foi feito através das ferramentas oferecidas pelo Power BI afim de aprimorar meus conhecimentos sobre ETL. Os dados utilizados foram de origem do site "Kaggle".

data-analysis data-visualization database etl powerbi

Last synced: 20 Jun 2026

https://github.com/zborovskaanna/e-commerce-web-events-analysis

SQL project based on the Big Query public database 'The Look e-Commerce' and a dashboard in Looker Studio

analysis bigquery dashboard data-visualization looker-studio sql

Last synced: 03 Jan 2026

https://github.com/sammdu/global-warming-hurricane-typhoon

The Effect of Global Warming on Hurricane and Typhoon Occurrence

data-science data-visualization global-warming hurricane-data

Last synced: 07 May 2026

https://github.com/farseenmanekhan1232/analyse-economic-cycle

A Python-based CLI tool for analyzing economic cycles and making data-driven investment decisions in the Indian stock market using Kite Connect API.

data-visualization investment matplotlib portfolio-optimization python stock-market

Last synced: 30 Jul 2025

https://github.com/vyjayanthipolapragada/genai_smart_retail_recommendation

GenAI Smart Retail is a recommendation system designed for retail environments. It provides personalized product recommendations to users based on product descriptions using a content-based filtering approach. The system leverages FastAPI for backend integration, allowing users to interact with the recommendation engine via an API. This project aim

content-based-recommendation data-analysis data-science data-visualization fastapi gen-ai instacart-data jupyter-notebook open-ai python3 retail scikitlearn-machine-learning stream

Last synced: 07 May 2026

https://github.com/shubhamgoyal575/credit-card-fraud-detection

📌 Credit Card Fraud Detection using Machine Learning This project focuses on detecting fraudulent credit card transactions using machine learning models like Random Forest, XGBoost, and Deep Learning. The dataset is preprocessed to handle class imbalance, and multiple models are evaluated based on ROC AUC Score and F1 Score.

adaboost-classifier artificial-neural-networks credit-card-fraud data-analysis data-cleaning data-preprocessing data-science data-visualization deep-learning exploratory-data-analysis lightgbm machine-learning machine-learning-algorithms random-forest-classifer scikit-learn tensorflow xgboost

Last synced: 08 Feb 2026

https://github.com/nagar2nd/zomato-bangalore-analysis-tableau

Analysing restaurant data in Bengaluru to enhance customer satisfaction by optimizing the restaurant experience. The focus is on improving the popularity of different cuisines, enhancing delivery times, and boosting restaurant ratings. An interactive Tableau dashboard has been developed to help Zomato identify key areas for improvements.

data-analysis data-visualization tableau

Last synced: 05 Mar 2026

https://github.com/kimaruthagna/geodjango

the project introduces the aspect of geodjango and storing of spatial data in a database.Postgres was used in this project

data-visualization donut-chart extension-postgis geodjango geomap graphos layers postgis postgresql-database python-json spatial-data

Last synced: 29 Oct 2025

https://github.com/sinsunsan/earth-survival-kit

Global warning data visualisation app to make everyone understand global warning and take actions that matter

angular angular7 d3 data-analysis data-visualization ecology global-warning ngx-charts

Last synced: 05 May 2026

https://github.com/andrewzgheib/football-database-analysis

Football database utilizing PostgreSQL and Pandas for data management, with PowerBI for intuitive KPI visualization

data-analysis data-visualization database pandas pgsql postgr powerbi sql

Last synced: 04 Apr 2025

https://github.com/kameronbrooks/datalys2-reporting

Datalys2 Reports allows you to create rich, interactive reports by simply defining a JSON configuration embedded in your HTML. It handles the layout, data visualization, and interactivity, so you don't need to write custom React code for every report.

data data-visualization html react

Last synced: 08 Apr 2026

https://github.com/lucasdota/bar_chart

Bar chart with JSON data fetching

d3js data-visualization fetch-api javascript json json-api

Last synced: 17 May 2026

https://github.com/moustafamohamed01/mall-customer-segmentation-data

Customer segmentation using K-Means clustering based on annual income and spending score.

data-science data-visualization k-means-clustering machine-learning python scikit-learn unsupervised-learning

Last synced: 08 May 2026

https://github.com/nathadriele/transaction_fraud_prevention_pipeline

Uma solução de detecção e prevenção de fraudes em transações financeiras, combinando Machine Learning, regras de negócio e análises estatísticas avançadas. O sistema oferece um dashboard interativo para monitoramento em tempo real, análise de dados e gestão de alertas de fraude.

data-analysis data-visualization docker fraud-prevention machine-learning matplotlib numpy pandas pipeline pytest python scikit-learn scipy seaborn streamlit tensorflow transaction xgboost

Last synced: 10 Apr 2026

https://github.com/ramyacp14/global-warming-analysis

Analyzes and visualizes global temperature data to identify climate change trends. The project includes global and seasonal temperature trend analysis, country-wise and city-specific temperature visualizations, and geospatial analysis with interactive maps and heatmaps.

data-visualization folium geospatial-analysis jupyter-notebook matplotlib numpy opencage-geocoder pandas plotly python seaborn

Last synced: 07 Apr 2026

https://github.com/shaheerazam-dev/cyclistic-case-study-google-data-analytics-certificate

This case study simulates the real-world experience of a junior data analyst at Cyclistic, a fictional company. We will leverage the data analysis process framework (Ask, Prepare, Process, Analyze, Share, Act) to address critical business questions and provide data-driven insights to guide strategic decision-making.

bigquery data-science data-visualization spreadsheet sql tableau

Last synced: 06 Feb 2026

https://github.com/jakobtroidl/barrio

A visual tool to compare and analyze nanoscale brain structures.

comparison data-visualization neuroscience scientific-visualization

Last synced: 09 Apr 2026