An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/singhdivyank/visualization

Wrangling NYPD data and visualising using graphs and maps in Python, Tableau, and R

data-visualization data-wrangling geopandas ggplot2 plotly pygwalker

Last synced: 13 Jun 2026

https://github.com/smahala02/materials-science-data-analysis

Analysis of diffraction and spectrum data in materials science using Python for data visualization and interpretation.

data-visualization diffraction-analysis materials-science python spectrum-analysis

Last synced: 18 May 2026

https://github.com/yusuf4030/the-data-analyst-toolkit

📊 Explore essential data analysis tools organized by role and task, empowering users from students to professionals with quick access to valuable resources.

budget budget-management business-intelligence charts cookbook cureated-list data data-analysis-python data-visualization internet-of-everything internet-of-transport large-language-models nse open-source python selenium stock-market traffic-analysis

Last synced: 18 May 2026

https://github.com/aidanv22/kagglecompetitions2024

These are coding projects I worked on during my 2024 Fall Semester. Each of these was ranked either in the 18pt or 20pt benchmark.

data-visualization data-wrangling dplyr embeddings feature-engineering feature-selection linear-regression logistic-regression models pca-analysis r xgboost

Last synced: 06 Apr 2025

https://github.com/lucs1590/triathlon-dashboard

This is a repository that shows some graphics and makes a dashboard related to triathlon data.

angular dashboard data-visualization data-viz graphs plotly plotly-dash plotlyjs storytelling triathlon

Last synced: 12 May 2026

https://github.com/dkoh2018/car_shopping

A car price analysis tool with brand comparisons, trend tracking, and interactive visualizations. Built with Python and Streamlit

automotive car-market data-visualization price-analysis price-tracker python streamlit web-scraping

Last synced: 18 May 2026

https://github.com/gui-sitton/games

Identify patterns that determine whether a game is successful or not. This will allow you to identify potential big winners and plan advertising campaigns.

data data-analysis data-analysis-python data-science data-visualization python

Last synced: 18 May 2026

https://github.com/kevinwood15/python_twitter_datawrangling_project

The main objectives of this project is to wrangle (clean) and analyze twitter data. I deal with some messy data, clean it, then plot some visualizations of the data to analyze it.

cleaning-data data-science data-visualization python wrangling-data

Last synced: 18 May 2026

https://github.com/sayamalt/titanic-survival-prediction

Successfully developed a Logistic Regression model for predicting the survival of a passenger aboard the Titanic ship based on his/her various features such as gender, age, passenger class, no. of siblings, embarkation location, etc.

data-cleaning data-preprocessing data-visualization exploratory-data-analysis logistic-regression machine-learning sklearn

Last synced: 18 May 2026

https://github.com/adriangalvanzamora/ecommerce-analytics-olist

Data analysis project based on the Olist Brazilian E-Commerce dataset. Includes data cleaning, exploratory analysis, delivery performance metrics, customer satisfaction modeling, and geospatial insights. Built entirely in Python (Jupyter Notebook) using real-world data from Kaggle.

brazil customer-satisfaction data-analysis data-visualization ecommerce folium geospatial-analysis machine-learning matplotlib notebook pandas plotly python seaborn

Last synced: 06 May 2026

https://github.com/ahmedmmahrous/movie-recommendation-and-analysis

Perform analysis and Basic Recommendations based on Similar Genres and Movies which Users prefer.

data-visualization feature-engineering nu pan py recommender-system seaborn

Last synced: 03 Feb 2026

https://github.com/vaxdata22/zillow-rapid-api-end-to-end-etl-data-pipeline-by-airflow-on-ec2

This is an end-to-end AWS Cloud ETL project. This data pipeline orchestration uses Apache Airflow on AWS EC2 as well as AWS Lambda. It demonstrates how to build ETL data pipeline that would perform data transformation using Lambda function as well as loading into a Redshift cluster table. The data would then be visualized using Amazon QuickSight.

amazon-quicksight amazon-redshift apache-airflow aws-ec2 aws-lambda aws-s3 business-intelligence dags data-visualization etl-pipeline orchestration python3 rapid-api zillow-house-listings

Last synced: 19 May 2026

https://github.com/ramonanf/tc1002s_semanatec

Herramientas computacionales: El arte de la analítica

data-analysis data-visualization jupiter-notebook pandas-python

Last synced: 15 Jun 2025

https://github.com/amlanmohanty1/fannie-mae-borrower-behavior-and-characteristics-2007vs2019

Analysis using R and tidyverse to compare borrower behavior and characteristics between the years 2007 and 2019, focusing on key financial metrics such as credit scores, interest rates, debt to income ratios, and loan to value ratios.

data-visualization fannie-mae r tidyverse

Last synced: 13 Sep 2025

https://github.com/notthestallion/data_visualisation-examples

This repository was created to learn and practice graph showing and data visualization. The goal is to gain experience in creating compelling and informative visualizations.

data data-science data-visualization database learn learn-to-code learning learning-by-doing matplotlib matplotlib-figures matplotlib-pyplot visualization

Last synced: 12 May 2026

https://github.com/srvcl/lung-cancer-survival-analysis

Data Cleaning of a dataset and Survival Analysis in R Language

data-analysis data-science data-visualization r survival-analysis

Last synced: 11 May 2026

https://github.com/franksunye/streamlitccdemo

A fully-featured Streamlit application demo, showing how to quickly deploy interactive web apps on Streamlit Community Cloud. Supports English, Chinese, and Spanish (i18n), including user interaction, file processing, data display, and database features.

data-visualization demo i18n streamlit

Last synced: 19 May 2026

https://github.com/jabulente/geospatial-data-visualizations-tanzania-s-administrative-geographic-and-socioeconomic-landscape

This repository showcases geospatial data visualizations focused on Tanzania's administrative boundaries, geographic features, and selected socioeconomic indicators. Using GeoPandas, Matplotlib, and other geospatial libraries, the project provides static and customizable maps of regions, districts, and population distributions.

ai data-science data-visualization geopandas geospatial-analysis geospatial-visualization machine-learning oops python tanzania tanzania-locations

Last synced: 29 Jul 2025

https://github.com/kimaruthagna/geodjango

the project introduces the aspect of geodjango and storing of spatial data in a database.Postgres was used in this project

data-visualization donut-chart extension-postgis geodjango geomap graphos layers postgis postgresql-database python-json spatial-data

Last synced: 29 Oct 2025

https://github.com/nagar2nd/zomato-bangalore-analysis-tableau

Analysing restaurant data in Bengaluru to enhance customer satisfaction by optimizing the restaurant experience. The focus is on improving the popularity of different cuisines, enhancing delivery times, and boosting restaurant ratings. An interactive Tableau dashboard has been developed to help Zomato identify key areas for improvements.

data-analysis data-visualization tableau

Last synced: 05 Mar 2026

https://github.com/sukhitashvili/pca_tutorial

PCA algorithm from scrach, using only matrix-vector multiplications

data-analysis data-science data-visualization machine-learning-algorithms pca

Last synced: 29 Mar 2025

https://github.com/ezeparziale/analisis-uso-bicicletas-caba

:biking_man: Análisis de como afecto la pandemia el uso de las bicicletas en CABA.

data data-science data-visualization

Last synced: 14 Mar 2025

https://github.com/analyticalnahid/seaborn-tutorial

A complete Notebook on Seaborn for Data Science

data-visualization seaborn seaborn-tutorial

Last synced: 23 Aug 2025

https://github.com/grascya/sleep-health_-lifestyle-dataset

Classifier to predict the presence of a sleep disorder based on the other columns in the dataset.

data-visualization exploratory-data-analysis joblib machine-learning-algorithms pickle python statistical-analysis

Last synced: 20 May 2026

https://github.com/otsaloma/pollen-chart

Helsinki pollen count visualization

data-visualization javascript lambda pollen python

Last synced: 17 Apr 2026

https://github.com/cudavailable/gdp-data-visualization

A data visualization project for GDP data

data-visualization gdp vue

Last synced: 20 May 2026

https://github.com/shivasairam1706/mlops-project1

End-to-end ML-Ops project using PySpark and AWS, covering environment setup, model training, deployment with data capture, execution, and analysis. CI/CD pipelines (AWS CodePipeline) and monitoring (CloudWatch) ensure automated deployment, performance tracking, and model retraining for production-ready ML solutions.

aws aws-lambda aws-s3 data-engineering data-science data-visualization delta-lake docker forcasting mlops-project pyspark unix-shell

Last synced: 20 May 2026

https://github.com/gui-sitton/carsells

In this project I am an analyst on the Crankshaft List. Hundreds of free vehicle advertisements are published on the site every day. I need to study the data collected over the last few years and determine which factors influence the price of a vehicle.

data data-analysis data-analysis-python data-science data-visualization python

Last synced: 20 May 2026

https://github.com/mohamed-walied/customer-behavior-analysis-using-r

Customer Behavior Analysis project utilizing the "Groceries Market Basket Dataset" from Kaggle. The project employs a data-driven approach to uncover customer purchasing patterns and relationships within the grocery market using K-means Clustering and Association Rules using Apriori-Algorithm. In collaboration with some friends.

apriori-algorithm association-rule-learning dashboard data-cleaning data-visualization k-means-clustering r-programming-language

Last synced: 26 Jul 2025

https://github.com/c2r0b/2q

Manage data and relationships with AI

data-visualization graphql relationships rust tauri

Last synced: 09 Apr 2026

https://github.com/catalina2820/inteligencia-de-negocios

This repository contains materials and resources for the Business Intelligence course. It includes notes, workshops, and practical exercises that cover essential concepts and applications in data science, data visualization, machine learning, and big data.

bigdata data-cleaning data-science data-visualization web-scraping

Last synced: 04 Apr 2025

https://github.com/anergictcell/esbmeplots

An extension of the D3.js library for fast and flexible generation of basic plot types

d3js data-visualization javascript plotting

Last synced: 13 Jun 2026

https://github.com/maruf-hossen/kaggle-projects-and-learning

Comprehensive data science learning journey through Kaggle courses and exercises. Documenting progress in SQL, Python, ML, and data visualization with practical projects and business applications.

business-intelligence data-cleaning data-science data-visualization kaggle learning-journey machine-learning pandas python sql

Last synced: 05 May 2026

https://github.com/hemant-kumar786/heart-disease-prediction

Heart Disease Analysis project in RStudio using statistical methods and data visualization. Includes data cleaning, exploratory data analysis (EDA), correlation study, and insights on key health indicators influencing heart disease.

correlation-study data-analysis data-visualization eda healthcare heart-disease r rstudio statical-analysis

Last synced: 02 Nov 2025

https://github.com/mondial7/echarts-wc-import

Webcomponent to import Echarts library - current version echarts 3.8.5

data-visualization echarts3 polymer2 webcomponents

Last synced: 03 May 2026

https://github.com/errea/vet_clinic_database

For this project you need special preparation. As the goal of this project is to solve some performance issue, first we need to introduce those issues. In order to do that, you will populate your database with a significant number of data.

data data-analysis data-structures data-visualization database

Last synced: 21 May 2026

https://github.com/ivangrana/data-visualization

Data visualization repository made with Chart.Js, D3,Plotly and Rstudio

d3js data-visualization

Last synced: 20 Jul 2025

https://github.com/arekflo2002/analiza_danych-rstudio-_dyskryminacja_kobiet

Wykorzystując rstudio oraz zestawy dane ze strony https://www.gapminder.org/data/ badam tematykę dyskrminacjii kobiet na poszczególnych kontynentach i wyciągam odpowiednie wnioski

data data-preparation-and-analysis data-visualization rstudio statistics

Last synced: 14 Apr 2025

https://github.com/muneeb706/nei_pm2.5_data_analysis

Exploratory Data Analysis of PM2.5 Emission records from EPA National Emission Inventory

data-visualization exploratory-data-analysis r-programming

Last synced: 27 Jun 2025

https://github.com/balajimohan18/power-bi-visualization-project

This repository contains Visualization Projects which is visualized through Power BI Software, by using the visualization we can gain multiple insights and strategies which helps to develop the business for gaining high profit margins and by the insights we can reduce damages by accidents & calamities.

data-analysis data-cleaning data-science data-visualization exploratory-data-analysis microsoft-excel microsoft-power-bi microsoft-powerpoint powerbi powerbi-visuals powerpoint-slides

Last synced: 08 Mar 2026

https://github.com/gustavo-b-morales-s/footwear-market-data-pipeline

This project uses Python, Scrapy, SQLite3 and Streamlit to extract sports shoe data from Mercado Livre, perform transformations using Pandas, store the data in an SQLite database and create a data visualization interface emphasizing the main KPIs using Streamlit.

data-engineering data-visualization etl-pipeline webscraping

Last synced: 06 Apr 2025

https://github.com/abhishekyadav915/diwali_sales_analysis

This project aims to analyze sales data during the Diwali festival using Python. The analysis focuses on identifying key trends, customer purchasing behavior, and sales performance across different segments. By leveraging data visualization and statistical analysis, we uncover insights.

data-analysis data-visualization matplotlib-pyplot numpy-library pandas-dataframe seaborn-python

Last synced: 05 Apr 2025

https://github.com/thesfinox/mltools

A collection of simple tools for data science and machine learning projects.

ai data-analysis data-science data-visualization logging machine-learning matplotlib neural-network python toolbox

Last synced: 14 May 2025

https://github.com/rizz1406/crypto-price-tracker

A user-friendly Crypto Price Tracker built using Python and Streamlit. Track top 20 cryptocurrencies in INR and USD, visualize historical trends, and set up price alerts easily.

coingecko-api crypto-tracker cryptocurrency cryptocurrency-prices data-visualization pandas price-alerts python streamlit

Last synced: 11 May 2026

https://github.com/hfagerlund/machine-learning-iris-analysis

No longer maintained. Moved to https://github.com/hfagerlund/machine-learning-classifier-iris/.

data-visualization jupyter-notebook machine-learning python37

Last synced: 22 Jul 2025

https://github.com/sahilmaurya28/youtube-data-analysis

YouTube Data Analysis using Python — uncovering trends, engagement patterns, and correlations between likes, comments, views, and categories to understand what drives content success.

analysis data-analysis data-visualization matplotlib-pyplot numpy pandas portfolio-project python seaborn youtube

Last synced: 13 Apr 2026

https://github.com/sreekar0101/-movie-recommendation-system-using-python

The Movie Recommendation System is designed to suggest personalized movie recommendations by analyzing extensive datasets containing movie details and credits.ultilizes python libraries numpy pandas and scikit learn.The system achieved a 15% improvement in accuracy compared to the baseline model by identifying key factors that influence user choice

data-analysis data-visualization numpy-library pandas-dataframe scikit-learn seaborn-python

Last synced: 02 Jan 2026

https://github.com/jarif87/pokeinsights

A Selenium-Powered Data Scraping and Tableau Visualization Project

data-visualization python scraping selenium tableau

Last synced: 21 May 2026

https://github.com/rudra-g-23/power-bi-custom-visual

A custom Power BI visual that displays a customizable, interactive charts with advanced capabilities.

custom-visuals data-analysis data-visualization dax powerbi powerbi-custom-visuals svg visualization

Last synced: 02 Jan 2026

https://github.com/tushar2704/employee-distribution

This repository contains valuable insights and visualizations derived from an extensive HR dataset spanning from 2000 to 2020, with over 22,000 rows.

data-analysis data-visualization excel postgresql powerbi sql tushar2704

Last synced: 04 Nov 2025

https://github.com/tamanna2005/streamlit-crime-dashboard

A Streamlit-based personal project that visualizes crime data in Pittsburgh through an interactive dashboard, focusing on data storytelling and insightful exploration.

crime-data data-analysis-project data-visualization eda interactive-dashboard python streamlit

Last synced: 28 Jun 2025

https://github.com/vedikasnehil/sql-50

This project focuses on solving 50 SQL problems every weekend from LeetCode to strengthen SQL skills, master advanced techniques, and build consistency. Each solution is documented with clear explanations, creating a valuable resource for learning and application.

data-visualization database-management sql

Last synced: 06 Jan 2026

https://github.com/leosolar8/stock-price-prediction-ai-model

This project shows how to use a special type of AI called Long Short-Term Memory (LSTM) to predict stock prices. The project is split into two main parts: Training the AI Model and Making Predictions (Inference)

ai csv-dataset data-science data-visualization deep-learning finance financial-data forecasting keras lstm machine-learning python rnn stock-market stock-prediction tensorflow time-series time-series-forecasting

Last synced: 08 Apr 2026

https://github.com/sanjana-bongale/cancer_survival_data_analysis_and_prediction_using_logistic_regression

This project performs data analysis using Python to predict cancer patient survival outcomes. It involves data cleaning, exploratory analysis, and visualizations to explore factors like cancer type, stage, and treatments. A logistic regression model is built to predict patient survival based on demographic and medical data.

data-analysis data-cleaning data-science data-visualization eda jupyter-notebook kaggle logistic-regression machine-learning matplotlib numpy pandas predictive-modeling python scikit-learn seaborn

Last synced: 08 Apr 2026

https://github.com/sayamalt/fake-news-classification-using-fine-tuned-bert

Successfully developed a text classification model to predict whether a given news text is fake or not by fine-tuning a pretrained BERT transformed model imported from Hugging Face.

bert-embeddings bert-model data-analysis data-visualization deep-learning fine-tuning-bert model-evaluation model-training-and-evaluation text-classification text-preprocessing text-tokenization tokenizer-nlp wordcloud-visualization

Last synced: 05 Apr 2025

https://github.com/bala-1409/rafik-s-kitchen-data-analysis

The Project is about the Analysis of the Sales and Expenses Data of a Famous Fast-food Restaurant. This mainly focuses on gaining Insights that will boost the Future Sales and also Business Strategies it Improve the Profit Margins. Handled Tools are SQL, Python, Power BI, MS Office Tools.

business-analytics business-intelligence data-analysis data-analytics data-visualization eda exploratory-data-analysis ms-office powerbi-report powerpoint-presentations python sql-server

Last synced: 06 May 2026

https://github.com/jocelynvelarde/embraceplus-visualizer

Visualize your raw data from .avro files for the EmbracePlus device from Empatica

avro-schema csv-files data-visualization empatica health-monitoring monitoring-tool python streamlit

Last synced: 14 May 2026

https://github.com/rfonod/tableau-dashboard

A Tableau dashboard visualizing the change in the Summary Innovation Index (SII) from 2012 to 2019, relative to selected academic research trends in Europe. It includes a bar chart for country comparisons, a scatter plot for trend analysis, and a map to show geographic patterns, with interactive features for enhanced insights.

dashboard data-visualization innovation-index tableau visualization

Last synced: 25 Jan 2026

https://github.com/constantinoschillebeeckx/meowcow

Quickly visualize multidimensional data.

d3js data-visualization nvd3 viz

Last synced: 28 Jun 2025

https://github.com/noor188/preswald-data-app

A data app to visualize and manipulate the graduate admission dataset

data-analysis data-visualization open-source

Last synced: 04 Jul 2025

https://github.com/muthukumar0908/cardekho_used_car_price_prediction

The project aim is to build a machine learning model that offers users to find current valuations for used cars.

data-analysis data-visualization datacleaning eda machine-learning python streamlit

Last synced: 30 Mar 2025

https://github.com/tolumie/web-scraping-rest-api-stock-data-operations

Web Scraping, REST API & Stock Data Operations is a data-driven project that explores the power of web scraping, API interactions, and stock market analysis using Python. From extracting stock data and public records to analyzing real-world financial trends, this repository is a one-stop resource for data enthusiasts, traders, and analysts.

api-integration data-analysis data-cleaning data-visualization financial-data python rest-api sql-databases stock-data web-scraping

Last synced: 19 May 2026

https://github.com/netesf13d/expt-sequence-analysis

Data processing, analysis and visualization package for atomic physics experiments in the single-atom regime.

cold-atoms data-analysis data-visualization optical-tweezers

Last synced: 24 Jul 2025

https://github.com/dmarks84/ind_project_superstore-sales-time-series-analysis--kaggle

Independent Project - Kaggle Dataset-- I worked on the Superstore Sales Dataset, performing (as Part 1) data cleaning and preparation and exploratory data analysis. The main task was to make predictions for future sales based on time-series analysis, which is found in Part 2.

chloropleth data-modeling data-visualization eda linear-regression matplotlib numpy pandas python seaborn sklearn statistics statsmodels supervised-ml time-series-analysis

Last synced: 09 Apr 2026

https://github.com/armahdavi/analytics_statistics_ML_plotting_dust_extraction_hvac_filters_ph2

PhD Technical Paper 1 - Phase 2 - Mahdavi & Siegel (2020) (Aerosol Science & Technology; AS&T) - Sharing all the data pipelines, processing codes, descriptive statistics, statistical modellings, and plotting/visualizations - Project Miestone: 2017 - 2020 - Full-length article is available

data-pipelines data-science data-visualization machine-learning matplotlib-pyplot numpy pandas-dataframe python scipy-stats sklearn statistics

Last synced: 17 Sep 2025

https://github.com/andersoncrs/clasificacion-propina-restaurante

Este informe desarrolla, de manera clara y práctica, un análisis completo del conocido conjunto de datos de propinas (tips), mostrando paso a paso cómo transformar la información cruda en modelos predictivos útiles.

clasification data-analysis data-visualization tips

Last synced: 26 Jul 2025

https://github.com/kushalagarwalla/netflix-movie-data-analysis

🚀 Netflix Data Analytics Project 🎬📊 | Analyzed 9K+ movies to uncover insights on genres, popularity, votes & release trends. Includes EDA, KPIs & visualizations using Python (Pandas, NumPy, Matplotlib, Seaborn). Supports data-driven content & engagement strategy.

data-analysis data-visualization jupyter-notebook numpy pandas python seaborn

Last synced: 06 May 2026

https://github.com/vitor-ace/sunspots-data-analysis

This is a Jupyter Notebook which works with Data Analysis logic and libraries implementation with Python.

data-analysis data-visualization debbuging error-handling file-handling matplotlib-pyplot numpy pandas python

Last synced: 06 May 2026

https://github.com/syncfusionexamples/how-to-add-arrows-to-the-chart-axis-in-.net-maui-chart

Learn how to enhance MAUI charts by adding arrows to the chart axes using annotations for improved visualization and clarity.

axis-with-arrows chart-annotations chart-customization charting-library charts data-visualization line-annotation maui-charts

Last synced: 26 Jul 2025

https://github.com/ornl/covid19vis

Visualizations of COVID-19 case data

data-visualization scientific-visualization

Last synced: 03 Jan 2026

https://github.com/leandrocollares/population-in-dutch-provinces

A responsive bar chart showing the population of Dutch provinces

d3 data-visualization svelte

Last synced: 16 Apr 2026

https://github.com/shreedata/covid-da-dasboard-using-powerbi

This repository showcases a PowerBI dashboard focused on visually representing COVID-19 data for Indian states and Union Territories in an easily understandable way. The dataset is sourced from Kaggle.

data-cleaning data-visualization datanalaysis microsoft microsoft-powerbi powerbi-report powerbi-visuals powerbidashboard

Last synced: 19 Feb 2026

https://github.com/swethajoseph/statistical-stock-performance-analysis

Conducted a statistical analysis of Microsoft, Tesla, and Apple stock performance compared to the S&P 500, examining price trends, volatility, and correlations to derive investment insights.

advancedexcel comparative-analysis data-analysis data-visualization datapreparation descriptive-statistics moving-average msexcel performance-analysis performance-metrics regression-analysis statistical-analysis

Last synced: 03 Jan 2026

https://github.com/yash22222/literacy-exploration-analysis

Delve into India's literacy landscape through data analysis. Uncover regional disparities, high/low literacy states & gender imbalances.

csv data-analysis data-visualization government-data india literacy literacy-analysis states

Last synced: 29 Jul 2025