An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/nadahamdy217/skincaresentinel

This project analyzes customer feedback for skincare products by predicting sentiment using an unsupervised model. It includes a web application for real-time sentiment analysis, an ETL pipeline built with Azure Data Factory, Azure Databricks, and Azure Synapse Analytics, and a Power BI dashboard for visualizing review trends.

azure customer-feedback data-engineering data-science data-visualization database databricks etl-pipeline flask machine-learning powerbi python sentiment-analysis synapse-analytics unsupervised-learning web-application

Last synced: 07 Apr 2026

https://github.com/saikiran76/titanicdata-analysis-eda

In this notebook, we're going to analyse the famous Titanic dataset from Kaggle. The dataset is meant for supervised machine learning, but we're only going to do some exploratory analysis at this stage. We'll try to answer some questions using metrics and EDA.:

analysis data-science data-visualization eda python

Last synced: 19 May 2026

https://github.com/vincent-tran-94/Dataviz_Tweets_ChatGPT

Une application Streamlit pour analyser et visualiser les données et les tweets sur la sortie de ChatGPT. Ce projet comprend la gestion des données, l'analyse des sentiments, les tendances émergentes et les applications potentielles de ChatGPT.

data-management data-visualization sentiment-analysis streamlit text-mining twitter

Last synced: 10 Aug 2025

https://github.com/sayamalt/concrete-strength-prediction

Successfully developed a machine learning model which can accurately predict the strength of cement based on various features such as blast furnace slag, water, coarse aggregate, etc.

cross-validation data-visualization exploratory-data-analysis feature-engineering hyperparameter-optimization machine-learning model-deployment model-training-and-evaluation regression-models

Last synced: 09 Nov 2025

https://github.com/sayamalt/superstore-sales-prediction

Successfully established a machine learning model that can accurately predict the sales of a superstore based on various features such as quantity, profit, discount, postal code, etc. The features are mainly associated with order details and customer demographics.

azure-machine-learning azure-web-app-service cicd-deployment cross-validation data-cleaning-and-preprocessing data-visualization exploratory-data-analysis feature-engineering github-actions-ci-cd hyperparameter-tuning machine-learning model-deployment model-retraining model-testing model-training-and-evaluation regression-models

Last synced: 09 Nov 2025

https://github.com/sayamalt/quora-duplicate-question-pairs-identification

Successfully developed a machine learning model which can accurately detect whether any given pair of Quora questions are duplicate or not.

data-visualization machine-learning natural-language-processing nltk paraphrase-detection text-preprocessing

Last synced: 09 Nov 2025

https://github.com/hemangsharma/hotel-revenue-booking-analysis

This project provides a comprehensive revenue and reservation analysis for Highfield Hotel using historical data exported from booking systems and internal revenue reports. The goal is to derive actionable insights to improve room profitability, understand booking patterns, and support data-driven decision-making.

analysis data-analysis data-visualization hotel

Last synced: 10 Aug 2025

https://github.com/ddihora1604/iitk_task

A comprehensive financial data analysis system that collects, processes, and analyzes data from approximately 500 tickers in the S&P Global Index. It provides detailed financial information, ESG metrics, and various financial statements for comprehensive market analysis.

beautifulsoup4 data-analysis data-visualization datamodelling dataset esg machine-learning python yahoo-finance

Last synced: 29 Oct 2025

https://github.com/nafisrayan/decentai

A comprehensive platform built using ReactJS and Flask, combining blockchain technology with AI to create a secure and intelligent space for community engagement and policy discussions. Leverages NLP and LLM for meaningful interactions and sentiment analysis while ensuring data security and user privacy.

chatbot data-analysis data-visualization flask gemini gemini-ai gemini-ai-chatbot gemini-api government government-tech llm mongodb nlp polls python react tailwind voting-systems winknlp

Last synced: 12 Apr 2026

https://github.com/1ayanabil1/100-days-of-python-bootcamp

Join me on my journey to code in Python every day for 100 days! 🐍 This challenge is designed to sharpen my programming skills, explore Python libraries, and build cool projects along the way.

data-structures data-structures-and-algorithms data-visualization django flask machine-learning matplotlib numpy pandas python seaborn web-development

Last synced: 09 Apr 2026

https://github.com/nadamarei/data-analyzer

The Qualitative Data Analysis Tool is a powerful Streamlit application designed for researchers to analyze word frequencies in corporate documents. This tool processes PDF reports, identifies target words and their contextually relevant synonyms, and generates comprehensive reports with document statistics, summary analysis, and per-file breakdowns

data-analysis data-visualization python-3 streamlit

Last synced: 18 May 2026

https://github.com/andersoncrs/analisis-de-texto-tweets

En este proyecto exploro el análisis de texto de tweets para descubrir tendencias, opiniones y temas relevantes en redes sociales. Usando herramientas de procesamiento de lenguaje natural, convierto grandes volúmenes de mensajes en información clara y visualmente atractiva.

data-analysis data-visualization eda text-mining

Last synced: 21 Jul 2025

https://github.com/kylemit/livedataisbeautiful

A casual attempt at data visualizations

data-visualization highcharts

Last synced: 20 May 2026

https://github.com/0xhericles/ufcg-geojson

GeoJSON file containing the blocks and buildings of the Federal University of Campina Grande.

data data-visualization geojson map open-source ufcg university

Last synced: 09 Feb 2026

https://github.com/dantasl/map-covid-brazil

A map from Brazil for COVID-19 confirmed cases and deaths powered by Google Charts API.

covid-19 data-visualization google-charts map

Last synced: 11 Aug 2025

https://github.com/rajkumargara/bike_rental_data_analysis

Chicago bike rental data analysis for business insights using R programming

data-analysis data-visualization data-wrangling large-dataset machine-learning-algorithms

Last synced: 11 Aug 2025

https://github.com/erayagdogan/simplecharts

Simple Charts is a chart maker compose app with material 3 design. Charts are created using the lets-plot-compose library.

android android-app charts data-analysis data-visualization jetpack-compose lets-plot-kotlin material-3 viewmodel

Last synced: 11 Aug 2025

https://github.com/riyanshibariyaa/Vehicle-Emission-Analysis_MACHINE_LEARNING_

Vehicle Emissions Analysis This project focuses on analyzing vehicle emissions data using various machine learning techniques. The dataset used for analysis contains information about vehicle emissions, including engine size, CO2 emissions, transmission type, smog level, and fuel consumption.

artificial-intelligence data-visualization exploratory-data-analysis feature-engineering linear-regression machine

Last synced: 12 Aug 2025

https://github.com/ashish-kr-srivastava/olympic-games-eda---python

About Exploratory Data Analysis of a Historical Olympic Games Dataset, including all the games from Athens 1896 to Rio 2016.

data-visualization datacleaning eda matpotlib numpy pandas python seaborn seaborn-python

Last synced: 09 Apr 2026

https://github.com/mindlessmuse666/eda-pandas

Проект по разведочному анализу данных (EDA) о пассажирах Титаника с использованием библиотеки Pandas. Включает в себя загрузку данных, предобработку, статистический анализ, визуализацию и создание сводных таблиц. Цель проекта - демонстрация основных методов и инструментов EDA для анализа и понимания данных.

data-analysis data-processing data-science data-visualization eda exploratory-data-analysis matplotlib pandas python titanic

Last synced: 18 Apr 2026

https://github.com/r12habh/canada-imigration-data-analysis

Dataset: Immigration to Canada from 1980 to 2013 - International migration flows to and from selected countries - The 2015 revision from United Nation's website. (Cognitive Class Data Analysis with Python)

canada data-analysis data-science data-visualization datascience python python3

Last synced: 23 May 2026

https://github.com/itskshitija/tesla-stock-price-prediction

Welcome to the Tesla Stock Price Forecasting project, where we delve into time-series analysis to predict stock price trends for one of the world's most innovative companies—Tesla Inc.

data-visualization eda python time-series-analysis

Last synced: 12 Aug 2025

https://github.com/nathanaelmutua/british-airways-data-science-challenge

My solutions for the Forage program: web scraping, data cleaning, analysis, and visualization to extract business insights. Demonstrating practical data science skills for real-world problem-solving.

british-airways british-airways-virtual-program data-science data-visualization dataanalysis forage internship-project internship-task jupyter-notebook python sentiment-analysis webscraping

Last synced: 12 Aug 2025

https://github.com/fbarffmann/cryptoclustering

Clustered over 100 cryptocurrencies using K-Means and PCA to identify market patterns. Optimized clustering retained 89.5% explained variance.

clustering crypto-analysis data-visualization hvplot k-means machine-learning pandas pca python sklearn

Last synced: 09 Apr 2026

https://github.com/farhad-here/adventureworks_interactive_sales_dashboard_powerbi

An interactive Power BI dashboard for Adventure Works sales team to analyze performance, customers, products, and employees. Includes data cleaning, data modeling, DAX measures and advanced visualization features.

business-intelligence chart csv data-analysis data-cleaning data-cleaning-and-preprocessing data-visualization dax powerbi

Last synced: 13 Aug 2025

https://github.com/siddhant4srivastava/numeric-and-visual-summary

Exploring Data with Numeric and Visual Summaries of a Bank Loan Dataset

data-science data-visualization

Last synced: 10 Nov 2025

https://github.com/vit0r/trino-datavirtualization

POC trino - some catalogs, mariadb,postgresql,mongodb and minio

data-visualization

Last synced: 07 Mar 2026

https://github.com/radhikareddy-chintareddy/big-data-analysis-ny-weather-air-quality-2022

End-to-end workflow showcasing database setup, API development, and interactive data retrieval of large datasets. Includes integration and analysis of 2022 SURFACE HOURLY weather data (global, US, and NY) merged with NY air pollution data from the EPA to uncover actionable insights.

big-data-analytics data-integration data-visualization flask-restful jupyter-notebook pymysql python

Last synced: 18 May 2026

https://github.com/cosmoduende/r-ggcats

StrangeR things: Adding… Cats? to your plots on R. How to analyze and visualize data with the help of funny cats with the ”ggcat” package.

data-analysis data-analytics data-science data-visualisation data-visualization data-viz dataviz ggcats r-language r-library r-package r-programming r-scripts r-studio rstats rstudio

Last synced: 22 Jul 2025

https://github.com/oiricaud/angularjs

Obtain the values from the JSON variables to display area, bar and linear graphs.

data-visualization graphs

Last synced: 09 Mar 2026

https://github.com/mukeshlilawat1/netflix-data-visualization

Netflix Data Visualization – This project explores the Netflix dataset using Pandas for data manipulation and Matplotlib for creating meaningful visualizations. It highlights trends in movies and TV shows, distribution by release year, ratings, duration, and categories, making the data easy to understand through graphical insights.

data-visualization matplotlib pandas pip python

Last synced: 09 Apr 2026

https://github.com/itsachrafmansari/moroccan-real-estate-analysis

Scrape, process, analyze, and visualize data from Avito.ma to uncover current trends in Morocco's real estate market.

api-scraping data data-analysis data-mining data-science data-scraping data-visualization eda exploratory-data-analysis morocco real-estate web-scraping

Last synced: 13 Aug 2025

https://github.com/raghul-m/stock-price

Simple Stock Price App Using Streamlit and Yfinance

data-science data-visualization streamlit-webapp yfinance-library

Last synced: 04 Oct 2025

https://github.com/bretsw/eme6356-ss25-module5

Slide deck for EME6356, Module 5: Data Visualization (Spring 2025)

analytics data-analytics data-visualization slides

Last synced: 09 Mar 2026

https://github.com/alexeyraspopov/vizmath

Set of useful functions for data visualization

data-visualization

Last synced: 04 Oct 2025

https://github.com/cyprianfusi/new-york-city-public-schools-and-sat-scores

One of the most controversial issues in the U.S. educational system is the efficacy of standardized tests and whether they're unfair to certain groups. We could correlate SAT scores with factors like race, gender, income, and more.

data-analysis-python data-cleaning data-visualization data-wrangling

Last synced: 21 Mar 2025

https://github.com/cyprianfusi/uk-covid-19-data-via-opendata-api

With recommendation to the UK government to halt all mandatory testing! Tests should only be conducted on patients as part of diagnosis and treatment. This is because with low prevalence of the disease most positive test results are false positives. This is due to irreducible error in the test.

api covid-19 data-visualization pandas-python uk

Last synced: 21 Mar 2025

https://github.com/piras-s/tuningcurvesnestedbayesianinference

Bayesian inference of neural tuning curves using nested sampling (PyMultiNest), with theory, simulation, and diagnostic visualizations.

bayesian-inference data-visualization machine-learning model-evaluation nested-sampling neuroscience pymultinest python3 simulation

Last synced: 18 May 2026

https://github.com/bessouat40/photo-gallery-viewer

This project is a photo gallery app 🎨 It leverages a CLIP model for powerful image search based on text keywords. You can easily filter through your images using AI-driven queries!

artificial-intelligence data-visualization elasticsearch embeddings image-gallery image-search mvc-architecture offline photo-gallery python

Last synced: 09 Apr 2026

https://github.com/aathithya-shanmuga-sundaram/cyber-threat-intelligence-dashboard

Interactive Streamlit dashboard for visualizing and analyzing cyber threats, featuring real-time data insights, severity classification, geolocation mapping, and customizable dark-cyber UI.

cyber-threat-intelligence cyber-threat-tool cybersecurity cybersecurity-tools data-visualization dataset numpy pandas plotly python streamlit threat-intelligence

Last synced: 10 Nov 2025

https://github.com/cyprianfusi/world-happiness-report-for-2015-2019

World Happiness Report for 2019 with strange and unexpected results for Sub-Sahara African Countries! But it's data speaking...

data-visualization pandas-python

Last synced: 21 Mar 2025

https://github.com/aman-codde/credit-card-analytics

A full-stack dashboard for credit card users to analyze spending, track rewards, and download statements securely.

analytics dashboard data-visualization express fullstack jwt-authentication mongodb nodejs react recharts tailwindcss

Last synced: 09 Apr 2026

https://github.com/un-ocha/ai-prototypes

Exploratory AI‑assisted prototypes and visualisations. For learning and demonstration only – not for operational or decision‑making use.

ai data-visualization experimentation humanitarian prototypes

Last synced: 25 Jun 2026

https://github.com/ianjure/average-precipitation-map

A 3D data visualization of average precipitation using R.

data-visualization philippines r

Last synced: 16 Aug 2025

https://github.com/easonlai/eda_for_hk_covid19

This is a code sample of Exploratory Data Analysis (EDA) for COVID-19 cases in Hong Kong. Data is obtained from official data.gov.hk.

covid-19 covid19-data data-analytics data-science data-visualization eda matplotlib omicron pandas python python3 seaborn

Last synced: 09 Apr 2026

https://github.com/matthewandretaylor/csc207project

Forest Visualization 3D MVC. Using Data collected from the city of Kitchener

3d-graphics data-visualization

Last synced: 19 Apr 2026

https://github.com/ashraf-khabar/bank-marketing-data-analysis

This project is focused on analyzing bank marketing data using PyTorch, pandas, numpy, and scikit-learn. The goal is to build a predictive model that can help identify potential customers who are more likely to subscribe to a bank's term deposit.

data-cleaning data-science data-visualization dataset deep-learning deep-neural-networks feedforward-neural-network learning neural-networks numpy pandas python pytorch sklearn

Last synced: 09 Apr 2026

https://github.com/nitheshgoutham/phonepe-pulse-data

Phonepe Pulse Data Visualization and Exploration: A User-Friendly Tool Using Streamlit and Plotly

data-science data-visualization plotly python sql streamlit

Last synced: 09 Apr 2026

https://github.com/supsi-deass-cpps/multilingual_thematic_analysis

Modular R pipeline for multilingual survey analysis — translate, embed, cluster, and visualize open-ended responses using Google Cloud and tidyverse tools.

clustering data-visualization linguistics multilingual-analysis natural-language-processing qualitative-research r reproducible-research social-science survey-data text-mining thematic-analysis translation

Last synced: 04 Oct 2025

https://github.com/chandkund/loan-eligibility-prediction

This project is designed to predict the eligibility of loan applicants based on various factors such as income, credit history, and marital status. By analyzing historical loan application data, the model helps to determine whether a loan application should be approved or not.

data-analysis data-science data-visualization machine-learning-algorithms matplotlib numpy pandas python seaborn

Last synced: 09 Apr 2026

https://github.com/kplanisphere/analysis-of-political-texts

Analysis and Classification System of Political Texts Using Natural Language Processing - Final Project for the Information Retrieval Course

classification-model data-science data-visualization deep-learning information-retrieval machine-learning natural-language-processing neural-network nlp nlp-machine-learning

Last synced: 05 Oct 2025

https://github.com/guilherme-marcello/r-data-analysis-barplots

Reading RDS files, processing and presentation in bar plots

bar-plot data-visualization r

Last synced: 05 Oct 2025

https://github.com/gfav-cybergeek/prodigy_ml_01

A linear regression model to predict house prices based on square footage, number of bedrooms, and bathrooms. Includes feature engineering, preprocessing, and model evaluation.

ai airtificialintelligence algorithms algorithms-and-data-structures data-structures data-visualization jupyter jupyter-notebook jupyterlab machine-learning machine-learning-algorithms machine-learning-models python

Last synced: 05 Apr 2025

https://github.com/harshindcoder/online_retail_data_clustering_project

This marketing analytics project uses RFM (Recency, Frequency, Monetary) features for customer classification, inspired by the online retail mining paper. The RFM model helps segment customers, identify high-value ones, and optimize marketing strategies.

customer-segmentation data-analysis data-visualization market-analytics

Last synced: 17 Aug 2025

https://github.com/kowshik24/predictstock

🚀 StockSage: Predicting Tomorrow's Stocks, Today! 🌌 Dive deep into the future of stock prices with StockSage! Powered by LSTM networks, this repository is a treasure trove for those looking to explore the intricacies of stock price predictions. 📈✨ 🔗 Live App: https://stocksage.streamlit.app/

data-science data-visualization deep-neural-networks lstm stock-market streamlit tensorflow

Last synced: 18 Mar 2026

https://github.com/allanotieno254/powerbi-sales-dashboard

This repository provides a step-by-step guide to building a Power BI sales dashboard. It includes sample data, DAX measures, and visual examples

data-analytics data-visualization dax-expression power-bi sales-analysis

Last synced: 03 Jan 2026

https://github.com/subhadipsinha722133/credit-card-fraud-dection

Web application for detecting fraudulent credit card transactions using machine learning

data-visualization fraud-detection machine-learning matplotlib numpy pandas seborn sklearn streamlit

Last synced: 10 Apr 2026

https://github.com/saravanansuriya/phonepe-pulse-data-visualization-and-exploration

Creating a dashboard by using streamlit application. In this app visualizing the data taken from Phonepe pulse Github repository.

data-visualization github-cloning mysql-database pandas-dataframe plotly-express python streamlit-webapp

Last synced: 10 Apr 2026

https://github.com/davidzajac1/four-percent-rule-pandas-analysis

Analysis of the 4% Personal Finance Rule of Thumb

data-analysis data-visualization pandas python

Last synced: 20 Apr 2026

https://github.com/1dagord/spectrogram

Generates a spectrogram based on live audio input or a .wav file

audio-analysis data-visualization fft-analysis python python3 spectrogram

Last synced: 18 Aug 2025

https://github.com/mvinyard/vinplots

Michael E. Vinyard's python plotting assistant

data-visualization plotting python

Last synced: 05 Oct 2025

https://github.com/tclzcja/china-greenhouse-gas-mitigation

This is an environmental data visualization project that shows how much greenhouse gas China can decrease from emission by switching to natural gas energy from coal.

client-project data-visualization

Last synced: 05 Oct 2025

https://github.com/jpgiant/nyc_energy_prediction

A comprehensive code for predicting energy usage in NYC using Machine Learning Algorithms.

data-analysis data-science data-visualization folium jupyter-notebook machine-learning matplotlib numpy pandas python seaborn sklearn

Last synced: 10 Apr 2026

https://github.com/izhaan0/predict-marks-based-on-study-hours

Student Marks Predictor is a machine learning project that predicts a student’s exam scores based on the number of study hours. It uses Linear Regression to learn the relationship between study hours and marks, and provides both command-line and interactive Streamlit web interfaces for prediction and visualization.

data-visualization joblib jupyter-notebook machine-learning machine-learning-algorithms matplotlib-pyplot numpy pandas pandas-dataframe pickle python scikit-learn seaborn

Last synced: 05 Apr 2026

https://github.com/hanannazri/predictive-customer-churn-modelling-with-price-sensitivity-insights-

As a part of BCG Data Science Project, I developed a predictive churn-risk model for XYZ energy utility by engineering price, consumption and contract features and training XGBoost and Random Forest models to identify customers most likely to churn at current prices.

bcgx data-visualization exploratory-data-analysis feature-engineering jupyter jupyter-notebook model-training-and-evaluation numpy pandas random-forest scikit-learn xgboost

Last synced: 19 Aug 2025

https://github.com/palakjainanalyst/ecommerce-customer-spending-analysis

An end-to-end Ecommerce analytics project uncovering customer spending trends using Excel, Python, SQL, and Power BI. From raw data to interactive dashboards, this project delivers deep insights on spending patterns, high-value customer segments - showcasing a complete data-to-decisions workflow.

data-analysis data-visualization database ecommerce excel jupyter-notebook powerbi python spending sql

Last synced: 06 May 2026

https://github.com/shadz23/smart-energy-dashboard

Power BI dashboard analyzing household electricity consumption to reveal usage patterns, peak hours, and estimated costs for smarter energy management and reduced bills. 🐙

chart data-analysis data-visualization dax energy-consumption hs110 hs300 ibm ibm-cloud influxdb jupyter-notebook kasa kp115 linuxone observability photovoltaics-dashboard plotly sense

Last synced: 19 Aug 2025

https://github.com/rahmamohammad/retail_project

Retail & Data analytics: KPIs, sales trends, Excel planning pack, forecasting & inventory tracking.

data-analysis data-visualization ecommerce excel jupyter-notebook matplotlib python retail-analytics storytelling

Last synced: 17 May 2026

https://github.com/marcellobb/cars-eda

🚘 apply exploratory data analysis to a car dataset

data-visualization jupyter-notebook

Last synced: 19 Aug 2025

https://github.com/ganesh774218/students-health-predictor

An end-to-end machine learning project that applies logistic regression to identify students at risk of depression based on demographic, academic, and lifestyle features. This repository includes data preprocessing, feature engineering, model training, evaluation metrics, and visualizations to provide actionable insights.

data-science data-visualization jupyter-notebook logistic-regression machine-learning machine-learning-algorithms python regression-models

Last synced: 19 May 2026

https://github.com/thytranx/datalens

Datalens aims to enable interactive 3D exploration of complex, multi-dimensional datasets. Designed for intuitive usability, it is accessible to users without advanced programming skills or specialized hardware.

3d-visualization data-visualization glfw3 opengl

Last synced: 18 Apr 2026

https://github.com/alvitachen/breathe-retrospective

A dashboard that visualizes AQI and PM job openings across target cities.

aqi beginner charts data-visualization personal-project react vite

Last synced: 10 Apr 2026

https://github.com/niniola-creator/niniola-creator

This is a repository that I have created to show my skills, share my projects and track my progress in my data science/web development journey.

bootstrap5 css3 data-analysis data-science data-visualization database html5 javascipt javascript matplotlib pandas powerbi python spreadsheets sql

Last synced: 07 Apr 2026

https://github.com/gui-sitton/y.music

In this project I compared the musical preferences of the citizens of Springfild and Shelbyville. I examined real Y.Music data to test hypotheses and compare the behavior of users in these two cities.

data data-analysis data-analysis-python data-science data-visualization python

Last synced: 18 May 2026

https://github.com/sreyashidey/scrape-analyze-visualize

A project for web scraping, data analysis, and visualization using Selenium, BeautifulSoup, and Python.

bs4 data-visualization selenium

Last synced: 03 May 2026