An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/divyashah0510/sales-insights-for-retail-company

This project is a data analysis project for a retail company. The company has dataset: sales_data.csv that contains the sales data for the company. The objective of this project is to analyze the sales data and provide insights to the company to improve their sales.

dash data-visualization pandas plotly sales sales-insights streamlit visualization

Last synced: 02 Jan 2026

https://github.com/bhavinpatel4199/machine-learning-framework

This repository, showcases various projects that explore key concepts in both supervised and unsupervised learning, with a focus on real-world applications. The projects utilize a range of machine learning techniques, including data preprocessing, feature selection, exploratory data analysis (EDA), and model optimization.

classification clustering data-science data-structures data-visualization exploratory-data-analysis machine-learning machine-learning-algorithms machine-learning-models pandas-dataframe predictive-modeling preprocessing-data sklearn supervised-learning unsupervised-learning

Last synced: 20 Jan 2026

https://github.com/lut-ful/e-commerce-sales-report

This dashboard provides a visual analysis of e-commerce sales data

data data-analytics data-science data-visualization power-bi statics

Last synced: 28 Jun 2025

https://github.com/timjjting/escaping-flatland-slides

Slides for techniques behind escaping flatland

data-visualization glsl lod octree threejs

Last synced: 14 May 2025

https://github.com/fazej99/u.s-climate-and-temperature-analysis

This project analyzes historical temperature trends in the U.S., explores their economic impacts, predicts future changes using machine learning, visualizes regional anomalies with GIS, and presents findings through a secure and interactive Streamlit dashboard.

data-analysis data-science data-visualization gis machine-learning streamlit

Last synced: 22 May 2026

https://github.com/aliasgarsogiawala/dashboards

Power BI dashboards , each folder contains a pbix file and a pdf file with explanation of the dashboard

analysis dashboards data data-visualization powerbi

Last synced: 12 Feb 2026

https://github.com/sweta-kaundilya/adventureworks-cycles-powerbi-project

This project was completed to simulate real-world tasks that data professionals encounter every day on the job.

dashboarddesign data-visualization datamodeling dataprep dax exploratory-data-analysis powerbi powerquery

Last synced: 08 Mar 2026

https://github.com/bala-1409/milk-production-time-series-forecasting-datascience-project

This project uses time series forecasting to predict future milk production. The data used in this project is monthly milk production data from January 1962 to December 1975. The ARIMA (autoregressive integrated moving average) model is used to forecast the milk production. The model is evaluated using various metric.

acf adf arima-model data-analysis data-science data-visualization datapreprocessing eda exploratory-data-analysis forecasting machine-learning-algorithms pacf python python3 sarimax-model seasonality seasonality-analysis time-series time-series-forecasting trends

Last synced: 27 Apr 2026

https://github.com/bala-1409/rafik-s-kitchen-data-analysis

The Project is about the Analysis of the Sales and Expenses Data of a Famous Fast-food Restaurant. This mainly focuses on gaining Insights that will boost the Future Sales and also Business Strategies it Improve the Profit Margins. Handled Tools are SQL, Python, Power BI, MS Office Tools.

business-analytics business-intelligence data-analysis data-analytics data-visualization eda exploratory-data-analysis ms-office powerbi-report powerpoint-presentations python sql-server

Last synced: 06 May 2026

https://github.com/noor188/preswald-data-app

A data app to visualize and manipulate the graduate admission dataset

data-analysis data-visualization open-source

Last synced: 04 Jul 2025

https://github.com/sivkri/shiny-scatter-plot-app

This repository contains a Shiny app that allows users to create interactive scatter plots by selecting the X and Y axes and customizing the point color. The app utilizes the shiny package in R to provide a user-friendly interface and the ggplot2 package for creating visually appealing plots.

data-analysis data-visualization ggplot2 interactive-web-application r rprogramming scatter-plot shiny

Last synced: 22 Mar 2025

https://github.com/nimomach/amazon-sales-data

This is a small dataset containing Amazon sales data analysis for few regions.

dashboards data data-analysis data-visualization

Last synced: 08 Mar 2026

https://github.com/dbolotov/ts_smoothing_visualizer

Streamlit app for visualizing and comparing time series smoothing methods on real and synthetic datasets.

data-science data-visualization streamlit time-series

Last synced: 24 Jul 2025

https://github.com/tolumie/web-scraping-rest-api-stock-data-operations

Web Scraping, REST API & Stock Data Operations is a data-driven project that explores the power of web scraping, API interactions, and stock market analysis using Python. From extracting stock data and public records to analyzing real-world financial trends, this repository is a one-stop resource for data enthusiasts, traders, and analysts.

api-integration data-analysis data-cleaning data-visualization financial-data python rest-api sql-databases stock-data web-scraping

Last synced: 19 May 2026

https://github.com/stanleynguyen/so.cube

World map visualisation of World's Cube Association data 🌏

cas cube data-visualization leaftlet map

Last synced: 24 Jul 2025

https://github.com/netesf13d/expt-sequence-analysis

Data processing, analysis and visualization package for atomic physics experiments in the single-atom regime.

cold-atoms data-analysis data-visualization optical-tweezers

Last synced: 24 Jul 2025

https://github.com/swethajoseph/netflix-powerbi-interactive-dashboard

Created an interactive Netflix Power BI dashboard to analyze and visualize Netflix's content library, uncovering trends in content type, genre distribution, and global reach

data-analysis data-visualization interactive-visualizations powerbi powerbi-dashboards powerbi-report

Last synced: 03 Jan 2026

https://github.com/andersoncrs/clasificacion-propina-restaurante

Este informe desarrolla, de manera clara y práctica, un análisis completo del conocido conjunto de datos de propinas (tips), mostrando paso a paso cómo transformar la información cruda en modelos predictivos útiles.

clasification data-analysis data-visualization tips

Last synced: 26 Jul 2025

https://github.com/ptdewey/spotipy-wrapped

Make sense out of Spotify personal data

data-visualization jupyter-notebook python spotify

Last synced: 01 Aug 2025

https://github.com/syncfusionexamples/how-to-add-arrows-to-the-chart-axis-in-.net-maui-chart

Learn how to enhance MAUI charts by adding arrows to the chart axes using annotations for improved visualization and clarity.

axis-with-arrows chart-annotations chart-customization charting-library charts data-visualization line-annotation maui-charts

Last synced: 26 Jul 2025

https://github.com/faraazarsath/guvi---assignment_5_6_7

This repository is about Analysis of Cricket Chirps, Brain-Body Weight, and Salary Discrimination Data: Linear regression, visualization,R2 squared and correlation assessments.

analysis data-visualization linear-regression r2score

Last synced: 05 Oct 2025

https://github.com/jain1shh/solar-flare-prediction

This repository contains code and data for predicting solar flare energy ranges using machine learning, based on NASA's RHESSI mission data. It includes preprocessing of FITS files into a unified CSV dataset and implements models like Gradient Boosting, Random Forest, and Decision Tree classifiers, achieving accuracies up to 87%.

data-visualization machine-learning numpy pandas python scikit-learn solar-flare-prediction

Last synced: 09 Apr 2026

https://github.com/jakebrehm/geophotos

🗺 📍 A Python package to pull, analyze, and plot coordinates from various sources.

data-visualization gdal geopandas heatmap osgeo photos plot plotting python python-3

Last synced: 09 Jun 2026

https://github.com/lucas-mazzolim/superstore-bi

Project where I prepared two data sources for querying and created a BI visualization in Data Studio. Used tools as Mysql, Looker Studio, Google Spreadsheet and Python.

business-intelligence data-analysis data-visualization google-looker-studio mysql spreadsheet

Last synced: 27 Jul 2025

https://github.com/rajireddy15/employee-attrition-prediction-hr-analytics-

Employee Attrition Prediction (HR Analytics) helps organizations analyze employee data, identify factors driving turnover, and predict attrition using machine learning and visual dashboards, enabling data-driven HR decisions and retention strategies.

data-cleaning data-collection data-manipulation data-preprocessing data-science data-visualization eda feature-engineering imbalanced-data machine-learning mysql-database numpy pandas

Last synced: 04 May 2026

https://github.com/erictleung/tidytuesdays

:chart_with_upwards_trend: My attempts at #tidytuesday

data data-science data-visualization r rstats tables tidytuesday tidyverse

Last synced: 19 Sep 2025

https://github.com/christos-pelekis/harsourcerer

An inclusive MERN stack-based platform for comprehensive analysis and exploration of HTTP traffic data extracted from HAR (HTTP Archive) files.

data-visualization har-files http-traffic mern-stack

Last synced: 29 Jul 2025

https://github.com/archanakokate/eda_amazon_products_and_discounts_2023

Exploratory Data Analysis (EDA) on Amazon's 2023 Products and Discounts data

data-analysis data-mining data-visualization exploratory-data-analysis

Last synced: 03 Jan 2026

https://github.com/fawadeqbal/data-science

A comprehensive repository covering essential Data Science concepts using PyTorch, including anomaly detection, classification, clustering, regression, and more. Includes hands-on implementations and tutorials for each concept.

calculus calculus-2 data-science data-visualization dataset machine-learning python pytorch statistics

Last synced: 29 Jul 2025

https://github.com/malakasupun/crime-data-analysis-of-lapd

This project aims to explore and analyse crime patterns in Los Angeles using a dataset spanning from 2020 to the present. The primary focus is to extract meaningful insights by integrating structured data analysis and advanced techniques in SQL and Natural Language Processing (NLP).

data-analysis data-visualization llm nlp sql

Last synced: 29 Jul 2025

https://github.com/zborovskaanna/e-commerce-web-events-analysis

SQL project based on the Big Query public database 'The Look e-Commerce' and a dashboard in Looker Studio

analysis bigquery dashboard data-visualization looker-studio sql

Last synced: 03 Jan 2026

https://github.com/j4rviscmd/streamlit-advanced-dataframe

🚀 A powerful Streamlit custom component that extends st.dataframe with advanced features: filtering, sorting, row/cell selection, column resizing, virtual scrolling (60fps with 100K rows), and more. Built with React + TanStack Table v8.

data-table data-visualization dataframe pandas python react streamlit streamlit-component streamlit-custom-component tanstack-table typescript

Last synced: 09 Mar 2026

https://github.com/kartikey2807/bike-classification-1rt700

Binary classification problem involving Logistic regression, SMOTE and feature expansion.

data-analysis data-engineering data-visualization logistic-regression

Last synced: 30 Jul 2025

https://github.com/hooopo/ossinsight-pick

Handpicks, features, or highlights a selection of open-source repositories each week. We cherry-pick the best, trending, or otherwise interesting repositories, providing an in-depth analysis you won't find elsewhere, thus enabling developers to discover, learn from, and contribute to these noteworthy projects.

analytics data-visualization github open-source trending-repositories visualization

Last synced: 30 Jul 2025

https://github.com/sakshithbillava/expense-manager

A web-based expense tracking app built with Python and Streamlit, featuring real-time updates, data visualization, user authentication, and MongoDB integration.

authentication data-visualization expense-manager matplotlib mongodb numpy pandas personal-finance python streamlit webapp

Last synced: 09 Apr 2026

https://github.com/mr-chang95/twitter_datawrangling

Twitter Data Wrangling for Udacity's Data Analyst Nanodegree Program

data-visualization data-wrangling dogs matplotlib numpy pandas python twitter

Last synced: 09 Apr 2026

https://github.com/vishal-bhandary/sql-data-analytics

This repository contains a collection of SQL scripts demonstrating various analytical techniques, such as changes over time, cumulative, performance, data segmentation, part-to-whole analysis.

analytics business-intelligence customer-segmentation dashboarding data-analysis data-reporting data-visualization data-warehouse etl kpi product-analysis sql sql-server star-schema t-sql

Last synced: 30 Jun 2026

https://github.com/thedevreda/jadaerospace

A Real life project showing how to improve selling aircraftparts and helping salers to focus more on effective products at JadAero

data data-analysis data-cleaning data-visualization jupyter-notebook powerbi python

Last synced: 02 Aug 2025

https://github.com/azaz9026/eda

Exploratory Data Analysis (EDA) refers to the method of studying and exploring record sets to apprehend their predominant traits, discover patterns, locate outliers, and identify relationships between variables. EDA is normally carried out as a preliminary step before undertaking extra formal statistical analyses or modeling.

data-cleaning data-visualization encoding machine-learning matplotlib numpy pandas plotly python3 seaborn sklearn-library

Last synced: 15 Apr 2026

https://github.com/tashi-2004/apache-flink-spark-data-streaming

This project showcases a real-time data streaming pipeline using Apache Flink, Apache Spark, and Grafana. It streams data, stores it in Parquet format, and performs aggregations for insights, with seamless visualization via Grafana dashboards.

apache-flink apache-spark data-aggregation data-analysis data-science data-streaming data-visualization flink flink-stream-processing flink-streaming grafana-dashboard grafana-plugin pyflink python3

Last synced: 09 Feb 2026

https://github.com/drisskhattabi6/text-to-sql

Chat with DB : A powerful web application that transforms natural language questions into executable SQL queries against a PostgreSQL or MySQL database and visualizes the results, Using Langchain (Ollama and ChromaDB), LangGraph and Streamlit

ai-agent chat-with-db chromadb data-visualization gemini langchain langgraph mysql ollama openai postgresql streamlit text-to-sql text2sql txt2sql

Last synced: 09 Apr 2026

https://github.com/hari00887/analysis-of-global-terrorism

Analysis of Global Terrorism Using AHP A quantitative study of GTD data to assess attack severity and evolution across time and space.

data-analysis data-visualization powerbi

Last synced: 02 Mar 2026

https://github.com/joaopalmeiro/ctrlair

A toolbox for Altair.

altair data-visualization python

Last synced: 04 Aug 2025

https://github.com/sshehrozali/top-repo-visualizer

Program to generate visual graph of top most starred GitHub repos using PyGal and GitHub API.

api data-visualisation data-visualization github-api graph pygal python

Last synced: 05 Aug 2025

https://github.com/shrutiijoshi/corporate-campus-hiring-analysis

This project analyzes corporate campus hiring trends for fresh graduates in India.

dashboard data-analysis data-visualization excel powerbi

Last synced: 09 Mar 2026

https://github.com/luarmen/covid-19-testing-forecast-iceland-2020-2022-

Time series forecasting of daily COVID-19 testing in Iceland using R. Models compared include ETS, SARIMA, and Auto ARIMA with cross-country validation on UAE data.

arima covid-19 data-visualization ets forecasting pandemic-data r sarima statistical-modeling statistical-modeling-techniques time-series

Last synced: 07 Aug 2025

https://github.com/albertofaraujo/pbi_dashboard_anatel

Analisar o desempenho das operadoras no atendimento das reclamações registradas por usuários de telecomunicações no sistema Anatel Consumidor

data-visualization dax-studio power-query powerbi

Last synced: 19 Feb 2026

https://github.com/syncfusionexamples/how-to-collapse-the-visibility-of-specific-data-label-in-.net-maui-cartesian-chart

This article in the Syncfusion Knowledge Base explains how to collapse the visibility of specific data label in .NET MAUI Cartesian chart

charting-library charts column-chart data-label data-label-customization data-label-visibility data-visualization hide-data-label maui-charts

Last synced: 03 Oct 2025

https://github.com/oncs21/expenai

An open source modern and simple expense tracking app with numerous integrations

bootstrap css data-visualization database-management ejs express mindee mongodb mongoose node-js server-security

Last synced: 09 Apr 2026

https://github.com/sayamalt/flight-price-prediction

Successfully established a machine learning model to accurately predict the price of a flight in India based on several features such as duration, days left, arrival time, departure time and so on.

data-visualization exploratory-data-analysis feature-engineering hyperparameter-tuning machine-learning model-deployment model-training-and-evaluation regression-models

Last synced: 09 Nov 2025

https://github.com/saikiran76/titanicdata-analysis-eda

In this notebook, we're going to analyse the famous Titanic dataset from Kaggle. The dataset is meant for supervised machine learning, but we're only going to do some exploratory analysis at this stage. We'll try to answer some questions using metrics and EDA.:

analysis data-science data-visualization eda python

Last synced: 19 May 2026

https://github.com/sayamalt/quora-duplicate-question-pairs-identification

Successfully developed a machine learning model which can accurately detect whether any given pair of Quora questions are duplicate or not.

data-visualization machine-learning natural-language-processing nltk paraphrase-detection text-preprocessing

Last synced: 09 Nov 2025

https://github.com/1ayanabil1/100-days-of-python-bootcamp

Join me on my journey to code in Python every day for 100 days! 🐍 This challenge is designed to sharpen my programming skills, explore Python libraries, and build cool projects along the way.

data-structures data-structures-and-algorithms data-visualization django flask machine-learning matplotlib numpy pandas python seaborn web-development

Last synced: 09 Apr 2026

https://github.com/rajkumargara/bike_rental_data_analysis

Chicago bike rental data analysis for business insights using R programming

data-analysis data-visualization data-wrangling large-dataset machine-learning-algorithms

Last synced: 11 Aug 2025

https://github.com/riyanshibariyaa/Vehicle-Emission-Analysis_MACHINE_LEARNING_

Vehicle Emissions Analysis This project focuses on analyzing vehicle emissions data using various machine learning techniques. The dataset used for analysis contains information about vehicle emissions, including engine size, CO2 emissions, transmission type, smog level, and fuel consumption.

artificial-intelligence data-visualization exploratory-data-analysis feature-engineering linear-regression machine

Last synced: 12 Aug 2025

https://github.com/mukeshlilawat1/netflix-data-visualization

Netflix Data Visualization – This project explores the Netflix dataset using Pandas for data manipulation and Matplotlib for creating meaningful visualizations. It highlights trends in movies and TV shows, distribution by release year, ratings, duration, and categories, making the data easy to understand through graphical insights.

data-visualization matplotlib pandas pip python

Last synced: 09 Apr 2026

https://github.com/raghul-m/stock-price

Simple Stock Price App Using Streamlit and Yfinance

data-science data-visualization streamlit-webapp yfinance-library

Last synced: 04 Oct 2025

https://github.com/bessouat40/photo-gallery-viewer

This project is a photo gallery app 🎨 It leverages a CLIP model for powerful image search based on text keywords. You can easily filter through your images using AI-driven queries!

artificial-intelligence data-visualization elasticsearch embeddings image-gallery image-search mvc-architecture offline photo-gallery python

Last synced: 09 Apr 2026

https://github.com/aman-codde/credit-card-analytics

A full-stack dashboard for credit card users to analyze spending, track rewards, and download statements securely.

analytics dashboard data-visualization express fullstack jwt-authentication mongodb nodejs react recharts tailwindcss

Last synced: 09 Apr 2026

https://github.com/easonlai/eda_for_hk_covid19

This is a code sample of Exploratory Data Analysis (EDA) for COVID-19 cases in Hong Kong. Data is obtained from official data.gov.hk.

covid-19 covid19-data data-analytics data-science data-visualization eda matplotlib omicron pandas python python3 seaborn

Last synced: 09 Apr 2026

https://github.com/supsi-deass-cpps/multilingual_thematic_analysis

Modular R pipeline for multilingual survey analysis — translate, embed, cluster, and visualize open-ended responses using Google Cloud and tidyverse tools.

clustering data-visualization linguistics multilingual-analysis natural-language-processing qualitative-research r reproducible-research social-science survey-data text-mining thematic-analysis translation

Last synced: 04 Oct 2025

https://github.com/guilherme-marcello/r-data-analysis-barplots

Reading RDS files, processing and presentation in bar plots

bar-plot data-visualization r

Last synced: 05 Oct 2025

https://github.com/kowshik24/predictstock

🚀 StockSage: Predicting Tomorrow's Stocks, Today! 🌌 Dive deep into the future of stock prices with StockSage! Powered by LSTM networks, this repository is a treasure trove for those looking to explore the intricacies of stock price predictions. 📈✨ 🔗 Live App: https://stocksage.streamlit.app/

data-science data-visualization deep-neural-networks lstm stock-market streamlit tensorflow

Last synced: 18 Mar 2026

https://github.com/jpgiant/nyc_energy_prediction

A comprehensive code for predicting energy usage in NYC using Machine Learning Algorithms.

data-analysis data-science data-visualization folium jupyter-notebook machine-learning matplotlib numpy pandas python seaborn sklearn

Last synced: 10 Apr 2026

https://github.com/yashvs15112004/real-time-crisis-sentiment-analysis-and-mapping

🚨 An AI-powered web app for real-time crisis reporting. Analyzes submission urgency with an NLP model and visualizes critical alerts on a live map to aid first responders.

data-visualization emergency-response fastapi full-stack geolocation java leafletjs machine-learning microservice mongodb nlp python sentiment-analysis spring-boot

Last synced: 10 Apr 2026

https://github.com/souravsuvarna/whatsapp-chat-analyzer-and-visualizer-web-application

The WhatsApp chat analyzer and visualizer uses NLP algorithms to analyze chat data, tracking usage patterns and presenting insights through visually appealing charts and graphs. It helps users understand communication patterns and behaviors on WhatsApp.

data-analysis data-science data-visualization python python3 streamlit

Last synced: 18 Apr 2026

https://github.com/kristishqau/ApartmentRegressionAnalysis

This data science project aims to predict apartment prices through regression analysis. The dataset used contains information about apartments, and the project involves various steps such as data preprocessing, exploratory data analysis, feature engineering, and building a decision tree regression model.

apartment-prices data-preprocessing data-science data-visualization decision-tree-regression jupyter-notebook python3

Last synced: 22 Aug 2025

https://github.com/fernandogomesfg/sabores-aromas-analytics

Projecto Sabores & Aromas: um dashboard interativo desenvolvido no Power BI, focado em insights de vendas, desempenho por equipe e análise de rentabilidade para optimizar decisões estratégicas.

analise-de-dados data-science data-visualization dataanalytics powerbi storytelling-with-data vendas

Last synced: 13 Feb 2026

https://github.com/silent0wings/steganopixelcrypt

SteganoPixelCrypt is a C++ tool that encodes text into images by mapping characters to pixel colors. It provides a simple form of visual encryption, turning data into colored patterns that can be decoded back to text. Lightweight, fast, and ideal for basic steganography.

cli-tool cplusplus data-visualization encryption image-processing pixel-art steganography text-to-image utf-32 visualization-tool

Last synced: 26 Aug 2025