An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/devprnvk/realestateml

This Python program analyzes a dataset (HousePricePrediction.xlsx) containing information about house prices. It utilizes pandas for data manipulation, matplotlib for plotting, and seaborn for visualizing correlations and distributions.

data-science data-visualization datasets houses npm plotting prediction-model seaborn

Last synced: 30 Apr 2026

https://github.com/tashi-2004/global-ecommerce-retail-trends-analysis

The Global E-commerce & Retail Analysis project involves data preprocessing, dimensionality reduction with PCA, CLV calculation and What-If analysis . Key insights include effective PCA for data reduction, detailed CLV analysis across segments , and the impact of pricing strategies on sales.

boxplot clv-analysis data-science data-visualization dataintegration deep-learning dimensionality-reduction ecommerce heatmap machine-learning normalization outlier-detection outlier-removal pca-analysis preprocessing python scatter-plot whatif-analysis

Last synced: 30 Apr 2026

https://github.com/samiksha29-patil/hr-employee-data-analysis-visualization-in-python

This project focuses on analyzing an HR Employee Dataset that contains details about employees such as demographics, job status, salaries, performance reviews, satisfaction levels, and attrition reasons.

csv-files data data-visualization dataanalysis matplotlib numpy pandas python seaborn

Last synced: 30 Apr 2026

https://github.com/bachtiarashidiqy/ecommercedashboard

An interactive e-commerce analytics dashboard built with Streamlit, providing visualizations for sales performance, product analysis, geographic insights, and delivery status. Includes date filtering, company branding, and comprehensive documentation.

analytics dashboard data-analysis data-visualization e-commerce matplotlib pandas python seaborn streamlit

Last synced: 30 Apr 2026

https://github.com/mxagar/eda_fe_summary

An 80/20 guide for Data Processing: Data Cleaning, Exploratory Data Analysis, Feature Engineering, Feature Selection.

data-analysis data-cleaning data-modeling data-science data-visualization eda exploratory-data-analysis feature-engineering feature-selection machine-learning pandas

Last synced: 30 Apr 2026

https://github.com/diogojorgebasso/dataanalysis_r_minesnancy

Les codes et les matériaux des cours d'analyse de données en R à Mines de Nancy. Vous y trouverez également des scripts R, des notebooks et d'autres ressources pour chaque leçon.

analyse-data data-analysis data-science data-visualization estatistics r statistiques statistiques-descriptives

Last synced: 30 Apr 2026

https://github.com/fernandesotero/project-data-exploration

Student Performance Prediction with Data Science

data-visualization jupyter-notebook python

Last synced: 30 Apr 2026

https://github.com/cagandemirmr/airbnb_available_houses

In this repo, i create dashboard using Tableau.In this process, i use SQL and Python languages.

dashboard data-visualization dataprocessing python sql tableau

Last synced: 30 Apr 2026

https://github.com/srinibas-masanta/ibm-applied-data-science-capstone

This repository contains the work completed for the Applied Data Science Capstone Project offered by IBM on Coursera. The capstone project is the final course in the IBM Data Science Professional Certificate series and serves as an opportunity to apply the skills and knowledge gained throughout the series to a real-world data science problem.

capstone-project data-analysis data-science data-visualization machine-learning python web-scraping

Last synced: 30 Apr 2026

https://github.com/priyam-hub/covid-19-data-analysis

Explore COVID19 case numbers and deaths related to Coronavirus outbreak 2019/2020 in Pandas and in Jupyter notebook

analysis data data-visualization jupyter-notebook machine-learning python

Last synced: 08 Jun 2026

https://github.com/mayankfreelancer/advanced-sales-analytics-dashboard-power-bi-

This interactive Power BI dashboard provides a comprehensive analysis of sales data across regions, categories, and time periods. The project aims to uncover key trends in total sales, profit, quantity sold, and product performance, using advanced visualizations and forecasting techniques. 🛠 Tools & Techniques Used: Power BI

dashboard data-science data-visualization excel numpy pandas powerbi python sales-analysis sql

Last synced: 30 Apr 2026

https://github.com/samuelpillai/machine-learning-classification-regression-nlp

A curated collection of machine learning mini-projects covering classification, regression, and natural language processing (NLP). This project demonstrates model training, evaluation, feature engineering, and pipeline integration using real-world datasets and Python tools like Scikit-learn, pandas, and NLTK.

classification data-analysis data-science data-visualization feature-engineering jupyter-notebook machine-learning ml-pipeline model-evaluation nlp python regression-models scikit-learn supervised-learning text-mining

Last synced: 30 Apr 2026

https://github.com/tolumie/aviva-insurance-statistics-hypothesis-abtesting-modelling

This project explores the impact of demographic and lifestyle factors on insurance charges. Using statistical hypothesis testing (ANOVA, Chi-Square, T-tests) and predictive modeling (Elastic Net, Random Forest, Gradient Boosting). The analysis is deployed using Streamlit.

anova chi-square-test data-visualization eda gradient-boosting hypothesis-testing insurance-dataset machine-learning predictive-modeling python random-forest statistical-analysis streamlit

Last synced: 30 Apr 2026

https://github.com/rayxiang03/indeed-job-scraping

Python toolkit for scraping Indeed job listings, preprocessing data, and generating visualizations for market analysis.

cloudscraper data-visualization indeed job-analysis nlp pandas python web-scraping

Last synced: 30 Apr 2026

https://github.com/mmartin46/county-health-findings-project

Analyze the data set given by United Health Group(UHG) to determine the impact on race, social and demographic factors on health, survival, and mortality.

analysis data-science data-visualization linear-regression machine-learning pandas

Last synced: 30 Apr 2026

https://github.com/gerhynes/d3-births-pie-chart

A D3 pie chart showing UN birth data grouped by month and quarter. Built for The Advanced Web Developer Bootcamp.

d3 data-visualization javascript

Last synced: 30 Apr 2026

https://github.com/01110011011101010110010001101111/tigercosmosbootstrapdash

Sample Repository of Visualisaing TigerGraph Data with Cosmos in a Bootstrap Dashboard

bootstrap cosmos data-visualization graph-visualization tigergraph

Last synced: 30 Apr 2026

https://github.com/the-clone-xyz/stats-lapas-pakam

Visualisasi data narapidana berdasarkan jenis kelamin di Lapas Lubuk Pakam menggunakan data BPS Deli Serdang secara otomatis via GitHub Actions.

bps-api data-visualization github-actions lubuk-pakam statistics

Last synced: 30 Apr 2026

https://github.com/ddeepanshu-997/datascience-e-commerce-shopping-details-

in this project i am going to apply data preprocessing technique on the dataset in order to clean the data using libraries, etc. make some insights/analyses to findout the hotpicks of the shopping along with some data visualsation libraries to get the trends and many more aspects in order to make a small contribution to the field of data science

cleaning-data data data-science data-visualization dataframe datapreprocessing dataset libraries matplotlib-pyplot numpy pandas plots python visualization

Last synced: 30 Apr 2026

https://github.com/realvuk/r-for-data-science-by-vuk

My exercise from the book R for Data Science: Import, Tidy, Transform, Visualize, and Model Data 2nd Edition

data-science data-visualization r rstats

Last synced: 13 Jun 2026

https://github.com/fazatholomew/marlboroplan

In order to contribute to a more inclusive sustainable energy program in Massachusetts, this project is part of my work for a nonprofit organization called All In Energy and undergraduate thesis for my degree.

data-analysis data-visualization energy jupyter-notebook massachusetts python

Last synced: 01 May 2026

https://github.com/falakrana/data-analysis-visualization

This repository showcases data analysis and visualization projects using Python and Tableau. It includes exploratory data analysis, interactive dashboards, and insightful visual stories derived from real-world datasets.

data-analysis data-visualization python tableau-public

Last synced: 01 May 2026

https://github.com/kivanc57/explaratory_analysis

Exploratory and Descriptive Data Analysis on Indonesian data using R. This project involves reading data, feature analysis, correlation analysis, logistic regression, PCA, MDS, and clustering. Visualizations include boxplots, scatter plots, corrgrams, and dendrograms. Comprehensive report available in report.docx.

clustering data-science data-visualization descriptive-statistics explanatory-data-analysis mds pca plot r

Last synced: 08 Jun 2026

https://github.com/codeofrahul/python_amazon_sales_analysis

In this repository, I have saved my Python_Amazon_sales_analysis Notebook. To do this Amazon_sales_analysis, I have done end to end process. cleaned the dataset, Did EDA, ploted graph and reached to the conclusion.

amazon analysis data-visualization eda exploratory-data-analysis matplotlib pandas-library python seaborn

Last synced: 01 May 2026

https://github.com/kevinandersontech/ecommerce_dashboard_streamlit

A Streamlit dashboard that reads daily revenue metrics from the data pipeline. Provides date filters, summary KPIs, line charts, and a table to explore revenue over time across different statuses (e.g. paid, refunded, failed).

charts dashboard data-visualization duckdb filters metrics python streamlit

Last synced: 01 May 2026

https://github.com/ishmal793/bi-dummy-

An interactive and beginner-friendly data dashboard built using Streamlit. Upload your own CSV or Excel file, apply filters, view key statistics, and generate beautiful visualizations with no coding required.

data-analytics data-visualization eda pandas plotly python-dashboard streamlit

Last synced: 01 May 2026

https://github.com/fbarffmann/project1

Analyzed factors influencing movie profitability using Python. Cleaned and visualized film industry data to uncover trends in budgets, sales, genres, and ratings.

box-office-analysis data-analysis data-visualization matplotlib movie-industry pandas python regression seaborn

Last synced: 01 May 2026

https://github.com/anandvai/ai_rag_chatbot_multi_pdf_support

RAG (Retrieval-Augmented Generation) Chatbot built with Streamlit and LangChain, powered by Groq's blazing-fast LLaMA3-8B. It allows you to upload multiple PDFs, ask questions, and get precise, context-aware answers in a conversational format.

ai data data-science data-visualization data-visualizations dataengineering fastapi langchain langgraph python sql streamlit

Last synced: 01 May 2026

https://github.com/jianxi-erin/bigdata-machinelearning-lab

本项目是一个综合性的大数据与机器学习实验平台,包含两个主要任务,每个任务涵盖三个关键技术模块:大数据处理、数据分析和机器学习。项目基于真实的竞赛设计,提供完整的数据处理模拟和建模实践。

data-analysis data-visualization hadoop machine-learning python spark sql

Last synced: 03 May 2026

https://github.com/samyomb/olist-ecommerce-analytics

Olist e-commerce performance & customer reviews — Python cleaning + BigQuery SQL + Looker Studio dashboard (2017 FY & 2018 YTD) with actionable insights

analytics bigquery brasil customer-experience dashboard data-visualization e-commerce looker-studio olist python review sql

Last synced: 05 Oct 2025

https://github.com/lukakerr/us-surnames

US Surname data visualisation using R. Displays top 25 US surnames and race/ethnic percentage per name.

data data-visualization r

Last synced: 05 Oct 2025

https://github.com/trismald/eurosoccer1023

Data Analyst - European Soccer 2010 2023

data-analysis data-visualization jupyter-notebook pandas powerbi python

Last synced: 06 May 2026

https://github.com/tejas-jadhav01/blender-csv-visualizer

A Blender add-on for visualizing CSV data as 3D objects.

blender blender-addon csv data-visualization python

Last synced: 18 May 2026

https://github.com/dibsthegreat/titantic-dataset-analytics

DASC4850 Final Project where I did EDA to determine the survivability of Titanic guests depending on Age, Gender, Wealth, etc.

data-science data-visualization matplotlib numpy pandas python random-forest-classifier

Last synced: 13 Apr 2026

https://github.com/christophertrimboli/trends-vs-stock

A tool to browse culture's impact on stocks. Made in React, Node, Bootstrap and Charts.js to compare data from Google Trends and Alpha Vantage APIs.

bootstrap chartjs data-visualization google-trends intrinio nodejs react stocks

Last synced: 28 Apr 2026

https://github.com/subhamghimire/dataanavis

Learning Data analysis and visualization

data-analysis data-science data-visualization dataset

Last synced: 06 Oct 2025

https://github.com/davifeliciano/modern_physics_experiments

Collection of data analysis and visualization scripts developed in Python around some modern physics experiments

data-analysis data-visualization modern-physics physics physics-experiments

Last synced: 18 Jan 2026

https://github.com/gabboraron/biostatisztika_es_alkalmazasai

"A statisztika a matematika azon ága, melynek feladata, hogy eszközt adjon a politikusok kezébe, mellyel tetszőleges állítás és annak ellentéte is tudományos alapon igazolható"

biostatistics data-analysis data-visualization r statistics statistics-course

Last synced: 24 Oct 2025

https://github.com/meddhiachaouachi/browsetrack-datacollection

This full-stack web application streamlines data interaction with a simple and intuitive design. Built with Django and React, it offers secure and efficient tools for analysts to share insights, users to manage cookies, and clients to visualize real-time data effortlessly. The platform also tracks user browsing activity, providing valuable insights

cookies data-visualization datatracker social-network-analysis useractivity

Last synced: 07 Oct 2025

https://github.com/ljadhav25/world-population-analysis-1990-2023-

This repository contains data and analysis related to the world population from 1990 to 2023. The objective is to explore population trends, identify patterns, and visualize demographic changes across different countries and continents over the past few decades.

data-analysis-python data-visualization matplotlib numpy-library pandas-library seaborn

Last synced: 08 Oct 2025

https://github.com/sayamalt/mental-health-classification-using-fine-tuned-distilbert

Successfully established a multiclass text classification model by fine-tuning pretrained DistilBERT transformer model to classify several distinct types of mental health statuses such as anxiety, stress, personality disorder, etc. with an accuracy of 77%.

data-visualization deep-learning distilbert-fine-tuning distilbert-model model-evaluation model-inference model-training-and-evaluation multiclass-text-classification natural-language-processing text-classification text-preprocessing text-tokenization

Last synced: 08 Oct 2025

https://github.com/omarsolieman/socialgiveawaydataanalysis

This project involved cleaning, analyzing, and processing data from an Instagram giveaway to ensure a fair and data-driven winner selection process. The primary goal was to automate the process of identifying valid entries, weighting them based on engagement (likes and multiple entries), and performing a post-giveaway analysis

data-analysis data-science data-visualization instagram scraping threejs

Last synced: 14 May 2026

https://github.com/lixx21/tableau_netflix_movies_tvshows_2021

Visualize netflix movies and tv shows in 2021

data-visualization dataset netflix tableau

Last synced: 19 Jan 2026

https://github.com/mgckaled/rs_data-analytics

Repositório agregador do conteúdo da formação Data Analytics desenvolvido pelo Rocketseat.

data-analytics data-visualization python sql statistics

Last synced: 09 Oct 2025

https://github.com/nidhi-kekatpure/churn-prediction-for-subscription-customers

End-to-end customer churn prediction using AWS SageMaker, XGBoost, and Python - complete ML pipeline from data generation to real-time inference with 79% AUC score.

aws business-intelligence data-science data-visualization jupyter-notebook predictive-analytics

Last synced: 16 May 2026

https://github.com/rohanchawla972/business-analytics-and-visual-storytelling

Business analytics repository integrating multi-source datasets and applying advanced methods — EDA, clustering, panel regression, sentiment analysis, and data storytelling — to extract actionable insights across hospitality, innovation, and financial services.

business-analytics clustering data-visualization jupyter-notebook predictive-modeling python storytelling

Last synced: 18 May 2026

https://github.com/anandu-jpg/coffee-shop-sales-analysis

This project analyzes coffee shop sales data to identify trends, patterns, and insights that can help improve operations, boost revenue, and enhance the customer experience.

business-intelligence data-analysis data-visualization exploratory-data-analysis jupyter-notebook pandas phyton

Last synced: 18 May 2026

https://github.com/rightfulcode/customer-segmentation-rfm

This project performs customer segmentation using Recency, Frequency, and Monetary (RFM) metrics to identify key customer groups and provide actionable marketing insights.

data-analysis-python data-visualization elevvo-internship jupyter-notebook matplotlib pandas python rfm-analysis seaborn

Last synced: 08 May 2026

https://github.com/ratna-babu/generating-graphs

Generate, color, and visualize random graphs using Python's NetworkX and Matplotlib. Includes compression and storage of graph data with .gz and pickle. Ideal for exploring graph coloring and greedy algorithms in graph theory.

data-visualization erdos-renyi graph-coloring graph-theory greedy-algorithm matplotlib networkx python random-graph random-graph-generation

Last synced: 10 Oct 2025

https://github.com/salma-mamdoh/a-visual-history-of-nobel-prize-winners-project

My project aims to practice Data Analysis and Data Visualization on DataCamp

data-analysis data-visualization datacamp matplotlib pandas python seaborn

Last synced: 04 May 2026

https://github.com/frankelavsky/security-dash-challenge

I had two 8 hour days to create a visualization dashboard for three datasets. Tab one: Voronoi overlay on line graph. Tab two: Data partitioning method keeps in-memory usage low. Tab three: deals with "Failed" vs "Successful" attempts as positive/negative barcharts over time. I used d3.js, require, MVC pattern, and vanilla js.

client-side complexity css3 d3 d3js dashboard data-analysis data-structures-algorithms data-visualization frontend-app html5 interactive-visualizations javascript modular network-analysis network-monitoring network-security security single-page-app visualization

Last synced: 14 Apr 2026

https://github.com/controldata23/automobiles-data-exploration

An Exploratory Data Analysis done on an Automobiles dataset from kaggle

data-exploration data-visualization eda jupyter-notebook matplotlib python-data-analysis

Last synced: 19 Jan 2026

https://github.com/gorlix/sp-grafana-bridge

A lightweight, real-time data bridge to export Super Productivity tasks to InfluxDB v2 for advanced analytics on Grafana. Transform your time-tracking into actionable insights.

data-visualization grafana influxdb-v2 plugin productivity-tool super-productivity superproductivity time-tracking

Last synced: 31 May 2026

https://github.com/itskshitija/hr-data-analysis

The HR Data Analytics Dashboard project uses Power BI to analyze employee data, visualizing key HR metrics and KPIs to support data-driven decisions for improving workforce management, employee satisfaction, and organizational growth.

analytics data-science data-visualization dataanalysis dataanalytics hrdataanalysis powerbi-desktop powerbidashboard

Last synced: 21 Jan 2026

https://github.com/alexmcvay/uber-data

UBER sql clone

data data-visualization sql

Last synced: 19 Jan 2026

https://github.com/bhavinpatel4199/machine-learning-programming

This repository serves as a central hub for various machine learning projects and experiments. It contains multiple sub-repositories, each focusing on different aspects of machine learning, from data preprocessing to advanced deep learning techniques.

data-structures data-visualization machine-learning machine-learning-algorithms pandas-dataframe python3 sklearn

Last synced: 19 Jan 2026

https://github.com/saifalibaig/covid-19-death-rate-analysis-using-python

Analysis of Covid-19 data along with the world happiness report to identify if there is any relationship between death rate and happiness rate of countries all over the world.

data-analysis data-visualization numpy pandas python3 sns visualization

Last synced: 03 May 2026

https://github.com/benst099/circlesplot

Visualize proportions with circles in a plot

cran cran-r data-science data-visualization proportions r visualization

Last synced: 11 Oct 2025

https://github.com/vinay-jose/territorial-sales-dashboard

EDA was carried out in the sales data of Atliq Technologies and a Dashboard was created in PowerBI to draw insights.

data-analysis data-visualization powerbi-desktop sql

Last synced: 11 Oct 2025

https://github.com/mitgar14/etl-workshop-2

Workshop #2 (ETL process using Airflow) for the ETL course using Apache Airflow to build a data pipeline.

airflow data-engineer data-engineering data-visualization etl pandas postgresql powerbi python sqlalchemy

Last synced: 14 Apr 2026

https://github.com/mr-chang95/udacity-starbucks-challenge

Data Science Project for Udacity's Data Scientist Program. Using Python in Jupyter Notebook.

data data-science data-visualization numpy pandas sklearn

Last synced: 14 Apr 2026

https://github.com/ahsankhizar5/titanic-eda-visualization

Exploratory Data Analysis and Visualization on the Titanic Dataset using Python, Pandas, Matplotlib, and Seaborn to uncover survival patterns.

data-analysis data-science data-visualization eda kaggle machine-learning matplotlib pandas python seaborn titanic-dataset

Last synced: 31 May 2026

https://github.com/ahsankhizar5/retail-sales-analysis-python-powerbi

A complete retail sales analytics project using Python for data cleaning and EDA, and Power BI for dashboard visualization. Built as a capstone for the Business Analytics Bootcamp by CourseMea.

business-analytics capstone-project coursemea dashboard data-visualization eda exploratory-data-analysis powerbi python python3 retail-data

Last synced: 31 May 2026

https://github.com/archanakokate/kkbox_music_recommendations

Predicting the chances of a user listening to a song repetitively after the first observable listening event.

data-visualization exploratory-data-analysis machine-learning statistical-analysis

Last synced: 11 Oct 2025

https://github.com/timjjting/task-lineage-generator

A simple Task Lineage Diagram Generator

d3 dag data-visualization golang graphviz-dot lineage task

Last synced: 21 Jan 2026

https://github.com/abeltavares/postql

Python library and command-line interface (CLI) tool for interacting with PostgreSQL databases, providing simplified database management, query execution, and result export functionalities.

cli command-line-interface data-analysis data-engineering data-export data-management data-processing data-visualization database database-administration database-tools etl oop postgres postgresql psycopg2 python sql sqlalchemy wrapper

Last synced: 19 Jan 2026

https://github.com/tzerk/esr

R package 'ESR' for plotting and analysing ESR spectra in dating applications

data-analysis data-visualization electron-spin-resonance geochronology r

Last synced: 13 Mar 2026

https://github.com/agarwalrachit399/fifa-19-analysis

Analysis of one of the most popular games ,FIFA-19 on tableau and a detailed report for the same

dashboard data-visualization fifa19 tableau

Last synced: 19 Jan 2026

https://github.com/sebastian-gregoricchio/rseb

An R-package for daily tasks required to handle biological data as well as avoid re-coding of small functions for quick but necessary data management.

atac-seq bedtools chip-seq cutandtag daily-tasks data-visualisation data-visualization datamining deeptools genomics ngs qpcr qpcr-analysis r rna-seq statistics

Last synced: 31 May 2026

https://github.com/luzmo-official/temperature-increase

A web app displaying Global temperature rises since 1961 based on the dataset made public by FAOSTAT

climate dashboard data-visualization temperature

Last synced: 19 Jan 2026

https://github.com/moustafamohamed01/san-francisco-salaries-data-analysis

Salary Data Analysis 💰📊 An analysis of salary data to uncover trends, distributions, and anomalies in employee compensation. This project involves data cleaning, visualization, and statistical analysis to explore salary trends, job roles, and correlations. Insights are presented using Python and Jupyter Notebook. 🚀

data-cleaning data-visualization jupyter-notebook matplotlib pandas python seaborn

Last synced: 08 May 2026

https://github.com/louisfernando1204/websocket-benchmark

A comprehensive performance testing and analysis suite designed to evaluate and compare different WebSocket server implementations across various programming languages and libraries.

benchmarking broadcast-test coder-websocket csv data-analysis data-visualization echo-test golang gorilla-websocket nodejs python3 socket-io websocket-client websocket-server ws

Last synced: 09 Apr 2026

https://github.com/adadalshabab/data-engineering-gcp-project

An end-to-end modern data engineering project, including deployment of ETL pipeline on Google Cloud Platform, using BigQuery for data analysis and leveraging Looker to generate an insight dashboard.

bigquery data data-science data-visualization databases dataengineering-a engineering etl-pipeline looker-studio powerbi

Last synced: 19 Jan 2026

https://github.com/tashi-2004/Apache-Spark-Geospatial-Air-Quality-Analysis

This project analyzes air quality data across regions to identify improvement areas, track trends, and classify similar regions using clustering. Leveraging PySpark, it processes sensor data, calculates Air Quality Index (AQI), and visualizes results with histograms and geographic maps to highlight areas with good air quality.

aqi aqi-prediction clustering data-science data-visualization geospatial-visualization kmeans-clustering predictive-modeling sensor-data time-series-analysis

Last synced: 13 Oct 2025

https://github.com/achronus/data-graphing-tool

A tool for finding the perfect graph that fits your CSV data.

data-visualization matplotlib numpy pandas python3

Last synced: 13 Oct 2025