An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/victorowinoke/custmer-segmentation-using-rfm-python-

Customer Segmentation using the Recency, Frequency and Monetary Values

customer-segmentation data data-visualization python3 science time-series-analysis

Last synced: 26 May 2026

https://github.com/amanyadav-07/customer-churn-prediction

Machine Learning project to predict customer churn using Logistic Regression, Random Forest, and XGBoost. Includes data preprocessing, feature engineering, SMOTE balancing, model training, evaluation, and business insights.

accuracy-metrics data-analysis data-visualization logistic-regression machine-learning matplotlib numpy pandas python3 random-forest-classifier seaborn sklearn xgboost-classifier

Last synced: 11 Apr 2026

https://github.com/raghavendranhp/phonepe-pulse-data-visualization-and-exploration

This code clones PhonePe data from GitHub. After processing the data, it is displayed in an appealing manner to gain insights from PhonePe's information. This can be used to increase productivity, profits, and focus specifically on business development.

data-visualization githubclone mysql mysqlconnector pandas plotly plotly-dash python sqlalchemy streamlit visualization

Last synced: 11 Apr 2026

https://github.com/miteshgupta07/streamlit-machine-learning-app

A Streamlit application for interactive exploratory data analysis (EDA) and data visualization, offering dynamic tools to analyze and visualize machine learning datasets.

data-visualization python streamlit

Last synced: 27 Apr 2026

https://github.com/sharoonjoseph321/liver_cirrhosis

This project aim to understanding the factors contributing to liver cirrhosis, analyzing its impact, and possibly predicting disease outcomes using machine learning. It might also explore survival analysis or risk stratification for liver cirrhosis patients.

analytics data-science data-visualization dataanalysis machine-learning machine-learning-algorithms predictive-analytics predictive-modeling python random-forest-classifier visualization

Last synced: 15 Mar 2025

https://github.com/emcramer/clockplot

Plotting utility for a "clockplot" that puts groups into a time-ordered heterogeneity visualization

biology data-analysis data-visualization heterogeneity pseudotemporal-ordering

Last synced: 10 Mar 2026

https://github.com/khushi-sabarad/web_scraping

This project is a Python-based web scraper that extracts the menu from a cafe and saves it to an Excel file. It was created to automate the process of retrieving and updating menu prices, a task that was observed to be done manually at the hostel.

beautifulsoup data-analysis data-visualization market-analysis pandas python requests web-scraping wordcloud

Last synced: 29 Apr 2026

https://github.com/bilgeswe/datascience

Using Statistics and Data Science Project in Python using Google Colab tools, CVS and XLSX

box-plot colab-notebook cvs data-science data-scraping data-visualization heatmaps numpy python statistical-analysis statistics xlsx

Last synced: 29 Apr 2026

https://github.com/vedantshi/coffee-sales-dashboard

This project analyzes coffee sales data using Excel, featuring data cleaning, trend analysis, and an interactive dashboard. Key insights highlight top-performing products, regional sales trends, and seasonal patterns. Recommendations focus on marketing strategies and inventory optimization. Future plans include Power BI integration for visuals.

business-insights data-analysis data-visualization excel-dashboard pivot-tables sales-trends

Last synced: 05 Jan 2026

https://github.com/akhdandann/itutilizationdashboard-powerbi

Interactive Power BI dashboard for monitoring IT utilization, application uptime, and infrastructure performance at PT PLN (2014-2018).

business-intelligence dashboard data-visualization power-bi reporting

Last synced: 26 Jan 2026

https://github.com/hhlitval/siemens-cashflow-analysis

Financial data engineering and analysis project extracting cash flow metrics from Siemens annual reports and presenting insights through a static, data-driven web dashboard.

cashflow chartsjs data-engineering data-visualization duckdb etl financial-analysis javascript pdf-extraction python

Last synced: 26 May 2026

https://github.com/naomiwolfe/golden-isles-dashboard2

Interactive tourism analytics dashboard for Georgia's Golden Isles

analytics chartjs dashboard data-visualization georgia golden-isles tailwindcss tourism

Last synced: 05 Oct 2025

https://github.com/Gregoritsch3/Exercise_Pandas_1

A Pandas exercise demonstrating the loading, cleaning and reorganization of data in .xlsx or .csv files, descriptive statistics, data visualization, statistical approximation of data with the normal distribution, etc.). Libraries include Pandas, NumPy, Scipy, SymPy, MatplotLib,

data-cleaning data-visualization descriptive-statistics matplotlib numpy pandas scipy sympy

Last synced: 01 May 2025

https://github.com/tsopermon/comparison-ml-algorithms

This repository compares the performance of Adaline, Logistic Regression, and Perceptron models on binary classification tasks using linearly, non-linearly, and marginally separable datasets from the Iris dataset. It includes MATLAB implementations, 10-fold cross-validation, and visualizations of decision boundaries and MSE histories.

adaline binary-classification classification-accuracy cross-validation data-visualization decision-boundaries iris-dataset logistic-regression machine-learning matlab mse neural-networks perceptron

Last synced: 15 Mar 2025

https://github.com/aphp/jupyter-eds-notebooks

jupyter-eds-notebooks provides Docker images with preconfigured Jupyter environments for clinical and health data analysis, tailored for AP‑HP Datalabs and the HELIX platform.

data-analysis data-science data-visualization healthcare lab

Last synced: 13 Jan 2026

https://github.com/jleung51/visualizations

Javascript & D3.js visualizations of data.

d3js data-visualization javascript

Last synced: 27 Mar 2025

https://github.com/usk2003/vnrvjiet-lab-work

This repository contains my lab work for the B.Tech CSE-AIML program (2022-2026) under the R22 regulation at VNR Vignana Jyothi Institute of Engineering and Technology. It includes various subjects like Machine Learning, OS, Data Structures, C Programming, and more, showcasing my practical learning and implementations.

c-programming compiler-design computer-networks data-engineering data-structures data-visualization dbms engineering-drawing java machine-learning operating-system python software-engineering

Last synced: 11 Apr 2026

https://github.com/vidushibhadana/eda-on-nyc-taxi-data

About Conducting an Exploratory Data Analysis (EDA) on New York City taxi data and visualizing it through countplots, distribution plots (displot), and histograms using Python and it's libraries.

data data-visualization jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 11 Apr 2026

https://github.com/yaser-123/energy-consumption-dashboard

A Power BI dashboard to analyze energy consumption for water, gas, and electricity across cities and buildings. Features include interactive charts, drill-down insights, and dynamic filters for easy monitoring and optimization.

dashboard data-analysis data-analytics data-visualization energy-consumption energy-efficiency powerbi

Last synced: 05 Jan 2026

https://github.com/arslanr369/eda-journey

Exploratory data analysis (EDA) and visualization projects focusing on diverse datasets, including Bitcoin price trends and Indian restaurant reviews. Each notebook aims to provide insights and showcase data storytelling through visual exploration.

bitcoin data-science data-visualization eda

Last synced: 14 Mar 2025

https://github.com/msikorski93/meteorite-landings

Basic data analysis focused mainly on visualizing geospatial data worldwide with cartopy.

cartopy data-visualization geopandas gis mapping meteorite-landing-sites shapefile

Last synced: 16 May 2026

https://github.com/jofaval/melbourne-temperature-timeseries

Timeseries Data Analysis and Forecasting of the daily min temperature in Melbourne from 1981 to 1990

data-analysis data-science data-visualization deep-learning google-colab melbourne python temperature tensorflow timeseries timeseries-analysis

Last synced: 29 Apr 2026

https://github.com/karo23361/toy-store-kpi-power-bi

PowerBI Portfolio Project

csv data data-visualization powerbi

Last synced: 03 Feb 2026

https://github.com/aykutsahinn/carpredictapp

İkinci El Araçların Analizi | Jupyter Notebook

analysis data-visualization jupyter-notebook pyhton streamlit

Last synced: 29 Apr 2026

https://github.com/hazz-i/e-commerce-analysis

FP Dicoding Analisis data dengan python

data-visualization jupyter-notebook python

Last synced: 29 Apr 2026

https://github.com/akhdandann/squadevaluationdashboard-powerbi

A Power BI dashboard that visualizes squad evaluation metrics including happiness, contribution, commitment, delivery, and agile behavior across tribes at PT. XL Axiata Tbk. (with dummy data)

business-intelligence dashboard data-visualization power-bi reporting

Last synced: 26 Jan 2026

https://github.com/franloza/contratosdemadrid

This project is an interactive web application for exploring and analyzing public contracts in the Community of Madrid. It allows users to search for companies and view their contract details, aiming to promote transparency and facilitate access to public information.

data-visualization duckdb evidence open-data

Last synced: 23 Jun 2026

https://github.com/rakeshdabbikar4/sales-performance-dashboard-powerbi

Interactive Sales Performance Dashboard built using Power BI to analyze revenue, orders, profit, trends, and regional performance.

business-analytics business-intelligence data-analytics data-visualization dax powerbi sales-dashboard

Last synced: 13 Jan 2026

https://github.com/pat8901/diskanalyzer-cli

Processes a pdf file holding storage utilization data to automatically create graph visualizations revealing the true demographics hidden in large data.

data-visualization graphs-generation matplotlib

Last synced: 27 Dec 2025

https://github.com/syarwinaaa09/hypothesis-testing-with-mens-and-womens-soccer-matches

a data-driven exploration of international men's and women's football (soccer) match results using Python

data-analysis data-visualization football jupyter-notebook men-vs-women pandas python soccer sports-analytics visualization

Last synced: 05 May 2026

https://github.com/lotfiferaga/google-play-store-sentiment-analysis

Perform sentiment analysis on Google Play Store reviews using Python. Analyze user feedback to determine the overall sentiment (positive, negative, or neutral) towards various apps. Gain insights to aid developers and businesses in understanding user satisfaction levels and improving their products.

data-analysis data-visualization googleplayservices python reviewsanalysis-nlp

Last synced: 26 Feb 2025

https://github.com/shubham200137/icc-women-s-t20-world-cup-data-analytics

Created a Power BI report to identify top 11 players for a T20 cricket team by scraping data from espncricinfo with Python, cleaning and transforming the data with pandas, and evaluating various player performance metrics.

beautifulsoup4 data-analysis data-visualization numpy-python pandas-python powerbi web-scraping

Last synced: 25 Feb 2025

https://github.com/shivam5509/power-bi-project

Expert in creating interactive dashboards and reports using Power BI, utilizing 10+ visual tools like cards, slicers, and charts. Skilled in cleaning and transforming large datasets with Power Query Editor. Proficient in advanced DAX functions (SUMX, FILTER, CALCULATE) to derive insights and drive data-driven decisions.

advanced-excel computer-science data-analysis data-mining data-visualization engineering mysql numpy pandas powerbi pyhton3 sql sql-server

Last synced: 11 Apr 2026

https://github.com/itrauco/data-dirtying-tool

a simple command line tool to generate dirty data and do common data things in google cloud

data data-analysis data-engineering data-ops data-pipeline data-science data-visualization data-wrangling dirty-data google-cloud machine-learning

Last synced: 24 Feb 2025

https://github.com/maettuu/project-beatblend

Repository for the Master's Project 2024 on Visualizing and Explaining Sequential Song Recommendations through Data Humanism

audio-features aws ci-cd content-based-recommendation data-visualization discogs-api docker fastapi full-stack jwt-token masters-project postgres python recommendation-system redis rest-api spotify-api visual-data vuejs websocket

Last synced: 11 Apr 2026

https://github.com/yash-rewalia/airbnb_eda_pandas

The goal of the project is to gather information and analyze the detailed information of the different entries in order to provide insights about the host and price of the property in a particular area as per your preference , type of rooms and number of reviews accordingly.

data data-cleaning data-insights data-preprocessing data-visualization matplotlib numpy pandas python seaborn

Last synced: 11 Apr 2026

https://github.com/shariqayan/diwali_sales_analysis_python

The Diwali Sales Analysis project focuses on analyzing sales data during the Diwali festival to gain insights into customer behavior, improve customer experience, and optimize sales strategies.

data-visualization matplotlib numpy pandas python seaborn

Last synced: 29 Apr 2026

https://github.com/gunjanmimo/d3-visualization

D3.js is a JavaScript library for producing dynamic, interactive data visualizations in web browsers. It makes use of Scalable Vector Graphics, HTML5, and Cascading Style Sheets standards. It is the successor to the earlier Protovis framework

d3js data data-science data-visualization reactjs

Last synced: 29 Apr 2026

https://github.com/jfaccioli/citi-bike-tableau

A data analysis of Citi Bike users in Jersey City using Tableau

data-analysis data-visualization tableau tableau-public

Last synced: 26 Jan 2026

https://github.com/guyabel/chord-afcon

Visualizing bilateral links between AFCON squads and players clubs

afcon africa chord-diagram data-visualization data-viz dataviz football rstats visualization

Last synced: 10 Jun 2026

https://github.com/sanand0/booksviz

LLM-generated visual insights from the GoodReads 100K dataset

data-visualization llm

Last synced: 20 Jan 2026

https://github.com/azaz9026/data_cleaning

Welcome to the Data Cleaning repository! This collection is dedicated to showcasing techniques and methods for cleaning and preparing datasets for analysis.

data-analysis data-engineering data-structures data-visualization eda feature-engineering machine-learning numpy outliers pandas python seaborn

Last synced: 13 Apr 2026

https://github.com/kklassa/evolutionary-algorithm

implementation of an evolutionary algorithm for function minimization and a testing and result visualization environment

ai algorithm artificial-intelligence data-science data-visualization evolutionary-algorithm evolutionary-algorithms optimization python

Last synced: 20 Jun 2025

https://github.com/sayamalt/life-expectancy-prediction

Successfully established a machine learning model which can accurately predict the expected life duration of a human being based on several demographic features such as alcohol consumption per capita, average BMI of entire population, etc.

cross-validation data-cleaning-and-preprocessing data-visualization docker end-to-end-pipeline exploratory-data-analysis feature-engineering github-actions-workflow hyperparameter-tuning machine-learning model-deployment model-training-and-evaluation

Last synced: 04 May 2026

https://github.com/sayamalt/employee-attrition-prediction

Successfully established a machine learning model which can accurately predict whether an employee of a given company will leave it in the impending future or not, based on several employee details and employment metrics.

binary-classification continuous-deployment continuous-integration cross-validation data-exploration-and-preprocessing data-visualization exploratory-data-analysis feature-engineering hyperparameter-optimization machine-learning model-deployment model-training-and-evaluation

Last synced: 08 Oct 2025

https://github.com/0xarchit/covid-data-dashboard

This repo consists files related to Data Visualization Covid Data Dashboard Assignment

covid-19 covid19-data dashboard data-visualization streamlit

Last synced: 10 Apr 2026

https://github.com/jiyanshgarg/delhivery-logistics-data-analysis

This project analyzes Delhivery's logistics delivery dataset to understand delivery performance, route efficiency, and operational patterns using data analytics techniques. The analysis focuses on transforming raw segment-level logistics data into meaningful trip-level insights that can help improve delivery efficiency and route planning.

business-insights-and-recommendations data-analysis data-cleaning-and-preprocessing data-visualization exploratory-data-analysis feature-engineering feature-extraction feature-selection hypothesis-testing outlier-detection outlier-treatment

Last synced: 12 Jun 2026

https://github.com/nimomach/amazon-sales-data

This is a small dataset containing Amazon sales data analysis for few regions.

dashboards data data-analysis data-visualization

Last synced: 08 Mar 2026

https://github.com/sayamalt/twitter-sentiment-analysis

Successfully established a machine learning model which can accurately classify the sentiment of any particular tweet into either positive, negative or neutral category.

data-visualization exploratory-data-analysis nlp sentiment-analysis supervised-learning text-processing

Last synced: 09 Nov 2025