An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/shuddha2021/ai-document-analyzer

An interactive, client-side AI Document Summarizer & Analyzer built with HTML, CSS, and JavaScript. Features summarization, entity extraction, insights, file parsing (TXT, CSV, XLSX, HTML), and visualizations, all in-browser.

artificial-intelligence chartjs client-server client-side-ai css d3js data-visualization document-summarization file-parser html javascript search-functionality sheetjs text-analysis

Last synced: 19 May 2026

https://github.com/ashwathnakate/sales-dashboard

A sales dashboard made purely in Python and deployed using shinyapps

dashboard data-visualization python shiny-apps

Last synced: 05 Oct 2025

https://github.com/yuvraj0412s/proactive-fraud-detection-using-machine-learning

An end-to-end machine learning project for detecting financial fraud using LightGBM, featuring in-depth EDA, advanced feature engineering, and a focus on actionable business insights.

class-imbalance classification-model data-analysis data-science data-visualization exploratory-data-analysis feature-engineering fintech fraud-detection jupyter-notebook lightgbm machine-learning pandas python scikit-learn smote

Last synced: 02 May 2026

https://github.com/viztruth/google-play-store-data-analysis

This repository contains all the materials of my final project 'Google Play store Data Analysis' for the 'Telling Stories with Data' course at PES University.

data-analysis data-visualization

Last synced: 21 Aug 2025

https://github.com/mohamed3nan/udacity

Udacity Data Analysis Nanodegree Program

data-analysis data-visualization numpy pandas python

Last synced: 10 Apr 2026

https://github.com/aymane-maghouti/mobile-data-hive-insights

This project demonstrates the process of extracting data from a MySQL database, transferring it using Apache Sqoop, storing it in Hive Data warehouse (the data actually is store in Hadoop Distributed File System (HDFS)), and performing analysis using Hive Query Language (Hive QL) (it is a language close to SQL). Then visualize the data in Power BI,

apache-sqoop data data-integration data-visualization hadoop-hdfs hivedb hiveql powerbi

Last synced: 09 Mar 2026

https://github.com/abbasi0abolfazl/commentanalyzer

machine learning project designed to analyze Instagram comments for sentiment detection, question identification, and topic modeling. Utilizing algorithms such as LDA, LSA, NMF, and BERT, CommentAnalyzer provides valuable insights into user interactions, helping brands and researchers understand audience sentiments and trends.

data-visualization instagram-data-analysis machine-learning question-detection sentiment-analysis social-media-analytics topic-modeling

Last synced: 30 Aug 2025

https://github.com/paul019/pappe

A CLI to draw your data on top of millimeter paper

automation data-visualization diagram diagram-generator python

Last synced: 05 Mar 2025

https://github.com/Zen204/airbnb-availability

A machine learning model that predicts Airbnb listing availability, utilizing feature engineering and supervised learning techniques to improve guest experience and optimize host management.

binary-classification data-analysis data-preprocessing data-visualization feature-engineering machine-learning matplotlib model-evaluation nlp pandas predictive-modeling python scikit-learn seaborn supervised-learning

Last synced: 02 Apr 2025

https://github.com/s-araromi/smart-device-data-analysis-bellabeat_casestudy

This repository contains a comprehensive analysis of smart device data for Bellabeat, focusing on user activity, sleep, calorie burn, and weight management. Insights from Fitbit data are used to inform Bellabeat's marketing strategy and product recommendations.

bellabeat data-analysis-in-r data-visualization fitbit-smart-devices fitness-tracking r-programming sleep-analysis smart-devices

Last synced: 25 May 2026

https://github.com/yashika-malhotra/micromobility-service-provider---hypothesis-testing

Examined factors influencing demand for micro-mobility shared electric cycles Performed exploratory analysis and hypothesis testing, revealing the distinct influence of weather-season association on hourly counts

colab-notebook data-visualization eda exploratory-data-analysis hypothesis-testing jupyter-notebook matplotlib matplotlib-pyplot numpy pandas python scipy-library scipy-stats seaborn skit-learn

Last synced: 12 Apr 2026

https://github.com/ninadpatil09/bankcard-analytics---credit-debit-card-usage-monitoring

This project is a comprehensive data analysis initiative aimed at extracting valuable insights from bank card usage data. The tools and techniques includes Python, Excel, Tableau, web scraping, pandas. It centers around understanding and visualizing trends and patterns in credit and debit card usage across multiple banks.

data-cleaning data-visualization excel python tableau tableau-public web-scraping

Last synced: 18 Apr 2026

https://github.com/greed2411/ndl

Numbers Don't Lie, attempt on Data Analysis using pandas and matplotlib.

cities data-analysis data-science data-visualization india kaggle

Last synced: 19 Apr 2026

https://github.com/willie-conway/little-lemon-database-capstone-project

This repository contains the capstone project for the Meta Database Engineer Professional Certificate 🎓, showcasing a comprehensive database design 🗃️, SQL implementation 💻, and data analytics 📊 for the fictional restaurant "Little Lemon" 🍋.

analytics-python big-data business-intelligence customer-analytics data-analytics data-integration data-management data-mining data-modeling data-retrieval data-visualization database-engineering mysql performance-analysis python relational-databases restaurant-management sql-data-analysis stored-procedures tableau

Last synced: 13 Apr 2026

https://github.com/odeyiany2/flit-apprenticeship-data-science-projects

This repo contains all my projects for my FLiT Apprenticeship

data-analysis data-science data-visualization machine-learning sql

Last synced: 17 May 2026

https://github.com/phac-nml/nf-sequenoscope

Streamlined Nextflow wrapper for the Sequenoscope toolkit. Simplifies complex metagenomic workflows with automated batch processing, allowing efficient comparative analysis from raw reads to visualization.

adaptive-sampling batch-processing data-visualization dsl2 metagenomics nextflow sequenoscope

Last synced: 16 Jan 2026

https://github.com/sunnybibyan/call_centre_power_bi_dashboard

Create a dashboard in Power BI to visualize relevant KPIs and metrics that will help the call center manager understand trends.

call-centre-analysis dashboard data-analysis data-visualization powerbi

Last synced: 19 Mar 2026

https://github.com/ondrejhruby/airbnb-analysis-machine-learning

A comprehensive end-to-end machine learning project analyzing Airbnb listings data. This project includes exploratory data analysis, model training, optimization, and model interpretability, using a randomly generated dataset for demonstration purposes.

airbnb-data data-science data-visualization exploratory-data-analysis hyperparameter-tuning machine-learning model-interpretability python regression-analysis

Last synced: 20 Jul 2025

https://github.com/gracysapra/r-in-data-science

This repository contains essential guides for data analysis using R, covering topics like data preparation, data reshaping, and data visualization. Each file focuses on fundamental techniques to manipulate, clean, and visualize data effectively using R programming.

data-analysis data-preparation data-reshaping data-science data-visualization data-visualizations ggplot r r-for-data-science

Last synced: 19 Apr 2026

https://github.com/poc275/datavisusercontrols

Data Visualisation User Controls for Windows Store Apps

c-sharp data-visualisations data-visualization metro-application user-controls

Last synced: 27 May 2026

https://github.com/vidhi1290/zomato-data-analysis

Zomato Data Analysis - Explore the world of Zomato restaurant data through Python and data analysis. Uncover trends and insights using Pandas for data manipulation and Matplotlib for visualization. Join us in this journey to reveal the hidden stories within the data!

data-analysis data-analysis-python data-science data-visualization dataprocessing machine-learning machine-learning-algorithms matplotlib numpy pandas python scikit-learn zomato-data-analysis

Last synced: 11 Apr 2026

https://github.com/filiplangiewicz/fastfooddatavisualization

🍟 Project for Data Visualization Techniques course about fast food consumption

collaboration data-science data-visualization dplyr ggplot2 poster

Last synced: 15 Mar 2025

https://github.com/fatihilhan42/stock-market-analysis-with-pandas-python

Hello, today I will tell you the details of the stock market analysis project with python.

data-science data-visualization stock-market

Last synced: 23 Mar 2025

https://github.com/sandravizz/visual-data-science-r

Visual explorative analysis in R from scratch mainly using ggplot

data-science data-visualization ggplot r

Last synced: 16 May 2025

https://github.com/flower-of-the-bridges/va-project

Project for the Visual Analytics course from the Engineering in Computer Science master degree at Sapienza

d3-js d3-visualization data-visualization data-viz visual-analytics

Last synced: 21 Feb 2026

https://github.com/chandkund/spam-email-detection

This project focuses on detecting spam emails using a fine-tuned DistilBERT model, a lighter version of the BERT model. The model is trained to classify email text into two categories: spam (1) and not spam (0). The dataset consists of email texts labeled as either spam or non-spam.

data-visualization datapreprocessing matplotlib pandas python pytorch sklearn transformer

Last synced: 20 Jan 2026

https://github.com/javitocor/d3-data-visualization

Different projects to visualize data from APIs using D3 library

d3-visualization d3js data-visualization freecodecamp javascript practice-project

Last synced: 30 Apr 2026

https://github.com/katiesaund/market_size_app

An interactive dashboard to plot a company's estimated market size.

data-visualization finance r rshiny shiny shinyapp

Last synced: 24 Mar 2025

https://github.com/guruakaashjn/te_project_microsoft_ai

AI based statistical analysis of land-use plastic pollution in India using AI/ML techniques.

artificial-intelligence data-analysis data-analytics data-science data-visualization machine-learning powerbi

Last synced: 27 Feb 2025

https://github.com/noturlee/imdb-dataanalysis

A data model that predicts the IMDb rating of a movie based on features like genre, director, and actors. Using regression techniques to tackle this problem.

data-analysis data-cleaning data-modeling data-science data-visualization

Last synced: 08 Apr 2025

https://github.com/GZ430/global-christianity-dataviz-jp

A web app built in R Shiny for users to explore global Christianity data from Joshua Project, World Watch List, and others.

christianity data-visualization r shiny

Last synced: 10 Mar 2025

https://github.com/dev-owdenmag/dataflow-manager

A dynamic and versatile web application for managing, collecting, and presenting data with an integrated printing feature.

data data-management data-management-platform data-visualization python

Last synced: 30 Mar 2025

https://github.com/gher-uliege/bluecloud-plankton

Spatial interpolation of plankton data using a neural network

data data-analysis data-visualization neural-network oceanography

Last synced: 30 Mar 2025

https://github.com/natanast/immunovisual

A collection of charts made with the R programming language, focusing on immunogenetics analyses. Different charts types are being organized into multiple sections, each accompanied by its reproducible code. The gallery spotlights the utilization of prominent R packages such as tidyverse, data.table, and ggplot2.

data-visualization ggplot2 quarto r-programming

Last synced: 11 Mar 2026

https://github.com/hendersontrent/hotter

R package for webscraping, statistical analysis, and data visualisation of the Triple J Hottest 100 Countdown and related data.

data-visualisation data-visualization r triple-j webscraping

Last synced: 08 Apr 2025

https://github.com/kuroko1t/geoview

A lightweight, browser-based GIS data viewer built with Streamlit and Geopandas. Visualize Shape files, GeoJSON, and more instantly

data-visualization folium geojson geopandas gis shapfile streamlit

Last synced: 29 May 2026

https://github.com/colburncodes/se_pudding_2023

This project is a React app designed to showcase research conducted by a team of data scientists and data analysts. The app is utilizing React and React-Chartjs-2

chartjs-2 data-analysis data-science data-visualization react-chartjs-2 reactjs

Last synced: 11 May 2026

https://github.com/ineelhere/shinyDwight

A dashboard depicting the great battle of Dwight Schrute and the Computer | The Office

bslib data-visualization imola r rshiny rshiny-application sass server shiny-apps ui-components

Last synced: 01 Apr 2025

https://github.com/ax-va/interactive-data-visualization-with-d3-dale-2023

These examples on Interactive Data Visualization with D3.js in the web browser are compiled with some modifications from the book "Data Visualization with Python and JavaScript: Scrape, Clean, Explore, and Transform Your Data", Second Edition, written by Kyran Dale and published by O'Reilly Media in 2023.

ax-va d3 data-science data-visualization dataviz frontend javascript web

Last synced: 12 Jun 2025

https://github.com/yusuf-abol/alumni-interaction-and-conversation-dynamics-nlp

This Natural Language Processing (NLP) project took a dive into chat engagement dynamics within the University of Ilorin’s Class of 2018 Statistics alumni group. By applying Latent Dirichlet Allocation (LDA) for topic modeling and network analysis, I uncovered communication patterns, topic distributions, and member interactions.

alumni-network anonymization conversation data-science data-visualization engagement machine-learning network-analysis nlp python-3 sentiment-analysis statistics whatsapp

Last synced: 05 May 2026

https://github.com/kurosawaxyz/covid4eu-sorbonne

Economy: “Analysis of Labor Market decisions of men and women during the COVID-19 pandemic in the 4EU+ countries”.

covid-19 data-analysis data-science data-visualization pandas

Last synced: 04 Jul 2025

https://github.com/rani-sikdar/pwc-virtual-internship-powerbi

Comprehensive Power BI dashboards showcasing insights on Call Centre Trends, Customer Retention, and Diversity & Inclusion to drive business impact.

business-analytics business-intelligence data-analysis data-cleaning data-visualization interactive interactive-visualizations powerbi

Last synced: 07 Jan 2026

https://github.com/gracysapra/pandas-numpy-data-visualisation

This repository contains essential Python scripts and notebooks for data analysis and visualization. It includes: pandas: Data manipulation and analysis, including operations on series and dataframes. NumPy: Efficient numerical computations and array processing. Data Visualization: Creating insightful visualizations using Matplotlib and Seaborn.

data-science data-visualization matplotlib numpy numpy-arrays pandas pandas-dataframe pandas-series seaborn

Last synced: 07 May 2026

https://github.com/whitehathackerpr/data-visualization-tool

This is a Python-based web application that allows users to upload datasets, analyze data, and create visualizations interactively. The tool is designed for ease of use and provides a simple interface to perform basic data analysis and generate visualizations

data data-analysis data-visualization python python3

Last synced: 05 Sep 2025

https://github.com/isatyamks/chatsense

This is my first machine learning model, designed to predict the mood and behavior of users by analyzing their WhatsApp chat archives.

analysis artificial-intelligence behavior data-visualization machine-learning matplotlib pandas prediction seaborn vercel-deployment whatsapp-chat wordcloud

Last synced: 07 Jan 2026

https://github.com/ddihora1604/iitk_esg

Researching and Analyzing key ESG (Environmental, Social, Governance indicators) metrics and their impact on stock performance and market behavior. Leveraging AI techniques (like Machine Learning and NLP) in finance to extract insights from ESG disclosures, enhancing financial predictions and sustainable investment strategies.

data-analysis data-visualization esg python yahoo-finance

Last synced: 24 Apr 2025

https://github.com/vincent-tran-94/dataviz_tweets_chatgpt

Une application Streamlit pour analyser et visualiser les données et les tweets sur la sortie de ChatGPT. Ce projet comprend la gestion des données, l'analyse des sentiments, les tendances émergentes et les applications potentielles de ChatGPT.

data-management data-visualization python streamlit text-mining twitter

Last synced: 08 May 2026

https://github.com/gappeah/nike_web_crawler

This project involves web scraping Nike's product pages to extract product names, prices and links. The project showcases three different implementations of the web crawler using Selenium and BeautifulSoup. It also includes visualisation of the scraped data using Matplotlib and Seaborn.

beautifulsoup data-analysis data-visualization python selenium web-crawler web-scraper webcrawler webscraper webscraping webscraping-beautifulsoup

Last synced: 04 Jul 2025

https://github.com/open-data-plan/ava-react

React Component wrapper for AVA

auto-chart ava chart data-visualization g2plot react

Last synced: 10 Jun 2025

https://github.com/aravindnathan02/sales-and-customer-analytics

This is a repository for sales and customer performance Tableau dashboard.

customer-dashboard dashboard data-analysis data-visualization sales-analysis sales-dashboard tableau

Last synced: 08 Jan 2026

https://github.com/owengombas/r_place

🖼 Data analysis on r/place 2022

data-science data-visualization rplace

Last synced: 18 Jan 2026

https://github.com/shantoroy/data-visualization-streamlit-app

Simple csv data analytics platform built using Python Streamlit

data-analytics data-visualization machine-learning python streamlit

Last synced: 13 Apr 2026

https://github.com/shiweihe0713/data-science-for-business-techincal

TECH-GB_2336 will teach you how to think about data based problems in the business world through the lens of data analytics. We will focus on data-analytic thinking, how to approach problems, how to develop insights using data, how to apply machine learning and other analytic techniques...

data-science data-visualization nyu pandas python stern

Last synced: 09 May 2026

https://github.com/garcane/global-shipping-analytics-dashboard

This Tableau project provides a comprehensive visual analysis of global sales, shipping costs, and quality metrics across different regions and countries.

data data-analysis data-analyst data-visualization metrics tableau

Last synced: 01 Mar 2026

https://github.com/gracysapra/heart-disease-prediction-using-logistic-regression

This project uses Logistic Regression to predict the likelihood of heart disease based on medical attributes such as age, cholesterol levels, and blood pressure. It includes model training, evaluation, and an interactive Gradio interface for real-time heart disease risk prediction.

classification data-preprocessing data-science data-visualization gradio-interface heart-disease-prediction logistic-regression machine-learning

Last synced: 11 Jun 2026

https://github.com/nafisalawalidris/building-a-clustering-model-for-customer-segmentation

Customer Segmentation Using Clustering: This repo applies clustering algorithms to a customer transaction dataset, grouping similar customers together based on their purchasing behavior. Targeted marketing strategies can be developed by analyzing distinct customer segments.

clustering customer-segmentation data-analysis data-visualization k-means machine-learning marketing-analytics unsupervised-learning

Last synced: 16 Mar 2025

https://github.com/dhrupad17/ibm-data-analyst-professional-certificate

Prepare for a career as a data analyst. Build job-ready skills – and must-have AI skills – for an in-demand career. Earn a credential from IBM. No prior experience required.

assignment-solutions coursera data-analytics data-science data-visualization excel ibm pandas professional-certificate professional-certificates python quiz updated-2024

Last synced: 13 Apr 2026

https://github.com/archanakokate/movielens-case-study-eda-prediction-

Exploratory Data Analysis on Movielens data files and Model building using Decision Tree Classifier , Random Forest Classifier and XG Boost.

data-visualization dataengineering exploratory-data-analysis machine-learning-algorithms

Last synced: 17 Mar 2025

https://github.com/emilhvitfeldt/ggtetris

Create Tetris Chart Visualizations in R

data-visualization datavisualization dataviz ggplot2 r rstats

Last synced: 05 Apr 2025

https://github.com/pedroscaff/sensor-platform-data-analysis

Data analysis and visualization for data collected with Sensor Platform

data-visualization heremaps maps

Last synced: 07 Oct 2025

https://github.com/deliprofesor/ridge-regression-for-sales-prediction-model-evaluation-and-hyperparameter-tuning

This project builds and optimizes a model on a dataset using Ridge regression and polynomial features. Model accuracy is enhanced through regularization and polynomial transformations. Grid search and cross-validation are used to find the best parameters, and the model's performance is evaluated.

cross-validation data-science data-visualization grid-search machine-learning model-optimization mse overfitting-prevention polynomial-regression python r2-score regression-analysis regularization ridge-regression rmse scikit-learn

Last synced: 03 May 2026

https://github.com/czheluo/fst-manhattan

Fst manhattan Plot

data-visualization fst

Last synced: 14 Apr 2026

https://github.com/martialhimanshu/motion-detector-camera

An application to detect motion of any object using any camera device and analysis of output data using plot and CSV file

bokehplots data-visualization dataframe-library motion-detection opencv2 python

Last synced: 09 Oct 2025

https://github.com/kamanhang/sqldatawarehousedataengineeringproject

This project delivers a modern data warehouse which focuses on building clean, organized data pipeline which covers important aspects such as ETL Pipeline Development, Data Cleaning, Data Modelling and Data Analytics

customer-analytics data-analysis data-cleaning data-engineering data-modeling data-pipeline data-visualization datascience etl-pipeline postgresql powerbi powerbidashboard sales-analysis sql

Last synced: 10 Oct 2025

https://github.com/alifeee/co2-stacked

A visualisation of CO2 levels as a vertically stacked graph, with days going upwards. Using @tomhazledine's <stacked-sparklines> web component.

co2-monitoring data-visualization web-component

Last synced: 19 Jan 2026

https://github.com/listiangr/ecommerce_sales_data_analysis

Proyek ini menganalisis data penjualan e-commerce untuk membantu bisnis memahami tren penjualan, performa produk, dan segmen pelanggan. Tujuan utamanya adalah memberikan wawasan yang dapat meningkatkan strategi pemasaran dan pengelolaan produk.

dashboard data-analysis data-cleaning data-collection data-penjualan data-visualization exploratory-data-analysis microsoft-excel

Last synced: 19 Jan 2026

https://github.com/faizantkhan/regression-project-bangalore-property-price-prediction

🏠 Bangalore Property Price Prediction is a comprehensive project designed to accurately predict property prices in Bangalore. Leveraging advanced regression techniques and a dataset sourced from Kaggle, the model undergoes meticulous feature engineering, data cleaning, and parameter tuning to ensure high accuracy.

backend-api css data-cleaning data-science data-visualization eda flask html javascript machine-learning-algorithms numpy pandas project project-repository property python regression-models server

Last synced: 14 Apr 2026

https://github.com/ryanfranklin237/data-visualization-spreadsheets

Data visualization done with microsoft excel and google spreadsheets

data-analysis data-science data-visualization google-spreadsheets microsoft-excel

Last synced: 22 Feb 2026

https://github.com/clever-boy/productclassification

Comprehensive product analysis and recommendation system with JSON data processing, visual analytics, and machine learning.

data-visualization json-processing machine-learning product-analysis python recommendation-system

Last synced: 14 Apr 2026

https://github.com/vineet416/chronic-kidney-disease-prediction

This repository contain code of Chronic Kidney Disease Detection Prediction Project. The goal of this project is predict the chronic kidney disease using parameters like Diabetes Mellitus, Blood Urea, Sugar, Hypertension etc.. I used multiple machine learning algorithms with hyperparameter tuning which is having highest accuracy score of 97.5

data-visualization data-wrangling exploratory-data-analysis feature-engineering feature-selection hyperparameter-tuning machine-learning matplotlib numpy pandas plotly pre-processing python seaborn sklearn-library statsmodels

Last synced: 14 Apr 2026

https://github.com/tuni56/matplotlib_introduction

Want to bring trigonometry to life? With Matplotlib, you can easily plot the sine and cosine functions on the same graph, creating an intuitive visualization of their periodic nature.

data-science data-visualization matplotlib python trigonometry-visualisation

Last synced: 16 Oct 2025