An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/melogabriel/nubank-expenses-analysis

This project consolidates monthly credit card statement data from Nubank into a single CSV file using Python, enabling data visualization through a Google Sheets dashboard in Looker Studio.

data-analysis data-visualization googlesheets lookerstudio pandas python

Last synced: 02 May 2026

https://github.com/stephanabitter534/flowy.blazor

🌐 Build responsive web applications effortlessly with VIOVNL.Flowy.Blazor, a Blazor component for smooth drag-and-drop functionality.

blazor component-library csharp data-visualization dotnet event-driven flowchart frontend interactive-graphics open-source single-page-application ui-framework user-interface web-application web-development

Last synced: 01 May 2026

https://github.com/mohd-faizy/learn_matplotlib

Matplotlib is a powerful plotting library in Python that allows you to create static, animated, and interactive visualizations. It’s widely used for representing data graphically, making it easier to analyze and understand.

data-visualization matplotlib matplotlib-heatmap matplotlib-pyplot matplotlib-python matplotlib-styles matplotlib-tutorial plots-graphs plots-in-python

Last synced: 26 Feb 2026

https://github.com/msikorski93/visualizing-lastest-usgs-earthquakes

This notebook contains an introduction to the use of Python and cartopy to visualize data concerning earthquakes. We will first read a file with earthquake locations (latitudes, and longitudes), magnitudes in Richter scale, and depths, and other descriptors and then overlay it on a worldwide map.

cartopy data-visualization folium map

Last synced: 09 Jun 2026

https://github.com/gerhynes/d3-data-dashboard

A D3 data dashboard showing CO2 emissions per country. Built for The Advanced Web Developer Bootcamp.

d3 data-visualization jaavscript

Last synced: 02 May 2026

https://github.com/msthamizh/phonepe-pulse-data-visualization-and-exploration

Developing a Streamlit application that allows users to explore and analyze transaction data from the PhonePe Pulse dataset. The project aims to provide insights into digital payment trends across India.

data-analysis data-visualization dataframe mysql pandas plotly python streamlit

Last synced: 02 May 2026

https://github.com/code-taweezy/thermos-ai

An AI-powered thermal optimization and anomaly detection system for energy-efficient data centers. Note: this is a MVP

ai case-study data-visualization lovable-ai machine-learning matpplotlib mvp numpy pandas project python sckiit-learn seaborn solution tensorflow

Last synced: 02 May 2026

https://github.com/fybex/chatgpt-conversations-analysis

Analysis of 89,000 ChatGPT conversations to understand interaction patterns and response behaviors.

chatgpt conversation-analysis data-analysis data-visualization language-analysis prompt-patterns sentiment-analysis

Last synced: 02 May 2026

https://github.com/amauryval/gdf2bokeh

Gdf2Bokeh, a library to map quickly your geographic data with the bokeh library

bokeh data-visualization geometry geopandas gis-development map mapping pandas python shapely

Last synced: 10 Feb 2026

https://github.com/sreejabethu/sales-data-analysis-forecasting

Welcome to the Sales Data Analysis & Forecasting project! 🚀 This repository showcases my data analysis skills through exploratory data analysis (EDA), data cleaning, and visualization of sales and customer feedback data. The goal is to extract actionable insights to drive business decisions.

analysis barchart data-visualization datacleaning exploratory-data-analysis forecasting histogram matplotlib-pyplot numpy-library pandas-library pycharm-ide sales-analysis salesdata salesdataanalysis seaborn-plots transformation

Last synced: 19 May 2026

https://github.com/sandravizz/global-inequality-data

This repository includes the data wrangling about the Global Inequality dataviz project using vanilla js and the library arquero.js

arquero d3js data-visualization data-wrangling javascript world-inequality-database

Last synced: 10 Feb 2026

https://github.com/jfraziz/jabar.pages.dev

Improving Market Business Strategy and Poverty Alleviation Through Granular Socio-Economic Mapping: Multisource Remote Sensing and Other Geospatial Big Data Approach​

clustering data-visualization gis remote-sensing research-project socio-economic

Last synced: 27 Jul 2025

https://github.com/prem-ium/tax-merge

Consolidate multiple 1099 brokerage tax forms, visualize realized gains/losses, and track data for Fidelity, Chase, Vanguard, Schwab, and others!

1099 accounting chase data-science data-visualization data-wrangling desktop-application fidelity financial-analysis firstrade graphical-user-interface schwab tastytrade tax tax-forms taxes vanguard

Last synced: 02 Feb 2026

https://github.com/zoliqua/venn-diagram-lab

🟢 🟤 Interactive Venn diagram viewer & editor — 44 SVG models (2–9 sets), pre-computed region paths, React + TypeScript + Vite

adelaide bioinformatics data-visualization edwards-venn euler hamilton interactive manawatu massey palmerton-north python-package react set-theory svg typescript upsetplot venn venn-diagram venndiagram victoria

Last synced: 05 May 2026

https://github.com/baggiponte/ta-statistics-for-big-data-2022

🎓 Introduction to Python and Machine Learning [UniMi • AY 2021/2022]

clustering data-science data-visualization machine-learning python scikit-learn

Last synced: 03 May 2026

https://github.com/rohithay/customer-segmentation

Clean and Apply RFM technique to rank and group clusters to identify the best customers and perform targeted marketing campaigns, using real online transaction data

data-cleaning data-science data-visualization datetime marketing-analytics pandas python3 user-segmentation

Last synced: 03 May 2026

https://github.com/noturlee/titanic-datamodel

This project demonstrates the process of data preprocessing, model training, evaluation, and tuning in building a predictive model for a classic dataset. The Random Forest model, with its ability to handle complex relationships and interactions between features, proved to be the most effective in this case.

data-modeling data-science data-visualization python

Last synced: 08 Apr 2025

https://github.com/noturlee/imdb-dataanalysis

A data model that predicts the IMDb rating of a movie based on features like genre, director, and actors. Using regression techniques to tackle this problem.

data-analysis data-cleaning data-modeling data-science data-visualization

Last synced: 08 Apr 2025

https://github.com/tushard48/product-cluster-analysis

This project performs clustering analysis on a product dataset to identify and group similar products. The analysis includes data preprocessing, application of various clustering algorithms, and visualization of results to gain insights into product patterns. Key techniques used are K-Means, Mini Batch K-Means, evaluated using metr

data-visualization excel machine-learning powerbi streamlit unsupervised-learning

Last synced: 03 May 2026

https://github.com/wizardoftrap/football-team-analytics

This Jupyter notebook, created on Kaggle, analyzes football player and team statistics for the 2024-2025 season. It provides insights into player performance, team metrics, and playing styles across major European leagues using data from the dataset players_data-2024_2025.csv.

data-analysis data-visualization jupyter-notebook pandas python

Last synced: 05 May 2026

https://github.com/flazefy2/ds-customer_shopping

https://www.kaggle.com/datasets/bhadramohit/customer-shopping-latest-trends-dataset

data-science data-visualization jupiter-notebook numpy python sales shopping squarify statistics

Last synced: 05 May 2026

https://github.com/kiranmayi5/python-projects

A collection of Python projects showcasing skills in data analysis and visualization.

data-analysis data-visualization machine-learning nlp python

Last synced: 05 May 2026

https://github.com/saeedkohansal/chart.js-tutorial-with-examples

Chart.js is a lightweight, open-source JavaScript library for creating stunning and interactive charts using HTML5 Canvas. It supports various chart types like bar, line, and pie charts, is easy to use, and offers customization options to suit any data visualization needs. Perfect for modern web applications!

barchart canvas chart chart-js chartjs charts css data-visualization datavisualization gilgeekify html html-chart html5 javascript linechart piechart programming scatterchart web-development webdevelopment

Last synced: 05 May 2026

https://github.com/ascender1729/urban-soundscape-harmonizer

Real-time noise level monitoring and analysis for major Indian cities using IoT simulation and data visualization.

data-visualization iot-simulation noise-pollution python react-fastapi urban-planning

Last synced: 27 Jul 2025

https://github.com/md-emon-hasan/ml-projects-telcom-customer-churn-prediction

📱 Customers are likely to leave a telecom service, enabling companies to take measures for retention and create accurate churn prediction models.

boostrap5 customer-churn customer-segmentation data-engineering data-science data-visualization logestic-regression machine-learning telco-customer-churn-prediction telcom-churn

Last synced: 05 May 2026

https://github.com/msohaill/wrappedify

Full stack implementation of Wrappedify

data-visualization express music socket-io spotify sveltekit

Last synced: 05 May 2026

https://github.com/shoyeb45/avltreevisualizer

This is data structure project. In this project I've built AVL tree visualisation application using javafx library of java. This project have solidified my understanding of AVL tree data structure and learnt a lot from this project

data-structures data-visualization java javafx javafx-application visualization

Last synced: 27 Feb 2026

https://github.com/romerorodriguezd/housing-market-tracker

Jupyter Notebook to store and evaluate long-run evolution of house sales, based on Idealista website.

data-visualization matplotlib pandas python scraping scraping-python sqlite3

Last synced: 06 May 2026

https://github.com/drcbeatz/machine-learning-tool

Machine Learning Tool - Train and test supervised ML algorithms (incl. binary classification and regression) on custom data sets and visualize your results without knowing how to code.

data-science data-visualization django machine-learning python scikit-learn

Last synced: 06 May 2026

https://github.com/kiranmayi5/tableau-projects

A collection of interactive Tableau dashboards showcasing data visualization and storytelling skills.

business-intelligence data-storytelling data-visualization interactive-visualizations tableau

Last synced: 27 Feb 2026

https://github.com/scarblase/homeless-animals-analysis

A data-driven exploration of homeless animal statistics 🐶🐱. Analyze age distribution, shelter dynamics, and adoption patterns using Python, Pandas, and Seaborn.

animals data-analysis data-mining data-science data-science-projects data-visualization matplotlib matplotlib-pyplot numpy pandas plotly python python3 ukraine

Last synced: 06 May 2026

https://github.com/edseldim/FirstRoundElectionsFr

A data visualization spreadsheet on Excel

data-analysis data-visualization excel pandas python

Last synced: 02 Aug 2025

https://github.com/GZ430/global-christianity-dataviz-jp

A web app built in R Shiny for users to explore global Christianity data from Joshua Project, World Watch List, and others.

christianity data-visualization r shiny

Last synced: 10 Mar 2025

https://github.com/kirkalyn13/opensignal_autogenerate_report

Script used to generate results/summary, including the trends of flagged provinces, from the raw excel data file,

data-analysis data-science data-visualization matplotlib numpy pandas python

Last synced: 06 May 2026

https://github.com/amirhosseinhonardoust/customer-sentiment-intelligence-platform

An enterprise-grade NLP + Streamlit + SQL platform for analyzing customer feedback. Performs automated sentiment detection, stores labeled reviews in SQLite, and delivers real-time dashboards with probability insights to support business, marketing, and product optimization decisions.

community-project cost-of-living dashboard data-analysis data-visualization economic-analysis inflation-tracking local-data open-data pandas price-tracker public-insight python sqlite streamlit

Last synced: 06 May 2026

https://github.com/fatihilhan42/eda-on-data-science-salary-with-python

You can access the files of this project, which analyzes people working in the field of data science according to countries and working wages.

analysis data-science data-scientists data-visualization jupyter-notebook pyhton salary

Last synced: 23 Mar 2025

https://github.com/shazeus/vizflow-cli

Data visualization pipeline tool for schema inspection, charts, dashboards, and export

charts cli dashboard data-visualization flask pandas plotly python

Last synced: 09 Jun 2026

https://github.com/namratha2301/best-selling-books

Comprehensive examination of best-selling books, focusing on understanding sales patterns, genre distributions, and the impact of various features on book performance.This project aims to predict book sales and classify genres, providing valuable insights for authors, publishers, and readers.

data-analysis data-visualization matplotlib pandas sckiit-learn seaborn

Last synced: 06 May 2026

https://github.com/mrankitgupta/titanic-survival-prediction-93-xgboost

Titanic Survival Prediction Project (93% Accuracy)🛳️ In this notebook, The goal is to correctly predict if someone survived the Titanic shipwreck using different Machine Learning Model & Hyperparameter tunning.

classification data-analysis data-science data-visualization gradient-boosting kaggle-competition linear-regression logistic-regression machine-learning machine-learning-algorithms ml ml-models nlp prediction predictive-modeling random-forest titanic titanic-kaggle titanic-survival-prediction xgboost

Last synced: 06 May 2026

https://github.com/treyhamilton/stat-project-1

A compilation of various programming concepts written in R covering the topics listed below

data-visualization exploratory-data-analysis regression-models

Last synced: 28 Jun 2025

https://github.com/basedrhys/global-stock-vis

Visualisation of financial stock data using CesiumJS

cesiumjs data-visualization nodejs python

Last synced: 06 May 2026

https://github.com/flexmonster/svelte-flexmonster

Svelte wrapper for Flexmonster Pivot Table & Charts

data-analysis data-visualization frontend pivot-tables svelte sveltekit

Last synced: 27 Feb 2026

https://github.com/shuklayash02/sales_dashboard_powerbi

Created interactive dashboard to track and analyze online sales data Used complex parameters to drill down in worksheet and customization using filters and slicers

data-visualization datacleaning excel powerbi

Last synced: 19 Mar 2026

https://github.com/shuklayash02/excel_complete_vrindastore_dataanalysis

Compltete AnalysisData Cleaning,processing and data analysis with interactive dashboard

analysis data data-visualization datacleaning excel excel-vba

Last synced: 19 Mar 2026

https://github.com/scarblase/portfolioprojects

A collection of data analysis and business intelligence projects using SQL, Python, and visualization tools to uncover insights from real-world datasets. 🚀📊

csv data-analysis data-engineering data-mining data-science data-visualization matplotlib matplotlib-pyplot pandas python python3 seaborn sql

Last synced: 06 May 2026

https://github.com/bladealex9848/presion_arterial

Aplicación para el seguimiento de la presión arterial, permitiendo el registro y visualización de mediciones de presión sistólica y diastólica.

blood-pressure-monitor chronic-disease-management data-visualization e-health e-healthcare healthcare-application medical-data-analysis patient-management python sqlite streamlit

Last synced: 06 May 2026

https://github.com/gitjeff05/nih-grant-awards

Analysis of NIH grant awards from ExPORTER data files

data-visualization jupyter-notebook jupyterlab matplotlib pandas python3 seaborn

Last synced: 06 May 2026

https://github.com/himanchalchandra/science-canvas

Repo containing projects I did during a four months bootcamp on Data Science and Machine Learning organized by Science Canvas India.

data-mining data-science data-visualization machine-learning-algorithms mysql nlp-machine-learning

Last synced: 06 May 2026

https://github.com/mxagar/statistics_with_python_coursera

My personal notes done while following the Coursera Specialization "Statistics with Python", from the University of Michingan, hosted by Dr. Brenda Gunderson.

data-modeling data-science data-visualization hypothesis-testing machine-learning pandas python statistics

Last synced: 06 May 2026

https://github.com/allanotieno254/data-analytics-with-tableau

Repository showcasing projects and insights generated through Tableau. Contains visualizations, dashboards, and analytical reports on various datasets,

analytics-intelligence business-intelligence dashboards-tableau data-analytics data-storytelling data-visualization tableau

Last synced: 19 Mar 2026

https://github.com/komed3/moneylens

MoneyLens helps you see money in context. Convert any amount of money into tangible comparisons — from gold weight and coin stacks to purchasing power and global wealth rankings.

data-visualization economics finance money visualization

Last synced: 12 Feb 2026

https://github.com/vietdoo/real-estate-marketplace

a webapp that can visualize homes for sale on a cluster map. Data is continuously fetched from MongoDB, build filtering functions and APIs to find homes.

api bigdata data-visualization flask maps mongodb reactjs webscraping

Last synced: 07 May 2026

https://github.com/ssreeramj/binod-detector

Scrapes comments of a youtube video and shows distribution of comments having 'binod' in it

data-visualization heroku python streamlit youtube-api

Last synced: 16 May 2026

https://github.com/amirhosseinhonardoust/ai-personal-study-tracker

An AI-driven productivity tracking app built with Python, Streamlit, SQLite, and Machine Learning. It logs and analyzes study sessions, predicts productivity using Random Forest models, and visualizes key insights to help learners improve focus, habits, and overall academic efficiency.

ai data-analytics data-visualization education learning-analytics machine-learning productivity python random-forest self-improvement sqlite streamlit student-success study-tracker time-management

Last synced: 07 May 2026

https://github.com/sivas-2/coffee-sales-visualization

This repository contains data visualization scripts and notebooks analyzing coffee sales data from a vending machine, sourced from Kaggle. The visualizations explore sales trends, customer preferences, and product popularity over time.

data data-analysis data-science data-visualization python visualization

Last synced: 07 May 2026

https://github.com/rembertdesigns/smart-vinyl-catalog

AI-powered vinyl cataloging and music discovery platform leveraging BigQuery’s generative AI. Processes mixed-format data to deliver personalized recommendations, collection analytics, and intelligent search. Created for the Kaggle BigQuery AI Challenge to showcase real-world, scalable AI solutions for music lovers.

ai bigquery data-science data-visualization generative-ai hackathon kaggle kaggle-competition machine-learning music-analytics music-recommendation-algorithm python recommender-system vinyl

Last synced: 07 May 2026

https://github.com/tushard48/analyzing-usa-market-trends-a-financial-overview

In-depth analysis of US market trends, encompassing economic indicators, industry performance, and financial data

data data-visualization powerbi

Last synced: 19 Mar 2026

https://github.com/willjw3/virus-spy

A simple, easy-to-follow Covid-19 tracker Jamstack app.

data-visualization jamstack javascript react

Last synced: 07 May 2026

https://github.com/gutyoh/netflix-dashboard

Dashboard created with Python 🐍 and the Dash 📊 framework to visualize Netflix’s content distribution and classification.

dash dashboard data-visualization plotly python

Last synced: 08 Apr 2025

https://github.com/alejo1630/whatsapp_chat_analysis

A Jupyter Notebook with the analysis for a Whatsapp Chat using several techniques of data wrangling, EDA and Sentiment Analysis

data-visualization data-wrangling emojis exploratory-data-analysis jupyter-notebook sentiment-analysis whatsapp-chat

Last synced: 28 Feb 2026

https://github.com/ismail-mouyahada/lodscroljs-library

LodScrolJS Documentation LodScrolJS is a lightweight, fast, and secure JavaScript library designed to load any type of content from APIs on scroll, helping to avoid loading too much data at once. It works seamlessly with various JavaScript frameworks

data data-visualization load-on-scroll loading loading-spinner loadonscroll scroll

Last synced: 13 Feb 2026

https://github.com/tsu2000/bondscope_sg

Web app to visualise and predict the yields of different types of Singapore T-Bills and SGS Bonds.

dashboard data-science data-visualization finance r-shiny singapore time-series-analysis treasury-bonds

Last synced: 11 Aug 2025

https://github.com/backdoorali/insider-threat-detection-project

Personal data analysis project combining insider threat detection, cybersecurity, and exploratory data analytics. Built for portfolio showcase and practical skills demonstration.

cybersecurity data-analysis data-analysis-excel data-analysis-project data-analyst data-analytics data-visualization eda excel insider-threat jupyter-lab jupyter-notebook matplotlib numbers pandas portfolio-project python python3 threat-detection threat-intelligence

Last synced: 07 May 2026

https://github.com/garcane/london-housing-price-dashboard

This Excel-based Housing Visual Dashboard provides a comprehensive view of average house prices across various boroughs in London from 1996 to 2013. The dashboard is designed to offer insights into housing market trends and price variations across different areas of London over time.

data data-analysis data-visualization excel visual

Last synced: 13 Feb 2026

https://github.com/bala-1409/foreign-exchange-rate-time-series-data-science-project

This project will use time series analysis to forecast the exchange rate between the euro and the US dollar. The project will use a variety of statistical techniques, such as ARIMA to model the data and forecast the exchange rate.

data-analysis data-science data-visualization datapreprocessing eda exploratory-data-analysis forecasting machine-learning-algorithms model modelfitting predictive-modeling python3 scikit-learn statsmodels time-series time-series-analysis

Last synced: 07 May 2026

https://github.com/owengombas/r-place

🖼 Data analysis on r/place 2022

data-science data-visualization rplace

Last synced: 13 Feb 2026

https://github.com/thevinh-ha-1710/diabetes-predictive-model

This project aims to train a predictive model to diagnose diabetes on women patients.

data-analysis data-science data-visualization model-training-and-evaluation python

Last synced: 13 Feb 2026

https://github.com/zayarhtet/visualising-election-with-data

Visualisation of Myanmar Election 2020 with past data

data-mining data-science data-visualization

Last synced: 13 Feb 2026

https://github.com/vishvamporwal/pharmassist

A progressive web app made with Flask for Industrial pharmaceutical management and Analysis, to improve efficiency and make management easier.

data-visualization flask html-css-javascript python

Last synced: 07 May 2026

https://github.com/y-india/retail-sales-analysis-project

Analysis and preprocessing of retail store sales data. Includes data loading, merging, and initial inspection. 📌 Recommended: See README.md for detailed project progress and dataset information.

ai dashboard data-analysis data-science data-visualization jupiter-notebook machine-learning matplotlib python real-world-problem-solving real-world-project retail-analytics sales-analysis seaborn sklearn-library streamlit

Last synced: 07 May 2026

https://github.com/divyanshu-rawat/data-visualization-highmaps

Built Using Highcharts JavaScript API to Visualize Data !:mortar_board:

bootstrap data-visualization geolocation-api highcharts javascript jquery

Last synced: 07 May 2026

https://github.com/1ayanabil1/iris-visualization

This repository focuses on visualizing the Iris dataset using various data visualization techniques. It includes histograms, scatter plots, box plots, pie charts, bubble charts, and KDE plots to provide insights into the dataset’s structure. The project utilizes Matplotlib, Seaborn, Plotly, and Scikit-learn to generate insightful visualizations.

analytics clustering data-analysis data-science data-visualization datavisualization-project datavisualizations eda exploratory-data-analysis machine-learning machinelearning-python python

Last synced: 07 May 2026

https://github.com/mustika-putri-m/excel-data-storytelling-pt-tumbuh-bersama-growian

PT. Tumbuh Bersama merupakan salah satu perusahaan di bidang retail yang menjual banyak sekali perlengkapan rumahan ke berbagai negara. Beberapa minggu kedepan perusahaan ini ingin mengadakan town hall untuk menginformasikan performance bisnis yang mereka miliki. Sebagai bagian dari team data, kalian diminta untuk melakukan analisis secara bebas

data-visualization excel vizualisation

Last synced: 19 Mar 2026

https://github.com/oldhero5/talent_track

TalentTrack is an open‐source recruitment analytics web application built with Flask and Python. It leverages advanced machine learning techniques—such as Product Quantization (PQ) for candidate ranking and SHAP for model interpretability—to help HR teams and recruitment professionals identify high-quality candidates efficiently.

active-learning analytics candidate-ranking data-visualization faiss flask hrtech machine-learning open-source python recruitment shap talent-analytics

Last synced: 07 May 2026

https://github.com/amanpatel-1234/automatic_data_analyzer_app

📊 Automatic Data Analyzer is a Streamlit web app that performs complete data analysis with just a single file upload. Simply upload your CSV file, and the app automatically generates: Data Overview: Preview, summary statistics, and data types. Visual Analysis: Histograms, distribution plots, and interactive charts for numeric columns.

app data-science data-visualization matplotlib-pyplot numpy pandas python3 seaborn seaborn-plots streamlit

Last synced: 09 Apr 2026

https://github.com/garcane/beverage-sales-analytics

This project provides an in-depth analysis of beverage sales and delivery across different states using Power BI.

data data-visualization powerbi powerbi-report powerbi-visuals

Last synced: 19 Mar 2026

https://github.com/garcane/insight-ventures-inc-analysis

This report provides a comprehensive analysis of Insight Ventures Inc.'s financial performance based on data from 2013 and 2014. It includes insights into sales performance, profitability, and market trends across North America and Europe.

data-visualization excel excel-dashboard excel-report financial-analysis

Last synced: 19 Mar 2026

https://github.com/pm25/youbike-station-finder

🚲 Display and visualize real-time information for Taipei YouBikes.

css3 d3js data-visualization html5 javascript visualization website youbike

Last synced: 08 May 2026

https://github.com/huonglarne/self-study-d3

Self studying Data Visualization with d3.js

dashboard data-visualization

Last synced: 13 Feb 2026

https://github.com/geo-y20/loan-approval-automation-using-mongodb-and-pymongo

This project demonstrates the implementation of a loan approval system that utilizes MongoDB for distributed data storage and management, and PyMongo for database operations. The project aims to automate the assessment of loan eligibility using customer details from online applications.

crud-application data data-analysis data-science data-visualization deployment jupyter-notebook loan-default-prediction loan-prediction-analysis machine-learning machine-learning-algorithms matplotlib mongodb pymongo streamlit web

Last synced: 08 May 2026

https://github.com/manhtdxxx/batch-and-stream-pipeline-via-lakehouse

This project demonstrates a modern Lakehouse architecture supporting both streaming and batch data pipelines, built on Apache Iceberg tables.

airflow batch-processing data-engineering data-visualization docker elt-pipeline hive-metastore iceberg kafka lakehouse medallion-architecture spark stream-processing superset trino

Last synced: 08 May 2026

https://github.com/darkmenacer/google-geolocation-bias

Tool to scrape web links for different queries from different cities across the world, all from your home

data-visualization postgresql python3 selenium webscraping

Last synced: 08 May 2026

https://github.com/shridhar1504/loan-clustering-datascience-project

This project uses Machine Learning to Cluster loan together based on their similarities. The project uses a dataset of loan application which includes information about the Loan amount and Balance. The project then use the clustering algorithm to group the loan together based on the similarities.

clustering-algorithm data-analysis data-science data-visualization datanalysis eda kmeans-clustering machine-learning python sql sql-server unsupervised-learning

Last synced: 08 May 2026