An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/thinkong/notetaiker

notetAIker is a local-first, AI-powered Markdown note-taking system that uses background agents to automatically tag, link, and organize atomic notes. It features semantic search, a 2D graph visualization for exploring connections, and supports both cloud and local AI models.

ai data-visualization knowledge-graph knowledge-management local-first notetaking self-hosted

Last synced: 18 Feb 2026

https://github.com/viseshrp/community_health_indicator

Android app to fetch,organize and represent NYC health data

android data-analysis data-visualization health

Last synced: 03 Mar 2025

https://github.com/mastersign/mastersign-datascience

High level helpers for data science in Python with Pandas.

data-science data-visualization database-access pandas python

Last synced: 05 May 2026

https://github.com/uba/cmap2png

Useful script to convert a color map definition to PNG image file.

colormap converter cpt data-science data-visualization matplotlib plot png python

Last synced: 17 Sep 2025

https://github.com/gui-sitton/churn-finalproject

predict its customers' churn. If it is discovered that a user is planning to switch operator, the company will offer them promotional codes and special plan options.

churn-prediction data-analysis-python data-science data-visualization

Last synced: 26 Jul 2025

https://github.com/jfraziz/jabar.pages.dev

Improving Market Business Strategy and Poverty Alleviation Through Granular Socio-Economic Mapping: Multisource Remote Sensing and Other Geospatial Big Data Approach​

clustering data-visualization gis remote-sensing research-project socio-economic

Last synced: 27 Jul 2025

https://github.com/ayushverma135/accenture-data-analytics-and-visualization

This program provided practical experience in advising a hypothetical social media client as a Data Analyst at Accenture. The simulation involved cleaning, modeling, and analyzing multiple datasets, culminating in the creation of a PowerPoint deck and video presentation to communicate key insights.

accenture analytics data data-visualization forage presentation

Last synced: 19 Sep 2025

https://github.com/sanmeet007/pandas_prototype

A package that with implementation of famous Pandas library in my way , allows you to visualize data using data tables

data-science data-visualization pandas-prototype prototyping python

Last synced: 19 Sep 2025

https://github.com/leandrocollares/with-or-without-curry-playoffs

An interactive visualization that allows users to learn when Curry's performance helped Golden State Warriors win games in the 2019 NBA playoffs

d3 data-visualization lodash node-sass react react-annotation react-popper

Last synced: 03 Jan 2026

https://github.com/bzbradford/wibee-dashboard

Data visualization app for WiBee pollinator surveys.

data-visualization pollinators shiny

Last synced: 30 Jul 2025

https://github.com/hetuvpatel/ml-diabetes-risk-progression-stage

Machine learning project analyzing diabetes risk progression using K-Means and Hierarchical clustering techniques on the Pima Indian Diabetes dataset. 🧠📊

cluster-analysis data-visualization heirarchical-clustering kmap kmeans machine-learning matplotlib sckit-learn seaborn

Last synced: 23 Sep 2025

https://github.com/qtle3/logistic-regression

A Python implementation of Logistic Regression to classify social network ads based on age and estimated salary, featuring data visualization and performance metrics such as confusion matrix and accuracy score.

data-visualization feature-scaling logistic-regression logistic-regression-algorithm model-evaluation

Last synced: 01 Aug 2025

https://github.com/athul64/exploratory-data-analysis

To preprocess and analyze the given employee dataset, present the findings graphically, and derive meaningful insights to help better understand the company’s workforce.

colab-notebook data-analysis data-visualization matplotlib numpy pandas python seaborn statistical-analysis

Last synced: 25 Feb 2026

https://github.com/jahnavigupta06/zepto-delivery-customer-analytics

Real-time SQL + Power BI Analytics Project replicating Zepto's customer & delivery insights.

business-intelligence churn-analysis customer-segmentation data-analysis data-visualization powerbi sql-server

Last synced: 02 Aug 2025

https://github.com/vatshayan/hiring

Hiring for Data Science, Cryptography, Machine Learning and Artificial Intelligence.

cryptography data-science data-visualization datasets hiring internship internship-task machine-learning machine-learning-algorithms

Last synced: 09 Mar 2026

https://github.com/t-mohamed-shafeek/data-analysis-on-tamil-nadu-road-accidents

The "Data Analysis on Tamil Nadu Road Accidents" is a project deals with analysis of data on Road Accidents encountered by Tamil Nadu ( one of the states of India ) in the year of 2020 and 2021. But the dataset is most recently created (created on February 15, 2023 with source form TN Police).

dashboard data-analysis data-science data-visualization jupyter-notebook tableau

Last synced: 03 Aug 2025

https://github.com/xilinjia/csv-analyser

Visualize and analyze data in csv files. A multiplatform project runnable on Linux, Android, Windows, MacOS

android compose csv data-science data-visualization kotlin kotlin-multiplatform linux macos multiplatform windows

Last synced: 09 Apr 2026

https://github.com/tom-draper/call-graph-viz

A tool to visualise Python function calls and code complexity.

call-graph call-graphs data-visualisation data-visualization function-calls graph python vizjs

Last synced: 09 Mar 2026

https://github.com/simranjeet97/ipl-dataanalysis

Data Analysis performed on IPL Dataset with Data Profiling, Data Pre-Processing, Data Manipulation, and Data Visualization.

artificial-intelligence data-analysis data-manipulation data-mining data-preprocessing data-science data-visualization indian-premier-league-2008-2018 ipl ipl-dataset iplayer python

Last synced: 08 May 2026

https://github.com/leocornus/leocornus-visualdata

JavaScript libraries to make data visualization simpler and easier.

data-analysis data-mining data-visualization data-visualization-simpler javascript-library

Last synced: 10 Aug 2025

https://github.com/judahpaul16/plume

Map how user information flows through any codebase or infrastructure into a readable graphic

charts cli data-flow data-visualization documentation flow observability pii reports tooling

Last synced: 25 Jun 2026

https://github.com/ascender1729/urban-soundscape-harmonizer

Real-time noise level monitoring and analysis for major Indian cities using IoT simulation and data visualization.

data-visualization iot-simulation noise-pollution python react-fastapi urban-planning

Last synced: 27 Jul 2025

https://github.com/centrefordigitalhumanities/digital-atlas

Interface to visualise connections between postcolonial intellectuals

data-visualization postcolonialism

Last synced: 20 Apr 2026

https://github.com/charliecm/meteorite-landings

Data visualization of meteorite landings on Earth.

astronomy d3 data data-visualization mapbox space visualization

Last synced: 18 Apr 2026

https://github.com/myself-aas/quantium_data_analytics_forage

This project analyzes retail customer chip purchasing behavior using Python, focusing on customer segmentation and key spending drivers to provide data-driven insights for strategic category management recommendations.

data-analysis data-engineering data-science data-visualization feature-engineering forage internship-project matplotlib-pyplot numpy-library pandas-dataframe pearson-correlation python quantium-virtual-experience scipy-stats seaborn

Last synced: 31 Jul 2025

https://github.com/soumyaco/hotel-price-data-analysis

Data analysis on Indian hotels price. A beginners guide to data analysis.

data-analysis-python data-science data-visualization matplotlib seaborn-plots

Last synced: 01 Aug 2025

https://github.com/shubhamgoyal575/diwali-sankranti-promotion-sales

This Power BI dashboard analyzes sales performance during Diwali and Sankranti festivals. It provides insights into revenue trends, top-selling products, regional sales distribution, and customer purchasing behavior to help optimize festive season sales strategies. 🚀

buisness-intelligence dashboard data-analysis data-visualization diwali-sankranti-sales-analysis excel fast-moving-consumers-goods fmcg microsoft-power-bi mysql power-query powerbi revenue-insights sales-dashboard sales-insights sql

Last synced: 02 Mar 2026

https://github.com/shriram-vibhute/data-analysis

This repository offers a comprehensive collection of data analysis techniques using NumPy Pandas, Matplotlib and Seaborn.

data-aggregation data-analysis data-visualization data-wrangling matplotlib numpy pandas seaborn

Last synced: 02 Aug 2025

https://github.com/sayamalt/fraudulent-transactions-prediction

Successfully trained a machine learning model which can predict whether a given transaction is fraud or not.

data-visualization exploratory-data-analysis imblearn machine-learning model-based-testing model-building predictive-analytics sklearn

Last synced: 29 Apr 2026

https://github.com/dianaow/singapore_stories

Exploring and visualizing datasets related to Singapore issues

d3 d3js d3v4 data-visualization maps singapore singapore-government

Last synced: 11 Aug 2025

https://github.com/amanpatel-1234/automatic_data_analyzer_app

📊 Automatic Data Analyzer is a Streamlit web app that performs complete data analysis with just a single file upload. Simply upload your CSV file, and the app automatically generates: Data Overview: Preview, summary statistics, and data types. Visual Analysis: Histograms, distribution plots, and interactive charts for numeric columns.

app data-science data-visualization matplotlib-pyplot numpy pandas python3 seaborn seaborn-plots streamlit

Last synced: 09 Apr 2026

https://github.com/virajbhutada/walmart-retail-analyzer

Gain valuable insights into retail sales with the "Walmart Retail Performance Dashboard" in MS Excel. This user-friendly tool facilitates an in-depth analysis of key sales metrics, providing a comprehensive view of Walmart's performance. Make data-driven decisions for informed and strategic business outcomes.

analytics data-analysis data-science data-visualization excel insights interactive-visualizations performance-analysis retail-sales walmart

Last synced: 04 Mar 2026

https://github.com/prathameshdhande22/data-visualization-tutorial

Repository Contains all the stuff required for Data Visualization. Numpy Tutorial, Pandas Tutorial, Matplotlib Tutorial, Seaborn Tutorial

data-visualization jupyter-notebook matplolib numpy pandas pandas-dataframe python seaborn tutorial

Last synced: 09 Apr 2026

https://github.com/stimulsoft/samples-dashboards.web-for-asp.net-angular

ASP.NET MVC and ASP.NET Core samples for Dashboards.ANGULAR data analysis tool, Visual Studio C# projects, and .NET Framework 4.5.2, 4.6, 4.7, 4.8 versions, and .NET Core 3.1, .NET 5.0, .NET 6.0, .NET 7.0, .NET 8.0 dashboard engine

analysis analytics angular angular17 charts client-server dashboard dashboards data-visualization embedded interactive kpi native net-core pivot sql tables tools typescript webapp

Last synced: 09 Apr 2026

https://github.com/fawaz-ahmed/js-infinite-median

Calculate median of a stream of numbers using heap sort (with nlogn comlpexity)

crypto data-structures data-visualization graphs infinite math median median-heap numbers pricing streaming-data

Last synced: 15 Aug 2025

https://github.com/dylan-stewart/capstone

Cloud-Based Analytics Application: CSV Check ~ Data Visualization for Inexperienced Users

automation cloud css csv data-visualization exploratory-data-analysis file-processing flask gcp gcs html javascript machine-learning python visualization web-app

Last synced: 09 Apr 2026

https://github.com/mariam-badr-mb/student-score-prediction

This project predicts students' exam scores based on study-related and demographic factors using machine learning models.

data-analysis data-visualization explore linear-regression machine-learning mean-square-error student-project supervised-learning

Last synced: 16 Aug 2025

https://github.com/shuddha2021/ai-document-analyzer

An interactive, client-side AI Document Summarizer & Analyzer built with HTML, CSS, and JavaScript. Features summarization, entity extraction, insights, file parsing (TXT, CSV, XLSX, HTML), and visualizations, all in-browser.

artificial-intelligence chartjs client-server client-side-ai css d3js data-visualization document-summarization file-parser html javascript search-functionality sheetjs text-analysis

Last synced: 19 May 2026

https://github.com/tiesen243/data-analyst

A simple data visualization with python

data-visualization jupyter-notebook

Last synced: 18 Aug 2025

https://github.com/yuvraj0412s/proactive-fraud-detection-using-machine-learning

An end-to-end machine learning project for detecting financial fraud using LightGBM, featuring in-depth EDA, advanced feature engineering, and a focus on actionable business insights.

class-imbalance classification-model data-analysis data-science data-visualization exploratory-data-analysis feature-engineering fintech fraud-detection jupyter-notebook lightgbm machine-learning pandas python scikit-learn smote

Last synced: 02 May 2026

https://github.com/marcramonmoreno/historian_app

Historian Data Tool - A secure internal web application for processing and visualizing historical data, built with React frontend and Flask backend. Features include data processing with configurable time ranges, tag configuration management, and CSV export capabilities. Access requires Tailscale VPN connection and proper permissions.

csv-export data-processing data-visualization docker docker-compose flask historian python react web-application

Last synced: 10 Apr 2026

https://github.com/prpriesler/three_industry_project

This project encompasses three distinct but inter connected projects, each focused on leveraging machine learning techniques to address critical challenges in different domains.

data-science data-visualization r rprogramming rstudio

Last synced: 20 Aug 2025

https://github.com/mohamed3nan/udacity

Udacity Data Analysis Nanodegree Program

data-analysis data-visualization numpy pandas python

Last synced: 10 Apr 2026

https://github.com/leegeunhyeok/visual-electron

📈 Electron + Vue.js based data visualization tool

data-visualization electron vue

Last synced: 24 Aug 2025

https://github.com/aveygo/nuclearsmoke

Smoke fallout forecasting in NSW

data-visualization rust weather

Last synced: 27 Aug 2025

https://github.com/sjcobb/perses-embed-example

Perses embeddable dashboard & visualization framework example app (built with MUI + Next.js)

charting-library dashboards data-visualization echarts mui perses prometheus

Last synced: 27 Aug 2025

https://github.com/abbasi0abolfazl/commentanalyzer

machine learning project designed to analyze Instagram comments for sentiment detection, question identification, and topic modeling. Utilizing algorithms such as LDA, LSA, NMF, and BERT, CommentAnalyzer provides valuable insights into user interactions, helping brands and researchers understand audience sentiments and trends.

data-visualization instagram-data-analysis machine-learning question-detection sentiment-analysis social-media-analytics topic-modeling

Last synced: 30 Aug 2025

https://github.com/paul019/pappe

A CLI to draw your data on top of millimeter paper

automation data-visualization diagram diagram-generator python

Last synced: 05 Mar 2025

https://github.com/ipanalytics/vpn-infrastructure-atlas

Interactive VPN infrastructure atlas: aggregate provider footprint by reported endpoint country, ASN, and hosting network. No raw configs or per-IP feed.

asn-analysis cybersecurity data-visualization fraud-detection github-pages ip-reputation network-intelligence proxy-detection threat-intelligence vpn-intelligence

Last synced: 25 May 2026

https://github.com/ipanalytics/tor-radar

The project collects public Tor relay data once per hour, stores compact snapshots in the repository, and renders an interactive dashboard without a database or backend.

asn data-visualization network-intelligence onionoo osint threat-intelligence tor tor-exit-nodes tor-relays

Last synced: 25 May 2026

https://github.com/karlyndiary/amazon-sentiment-analysis-eda

Amazon Customer Reviews Sentiment Analysis utilizing Python for Data Extraction and Pre-Processing, with real-time data from the Amazon API and Tableau for dashboard visualizations.

amazon-api api-data-extraction dashboard data-cleaning data-pipeline data-visualization etl roberta-sentiment-analysis sentiment-analysis tableau-dashboards vader-sentiment-analysis

Last synced: 22 Mar 2025

https://github.com/virajbhutada/global-universities-success-analysis-powerbi-sql-excel

This capstone project conducts in-depth analysis using Power BI, SQL, and Excel to explore complex dynamics shaping global university success. Integrating data from diverse ranking systems and criteria, our aim is to unravel the factors influencing universities worldwide.

capstone capstoneproject data-analysis data-analytics data-insights data-science data-science-projects data-visualization excel exploratory-data-analysis mece mysql powerbi powerpoint sql

Last synced: 20 Jun 2025

https://github.com/gappeah/global-shipping-analytics-dashboard

This Tableau project provides a comprehensive visual analysis of global sales, shipping costs, and quality metrics across different regions and countries.

data data-analysis data-analyst data-visualization metrics tableau

Last synced: 25 Feb 2025

https://github.com/sayamalt/black-friday-sales-prediction

Successfully established a machine learning regression model which can estimate the gross Black Friday sales for a particular customer, based on a distinct set of related and meaningful features, to a fair level of accuracy.

artificial-neural-networks data-visualization deep-learning exploratory-data-analysis feature-engineering machine-learning model-deployment model-training-and-evaluation regression-analysis regression-modelling regression-testing supervised-learning

Last synced: 09 Nov 2025

https://github.com/mariorodeghiero/datopian

Henry Hub Natural Gas - Daily Prices

csv-files d3 data-visualization nodejs react styled-components

Last synced: 10 Apr 2026

https://github.com/s-araromi/smart-device-data-analysis-bellabeat_casestudy

This repository contains a comprehensive analysis of smart device data for Bellabeat, focusing on user activity, sleep, calorie burn, and weight management. Insights from Fitbit data are used to inform Bellabeat's marketing strategy and product recommendations.

bellabeat data-analysis-in-r data-visualization fitbit-smart-devices fitness-tracking r-programming sleep-analysis smart-devices

Last synced: 25 May 2026

https://github.com/yashika-malhotra/micromobility-service-provider---hypothesis-testing

Examined factors influencing demand for micro-mobility shared electric cycles Performed exploratory analysis and hypothesis testing, revealing the distinct influence of weather-season association on hourly counts

colab-notebook data-visualization eda exploratory-data-analysis hypothesis-testing jupyter-notebook matplotlib matplotlib-pyplot numpy pandas python scipy-library scipy-stats seaborn skit-learn

Last synced: 12 Apr 2026

https://github.com/aryansharma5/data-visualization-and-thorough-analysis

comprehensive guide for data analysis and visualization

data-analysis data-visualization

Last synced: 18 Mar 2025

https://github.com/amirnaghibi/companion

find the safest way back home with a trusted companion

data-visualization django-framework pathfinding python3 webapp

Last synced: 10 Apr 2026

https://github.com/nouman6093/advanced-statistical-models

in this repository i will upload everything i have learned about data science advanced statistical models. there are over 42 statistical models. each of them work on algorithms. and there are over 32 algorithms. each library has its own way of writing such statistical models. after learning i will try to upload as much statistical models as possibl

data data-analysis data-science data-visualization

Last synced: 11 Jun 2026

https://github.com/mardavsj/weather-prediction

Weather prediction model which mainly focuses on visualization.

data-analysis data-visualization matplotlib numpy pandas pandas-dataframe

Last synced: 10 Apr 2026

https://github.com/lcvriend/laserbeans

Toolbox for data exploration

altair data-exploration data-visualization

Last synced: 15 Mar 2025

https://github.com/datamart/platform-management

📁 Platform management dashboard

charts dashboard data-visualization visualization

Last synced: 20 Mar 2025

https://github.com/gauff/belgianelectriccarmarketanalyser

Python tool for analyzing the belgian second hand electric car market by scraping and visualizing data from multiple car listing websites. Features parallel web scraping, price tracking, and interactive dashboards.

automotive beautifulsoup4 car-market dash data-analysis-python data-cleaning data-visualization electric-vehicles market-analysis pandas parallel-processing plotly price-comparison price-monitoring selenium web-scraping-python

Last synced: 30 Apr 2026

https://github.com/willie-conway/global-superstore-data-modeling-analysis

A comprehensive data modeling and analysis project for the 🌍Global Super Store, focusing on database design 🗃️, sales data analysis 📊, and interactive visualizations 📍 using MySQL 🖥️ and Tableau 📈.

business-analytics business-intelligence data-exploration data-modeling data-preprocessing data-restructuring data-visualization database-design er-diagram geographic-analysis interactive-dashboard mysql profit-analysis sales-analysis sales-performance sales-trends sql star-schema tableau time-series-analysis

Last synced: 21 Jul 2025

https://github.com/ryanfranklin237/data-visualization-python

A tool that allows you to visualize data from a csv or excel file in a graph or charts form

data-analysis data-science data-visualization matplotlib pandas-dataframe python

Last synced: 11 Jun 2026

https://github.com/odeyiany2/flit-apprenticeship-data-science-projects

This repo contains all my projects for my FLiT Apprenticeship

data-analysis data-science data-visualization machine-learning sql

Last synced: 17 May 2026

https://github.com/samjtro/tri-map

EPA-TRI Data Visualization & Analysis

data-visualization emissions environment python

Last synced: 01 Sep 2025

https://github.com/ondrejhruby/countries-of-the-world

Explore global data with this repository, featuring insights, visualizations, and Python code examples on countries worldwide—perfect for enhancing your data analysis and visualization skills.

data-analysis data-science data-visualization geography jupyter-notebook machine-learning matplotlib pandas python statistics

Last synced: 16 Apr 2026

https://github.com/ryannapp12/quant_trading_engine

A modular, and scalable quantitative trading engine built in Python. This project demonstrates efficient data caching with SQLite, concurrent backtesting, and advanced risk analytics, showcasing best practices in clean code architecture and performance optimization.

algorithmic-trading backtesting dash data-analysis data-visualization fintech lstm machine-learning numpy pandas plotly python quantitative-finance real-time risk-management sqlite technical-analysis tensorflow time-series-analysis trading-strategies

Last synced: 11 Apr 2026

https://github.com/avidlearnerinprogress/text_analytics_101

Coursework solutions for the course COMP47600 - Text Analytics

data-science data-visualization natural-language-processing nlp python3 r text-analytics

Last synced: 05 May 2026

https://github.com/rupeshrb/project_data_visualization_django

Data visualization is important concept which apply in this project with help of d3.js, pandas,python etc library

bootstrap d3js data-visualization django html-css-javascript pandas python sqlite

Last synced: 11 Apr 2026

https://github.com/vikhram-s/electoral-bond-data-analysis-using-python

Electoral Bond Data Analysis Using Python : This repository contains a comprehensive analysis of electoral bonds in India using Python. The project aims to understand the distribution, trends, and impact of electoral bonds on political funding. It includes data cleaning, exploratory data analysis, visualization, and reporting.

data-visualization dataanalysisusingpython electoralbonds exploratory-data-analysis

Last synced: 15 May 2025

https://github.com/ahmednasef3/udemy-courses-full-eda

Exploratory Data Analysis on the factors that can affect the promotions and earnings in Udemy Courses and the perfect way to make a good saled course in Udemy.

data-analysis data-science data-visualization eda exploratory-data-analysis matplotlib pandas seaborn udemy-course-project

Last synced: 01 May 2026

https://github.com/phac-nml/nf-sequenoscope

Streamlined Nextflow wrapper for the Sequenoscope toolkit. Simplifies complex metagenomic workflows with automated batch processing, allowing efficient comparative analysis from raw reads to visualization.

adaptive-sampling batch-processing data-visualization dsl2 metagenomics nextflow sequenoscope

Last synced: 16 Jan 2026

https://github.com/luabagg/worldwide-trends

Worldwide Google Trends visualization and classification

data-analysis data-visualization google-trends trends

Last synced: 03 Feb 2026

https://github.com/zeinhasan/eksploration-and-data-visualization-course-material

Exploratory Data Analysis (EDA) Laboratory Assistant Teaching Materials

data-analysis data-visualization statistics

Last synced: 24 Jun 2026

https://github.com/mitevpi/vue-d3-bar-chart

Reusable, reactive, animated bar chart using D3 + Vue.js. Written in idiomatic Vue, rather than D3 syntax.

d3 data data-visualization frontend interactive svg vue web

Last synced: 18 May 2026

https://github.com/zane/plot

A tiny Clojure library for plotting things at the REPL.

clojure data-science data-visualization repl statistics

Last synced: 01 Sep 2025

https://github.com/mastercruelty/gokart-data-hub

It manages data about gokart races and plot graphs about your times!

data-analysis-python data-science data-visualization gokart matplotlib pandas race

Last synced: 15 Apr 2025

https://github.com/sankalp130/call-center_data-analysis

This repository contains Tableau dashboard, visualization, and data analysis projects I have created.

data-cleaning data-schema data-visualization tableau

Last synced: 05 Jan 2026

https://github.com/hamdaniqhmqd/project-predict-saham-bbri

Repository Project-Predict-Saham-BBRI is a group assignment project that uses Streamlit, scikit-learn, and related technologies to build a BBRI stock price prediction application based on day, week, and month input.

data-visualization numpy pandas python sklearn streamlit

Last synced: 11 Apr 2026

https://github.com/filiplangiewicz/fastfooddatavisualization

🍟 Project for Data Visualization Techniques course about fast food consumption

collaboration data-science data-visualization dplyr ggplot2 poster

Last synced: 15 Mar 2025

https://github.com/tim-richter/spinne

Spins a web of component relationships for react projects

abstract-syntax-tree components data-visualization design-system jsx react stats tsx usage

Last synced: 10 Mar 2026

https://github.com/chandkund/spam-email-detection

This project focuses on detecting spam emails using a fine-tuned DistilBERT model, a lighter version of the BERT model. The model is trained to classify email text into two categories: spam (1) and not spam (0). The dataset consists of email texts labeled as either spam or non-spam.

data-visualization datapreprocessing matplotlib pandas python pytorch sklearn transformer

Last synced: 20 Jan 2026