An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/ondrejhruby/countries-of-the-world

Explore global data with this repository, featuring insights, visualizations, and Python code examples on countries worldwide—perfect for enhancing your data analysis and visualization skills.

data-analysis data-science data-visualization geography jupyter-notebook machine-learning matplotlib pandas python statistics

Last synced: 16 Apr 2026

https://github.com/piratesmanx1/tutor-management-system-2020

A group assessment that required individuals to implement their specific data structure within the system, such as the likes of Array List and Linked List. The assessment required no knowledge of database which is making the overall assessment tricky to handle as we need to rely on the memory addresses that act as a temporary storage to get the job done. In this assessment we've learned the concept of C++, Data Structures, and numerous algorithm that could exploit in other programming languages, which increased our critical thinking skill on applying the system's feature.

c-plus-plus data-structures data-visualization datastructures doubly-linked-list doublylinkedlist linked-list linked-lists

Last synced: 11 Oct 2025

https://github.com/abhi227070/whatsapp-chat-analyzer

The WhatsApp Chat Analyzer is a project that leverages machine learning and natural language processing techniques to analyze chat data from WhatsApp conversations. It provides insights such as message statistics, sentiment analysis, word clouds, and more.

artificial-intelligence data-analysis data-visualization machine-learning machine-learning-algorithms python-3 python-programming

Last synced: 29 Jun 2026

https://github.com/webmobiledev/d3-visuals

There are some data visualizations like treechart, bubblechart, mbostock, mapchart using d3.js.

bootstrap d3js data-visualization highcharts mbostock-d3

Last synced: 27 Apr 2026

https://github.com/fimbres/astro-3-experimental

A basic project to test out the new features of Astro 3.0, specially View Transitions API, using React.js, TailwindCSS, TypeScript, Zod, and many more technologies!

astro data-visualization react single-page-applications tailwindcss typescript view-transitions-api zod

Last synced: 11 Apr 2026

https://github.com/sarthak-0-sach/drivermasterdata_database_table

This code enables data integration from multiple sources and ensures a single source for all driver-related attributes. Designed for scalability and pipeline compatibility, this project supports clean data transformations, validations, and storage-ready outputs. Ideal for quick analytics, created using python & airflow, automated using cronjob.

apache-airflow-etl-pipeline data-analysis data-visualization database-management python

Last synced: 27 Apr 2026

https://github.com/colburncodes/se_pudding_2023

This project is a React app designed to showcase research conducted by a team of data scientists and data analysts. The app is utilizing React and React-Chartjs-2

chartjs-2 data-analysis data-science data-visualization react-chartjs-2 reactjs

Last synced: 11 May 2026

https://github.com/whitehathackerpr/data-visualization-tool

This is a Python-based web application that allows users to upload datasets, analyze data, and create visualizations interactively. The tool is designed for ease of use and provides a simple interface to perform basic data analysis and generate visualizations

data data-analysis data-visualization python python3

Last synced: 05 Sep 2025

https://github.com/linuxto5re/salesandguestmanagementmldotnet

welcome to our Sales and Guest Projection repository! Discover precise guest predictions via ML.NET, historical data, and advanced tech. This model also applies to sales forecasts, fueled by ML.NET's capabilities. In addition, we've added data visualization.

csharp data-visualization machine-learning mldotnet mvvm-architecture oxyplot sql-server

Last synced: 27 Apr 2026

https://github.com/hendersontrent/hotter

R package for webscraping, statistical analysis, and data visualisation of the Triple J Hottest 100 Countdown and related data.

data-visualisation data-visualization r triple-j webscraping

Last synced: 08 Apr 2025

https://github.com/samjtro/tri-map

EPA-TRI Data Visualization & Analysis

data-visualization emissions environment python

Last synced: 01 Sep 2025

https://github.com/kplanisphere/plotted-3d-environment

Plotted 3D Environment is a graphical project inspired by Minecraft, designed to demonstrate 3D object creation, animation, and interaction using OpenGL. It features first-person navigation, texture mapping, and collision detection within a dynamic 3D environment filled with obstacles and enemies - Final project for the Graphing course.

3d-graphics animation camera-movement collision-detection computer-graphics cpp data-visualization educational-project opengl texture-mapping

Last synced: 03 May 2026

https://github.com/dsnchz/solid-plotly

SolidJS wrapper for Plotly.js – reactive and performant charts powered by Plotly, built for Solid.

analysis analytics charting-library charts data-visualization solidjs visualization

Last synced: 02 Apr 2026

https://github.com/ryanfranklin237/data-visualization-python

A tool that allows you to visualize data from a csv or excel file in a graph or charts form

data-analysis data-science data-visualization matplotlib pandas-dataframe python

Last synced: 11 Jun 2026

https://github.com/cainmagi/dash-json-grid

Dash porting version of the react project React JSON Grid. Provide structured and nested grid table view of complicated JSON objects/arrays.

dash data-visualization json json-table json-viewer plotly-dash python python-dash python-libary python3

Last synced: 23 Oct 2025

https://github.com/kibotu/git-history-timeline

Visualize your complete GitHub contribution history across all repositories and branches. See every commit from every repo and every branch—not just the default branch. Self-contained HTML output perfect for portfolios and sharing.

analytics commit-history contribution-graph contributions data-visualization developer-tools embeddable git github github-api github-pages github-stats html nodejs open-source portfolio self-hosted static-site timeline visualization

Last synced: 03 Apr 2026

https://github.com/souravbhadra/maplapse

A python library to create animated timelapse of maps

animation data-visualization geospatial-visualization gif gis map mapping python spatial-data video

Last synced: 15 Mar 2025

https://github.com/Rayyan9477/Calorie-Burnage-Exploratory-Data-Analysis

Calorie Burnage is the measure of calories burned during physical activity or exercise, crucial for weight management and fitness goals. This project focuses on analyzing a dataset that includes information on duration, pulse rates, and calories burned during exercise sessions.

data-analytics data-science data-visualization exploratory-data-analysis linear-regression r-language r-programming

Last synced: 29 Apr 2025

https://github.com/zoliqua/venn-diagram-lab

🟢 🟤 Interactive Venn diagram viewer & editor — 44 SVG models (2–9 sets), pre-computed region paths, React + TypeScript + Vite

adelaide bioinformatics data-visualization edwards-venn euler hamilton interactive manawatu massey palmerton-north python-package react set-theory svg typescript upsetplot venn venn-diagram venndiagram victoria

Last synced: 05 May 2026

https://github.com/archanakokate/diabetes_prediction_capstoneproject

Analyzing and Modelling National Institute of Diabetes(NIDDK) dataset for accurate prediction of Diabetes in Patients, with Tableau dashboard visualization.

data-engineering data-visualization exploratory-data-analysis machine-learning-algorithms tableau-dashboards

Last synced: 17 Mar 2025

https://github.com/sandk21/detection_faux_billets

Algorithme de détection de faux billets selon leurs dimensions géométriques et application web pour générer les prédictions

data-analysis data-science data-visualization machine-learning pandas python scipy sklearn streamlit

Last synced: 03 Apr 2026

https://github.com/pavankethavath/dataspark-illuminating-insights-for-global-electronics

DataSpark is a retail analytics project for Global Electronics leveraging Python, SQL, and Power BI. It uncovers customer insights, sales trends, and store performance to optimize marketing, inventory, and operations. Features include clean datasets, SQL-driven analysis, and interactive dashboards, driving data-driven growth and decision-making.

data-engineering data-visualization dataanalytics powerbi python retail-data sql

Last synced: 27 Apr 2026

https://github.com/tushard48/product-cluster-analysis

This project performs clustering analysis on a product dataset to identify and group similar products. The analysis includes data preprocessing, application of various clustering algorithms, and visualization of results to gain insights into product patterns. Key techniques used are K-Means, Mini Batch K-Means, evaluated using metr

data-visualization excel machine-learning powerbi streamlit unsupervised-learning

Last synced: 03 May 2026

https://github.com/lilivalgo/coal-production-colombia

Data analysis that includes information on annual coal production, royalties generated, and climate variables. Descriptive analysis and visual analysis techniques were used

analysis data-visualization dataframes insights manipulation matplotlib python seaborn transformation

Last synced: 27 Apr 2026

https://github.com/mustika-putri-m/analysis-of-sales-transactions-in-an-online-shop---london

Crucial Question 1. How was the sales trend over the months? 2. What are the most frequently purchased products? 3. How many products does the customer purchase in each transaction? 4. What are the most profitable segment customers? 5. Based on your findings, what strategy could you recommend to the business to gain more profit?

data data-analysis-python data-analytics data-visualization ecommerce

Last synced: 24 Oct 2025

https://github.com/ddihora1604/iitk_esg

Researching and Analyzing key ESG (Environmental, Social, Governance indicators) metrics and their impact on stock performance and market behavior. Leveraging AI techniques (like Machine Learning and NLP) in finance to extract insights from ESG disclosures, enhancing financial predictions and sustainable investment strategies.

data-analysis data-visualization esg python yahoo-finance

Last synced: 24 Apr 2025

https://github.com/qtle3/polynomial-regression

This project demonstrates the use of both Linear and Polynomial Regression models to predict salaries based on the position level of employees. By using a dataset that contains position levels and their corresponding salaries, this project showcases the differences between linear and polynomial models in fitting data and making predictions.

data-visualization linear-regression model-comparison polynomial-regression salary-prediction

Last synced: 08 Apr 2025

https://github.com/md-emon-hasan/ml-projects-telcom-customer-churn-prediction

📱 Customers are likely to leave a telecom service, enabling companies to take measures for retention and create accurate churn prediction models.

boostrap5 customer-churn customer-segmentation data-engineering data-science data-visualization logestic-regression machine-learning telco-customer-churn-prediction telcom-churn

Last synced: 05 May 2026

https://github.com/avijit-jana/redbus-data_scraping_and_filtering_with_streamlit_app

A Streamlit-based application leveraging Selenium to automate data scraping from Redbus, enabling efficient collection, analysis, and visualization of bus travel data for improved operational efficiency and strategic planning in the transportation industry.

automation dashboard data-analysis data-visualization datadrivendecisions python3 redbus selenium-python streamlit-application webscrapping

Last synced: 15 Mar 2025

https://github.com/vidhi1290/zomato-data-analysis

Zomato Data Analysis - Explore the world of Zomato restaurant data through Python and data analysis. Uncover trends and insights using Pandas for data manipulation and Matplotlib for visualization. Join us in this journey to reveal the hidden stories within the data!

data-analysis data-analysis-python data-science data-visualization dataprocessing machine-learning machine-learning-algorithms matplotlib numpy pandas python scikit-learn zomato-data-analysis

Last synced: 11 Apr 2026

https://github.com/filiplangiewicz/fastfooddatavisualization

🍟 Project for Data Visualization Techniques course about fast food consumption

collaboration data-science data-visualization dplyr ggplot2 poster

Last synced: 15 Mar 2025

https://github.com/fatihilhan42/stock-market-analysis-with-pandas-python

Hello, today I will tell you the details of the stock market analysis project with python.

data-science data-visualization stock-market

Last synced: 23 Mar 2025

https://github.com/singh-dhruv/data-visualization-with-accuracy-metrics

Welcome to this repository! Here we loaded the iris_dataset , upon training, testing and perform operations on it resulted in beautiful data visualizations along with accuracy metrics. Data Visualization and accuracy metrics are provided for some of the models that are used.

accuracy-metrics data-visualization hyperparameter-optimization knn-model logistic-regression svm-model

Last synced: 02 Jul 2025

https://github.com/MinaRavi/first-practice-data-cleaning-visualization

Basic Practise of Data-cleaning & Data-visualization

data-analytics data-cleaning data-visualization git

Last synced: 31 Mar 2025

https://github.com/chandkund/spam-email-detection

This project focuses on detecting spam emails using a fine-tuned DistilBERT model, a lighter version of the BERT model. The model is trained to classify email text into two categories: spam (1) and not spam (0). The dataset consists of email texts labeled as either spam or non-spam.

data-visualization datapreprocessing matplotlib pandas python pytorch sklearn transformer

Last synced: 20 Jan 2026

https://github.com/katiesaund/market_size_app

An interactive dashboard to plot a company's estimated market size.

data-visualization finance r rshiny shiny shinyapp

Last synced: 24 Mar 2025

https://github.com/GZ430/global-christianity-dataviz-jp

A web app built in R Shiny for users to explore global Christianity data from Joshua Project, World Watch List, and others.

christianity data-visualization r shiny

Last synced: 10 Mar 2025

https://github.com/gutyoh/netflix-dashboard

Dashboard created with Python 🐍 and the Dash 📊 framework to visualize Netflix’s content distribution and classification.

dash dashboard data-visualization plotly python

Last synced: 08 Apr 2025

https://github.com/sevdanurgenc/python-for-data-science-lecture-notes

In this repo, I have the course contents of Python for Data Science training, which will be given to Siemens by the cooperation of Academy Peak Information Technologies Training and Consultancy between 28 June - 1 July 2022.

data-analysis data-mining data-modeling data-science data-structure data-visualization matplotlib-tutorial numpy-tutorial pandas-tutorial

Last synced: 23 Mar 2025

https://github.com/leosimoes/uerj-tcc-analisador-dados-texto

Texto do trabalho de conclusão de curso (TCC) em engenharia de computação. Aplicativo Web para análise de dados.

data-analysis data-science data-visualization python streamlit

Last synced: 24 Mar 2025

https://github.com/dev-owdenmag/dataflow-manager

A dynamic and versatile web application for managing, collecting, and presenting data with an integrated printing feature.

data data-management data-management-platform data-visualization python

Last synced: 30 Mar 2025

https://github.com/natanast/immunovisual

A collection of charts made with the R programming language, focusing on immunogenetics analyses. Different charts types are being organized into multiple sections, each accompanied by its reproducible code. The gallery spotlights the utilization of prominent R packages such as tidyverse, data.table, and ggplot2.

data-visualization ggplot2 quarto r-programming

Last synced: 11 Mar 2026

https://github.com/neerajcodes888/whatsapp-chat-analyzer

A Python tool for effortless analysis of WhatsApp conversations. Gain insights with basic statistics, word cloud visualizations, and URL statistics. Powered by pandas, urlextract, wordcloud, seaborn, and Streamlit. 📊📱

analyzer chat data-analysis data-visualization pandas python3 seaborn urlextract whatsapp wordcloud

Last synced: 12 Apr 2026

https://github.com/ksmooi/mscs_data_visualization

This series of reports demonstrates the process of creating, refining, and evaluating interactive data visualizations using Altair, focusing on Seattle weather data, with an emphasis on accessibility, usability, and effectively communicating insights.

altair data-visualization

Last synced: 01 Apr 2025

https://github.com/ahmetzamanis/bayesianusedcars

Predicting used car prices using Bayesian Linear Regression, visualizing results and comparing with OLS regression.

bayesian car-prices data-science data-visualization linear modeling r regression rmarkdown

Last synced: 16 Dec 2025

https://github.com/alejo1630/chicago_crimes

A Jupyter Notebook with the data analysis and data visualization of crimes in Chicago from 2017 to 2023 using libraries such as seaborn and folium

data-analysis data-visualization folium pandas python seaborn

Last synced: 12 Apr 2026

https://github.com/geetisha/sales_insight_data_analysis_using_sql_and_tableau-etl-

Sales Insights - A Data Analysis Project performed on Tableau & SQL Topics

dashboard data-analysis data-visualization mysql project sales-analysis sql tableau

Last synced: 07 Jan 2026

https://github.com/md-emon-hasan/3-eda-basketball-ml-app

A ML application focused on EDA and basketball analytics, showcasing data visualization and insights using Python and relevant libraries.

basketball-analysis csv data-visualization eda exploratory-data-analysis exploratory-data-analysis-eda ml-app

Last synced: 12 Apr 2026

https://github.com/sanam2405/chatinfo

Analysing the WhatsApp Chat with my crush over a 6M period

data-analysis data-visualization python

Last synced: 27 Apr 2026

https://github.com/gauranshgoel123/predictive-demand-analysis

Demand Forecasting Project A web application for predicting future demand for part numbers based on historical data. Built with React for the frontend and FastAPI with Python for the backend, this application visualizes demand trends and allows users to input additional data for improved accuracy. In render analyzer is frontend analysis is backend

chartjs data-analysis data-science data-visualization dataset deployment full-stack machine-learning numpy pandas predictive-analysis prophet-model python reactjs render

Last synced: 13 Apr 2026

https://github.com/whis99/userfunnelanalysis

An ecommerce user funnel conversion data analysis with matplotlib & python.

data-analysis data-analysis-python data-analyst data-visualization google-colab jupyter-notebook matplotlib python

Last synced: 13 Apr 2026

https://github.com/mgobeaalcoba/matplotlib_y_seaborn

Aquí dejaré trabajos de visualización realizados con ambas librerías de Python.

data-analysis data-science data-visualization dataset matplotlib numpy pandas python seaborn

Last synced: 13 Apr 2026

https://github.com/guidodipietro/fmcranks

Raw text input to nicely formatted XLSX file + PNG image of the range

data-analysis-python data-visualization python

Last synced: 26 Jan 2026

https://github.com/prernarohra/mental-health-prediction

This project focuses on predicting mental health outcomes using machine learning algorithms. By analyzing various psychological, social, and lifestyle factors, the model aims to identify individuals at risk, enabling early intervention and support.

data-analysis data-science data-visualization machine-learning mental-health python

Last synced: 20 May 2026

https://github.com/hhejo/pyviz

Data visualization using Python

data-visualization python

Last synced: 02 Apr 2025

https://github.com/garcane/global-shipping-analytics-dashboard

This Tableau project provides a comprehensive visual analysis of global sales, shipping costs, and quality metrics across different regions and countries.

data data-analysis data-analyst data-visualization metrics tableau

Last synced: 01 Mar 2026

https://github.com/vi/csvdimreduce

Command-line tool to run a dimensionality reduction algorithm on CSV files

cli command-line-tool csv csv-files data-science data-visualization dimension-reduction dimensionality-reduction

Last synced: 14 Apr 2025

https://github.com/nafisalawalidris/building-a-clustering-model-for-customer-segmentation

Customer Segmentation Using Clustering: This repo applies clustering algorithms to a customer transaction dataset, grouping similar customers together based on their purchasing behavior. Targeted marketing strategies can be developed by analyzing distinct customer segments.

clustering customer-segmentation data-analysis data-visualization k-means machine-learning marketing-analytics unsupervised-learning

Last synced: 16 Mar 2025

https://github.com/robinmillford/analytics_for_fashion_supply_management

This Streamlit dashboard provides a comprehensive analysis of supply chain data, focusing on key metrics such as production volumes, stock levels, order quantities, revenue, manufacturing costs, lead times, shipping costs, transportation routes, risk factors, and sustainability factors

dashboard data-analysis data-visualization streamlit supply-chain-management

Last synced: 07 Sep 2025

https://github.com/dhrupad17/ibm-data-analyst-professional-certificate

Prepare for a career as a data analyst. Build job-ready skills – and must-have AI skills – for an in-demand career. Earn a credential from IBM. No prior experience required.

assignment-solutions coursera data-analytics data-science data-visualization excel ibm pandas professional-certificate professional-certificates python quiz updated-2024

Last synced: 13 Apr 2026

https://github.com/karthikraja95/fika

Automate Data Science Workflow from Exploratory Data Analysis to Model Deployment

automation automl-python data-science data-visualization eda machine-learning nlp pipeline python

Last synced: 14 Jan 2026

https://github.com/jcaperella29/financial_analyis_pipeline

An AI-powered financial analysis pipeline that scrapes stock data, calculates Piotroski F-Score & Valuation, and generates trend plots—all in one click!

ai automation cli data-science data-visualization deep-learning finance financial-analysis financial-modeling investing investment-research machine-learning natural-language-processing pdf-generation python quantitative-finance selenium web-scraping

Last synced: 08 Jun 2026

https://github.com/paumatas/dpa-visual-analytics-tool

Visual analysis application for the data collected by a Formula Student car during driving, aiming to evaluate and compare the drivers of the team.

car-telemetry data-driven-decisions data-science data-visualization formula-student motorsport sports-analytics user-interface

Last synced: 17 Mar 2025

https://github.com/shridhar1504/power-bi-visualization-project

This repository contains Visualization Projects which is visualized through Power BI Software, by using the visualization we can gain multiple insights and strategies which helps to develop the business for gaining high profit margins and by the insights we can reduce the damages by accidents & calamities.

dashboard data-analysis data-cleaning data-science data-visualization exploratory-data-analysis microsoft-excel microsoft-power-bi microsoft-powerpoint powerbi powerbi-report powerbi-visuals powerpoint-slides

Last synced: 21 Jan 2026

https://github.com/pedroscaff/sensor-platform-data-analysis

Data analysis and visualization for data collected with Sensor Platform

data-visualization heremaps maps

Last synced: 07 Oct 2025

https://github.com/danielsobrado/galaxy-nodes

Three.js component library for exploring dense graph data in 3D as an interactive “galaxy” of nodes, clusters, and weighted relationships. It provides a polished React API for GPU-accelerated point clouds, cluster filament visuals, camera controls, search/filter workflows, and node/edge inspection.

3d-visualization data-visualization data-viz graph-data graph-visualization interactive-ui network-graph npm-package react threejs typescript typescript-library webgl

Last synced: 31 May 2026

https://github.com/martialhimanshu/motion-detector-camera

An application to detect motion of any object using any camera device and analysis of output data using plot and CSV file

bokehplots data-visualization dataframe-library motion-detection opencv2 python

Last synced: 09 Oct 2025

https://github.com/shubhamgoyal575/hospitality-domain

This project analyzes AtliQ Grands' historical revenue data to identify trends, optimize performance metrics, and support strategic decision-making. It includes data analysis, dashboard design in Power BI, and insight generation to help regain market position. Built with Excel and Power BI for visualization and business intelligence. 🚀

business-intelligence dashboard data-modeling data-visualization hospitality-analytics microsoft-power-bi power-query powerbi revenue-analysis trend-analysis

Last synced: 17 Feb 2026

https://github.com/jabhij/eda_experiments

In this repo I'll use different types of datasets to explore and implement various Exploratory Data Analysis (EDA) approaches.

ames-housing analysis battery-life blackfriday-analysis data-analysis data-science data-visualization eda matplotlib-pyplot numpy pandas python seaborn visualization zomato-data-analysis

Last synced: 14 Apr 2026

https://github.com/moindalvs/learn_eda_house_price_dataset

Data Set: House Prices: Advanced Regression Techniques Exploratory Data Analysis on more than 80 features

cardinality data-analysis data-science data-structures data-visualization missing-values

Last synced: 10 Oct 2025

https://github.com/andrewheiss/wcnpd18

Workshop given at the 2018 West Coast Nonprofit Data Conference

data-visualization talk workshop

Last synced: 19 Jan 2026

https://github.com/nomadsdev/financial-trend-analyzer

FinancialTrendAnalyzer helps analyze and visualize sales data to uncover financial trends. It uses Python to calculate total sales, track changes, and generate insightful charts for better decision-making.

business-intelligence data-analysis data-visualization financial-analysis matplotlib numpy pandas python revenue-trends sales-data seaborn time-series-analysis

Last synced: 19 Jan 2026

https://github.com/joshuatownsend/chrome-history-explorer

Local-first explorer for your browser history (Chrome, Edge, Brave, Firefox, Safari, or Google Takeout): full-text + semantic search, research-session/rabbit-hole detection, insights (on-this-day, routines, trends), an AI interest map, and liveness checks. Optional pluggable AI; private/LAN data never leaves your machine.

browser-history bun chrome-history data-visualization hono local-first privacy react self-hosted semantic-search sqlite tailwindcss typescript vite

Last synced: 31 May 2026

https://github.com/tuni56/matplotlib_introduction

Want to bring trigonometry to life? With Matplotlib, you can easily plot the sine and cosine functions on the same graph, creating an intuitive visualization of their periodic nature.

data-science data-visualization matplotlib python trigonometry-visualisation

Last synced: 16 Oct 2025

https://github.com/nathanvilbert/visualization-of-flood-in-jakarta-with-tableau

The Flood Intensity Visualization and Priority Identification System for Jakarta provides an interactive platform to analyze flood patterns, severity, and impacted regions in Jakarta from 2013 to 2020.

data-visualization exploratory-data-analysis tableau twb

Last synced: 19 Mar 2026

https://github.com/lhzn-io/kanoa

AI-powered interpretation of data science outputs with multi-backend support (Molmo, Gemini, Claude, OpenAI)

ai claude data-science data-visualization gemini interpretability jupyter llm machine-learning matplotlib molmo multimodal openai python vision-language-model vlm

Last synced: 01 Jun 2026

https://github.com/amr-yasser226/datagovernanceworkflow

Comprehensive data governance pipeline for SSH honeypot logs—covering data profiling, cleansing, quality assurance, encryption, classification, and GDPR/CCPA/HIPAA compliance. Built with Pandas, Pandera, YData Profiling, and cryptography, with simulated Caesar cipher attacks to demonstrate practical data-security techniques.

caesar-cipher ccpa cryptography cybersecurity data-cleaning data-encryption data-governance data-profiling data-quality data-validation data-visualization gdpr hipaa honeypot-analysis open-source pandas privacy-compliance python ssh-logs

Last synced: 05 Feb 2026