Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-23 00:07:29 UTC
- JSON Representation
https://github.com/sayantanidalui/indian-government-budget-analysis
A complete end to end data analysis project using Python, SQL, and Power BI based on a Kaggle dataset. Built to explore trends, allocations, and insights from India’s Union Budget (2021–24) for practice purposes.
data-analysis mysql pandas powerbi storytelling
Last synced: 07 May 2026
https://github.com/thevinh-ha-1710/diabetes-predictive-model
This project aims to train a predictive model to diagnose diabetes on women patients.
data-analysis data-science data-visualization model-training-and-evaluation python
Last synced: 13 Feb 2026
https://github.com/allanotieno254/powerbi-chocolate-sales-analysis-dax-calculations-80-
This Power BI project analyzes **chocolate sales performance using advanced DAX calculations and interactive visualizations. The report provides insights into monthly revenue, top-selling products, sales trends, and market performance.
business-intelligence data-analysis dax powerbi powerbi-dashboards powershell-module sales-analysis visualization
Last synced: 13 Feb 2026
https://github.com/allanccwang/electronic_projects
implement the circuit with microcontroller
arduino circuit-analysis circuit-simulations circuits-and-electronics cpp data-analysis microcontroller physics python wemos
Last synced: 07 May 2026
https://github.com/dsrodrigovieira/houserocketsales
Este repositório contém um projeto desenvolvido para praticar habilidades de análise de dados utilizando Python
data-analysis data-visualization heroku kaggle-dataset python
Last synced: 29 Apr 2026
https://github.com/sivas-2/coffee-sales-visualization
This repository contains data visualization scripts and notebooks analyzing coffee sales data from a vending machine, sourced from Kaggle. The visualizations explore sales trends, customer preferences, and product popularity over time.
data data-analysis data-science data-visualization python visualization
Last synced: 07 May 2026
https://github.com/asifdotexe/quickvu
Quick VU: No-code, data cleaning analysis and visualization tool built on Streamlit. Quickly clean, visualize, explore, and understand data relationships and correlations with ease. Perfect for analysts, business users, and anyone looking to gain data insights—without writing a single line of code.
automation data-analysis data-cleaning data-visualization python3 streamlit-application toolkit
Last synced: 06 Jun 2026
https://github.com/eduardoedubox/health_data_analysis
Health data analysis using Jupyter Notebook
data-analysis data-science database jupyter-notebook pandas python
Last synced: 07 May 2026
https://github.com/hebaqaisar/movie-recommender-system
AI Recommender System - Recommends you similar movies based on Directors, Tags, Name, Type, Actors, Genre etc
artificial-intelligence data-analysis data-mining data-science jupyter-notebook machine-learning machine-learning-algorithms ml movies-rate pycharm python
Last synced: 17 Apr 2026
https://github.com/hamzacham/data_set_projet-5
data-analysis data-science database dataset jupyter jupyter-notebook paython training
Last synced: 07 May 2026
https://github.com/pradipece/weather_forecast_data_analysis
Using decision trees and random forest algorithms to solve real-world data analysis. "sklearn_decision_trees_random_forests"
data-analysis data-science data-visualization git github python python3
Last synced: 19 Apr 2026
https://github.com/pxaris/expenditure-analyzer
Application for analyzing expenditure data over time
data-analysis data-visualization docker python statistics
Last synced: 29 Apr 2026
https://github.com/shsiddhant/memory.fm
A Python library, CLI tool, and web-based dashboard for exploring music listening history from Last.fm and Spotify.
analytics data-analysis data-visualization memories music
Last synced: 04 Apr 2026
https://github.com/idaraabasiudoh/knn-customer-classification
Labels telecommunication customer base to respective groups to determine service type required for each customer.
data-analysis jupyter-notebook machine-learning pyhton3 scikit-learn
Last synced: 07 May 2026
https://github.com/sayedgamal99/data-science
This is a repository for Data Science Projects.
data-analysis data-science deep-learning machine-learning python regression supervised-learning
Last synced: 07 May 2026
https://github.com/srinivasrm/graphics_cards_analysis_and_application
In the current project I have extracted graphics card current prices from an authorizer retailer in India and performed analysis
beautifulsoup data-analysis data-science data-visualization etl graphic-card-price-prediction graphics-card graphics-card-analysis heroku-database machine-learning matplotlib pgsql python regression scikit-learn seaborn sql streamlit webapplication
Last synced: 04 Mar 2026
https://github.com/backdoorali/insider-threat-detection-project
Personal data analysis project combining insider threat detection, cybersecurity, and exploratory data analytics. Built for portfolio showcase and practical skills demonstration.
cybersecurity data-analysis data-analysis-excel data-analysis-project data-analyst data-analytics data-visualization eda excel insider-threat jupyter-lab jupyter-notebook matplotlib numbers pandas portfolio-project python python3 threat-detection threat-intelligence
Last synced: 07 May 2026
https://github.com/gorodroz/crypto-tracker
Realtime Bitcoin price tracker using Binance WebSocket and REST API. Logs prices to CSV and supports Pandas for data analysis.
binance bitcoin crypto csv-logger data-analysis pandas python rest-api websocket
Last synced: 07 May 2026
https://github.com/bala-1409/foreign-exchange-rate-time-series-data-science-project
This project will use time series analysis to forecast the exchange rate between the euro and the US dollar. The project will use a variety of statistical techniques, such as ARIMA to model the data and forecast the exchange rate.
data-analysis data-science data-visualization datapreprocessing eda exploratory-data-analysis forecasting machine-learning-algorithms model modelfitting predictive-modeling python3 scikit-learn statsmodels time-series time-series-analysis
Last synced: 07 May 2026
https://github.com/md-emon-hasan/data-science
Data science tutorials, including data preprocessing, analysis, visualization, project deployment, machine learning and deep learning algorithms.
artificial-intelligence data-analysis data-engineering data-science deep-learning machine-learning-algorithms python
Last synced: 07 May 2026
https://github.com/0xjeremy/me-18-final
Data collection and Analysis tools for IMUs
data-analysis imu raspberry-pi
Last synced: 03 May 2026
https://github.com/jofaval/mlai-portfolio
My portfolio about Data Analysis, Machine Learning and A.I.
computer-science data-analysis data-science machine-learning portfolio python
Last synced: 25 Apr 2026
https://github.com/dogan-the-analyst/web_scraping_job_vacancies
data-analysis python web-scraping
Last synced: 07 May 2026
https://github.com/y-india/retail-sales-analysis-project
Analysis and preprocessing of retail store sales data. Includes data loading, merging, and initial inspection. 📌 Recommended: See README.md for detailed project progress and dataset information.
ai dashboard data-analysis data-science data-visualization jupiter-notebook machine-learning matplotlib python real-world-problem-solving real-world-project retail-analytics sales-analysis seaborn sklearn-library streamlit
Last synced: 07 May 2026
https://github.com/1ayanabil1/iris-visualization
This repository focuses on visualizing the Iris dataset using various data visualization techniques. It includes histograms, scatter plots, box plots, pie charts, bubble charts, and KDE plots to provide insights into the dataset’s structure. The project utilizes Matplotlib, Seaborn, Plotly, and Scikit-learn to generate insightful visualizations.
analytics clustering data-analysis data-science data-visualization datavisualization-project datavisualizations eda exploratory-data-analysis machine-learning machinelearning-python python
Last synced: 07 May 2026
https://github.com/shivshah19/movie-recommendation-system
This Movie Recommendation System is designed to provide personalized movie recommendations based on user preferences.
cosine-similarity data-analysis machine-learning pandas python streamlit
Last synced: 03 May 2026
https://github.com/faezeh-gholamrezaie/coronavirus-d3-chart
Tracing the spread of the coronavirus from the White House under the title of diagram D3
coronavirus coronavirus-analysis coronavirus-outbreak-tracking d3 data-analysis datavisualization white-house-coronavirus-outbreak-tracking
Last synced: 29 Apr 2026
https://github.com/alfikiafan/air-quality-analysis
This repository contains a comprehensive data analysis project on Air Quality Dataset, covering the complete data analysis process from data gathering, cleaning, exploratory data analysis (EDA), to building a fully interactive dashboard using Streamlit.
air-quality data-analysis dicoding
Last synced: 17 Apr 2026
https://github.com/bassamn/titanic-data-analysis
Exploratory data analysis (EDA) of the Titanic dataset using Python. Analyzed survival patterns by age, gender, and class with visualizations (seaborn/matplotlib). Non-ML focus—highlighting insights with statistics and plots.
data-analysis eda pandas python seaborn titanic visualization
Last synced: 08 May 2026
https://github.com/imartinezl/bicing-analysis
cplex data-analysis matlab optimization python spyder
Last synced: 28 Feb 2026
https://github.com/nomadsdev/sys-moninsight
System Monitoring and Analysis Tool is a utility for real-time performance tracking. It logs CPU, memory, and disk usage, provides visual graphs, and offers performance recommendations. Perfect for optimizing system efficiency.
automation cpu-usage data-analysis data-visualization disk-usage matplotlib memory-usage performance-analysis performance-optimization psutil python real-time-monitoring resource-management sys-moninsight system-metrics
Last synced: 19 Jun 2026
https://github.com/robcyberlab/machine-learning-classifier
🤖Machine Learning Classifier⚙️
ai artificial-intelligence classifiers data-analysis data-science deep-learning digit-recognition machine-learning pca-algorithm python svm-classifier
Last synced: 31 Mar 2025
https://github.com/m-faizan-mahmood/detailed-exploratory-data-analysis-eda-marketing-recomendations.
This project focuses on cleaning, preprocessing, and analyzing data using Pandas and NumPy. Key steps include handling missing values, removing outliers, feature engineering, and exploratory data analysis (EDA). Visualizations with Matplotlib and Seaborn highlight trends in customer spending, campaign performance, and product sales.
big-data data-analysis data-processing data-science eda exploratory-data-analysis numpy pandas python
Last synced: 11 Apr 2026
https://github.com/oguzgn/fully-automated-performance-marketing-dashboard
This project integrates data from multiple ad platforms with Google Analytics to track marketing campaigns. It uses a structured naming system and UTM tags. Data is visualized in Looker Studio dashboards to analyze campaign performance and ad spend.
bigquery data-analysis data-engineering data-modeling marketing-analytics marketing-automation marketing-data-science marketingdata sql
Last synced: 24 Mar 2025
https://github.com/carterlasalle/sportsarbfinder
Sports Betting Arbitrage Finder: Python tool for identifying profitable arbitrage opportunities across bookmakers. Features multi-region support, customizable profit margins, interactive calculator, and web interface. Uses real-time odds data from The Odds API. Ideal for betting enthusiasts, analysts, and educational purposes.
arbitrage-betting betting-strategy data-analysis finance gambling odds-api python sports-analytics sports-betting
Last synced: 31 Mar 2025
https://github.com/mathpfreitas/top-hits-spotify-2000-to-2019-
# 🎧 Top Hits Spotify (2000 - 2019)Explore music trends from 2000 to 2019 with this dataset of songs, artists, and genres. Use the insights to understand what makes a hit in today's music landscape. 🐙💻
analysis analytics chart data-analysis data-visualization exploratory-data-analysis hits interactivedashboards jupyter-notebook matplotlib music musical numpy pandas plotly python timeseries track-hits
Last synced: 01 Jul 2025
https://github.com/akash1070/freecodecamp-data-analysis-with-python-
contains study notes and assignments from freecodecamp of Data Analysis With Python
data-analysis demographic-analysis mean-variance-standard-calculator medical-data-visualisation numpy-library pandas-library python3 sea-level-predictor time-series-analysis
Last synced: 01 May 2026
https://github.com/akash1070/data-science-virtual-internship-by-anz
Exploratory data analysis and prediction of annual salary for customers from the dataset provided by ANZ.
data-analysis data-science predictive-analytics presentation-slides
Last synced: 24 Mar 2025
https://github.com/emredurukn/data-analysis
Example notebooks for analyzing data
data-analysis data-visualization python
Last synced: 12 May 2026
https://github.com/eshaagarwa/sales_insight_project
Sales insights project using Powerbi and SQL
data-analysis data-visualization databse datacleaning datamodeling microsoft-power-bi mysql-database powerbi sales-insights sql
Last synced: 08 Aug 2025
https://github.com/beallio/wherewolf
Wherewolf is a production-grade, local SQL workbench designed for data engineers and analysts to query local files (CSV, Parquet, JSON) with ease. Built with Streamlit, it provides a unified interface to execute SQL against either DuckDB or PySpark engines without requiring complex setup.
big-data data-analysis data-engineering etl parquet performance pyspark python spark-sql sql uv
Last synced: 28 Apr 2026
https://github.com/arnikz/piqmie
Proteomics Identifications & Quantitations Data Management & Integration Service
data-analysis data-management data-visualisation mass-spectrometry peptide-identification protein-inference protein-quantification proteomics silac web-application
Last synced: 03 Feb 2026
https://github.com/rayyan9477/household-transactions-analysis-and-clustering
This project involves analyzing household transaction data to gain insights into spending patterns and behaviors. The analysis includes data cleaning, exploratory data analysis (EDA), clustering using K-Means, and visualization of customer segments.
customer-segmentation data-analysis data-cleaning data-science exploratory-data-analysis kmeans-clustering machine-learning
Last synced: 27 Feb 2025
https://github.com/tjpalanca/ph-elections-2016-analysis
Analysis of Philippines Election Results 2016
analysis data-analysis data-science philippines-election voter-turnout
Last synced: 11 Jun 2025
https://github.com/timbeechey/opa
Ordinal pattern analysis R package
cran data-analysis hypothesis-testing longitudinal ordinal r r-package rcpp repeated-measures rstats statistics
Last synced: 21 Feb 2026
https://github.com/solrikk/pictrace-web
PicTraceV2 is a highly efficient image matching platform that leverages computer vision using OpenCV, deep learning with TensorFlow and the ResNet50 model, asynchronous processing with aiohttp, and Selenium for browser automation. PicTraceV2 allows users to upload images directly or provide URLs, quickly scanning a vast database to find image
automation computer-vision data-analysis data-extraction deep-learning image-processing image-search machine-learning natural-language-processing opencv openpyxl pandas python selenium tensorflow web-scraping yandex yandex-api
Last synced: 12 Apr 2026
https://github.com/saeun-park/lg-aimers-4th
MQL 데이터 기반 B2B 영업기회 창출 예측 모델 개발
b2b data-analysis data-science machine-learning mql
Last synced: 08 Apr 2025
https://github.com/aaryan-agr/canadian-energy
This project analyzes Canada's energy trade, focusing on imports, exports, and market trends in the energy sector.
data-analysis data-cleaning data-manipulation data-processing data-science data-vizualisation energy-sector time-series-analysis
Last synced: 10 Jun 2025
https://github.com/fatma-moanes/machine-learning-labs
My implementation for the labs of the Machine Learning course that I studied in my university, Zewail City.
bootstrap data-analysis data-science deep-learning keras knn-classification linear-regression logistic-regression machine-learning machine-learning-algorithms matplotlib ml neural-networks numpy pandas pca preprocessing python seaborn svm-classifier
Last synced: 12 Apr 2026
https://github.com/jinkogule/multi-analyst
O Multi Analyst é uma ferramenta de análise de dados com uma usabilidade simples, que utiliza inteligência artificial para interpretar os resultados das análises realizadas, retornando insights úteis aos usuários.
apriori-algorithm bootstrap css data-analysis django html numpy open-ai pandas python web-application
Last synced: 12 Apr 2026
https://github.com/guruakaashjn/te_project_microsoft_ai
AI based statistical analysis of land-use plastic pollution in India using AI/ML techniques.
artificial-intelligence data-analysis data-analytics data-science data-visualization machine-learning powerbi
Last synced: 27 Feb 2025
https://github.com/vidhi1290/zomato-data-analysis
Zomato Data Analysis - Explore the world of Zomato restaurant data through Python and data analysis. Uncover trends and insights using Pandas for data manipulation and Matplotlib for visualization. Join us in this journey to reveal the hidden stories within the data!
data-analysis data-analysis-python data-science data-visualization dataprocessing machine-learning machine-learning-algorithms matplotlib numpy pandas python scikit-learn zomato-data-analysis
Last synced: 11 Apr 2026
https://github.com/ivanildobarauna-dev/currency-quote
Complete solution for extracting currency pair quotes data with comprehensive testing, parameter validation, flexible configuration management, Hexagonal Architecture, CI/CD pipelines, code quality tools, and detailed documentation.
data-analysis data-analytics data-engineering library pypi-packages python
Last synced: 27 Oct 2025
https://github.com/hamada-khairi/pfda-hamada
A comprehensive R-based data analysis project that examines housing rental patterns across multiple cities, utilizing statistical methods and visualization techniques to analyze 4,746 properties' data points including rent prices, locations, and amenities. The project employs various R libraries to clean, process, and visualize rental market trends
apu data-analysis data-analysis-in-r data-cleaning-and-preprocessing data-processing-and-analysis data-science data-visualization-project ggplot2 house-rent-prediction r-programming-projects r-statistics r-studio real-estate-analytics
Last synced: 16 Mar 2025
https://github.com/nirmalvatsyayan/data-analyst-nanodegree
Udacity data analyst nanodegree project submissions and learning
data-analysis numpy pandas python statistics udacity-data-analyst-nanodegree
Last synced: 12 Apr 2026
https://github.com/mohamedhany99/collecting-sound-features-frequency-from-an-audio-file-and-save-it-in-excel
This script takes a single audio file and collect all the sound features in it including (Frequency - mean - variance - minimum value - maximum value - Median - Kurtosis - Skewness) and save it in an array and import this array in a row in the datasheet.
automation data-analysis data-science dataset-generation excel-import signal-processing
Last synced: 18 Apr 2026
https://github.com/noturlee/imdb-dataanalysis
A data model that predicts the IMDb rating of a movie based on features like genre, director, and actors. Using regression techniques to tackle this problem.
data-analysis data-cleaning data-modeling data-science data-visualization
Last synced: 08 Apr 2025
https://github.com/hordiales/redpanal-db-analysis
Analysis of the RedPanal.org music database
creative-commons data-analysis dataset etl machine-learning music music-information-retrieval statistical-analysis
Last synced: 10 Mar 2025
https://github.com/abdelhakim-gh/machine-learning_data-analysis_project
recognizing handwritten numbers & comparing the Life Expectancy vs Fertility in 1960 & 2013 of regions
data-analysis jupyter-notebook machine-learning python r r-studio
Last synced: 12 Apr 2026
https://github.com/willie-conway/datavista
DataVista is a comprehensive, production-grade data analysis and machine learning platform that combines real-time data ingestion from live APIs, interactive visualizations, statistical analysis, hypothesis testing, and machine learning model training — all in a unified, professional-grade interface. Built with React and Recharts.
analytics-platform api-integration classification coingecko-api csv-import data-analysis data-cleaning-and-preprocessing data-pipeline data-science data-visualizations etl hypothesis-testing json-export machine-learning-models open-meteo react recharts regression statistics world-bank
Last synced: 30 May 2026
https://github.com/sevdanurgenc/python-for-data-science-lecture-notes
In this repo, I have the course contents of Python for Data Science training, which will be given to Siemens by the cooperation of Academy Peak Information Technologies Training and Consultancy between 28 June - 1 July 2022.
data-analysis data-mining data-modeling data-science data-structure data-visualization matplotlib-tutorial numpy-tutorial pandas-tutorial
Last synced: 23 Mar 2025
https://github.com/samuelsoaress/python-study-datascience-ia
My data science and AI studies
data-analysis data-crawler data-mining data-science deep-learning machine-learning-algorithms
Last synced: 13 May 2026
https://github.com/ultrasage-danz/weather-data-analysis
Weather Data Analysis notebook project. Created using Google collab
collaboration data-analysis data-science dataset google google-colab-notebook project
Last synced: 24 Mar 2025
https://github.com/leosimoes/uerj-tcc-analisador-dados-texto
Texto do trabalho de conclusão de curso (TCC) em engenharia de computação. Aplicativo Web para análise de dados.
data-analysis data-science data-visualization python streamlit
Last synced: 24 Mar 2025
https://github.com/leosimoes/udacity-starbucks
Project 3 of the Udacity Machine Learning Engineer Nanodegree Program. Data analysis and machine learning application to Starbukcs data.
aws-iam aws-s3 aws-sagemaker data-analysis data-science machine-learning python
Last synced: 24 Mar 2025
https://github.com/olob0/badwords-pt-br
💬 Wordlist com palavrões em pt-BR para análise de dados, filtros, ou texto considerado "evitável"
badword-filter badwords brasil data-analysis filter filter-lists filterlist portugues portuguese text-analysis wordlist
Last synced: 06 Jan 2026
https://github.com/mftnakrsu/crm-rfm-analysis
CRM-RFM-Analysis
ai crm data data-analysis data-science deep-learning machine-learning python rfm rfm-analysis
Last synced: 16 Mar 2025
https://github.com/john-science/data_science_by_example
Examples of Data Science Tools & Libraries
data-analysis data-science ipython pandas
Last synced: 12 May 2025
https://github.com/ifibla/adsdb-project
Algorithms, Data Structures and Databases Project
data-analysis data-engineering python
Last synced: 12 Apr 2026
https://github.com/dwidevelopes/database-input-pelanggran-mahasiswa
Menginput data Mahasiswa Yang Melakukan Pelanggran yang siap di data dan di hukum Dan juga siap Terkena Sanksi
aplikasi aplikasi-sekolah data data-analysis database input-method mahasiswa sekolah siswa siswi website
Last synced: 02 May 2026
https://github.com/priyanshubiswas-tech/aws-mwaa-elt-airflow-sql-dbt-superset-project
This project was created as part of an assessment for DigitalXC AI. It demonstrates a cloud-based ELT pipeline using AWS MWAA, Airflow, dbt, PostgreSQL, and Superset. The pipeline automates data ingestion from S3, transformation with dbt, and visualization through Superset, following modern data engineering practices on a scalable AWS architecture.
apache-airflow apache-superset aws-s3 dag data-analysis data-engineering-pipeline data-visualization dbt elt-pipeline python rds-postgres
Last synced: 03 Jul 2025
https://github.com/sing-group/bew
Public repository for Biofilmfs Experiment Workbench (BEW).
aibench data-analysis data-management java jfreechart workbench
Last synced: 03 Jul 2025
https://github.com/gher-uliege/bluecloud-plankton
Spatial interpolation of plankton data using a neural network
data data-analysis data-visualization neural-network oceanography
Last synced: 30 Mar 2025
https://github.com/suhas-005/power-bi-dashboard
Power BI Dashboard Projects
data-analysis data-visualization dataset power-bi-project powerbi
Last synced: 01 Apr 2025
https://github.com/rekha0suthar/e-commerce-shopper-s-behaviour-understanding
Understand the online shopper purchasing pattern through Machine learning
data-analysis data-preprocessing data-visualization logistic-regression machine-learning numpy pandas python3 scikit-learn seaborn-plots
Last synced: 12 Apr 2026
https://github.com/mayankagg9722/movie-recommendation
Collaborative Filtering is performed over Movie Lens Dataset.
collaborative-filtering data-analysis jupyter-notebook movie-recommendation python-script website
Last synced: 29 Jul 2025
https://github.com/m-faizan-mahmood/house-price-prediction-machine-learning-model
Implemented a Multiple Linear Regression model to predict house prices based on square footage, number of bedrooms, and age of the house.
artificial-intelligence data-analysis data-science data-visualization machine-learning machine-learning-algorithms matplotlib neural-network numpy pandas predictive-modeling python regression-models seaborn sklearn
Last synced: 12 Apr 2026
https://github.com/thomascenni/anfavea-data-analysis
Data analysis with Pandas and Datapane.
Last synced: 06 May 2026
https://github.com/rani-sikdar/python-data-structures
A repository for sharing Python implementations of common data structures and algorithms, including arrays, linked lists, stacks, queues, trees, graphs, and sorting/searching techniques. Perfect for learning, revisiting fundamentals, and collaborating on computer science concepts. Contributions welcome! 🚀
data-analysis data-structure data-visualization numpy-library pandas-dataframe python python-3
Last synced: 30 Mar 2025
https://github.com/antonioscardace/mri-brainage
Showing Accelerated Brain Ageing in Alzheimer's Patients.
alzheimers-disease brain-age classification data-analysis medical-analysis predictive-modeling regression
Last synced: 18 Jan 2026
https://github.com/andr3w03/employee-attrition-problem
Employee Attrition Problem Analysis and Prediction
data-analysis data-science data-visualization dicoding gradient-boosting-classifier machine-learning problem-solving python sklearn streamlit
Last synced: 11 Apr 2026
https://github.com/foggy-projects/foggy-data-mcp-bridge
MCP Data Bridge for Java. Enabling safe Text-to-Query via a semantic layer, making enterprise data accessible to AI Agents.
agent data-analysis java llm mcp semantic-layer spring-boot text-to-sql
Last synced: 16 Mar 2026
https://github.com/sermonzagoto/data_cleansing_in_telco
Data Cleansing in Python
data-analysis data-science machine-learning matplotlib-figures pandas-python seaborn-plots
Last synced: 23 Mar 2025
https://github.com/avijit-jana/redbus-data_scraping_and_filtering_with_streamlit_app
A Streamlit-based application leveraging Selenium to automate data scraping from Redbus, enabling efficient collection, analysis, and visualization of bus travel data for improved operational efficiency and strategic planning in the transportation industry.
automation dashboard data-analysis data-visualization datadrivendecisions python3 redbus selenium-python streamlit-application webscrapping
Last synced: 15 Mar 2025
https://github.com/neerajcodes888/whatsapp-chat-analyzer
A Python tool for effortless analysis of WhatsApp conversations. Gain insights with basic statistics, word cloud visualizations, and URL statistics. Powered by pandas, urlextract, wordcloud, seaborn, and Streamlit. 📊📱
analyzer chat data-analysis data-visualization pandas python3 seaborn urlextract whatsapp wordcloud
Last synced: 12 Apr 2026
https://github.com/rajnish93/jpandas
A lightweight JavaScript library for working with tabular data, inspired by Pandas in Python. Built with TypeScript, it provides an intuitive API for data manipulation and analysis.
data-analysis data-analytics data-manipulation data-science dataframe javascript pandas stream-processing table typescript
Last synced: 11 Jun 2025
https://github.com/supersjgk/data-analysis-dns-over-https
A Data Analytics + ML project to classify Benign and Malicious DNS-over-HTTPS traffic
classification-model data-analysis data-analysis-python data-analytics datamining decision-trees dns dns-over-https doh gradient-boosting knn machine-learning random-forest
Last synced: 19 Mar 2025
https://github.com/ansh420/mcdonald_case-study
It is basically depend on the market Segment Analysis. It is a case study of mcDonald.
algorithms-implemented data-analysis python3 segmentation
Last synced: 12 Apr 2026
https://github.com/nikita-data/unit_economics_projects
unit economics & cohort analysis projects
cac churn-rate conversion create-function data-analysis data-visualization eda hypothesis-testing ltv math matplotlib numpy python retention-rate roi scipy seaborn segmentation statistics unit-economics
Last synced: 06 Jan 2026
https://github.com/azmainadel/twitter-data-neo4j
Playing with graph database on a large dataset of twitter data.
data-analysis data-visualization neo4j-database snap
Last synced: 06 Apr 2025
https://github.com/nikita-data/eda_projects
Exploratory data analysis projects
cac data-analysis data-visualization eda folium-maps hypothesis-testing ltv math matplotlib numpy plotly python regular roi scipy seaborn segmentation statistics unit-economics
Last synced: 06 Jan 2026
https://github.com/victor-lis/regression-ai-model
ai data-analysis python regression-model
Last synced: 01 Apr 2025
https://github.com/macdon112/layoff-analysis
SQL data cleaning & analysis of global layoffs
data-analysis data-cleaning data-exploration sql
Last synced: 21 Feb 2026
https://github.com/gholamrezadar/most-profitable-actors
Finds the list of actors with the most boxoffice profit using TMDB API.
Last synced: 16 Jan 2026
https://github.com/vi/rendercsv
Tool to convert CSV table to a picture.
animation csv csv2pic csv2png data-analysis picture png table table-renderer visualization
Last synced: 01 Apr 2025