Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-22 00:07:31 UTC
- JSON Representation
https://github.com/rani-sikdar/data-visualization-tools-python
data-analysis data-visualization pandas python
Last synced: 07 May 2026
https://github.com/eesunmoon/on-device_multimodal_er
[Research - MINES Lab] Multimodal Emotion Recognition for On-device AI
artificial-intelligence data-analysis deep-learning embedded-systems emotion-recognition heart-rate-analysis multimodal-fusion npu on-device python speech-processing speech-recognition tensorflow wearable-devices
Last synced: 08 Feb 2026
https://github.com/edseldim/FirstRoundElectionsFr
A data visualization spreadsheet on Excel
data-analysis data-visualization excel pandas python
Last synced: 02 Aug 2025
https://github.com/viseshrp/community_health_indicator
Android app to fetch,organize and represent NYC health data
android data-analysis data-visualization health
Last synced: 03 Mar 2025
https://github.com/ebowwa/chatgpt-export-processor
🤖 Extract, analyze & search your ChatGPT conversations locally | Privacy-first tool for OpenAI ChatGPT data export processing | Python CLI with embeddings support
ai-tools chatgpt chatgpt-export chatgpt-tools cli conversation-analysis data-analysis data-extraction embeddings local-first nlp openai openai-api privacy python
Last synced: 19 May 2026
https://github.com/hemanthkumarsunkari27/pmay_analysis_project
Built for the 1st AI for Good Hackathon by Snowflake, this project uses data analytics and AI to explore housing and sanitation trends in India under PMAY. Using Snowflake and Streamlit, it provides interactive insights into regional disparities, helping guide sustainable infrastructure development.
data-analysis data-visualization pmay-analysis sanitation-coverage snowflake-integration streamlit-dashboard sustainable-development
Last synced: 26 Mar 2025
https://github.com/thecoderpinar/customer-segmentation-clv-analysis
Optimize marketing strategies and enhance decision-making. Explore customer data, segment behavior, calculate CLV, analyze demographics, and visualize insights. 🚀
clv-analysis customer-segmentation data-analysis data-science data-visualization jupyter-notebook machine-learning marketing-strategy python
Last synced: 03 Apr 2025
https://github.com/defrecord/value-alignment-toolkit
A comprehensive toolkit for implementing, analyzing, and validating AI value alignment based on Anthropic's 'Values in the Wild' research.
ai anthropic data-analysis ethics privacy python simulation value-alignment
Last synced: 20 Jul 2025
https://github.com/salman-khan-mohammed/predicting-the-intent-of-online-shoppers
This project aims to predict online shoppers' purchase intentions using browsing history and user data from e-commerce sites. By analyzing clickstream and session information, the goal is to create a machine learning model that accurately forecasts customers' likelihood of making a purchase.
cluster-analysis data-analysis data-pre eda outliers prediction
Last synced: 31 Oct 2025
https://github.com/nafisalawalidris/dr.-semmelweis-and-the-discovery-of-handwashing
Uncover the revolutionary impact of handwashing on mortality rates in healthcare. Explore the story of Dr. Semmelweis and his groundbreaking findings.
data data-analysis handwashing healthcare-analysis medical-breakthrough mortality-rates
Last synced: 13 Jul 2025
https://github.com/sourabh-kumar04/numpy-basic
Numpy-Basic is a structured learning repo covering NumPy from basics to advanced. It includes arrays, indexing, reshaping, filtering, vector ops, angle functions, stats, and .npy file handling. Each concept is explained with code, examples, and Matplotlib visualizations in both light and dark modes. Ideal for students and data learners.
data-analysis data-science data-visualization learning learning-resources machine-learning matplotlib numerical-computing numpy python python-library python-programming
Last synced: 10 May 2026
https://github.com/giordano-lucas/tesco-extension
Products clustering and interactive visualization
clustering data-analysis data-visualization tesco
Last synced: 17 Jun 2026
https://github.com/vitia-fritelle/ipynb_converter
Jupyter notebook to Python file conversor
data-analysis data-science jupyter-notebook python
Last synced: 28 Apr 2026
https://github.com/patilni3/seaborn-in-depth
Python's Seaborn Library for Data Analysis, Machine Learning, Data Science and many more...
data-analysis data-reporting data-representation data-science data-visualization plots-in-python powerbi seaborn sns
Last synced: 03 Apr 2025
https://github.com/mksingh431/python-project
Learn Pandas with exercises and sample projects
data-analysis data-science data-visualization project projects python
Last synced: 03 May 2026
https://github.com/luminati-io/amazon-dataset-samples
A sample dataset of over 1,000 Amazon product listings, extracted using the Bright Data API, perfect for competitive analysis, market trends, and eCommerce insights.
amazon api data-analysis data-science dataset ecommerce products web-scraping
Last synced: 03 Jan 2026
https://github.com/tynoee/covid19_data_analysis
This is an analysis of Covid 19 dataset using multiple SQL queries. The dataset used for this analysis includes various information regarding COVID-19 cases such as confirmed cases, deaths, and recoveries, segmented by different geographical locations and time periods.
data-analysis excel sql sqlserver-2019 tableau tableau-public
Last synced: 16 Feb 2026
https://github.com/utkarsh-284/house_price_prediction
Python Project of predicting prices of Houses
data-analysis data-science data-visualization feature-engineering feature-extraction jupyter-notebook linear-regression machine-learning python
Last synced: 19 May 2026
https://github.com/vasilikizarkadoula/data-analysis-auth-2020
Εxercises for Data Analysis course in Faculty of Engineering of Aristotle's University of Thessaloniki
confidence-intervals correlation-coefficient data-analysis dimensionality-reduction hypothesis-testing probability-distribution pvalues regression uncertainty
Last synced: 25 Jul 2025
https://github.com/muneeb1030/eda-of-physionets-ecg
EDA of Physionet Data set regarding "A Large Scale 12 Lead Electrocardiogram Database for Arrhythmia Study 1.0.0". This project focuses on the preprocessing of electrocardiogram (ECG) signals and utilizes Principal Component Analysis (PCA) for dimensionality reduction
12-lead-ecg data-analysis ecg-signal eda pca python3 wfdb
Last synced: 25 Jul 2025
https://github.com/balapriyac/python-data-analysis
Code along to simple data science and analysis projects and tutorials
data-analysis data-science python
Last synced: 25 Jul 2025
https://github.com/patilni3/numpy-in-depth
Python's NumPy Library for Data Analysis, Machine Learning, Data Science and many more...
data-analysis data-engineering data-science machine-learning numpy pandas
Last synced: 10 May 2026
https://github.com/frankelavsky/political-polarization-challenge
I had 8 hours to build a solution to the research claim that "politics have become more divided in the past 50 years." You can navigate views of congressional voting patterns using arrows. I used d3, require, MVC pattern, and vanilla js. Pre-processed the data in node.js. Data is from DW-NOMINATE: ftp://k7moa.com/junkord/HANDSL01114A20_STAND_ALONE_30.DAT
client-side css d3 d3js data-analysis data-visualization frontend frontend-app html interactive interactive-visualizations javascript modular nodejs political-science politics requirejs research single-page-app visualization
Last synced: 06 Apr 2026
https://github.com/priyanshubiswas-tech/customer-churn-analysis-eda-
This project analyzes customer churn in a telecom company using Python, Pandas, SQL, and data visualization. It identifies key factors like contract type, payment method, and tenure to provide insights for improving retention. The skills gained are applicable in customer retention, user behavior analysis, fraud detection, and HR analytics.
data-analysis data-visualization dataset explo exploratory-data-analysis matplotlib matplotlib-pyplot numpy pandas-dataframe python seaborn seaborn-python
Last synced: 08 May 2026
https://github.com/cosmoduende/r-uber-trips-analyisis
Explore your activity on Uber with R: How to analyze and visualize your personal data history. Find out how you consume the Uber App using a copy of your data.
analisis-de-data data-analysis data-analytics data-science data-visualisation data-visualization data-viz eda flexdashboard ggmap ggplot2 mobility-as-a-service qmplot r-language r-programming ridesharing uber uber-data visualizacion-de-datos
Last synced: 14 Jul 2025
https://github.com/nafisalawalidris/logistic-regression-model-for-breast-cancer-recurrence-prediction
Predicting Breast Cancer Recurrence - A logistic regression model using patient attributes to classify recurrence risk. Dataset analysis and model evaluation. Contributions welcome.
breast-cancer classification-model data-analysis data-science healthcare logistic-regression machine-learning python recurrence-prediction scikit-learn
Last synced: 17 May 2026
https://github.com/adolbyb/data-science-python
An Introduction to Data Science and Data Visualization with the FAU Data Science and Machine Learning Club
data-analysis data-science data-visualization jupyter-notebook matplotlib numpy pandas python seaborn
Last synced: 13 Apr 2026
https://github.com/phillbertnevinemmanuel/coviddeathvaceda
an exploratory data analysis based on dataset of covid statisics from 2020-2022
Last synced: 09 Apr 2025
https://github.com/sahaavi/uber-vs-lyft
Advance Predictive Modeling in R
data-analysis data-science eda machine-learning predictive-modeling r
Last synced: 13 Sep 2025
https://github.com/hariyebk/eplinsights
English Premier League 2018/2019 Data Analysis
class-composition data-analysis filesystem-library
Last synced: 26 Jul 2025
https://github.com/fatihilhan42/web_scraping_football_statistics_per_game_data-main
In this notebook I will describe the process of scraping data from web portal understat.com that has a lot of statistical information about all games in top 5 European football leagues.
data-analysis data-manipulation data-science data-scraping data-visualization jupyter-notebook python
Last synced: 19 May 2026
https://github.com/patilni3/matplotlib-in-depth
Python's Matplotlib Library for Data Analysis, Machine Learning, Data Science and many more...
data-analysis data-representation data-science data-visualization matplotlib matplotlib-pyplot plots-in-python powerbi seaborn
Last synced: 03 Apr 2025
https://github.com/rayyan9477/diamond-price-forecasting
This is a comprehensive machine learning project focused on predicting diamond prices. Using a dataset of diamond attributes, the project implements various machine learning models to forecast prices. Key features include data preprocessing, exploratory data analysis (EDA), and model training with algorithms such as Linear Regression, Decision Tree
data-analysis data-science decision-trees eda linear-regression machine-learning
Last synced: 26 Jul 2025
https://github.com/chengkangzai/malaysia-pandemic-dashboard
covid-19 data-analysis pandemic-dashboard
Last synced: 29 Mar 2025
https://github.com/incubrain/awesome-maharashtra-data
A collection of datasets specific to Maharashtra, India. WIP
ai artificial-intelligence data data-analysis data-science datasets maharashtra marathi
Last synced: 23 May 2026
https://github.com/gad-dimnt-cptec/scanplot
Um sistema de plotagem simples para o SCANTEC
data-analysis jupyter-notebook pandas python scantec
Last synced: 17 Jan 2026
https://github.com/subhojit45/python3-iphones-x-flipkart-sales-analysis
A simple six questions and their insights derived from iphone sales on Flipkart dataset.
data-analysis jupyter-notebook python3 visual-studio-code visualization
Last synced: 19 May 2026
https://github.com/geobatpo07/office-hours-bootcamp
Practical case studies and labs from the Akademi 2025 Data Science & AI Bootcamp office hours.
artificial-intelligence data-analysis data-science data-visualization database deep-learning learning learning-by-doing machine-learning statistics
Last synced: 07 Mar 2026
https://github.com/windjammer6/8.-star-wars-data-analysis-python
A personal project to analyse data from a Star Wars survey. Python libraries used: Pandas, Matplotlib
Last synced: 27 Jul 2025
https://github.com/graphieros/data-visualisation
data visualisation solutions in vanilla js
data-analysis data-visualization pure-javascript svg-manipulating
Last synced: 07 Sep 2025
https://github.com/yernaz-togizbayev/microsoft_store_data-analysis
Microsoft Store
data data-analysis data-visualization jupyter-notebook python3
Last synced: 15 May 2026
https://github.com/pranabdas/suvtools
Python library for analyzing and visualizing SSLS SUV Beamline data.
data-analysis data-visualisation python
Last synced: 07 May 2025
https://github.com/rishabhraj43/diwali-sales-analysis
A Data Analysis project made in Python
Last synced: 01 May 2026
https://github.com/adrianycmc/introducaoadatascience
Explorando dados: Utilizando Python, Pandas e o Colaboratory do Google.
data-analysis data-science jupyter pandas python
Last synced: 30 Apr 2026
https://github.com/grypesc/graduateadmissions
Visualization, analysis and predictive modeling of a Kaggle graduate admissions dataset.
data-analysis data-mining data-science data-visualization dataset
Last synced: 08 Jul 2025
https://github.com/nysportsfan/Gun-Violence-in-the-US
This repository contains all the relevant files for my first capstone project as part of the Springboard Data Science Career Track.
data-analysis data-science data-visualization machine-learning python3 statistics
Last synced: 10 May 2025
https://github.com/celineboutinon/drinking-water-for-all
OpenClassrooms Data Analyst 2022-2023 - Projet 8 using PowerBI Desktop
data-analysis data-analytics data-structures data-visualisation database-design database-schema databases mysql-connector-python mysql-workbench power-bi-dashboard python sql
Last synced: 27 Apr 2026
https://github.com/tbep-tech/ecometab-r-training
Website materials for R training on ecosystem metabolism
data-analysis open-science workshop
Last synced: 19 Feb 2026
https://github.com/tbep-tech/verified-wbids
Materials for evaluation of verified WBIDs in the Tampa Bay watershed
data-analysis open-science tampa-bay tbep water-quality
Last synced: 19 Feb 2026
https://github.com/airdac/mva-bank_india
Multivariate Analysis of an Indian bank's dataset about loan paybacks in R. Team project from UPC's Master's Degree in Data Science
data-analysis data-science multivariate-analysis r upc
Last synced: 26 May 2026
https://github.com/birkkarlsen/beam_dynamics_tools
Repository filled with functions related to the analysis of longitudinal beam dynamics measurements and simulations
accelerator-physics beam-dynamics data-analysis
Last synced: 19 Sep 2025
https://github.com/anas436/extreme-weather-forecasts-ai
Improving Extreme Weather Forecasts Using AI
artificial-intelligence data-analysis data-science data-visualization keras-tensorflow machine-learning neural-network time-series-forecasting
Last synced: 22 Jul 2025
https://github.com/vkbo/osirisanalysis
Matlab toolbox for analysing simulation results from Osiris 3
data-analysis matlab matlab-gui physics-simulation
Last synced: 10 May 2025
https://github.com/akash1070/data-science-advanced-analytics-virtual-experience-program
The BCG Open-Access Data Science & Advanced Analytics Virtual Experience Program
data-analysis data-science machine-learning-algorithms
Last synced: 16 May 2026
https://github.com/tbep-tech/red-tide-twitter
Supplementary materials to accompany Skripnikov et al. red tide Twitter analysis
ccmp-li1 ccmp-wq3 data-analysis open-science tampa-bay tberf water-quality
Last synced: 19 Feb 2026
https://github.com/jasoncobra3/whatsapp_chat_analyzer
WhatsApp Chat Analyzer is a powerful tool that provides insightful analytics from your WhatsApp conversations. Whether you're curious about your chatting habits, want to analyze group dynamics, or need to extract meaningful data from your conversations, this tool has got you covered!
data-analysis data-science data-visualization machine-learning streamlit streamlit-webapp whatsapp-chat whatsapp-chat-analyzer
Last synced: 31 Jan 2026
https://github.com/codingprivacy/feedback-portal-system
AI based Feedback Portal System which takes periodic feedbacks from users via highly human friendly chat-bot, analyse the responses through NLP and sentiment analysis and visualize the analysis on the portal website.
artificial-intelligence bokeh chatbot data-analysis flask mysql-database nlp portal python sentiment-analysis visualization website
Last synced: 19 Sep 2025
https://github.com/jcaperella29/financial-data-scraper
Financial Data Scraper is a Python-based web scraping tool using Selenium to extract financial data from Stock Analysis. It scrapes Income Statement, Balance Sheet, Cash Flow, and Ratios for multiple companies and saves them as CSV files.
automation data-analysis finance financial-statements investment python selenium stock-market web-scraping
Last synced: 28 Jul 2025
https://github.com/ibensusan/wine-properties-assessment
Wine Properties Assessment using Microsoft Excel
data-analysis data-visualization excel
Last synced: 20 Mar 2026
https://github.com/ncasuk/decades-pp
Post processing library for the data from the FAAM aircraft
atmospheric-sciences data-analysis data-processing meteorology science
Last synced: 07 Mar 2026
https://github.com/sen2pi/bloquinho
Uma WebApp do genero de notion para correr on permises
agenda data-analysis data-visualization database education markdown note-taking notebook notebooks notepad notes notes-app notion obsidian password password-generator password-manager password-safety passwords personal-blog
Last synced: 09 Apr 2026
https://github.com/andystmc/nextflownyc
Developed a machine learning model (Bidirectional LSTM) to forecast NYC traffic volumes using 10 years of automated traffic count data. Achieved strong predictive accuracy, demonstrating the power of deep learning for urban traffic analysis.
data-analysis data-cleaning data-science data-visualization exploratory-data-analysis feature-engineering hyperparameter-tuning jupyter-notebook lstm-neural-networks machine-learning numpy pandas predictive-modeling python3 scikit-learn tensorflow-keras traffic-flow-forecasting
Last synced: 07 Apr 2026
https://github.com/sharathsphd/coffee_causality
Data-driven analysis of coffee shop sales using correlation, regression, and causal inference. A Jupyter Book project exploring foot traffic, weather patterns, and business analytics.
business-analytics causal-inference correlation data-analysis foot-traffic forecasting github-pages jupyter-notebook machine-learning open-source python regression retail-analytics statistics storytelling time-series visualization weather-analysis
Last synced: 18 May 2026
https://github.com/deepanshkhurana/cloudsimplifier
Simple helper functions to fetch and read data from various formats stored on Amazon AWS S3 Buckets. Most functions are essentially wrapping over cloudyR.
amazon aws cloudyr data-analysis data-fetching data-science package r rpackage s3
Last synced: 20 May 2026
https://github.com/fbraza/paris_airbnb
Analysis of Paris AirBnB data using R and Shiny
analysis data data-analysis paris-airbnb r shiny
Last synced: 21 Mar 2025
https://github.com/2003harsh/house-price-prediction-using-machine-learning
This project features a web app that predicts house prices using a linear regression model. Users can input details like location, square footage, bathrooms, and bedrooms through an HTML form. I've added a CI/CD pipeline with GitHub Actions, unit testing with pytest, and automated Docker containerization to improve deployment and robustness.
ci-cd data-analysis docker-image flask linear-regression machine-learning matplotlib mlops-workflow requests scikit-learn
Last synced: 04 Jan 2026
https://github.com/tiwarishubham635/uber-data-analysis-using-r
Analyzes the Uber Cab data using plots, heatmaps and dataframes
data-analysis data-visualization r
Last synced: 14 Apr 2025
https://github.com/michellepellon/jobx
A modern, powerful job scraper for LinkedIn, Indeed and beyond.
compensation data data-analysis indeed indeed-scraping jobs jobsearch linkedin linkedin-scraper
Last synced: 17 Jan 2026
https://github.com/ahmednurabdii/data-analytics-portfolio-superstore
My first portfolio project showcasing data cleaning, analysis, and visualization of Superstore sales data.
data-analysis data-visualization jupyter-notebook matplotlib numpy pandas portfolio-project python sales-analysis scipy seaborn superstore-dataset
Last synced: 07 Apr 2026
https://github.com/numbersprotocol/dyda
Dynamic data pipeline framework
ai artificial-neural-networks data-analysis data-science
Last synced: 07 Nov 2025
https://github.com/yashodatta15/maven_unicorn_company_challenge
An analysis on Unicorn companies.
data-analysis data-cleaning data-visualization powerbi unicorn-companies
Last synced: 19 Feb 2026
https://github.com/archived-blueprints/amazonathena-blueprints
Simplified blueprints for building data pipelines with Amazon Athena.
amazon-athena athena cli data-analysis data-engineering data-science elt etl
Last synced: 29 Jul 2025
https://github.com/tim-hub/python-course
A new Python Course, a new trial to offer MOOC style learning resources and content for python learners
Last synced: 17 Mar 2025
https://github.com/aldomann/tropical-cyclones
Scripts to replicate the analyses and figures from "Scaling of tropical-cyclone dissipation" by Corral et al.
bachelor-thesis data-analysis hurricanes mathematical-statistics ocean studies
Last synced: 19 May 2026
https://github.com/rayyan9477/youtube-spam-detection-with-flask-and-machine-learning
This is a web application built using Flask that detects spam comments on YouTube using a Naive Bayes classifier. It leverages techniques such as CountVectorizer for feature extraction and scikit-learn for machine learning. The application reads data from a CSV file and predicts whether a comment is spam or not.
data-analysis data-science machine-learning nlp-machine-learning spam-detection
Last synced: 21 Sep 2025
https://github.com/atharvbyadav/expensemate
A simple, lightweight personal finance tracker built with Streamlit and SQLite. Log expenses, visualize spending habits, manage budgets, and download reports – all through an interactive web interface.
budgeting data-analysis data-visualization expense-tracker finance-app open-source pandas personal-finance plotly python sqlite streamlit streamlit-webapp
Last synced: 28 Apr 2026
https://github.com/mynenik/xyplot-win32
XYPLOT Plotting and Data Analysis Program for 32-bit Windows
cpp data-analysis data-manipulation data-visualization forth mfc windows-app
Last synced: 18 Mar 2025
https://github.com/mariam-badr-mb/gtc-land-type-classification
This project develops a machine learning model to classify land cover types in Egypt using Sentinel-2 satellite imagery. The system detects categories such as agriculture, water bodies, urban areas, deserts, roads, and tree cover.
data-analysis data-visualization deep-neural-networks eda machine-learning model-architecture streamlit
Last synced: 12 Jun 2026
https://github.com/reusjimenez/python-data-analysis
Casos completos y ejercicios prácticos de análisis de datos. 📊
data-analysis data-visualization jupyter-notebook machine-learning matplotib numpy pandas python sklearn
Last synced: 12 Apr 2026
https://github.com/priyanshubiswas-tech/data-analysis-with-python
This repository showcases Python projects completed for a Data Analysis with Python certification, demonstrating skills in data manipulation, visualization, and statistical analysis using libraries like NumPy, Pandas, Matplotlib, Seaborn, and SciPy.
data-analysis demographic-data-analyzer mean-variance-standard-deviation-calculator medical-data-visualizer page-view-time-series-visualizer python scipy-stats sea-level-predictor seaborn
Last synced: 07 May 2025
https://github.com/jrschmidtt/csv-to-html
Convert csv file to html table in javascript.
body-parser csv data-analysis javascript nodejs oop
Last synced: 13 May 2026
https://github.com/sumitgirwal/procoder-public
"ProCoder", which is a web-based application providing massive open online courses for both professionals and students. It aims to offer a platform for learning coding skills online, accessible to anyone who is interested in learning programming or enhancing their coding knowledge. ProCoder provides courses on various programming languages, tools.
blog-platform bootstrap-4 chat-application css3 data-analysis django-crud django-project html5 javascript numpy-library pandas-library python3
Last synced: 05 Mar 2026
https://github.com/lmuffato/project-restaurant-orders-trybe
Projeto restaurant orders - Projeto avaliativo da Trybe do Bloco 36: Estrutura de Dados I: Arrays, Hashmaps e Sets
array array-set csv data data-analysis hashmap python set trybe trybe-projects
Last synced: 13 Sep 2025
https://github.com/gappeah/london-housing-price-dashboard
This Excel-based Housing Visual Dashboard provides a comprehensive view of average house prices across various boroughs in London from 1996 to 2013. The dashboard is designed to offer insights into housing market trends and price variations across different areas of London over time.
data data-analysis data-visualization excel visual
Last synced: 31 Jul 2025
https://github.com/dannyben/datamix
DSL for manipulating tabular data
csv data data-analysis data-engineering gem ruby tabular-data
Last synced: 31 Jul 2025
https://github.com/banyc/dfsql
SQL REPL/lib for Data Frames
cli csv data-analysis jsonl ndjson repl sql
Last synced: 31 Jul 2025
https://github.com/devandrenicolas/analise-de-vendas
This project is a comprehensive data analysis tool designed to analyze sales performance data. It includes modules for generating fake sales data, cleaning and preprocessing the data, and performing exploratory data analysis (EDA) with advanced visualizations.
data-analysis data-visualization faker-generator matplotlib pandas python
Last synced: 07 May 2026
https://github.com/myself-aas/quantium_data_analytics_forage
This project analyzes retail customer chip purchasing behavior using Python, focusing on customer segmentation and key spending drivers to provide data-driven insights for strategic category management recommendations.
data-analysis data-engineering data-science data-visualization feature-engineering forage internship-project matplotlib-pyplot numpy-library pandas-dataframe pearson-correlation python quantium-virtual-experience scipy-stats seaborn
Last synced: 31 Jul 2025
https://github.com/priyanshubiswas-tech/airflow_dbt_superset_project
End-to-end ITSM data engineering pipeline using PostgreSQL, DBT, Airflow, and Superset. Covers ingestion, cleaning, transformation, orchestration, and visualization, validated across Docker Toolbox and Docker Desktop environments.
apache-airflow apache-superset dags data-analysis dbt docker etl etl-automation etl-pipeline postgresql
Last synced: 07 May 2025
https://github.com/pseudomanifold/us-inauguration-speeches
Data & feature extraction for U.S. inauguration speeches
data-analysis data-science inauguration politics speech speeches
Last synced: 15 May 2025
https://github.com/bretsw/subreddits-over-time
Study of the r/Teachers and r/education subreddits over time
Last synced: 12 Jan 2026
https://github.com/simranjeet97/google-cloud-access-using-python
Google Drive Access using Python, Interact Programmatically and Manipulate accordingly
data-analysis data-science data-structures data-visualization gcp gcp-cloud-functions gcp-compute gcp-compute-engine gcp-projects gcp-storage google googlecloud googlecloudplatform python python3 visualization
Last synced: 23 May 2026
https://github.com/dalyanalytics/nonprofit-analytics-tools
Free analytics tools built specifically for nonprofits 🤝
analytics board-governance data-analysis donor-retention fundraising-app grant-research nonprofit rshiny shinylive
Last synced: 07 Mar 2026
https://github.com/listiangr/product_sales_data_analysis
Proyek ini menganalisis data penjualan untuk memberikan wawasan tentang tren penjualan, profitabilitas, dan permintaan produk, guna membantu perusahaan merencanakan strategi harga, promosi, dan pengelolaan inventaris yang lebih efektif.
corrplot data-analysis data-preprocessing data-visualization dplyr ggcorrplot ggplot2 product-sales r-language rstudio
Last synced: 03 Apr 2025
https://github.com/adityakumarsingh01/customer-purchase-behaviour-analysis
A data analysis project exploring online consumer behavior and FOMO effects using EDA on survey data.
consumer-behavior data-analysis eda fomo online-shopping python survey-data
Last synced: 25 Apr 2026
https://github.com/imannoferesti/3d_datavisualizer_vr
🚀 3D Scatter Plot Visualizer in Unity VR: Explore large datasets in 3D with dynamic point dispersion and value grouping (round, ceil, floor). Perfect for trend analysis and dense data visualization! 🎨📊
3d-scatter-plot big-data continuous-data-handling customizable-visualization data-analysis data-science-tools data-visualization dense-data-visualization discrete-data-handling exploratory-data-analysis immersive-analytics interactive-visualization overlapping-data-points trend-analysis unity value-grouping virtual-reality vr vr-data-visualization
Last synced: 03 May 2026
https://github.com/vhtua/group4_data_analysis
Hierarchical Cluster Analysis: Movie Genres Preferences
data-analysis hierarchical-clustering r unsupervised-learning
Last synced: 29 Mar 2025