Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-23 00:07:29 UTC
- JSON Representation
https://github.com/mariam-badr-mb/gtc-ml-project2-diabetes-prediction
This project is part of the GTC Machine Learning Program. It demonstrates the end-to-end ML workflow by building a predictive model for diabetes detection
classification-algorithm data-analysis data-visualization diabetes-prediction gridsearchcv hyperparameter-tuning machine-learning python
Last synced: 09 May 2026
https://github.com/rafiulgits/data-analysis
Data Analysis with python programming language
classification data-analysis data-mining data-visualization machine-learning mglearn regression regression-models sklearn
Last synced: 08 May 2026
https://github.com/ipanalytics/vpn-provider-overlap-intelligence
Aggregate VPN provider infrastructure overlap analysis: exact-IP overlap, shared /24 prefixes, hosting dependency, and provider relationship clusters. No raw VPN IP lists.
anti-fraud asn cybersecurity data-analysis fraud-detection infrastructure ip ip-intelligence ip-reputation network-analysis network-intelligence osint proxy-detection threat-intelligence vpn vpn-detection
Last synced: 25 May 2026
https://github.com/nikita-data/unit_economics_projects
unit economics & cohort analysis projects
cac churn-rate conversion create-function data-analysis data-visualization eda hypothesis-testing ltv math matplotlib numpy python retention-rate roi scipy seaborn segmentation statistics unit-economics
Last synced: 06 Jan 2026
https://github.com/mehulcode12/atliq-bank_creditcard_transaction_analysis
The credit card project at Atliq Bank comprises two key phases: market identification and trial. This initiative aims to leverage mathematical and statistical concepts to analyze data related to demographics, income, credit scores, and spending patterns in order to identify the target audience for the credit card.
codebasics data-analysis data-science data-visualization mathematics python python3 statistics
Last synced: 30 Apr 2026
https://github.com/phillbertnevinemmanuel/movieindustryanalysis-correlation
This project is a comprehensive data analysis endeavor within the Movie Industry, spanning from Data Cleaning to Exploratory Data Analysis, Correlation Analysis, and Temporal Analysis. The dataset was sourced from Kaggle, purportedly scraped using the IMDb API. Python was the primary tool utilized for analysis.
data-analysis data-cleaning python
Last synced: 30 Apr 2026
https://github.com/karthikudyawar/passwordometer
To predict the strength of the password
cybersecurity data-analysis data-visualization dataset docker exploratory-data-analysis-eda fastapi jupyter-notebook mongodb password-security password-strength-meter
Last synced: 30 Apr 2026
https://github.com/alcestide/scianalytics
Playground for Data Analysis and Visualization for Research and Scientifical Purposes with Pandas and Plotly.
csv data-analysis data-science data-visualization pandas plotly python science-research statistics
Last synced: 30 Apr 2026
https://github.com/abdelhakim-gh/machine-learning_data-analysis_project
recognizing handwritten numbers & comparing the Life Expectancy vs Fertility in 1960 & 2013 of regions
data-analysis jupyter-notebook machine-learning python r r-studio
Last synced: 12 Apr 2026
https://github.com/mgobeaalcoba/matplotlib_y_seaborn
Aquí dejaré trabajos de visualización realizados con ambas librerías de Python.
data-analysis data-science data-visualization dataset matplotlib numpy pandas python seaborn
Last synced: 13 Apr 2026
https://github.com/mariam-badr-mb/gtc-ml-project1-hotel-bookings
The goal of this project is to build a robust data preprocessing pipeline for a hotel booking cancellation prediction model. The focus is not on training the final machine learning model but on ensuring that the dataset is clean, consistent, and ML-ready.
cleaning-data data-analysis exploratory-data-analysis
Last synced: 05 Sep 2025
https://github.com/bcko/ud-da-stroopeffect
Udacity Data Analyst Nanodegree Project : Test a Perceptual Phenomenon (Stroop Effect)
data-analysis data-analyst-nanodegree stroop-effect udacity udacity-data-analyst-nanodegree
Last synced: 04 Jul 2025
https://github.com/rani-sikdar/pwc-virtual-internship-powerbi
Comprehensive Power BI dashboards showcasing insights on Call Centre Trends, Customer Retention, and Diversity & Inclusion to drive business impact.
business-analytics business-intelligence data-analysis data-cleaning data-visualization interactive interactive-visualizations powerbi
Last synced: 07 Jan 2026
https://github.com/komailmk/instagram-reach-forecasting
This repository provides a Python-based solution for forecasting Instagram reach using historical data and SARIMA modeling techniques.
data-analysis data-visualizations machine-learning
Last synced: 05 Oct 2025
https://github.com/nmsby/pca-machine-learning-lab
Principal Component Analysis (PCA) implementation and analysis lab for Machine Learning. Features manual PCA implementation, scikit-learn applications, data compression, and feature extraction with detailed visualizations.
data-analysis dimensionality-reduction jupyter-notebook machine-learning numpy pca python scikit-learn visualization
Last synced: 01 May 2026
https://github.com/muhammadhilmyputrarisma/ab-test
Python code for A/B testing on Cookie Cats game data. This project analyzes the impact of moving the first gate from level 30 to level 40 on player retention and game rounds, helping to evaluate if delaying the gate improves player engagement and gameplay experience.
ab-testing cookie-cats data-analysis data-visualization game-analytics python statistics
Last synced: 18 May 2026
https://github.com/billgewrgoulas/hypothesis-testing-on-37-seasons-of-nba
First assignment for the course Data Mining @CSE.UOI
data-analysis data-science numpy scipy seaborn statistics
Last synced: 01 May 2026
https://github.com/riddhis2226/titanic-survival-data-analysis
Titanic-Survival-Data-Analysis : Analyze passenger data from the Titanic to predict survival based on features like age, gender, class, and fare.
data-analysis data-mining data-science data-visualization database jupyter-notebook machine-learning-models machinelearning-python plotlyjs python3
Last synced: 01 May 2026
https://github.com/ajimaulana123/e-commerce-data-analis
Analisis dataset e-commerce guna menjawab kebutuhan product mana yang paling laris dibeli customer
Last synced: 28 Apr 2026
https://github.com/pngo1997/axa-xl-insurance-bi-dashboard
Provides a comprehensive analysis of insurance submissions, approvals, compliance rates, and profitability for AXA XL Insurance.
bi-analytics bi-dashboard business-analytics data-analysis filtering performance-analysis powerbi segmentation visualization
Last synced: 08 Feb 2026
https://github.com/priboy313/pandasflow
A set of custom python modules for friendly workflow on pandas
catboost data-analysis data-science pandas phik python scikit-learn shap
Last synced: 20 Jan 2026
https://github.com/archie-cm/credit_risk_model_vix_id-x_partners
The objective project is to decrease the company's losses by up to 30% through bad loans by creating a machine learning system to assist in automating loan assessments
credit-risk data-analysis data-visualization machine-learning scorecard
Last synced: 01 May 2026
https://github.com/muneeb1030/webscrapper_mastodon
The Mastodon Social Platform Scraper is a Python-based web scraping tool designed to explore and extract valuable data from the Mastodon social platform.
data-analysis data-collection mastodon python3 scrapy scrapy-spider selenium-python webscraping
Last synced: 09 Oct 2025
https://github.com/willie-conway/datavista
DataVista is a comprehensive, production-grade data analysis and machine learning platform that combines real-time data ingestion from live APIs, interactive visualizations, statistical analysis, hypothesis testing, and machine learning model training — all in a unified, professional-grade interface. Built with React and Recharts.
analytics-platform api-integration classification coingecko-api csv-import data-analysis data-cleaning-and-preprocessing data-pipeline data-science data-visualizations etl hypothesis-testing json-export machine-learning-models open-meteo react recharts regression statistics world-bank
Last synced: 30 May 2026
https://github.com/shrawans007/hotel_customers_sentiments
Sentiment Analysis for a Hotel Based on Customer's Reviews
2018-2019 data-analysis data-analysis-in-excel data-cleaning data-cleaning-and-preprocessing data-visualization excel excel-pivot-tables github hotel-review-sentiments hotel-service ms-excel ms-excel-data-analytics pivot-tables sentiment-analysis tableau tableau-public text-reviews treemap
Last synced: 22 Mar 2025
https://github.com/y-india/project-road-accident-severity-prediction-system
see README below , please.
application data data-analysis data-classification data-cleaning data-science data-visualization data-visualization-project machine-learning ml pandas project real-world-problem-solving real-world-project road-project streamlit-webapp
Last synced: 02 May 2026
https://github.com/melogabriel/nubank-expenses-analysis
This project consolidates monthly credit card statement data from Nubank into a single CSV file using Python, enabling data visualization through a Google Sheets dashboard in Looker Studio.
data-analysis data-visualization googlesheets lookerstudio pandas python
Last synced: 02 May 2026
https://github.com/mauriceling/sipy
Python-Based Statistical Graphical User Interface for Python
data-analysis julia julia-language jupyter jupyter-kernels pandas pandas-python python python3 r r-packages r-project r-stats scikit-learn scipy scipy-stats statistical-analysis statistical-tests statistics
Last synced: 15 Apr 2026
https://github.com/danhenriquex/data-science-project
The main goal of this project was to apply the concepts of data visualization and analysis.
data-analysis data-science numpy pandas python
Last synced: 12 Apr 2026
https://github.com/fybex/chatgpt-conversations-analysis
Analysis of 89,000 ChatGPT conversations to understand interaction patterns and response behaviors.
chatgpt conversation-analysis data-analysis data-visualization language-analysis prompt-patterns sentiment-analysis
Last synced: 02 May 2026
https://github.com/phammings/sales-management-analysis
Sales management analysis and Power BI dashboard for sample business request and user stories
data-analysis excel powerbi sql
Last synced: 01 Feb 2026
https://github.com/prernarohra/mental-health-prediction
This project focuses on predicting mental health outcomes using machine learning algorithms. By analyzing various psychological, social, and lifestyle factors, the model aims to identify individuals at risk, enabling early intervention and support.
data-analysis data-science data-visualization machine-learning mental-health python
Last synced: 20 May 2026
https://github.com/madhuresh2011/telco-customer-churn-analysis-using-python
The analysis primarily investigates factors influencing customer churn, particularly focusing on payment methods and contract types.
csv data-analysis matplotlib numpy pandas pyhton seaborn vizualisation
Last synced: 02 May 2026
https://github.com/shakhthi/deep-learning
All Materials, Practice codes and Projects related ML & DL
data-analysis deep-learning machine-learning
Last synced: 09 Apr 2025
https://github.com/swarnim1812/crime_project
AI-Driven Crime Forecasting Across Indian States — A pioneering machine learning project that harnesses time series modeling (SARIMAX, Ridge Regression) to uncover patterns and forecast crime trends using real-world multi-state temporal and socio-economic data.
analytics crime-locator crime-prediction data-analysis deep-learning machine-learning prophet-facebook sarimax-model time-series-forecasting
Last synced: 31 Jan 2026
https://github.com/ferrangarciarovira/premier-league-betting-analysis
Comprehensive Python analysis of Premier League betting market inefficiencies (2005–2024). Evaluates bookmaker biases, betting strategies, and market efficiency using statistical methods and Monte Carlo simulations.
betting-strategies bias-detection data-analysis market-efficiency monte-carlo-simulation premier-league python sports-analytics
Last synced: 03 May 2026
https://github.com/mirseo/pandas_learning
pandas_learning
data-analysis data-analysis-python data-science data-visualization numpy numpy-example pandas pd python python-3 python3
Last synced: 03 May 2026
https://github.com/savinrazvan/heredity
An AI that assesses the likelihood of genetic traits in individuals using a Bayesian Network to analyze family genetic data, modeling genetic inheritance and mutations to infer probabilities of gene presence and trait expression.
ai bayesian-network biological-data-analysis data-analysis educational-project family-genetics genetic-inheritance genetic-traits heredity mutation-modeling probability-calculation python
Last synced: 27 Feb 2025
https://github.com/jossimmar/ensa-scripts_py
Repositorio destinado al manejo de datos de consumo de los Clientes Mayores de ENSA del Grupo Distriluz.
data-analysis electrical-engineering python sqlite
Last synced: 10 May 2026
https://github.com/allanotieno254/codsoft
This repository showcases a series of data science projects completed during an internship with CODESOFT. Each project utilizes Python and various machine learning techniques to solve specific problems in data analysis, classification, regression, and predictive modeling.
classification data-analysis data-science feature-engineering machine-learning model-evaluation predictive-modeling python-programming regression
Last synced: 15 May 2025
https://github.com/leosimoes/uerj-tcc-analisador-dados-texto
Texto do trabalho de conclusão de curso (TCC) em engenharia de computação. Aplicativo Web para análise de dados.
data-analysis data-science data-visualization python streamlit
Last synced: 24 Mar 2025
https://github.com/gustavo-zamai/shop_data_analisys
Analysis diferents shopping mall sells
data-analysis openpyxl pandas python3 pywin32
Last synced: 01 Mar 2025
https://github.com/0xjeremy/me-18-final
Data collection and Analysis tools for IMUs
data-analysis imu raspberry-pi
Last synced: 03 May 2026
https://github.com/soumya-thoutam/revenue-and-demand-forecasting-analysis
End-to-end data analysis project with SQL and Power BI for revenue and demand forecasting in bike-sharing.
data-analysis data-visualization powerbi sql sql-server
Last synced: 16 Mar 2025
https://github.com/ronaldkanyepi/python-streamlit-covid-19-dashboard
This is a responsive streamlit covid 19 Dashboard
analytics data data-analysis data-visualization datascience python streamlit
Last synced: 18 May 2026
https://github.com/azmainadel/twitter-data-neo4j
Playing with graph database on a large dataset of twitter data.
data-analysis data-visualization neo4j-database snap
Last synced: 06 Apr 2025
https://github.com/anshmnsoni/pizza-market-analysis
data-analysis powerbi-visuals powerbidashboard
Last synced: 05 Feb 2026
https://github.com/shelton-beep/predicting-gpa-using-lifestyle-factors
Predicting student GPA using lifestyle factors like study habits, sleep, and stress levels. A machine learning model built to help students and educators understand the impact of lifestyle choices on academic performance.
data-analysis data-preprocessing data-science feature-engineering gpa-prediction machine-learning model-interpretability predictive-modeling python regression-analysis student-performance xgboost
Last synced: 04 May 2026
https://github.com/odeyiany2/flit-apprenticeship-data-science-projects
This repo contains all my projects for my FLiT Apprenticeship
data-analysis data-science data-visualization machine-learning sql
Last synced: 17 May 2026
https://github.com/equicirco/cirquant
Code and data delivering for quantifying circularity through open data and digital innovation.
circular-economy data-analysis database julialang official-statistics
Last synced: 13 Jan 2026
https://github.com/stastnypremysl/lsql-csv
lsql-csv is a tool for small CSV file data querying from a shell with short queries. It makes it possible to work with small CSV files like with a read-only relational databases. The tool implements a new language LSQL similar to SQL, specifically designed for working with CSV files in a shell. LSQL aims to be a more lapidary language than SQL.
csv data-analysis data-processing haskell language linux-shell lsql lsql-csv new-language query-language relational-database sql unix-command unix-philosophy unix-shell
Last synced: 25 Feb 2026
https://github.com/macdon112/layoff-analysis
SQL data cleaning & analysis of global layoffs
data-analysis data-cleaning data-exploration sql
Last synced: 21 Feb 2026
https://github.com/ryanfranklin237/data-visualization-python
A tool that allows you to visualize data from a csv or excel file in a graph or charts form
data-analysis data-science data-visualization matplotlib pandas-dataframe python
Last synced: 11 Jun 2026
https://github.com/wildanmujjahid29/books-sales-analytics-python
Books Sales Analytics With Pyhton
data data-analysis data-science data-visualization
Last synced: 12 Jun 2026
https://github.com/bgr8/bokeh-ile-veri-gorsellestirme
Data visualization with Bokeh Library
bokeh color data-analysis data-visualization hbar html python vbar
Last synced: 30 Oct 2025
https://github.com/seekinginfiniteloop/fedcal
A feature-rich Python calendar that enables time series analyses of changes in federal workforce schedules and shifts in executive department funding status.
data-analysis data-science econometrics economic-data economics federal federal-government hr pandas pandas-library pandas-python pydata python
Last synced: 15 Apr 2026
https://github.com/angelgardt/wlm-sdarp-old
World of Linear Models: Statistics & Data Analysis in R for Psychologists
data-analysis data-visualization gh-pages manim-animations quarto r rstudio statistics
Last synced: 04 May 2026
https://github.com/olob0/badwords-pt-br
💬 Wordlist com palavrões em pt-BR para análise de dados, filtros, ou texto considerado "evitável"
badword-filter badwords brasil data-analysis filter filter-lists filterlist portugues portuguese text-analysis wordlist
Last synced: 06 Jan 2026
https://github.com/mftnakrsu/crm-rfm-analysis
CRM-RFM-Analysis
ai crm data data-analysis data-science deep-learning machine-learning python rfm rfm-analysis
Last synced: 16 Mar 2025
https://github.com/apache/cloudberry-devops-release
DevOps and Release for Apache Cloudberry (Incubating)
ai big-data cloudberry data-analysis data-warehouse database devops distributed-database greenplum mpp olap postgres postgresql
Last synced: 04 Sep 2025
https://github.com/unnatmalik/dattavism-ai-powered-data-insight-generator-
Dattavism is an AI-powered data insight platform that transforms raw CSV files into comprehensive, contextualized reports—complete with visualizations, statistical summaries, and natural language insights. Dattavism is designed to handle datasets across diverse domains. it is Built using Python, Streamlit, Gemini API, Pandas, Matplotlib, NumPy,
data-analysis python streamlit
Last synced: 24 Jul 2025
https://github.com/kimtth/agent-data-analyst-stream-chainlit
⚡️Chainlit-based Data Analyst Chat Agent (Responses API, Server Sent Events) 📈
agent azure-openai chainlit code-interpreter data-analysis server-sent-events stream-response
Last synced: 09 Jun 2026
https://github.com/reusjimenez/powerbi-data-analysis
Dashboards interactivos desarrollados en Power BI, orientados al análisis de datos y visualización efectiva. 📊
business-intelligence dashboards data-analysis dax power-query powerbi
Last synced: 28 Jan 2026
https://github.com/vhawk19/ambaan
just wants the average analyst to be happi
data-analysis duckdb-wasm sql vue
Last synced: 01 Mar 2026
https://github.com/zen204/airbnb-availability
A machine learning model that predicts Airbnb listing availability, utilizing feature engineering and supervised learning techniques to improve guest experience and optimize host management.
binary-classification data-analysis data-preprocessing data-visualization feature-engineering machine-learning matplotlib model-evaluation nlp pandas predictive-modeling python scikit-learn seaborn supervised-learning
Last synced: 21 Jan 2026
https://github.com/boardgameanalytics/bga-notebooks
Exploratory notebooks using the BGG dataset
data-analysis data-exploration data-visualization ipython-notebook python
Last synced: 28 Jan 2026
https://github.com/randomshek/Working-With-Excel
Using Excel Power Query and PowerPivot, reorganise the data into a star schema and showcasing reports that can be created by data analysts using DAX formulae and PowerPivot
data-analysis excel power-pivot power-query
Last synced: 20 Jul 2025
https://github.com/foggy-projects/foggy-data-mcp-bridge
MCP Data Bridge for Java. Enabling safe Text-to-Query via a semantic layer, making enterprise data accessible to AI Agents.
agent data-analysis java llm mcp semantic-layer spring-boot text-to-sql
Last synced: 16 Mar 2026
https://github.com/dwidevelopes/database-input-pelanggran-mahasiswa
Menginput data Mahasiswa Yang Melakukan Pelanggran yang siap di data dan di hukum Dan juga siap Terkena Sanksi
aplikasi aplikasi-sekolah data data-analysis database input-method mahasiswa sekolah siswa siswi website
Last synced: 02 May 2026
https://github.com/hifza-khalid/book-management-system-sql
A Book Management System SQL project 📚 featuring tables for Authors ✍️, Books 📖, Customers 👤, and Orders 🛒. Includes sample queries for tracking book sales 💰, pricing by genre 🎭, and customer order history 📅.
book-management data-analysis database-management sql sql-queries
Last synced: 03 Feb 2026
https://github.com/abdul-wahab-318/pakistani-news-sentiment-analysis
This project involves performing sentiment analysis on Pakistani news articles collected over the past month (August-September 2024). The primary goal is to understand media sentiments regarding various topics and events covered in the news. A total of 800+ articles were scraped from multiple news sources.
data-analysis machine-learning pakistan pakistani-politics sentiment-analysis
Last synced: 26 Oct 2025
https://github.com/jabhij/fbi_nics-firearm-background-checks
This project is a try to showcase the use of guns across the US.
data-analysis data-analytics data-science data-visualization tableau
Last synced: 23 Feb 2026
https://github.com/vipul2001/cousera-courses
This repo covers the solution to the assignments of various courses on algorithm,deep learning and data Analytics
coursera-courses data-analysis data-analytics-ibm deep-learning divide-and-conquer neural-network
Last synced: 29 May 2026
https://github.com/ayenpure/stockmeup
This is a class project for 'CIS 610 : Data Science' where I try and validate Stock Market recommendations.
data-analysis data-mining data-science java mapreduce mapreduce-java
Last synced: 24 Oct 2025
https://github.com/sing-group/bew
Public repository for Biofilmfs Experiment Workbench (BEW).
aibench data-analysis data-management java jfreechart workbench
Last synced: 03 Jul 2025
https://github.com/dcs-training/null-hypothesis-testing-with-r
This two-class course will focus on developing theoretical and practical skills for null hypothesis testing in R. Go to the readme file
data-analysis data-wrangling r statistics
Last synced: 24 Oct 2025
https://github.com/gher-uliege/bluecloud-plankton
Spatial interpolation of plankton data using a neural network
data data-analysis data-visualization neural-network oceanography
Last synced: 30 Mar 2025
https://github.com/tomasoak/datahopper
Python package for data engineering and data wrangling
data data-analysis data-engineering data-mining data-science data-structures data-wrangling datascience pandas python
Last synced: 12 Mar 2026
https://github.com/tunjis/global-superstore_dashboard_tableau
Tableau dashboard with 4 different types of visualisations
charts dashboard data-analysis data-visualisation excel tableau
Last synced: 23 Jan 2026
https://github.com/kirkalyn13/opensignal_autogenerate_report
Script used to generate results/summary, including the trends of flagged provinces, from the raw excel data file,
data-analysis data-science data-visualization matplotlib numpy pandas python
Last synced: 06 May 2026
https://github.com/suhas-005/power-bi-dashboard
Power BI Dashboard Projects
data-analysis data-visualization dataset power-bi-project powerbi
Last synced: 01 Apr 2025
https://github.com/discdiver/new-belgium-ratings
Find the most popular New Belgium beers of all time!
beautifulsoup data-analysis pandas python seaborn webscraping
Last synced: 10 Apr 2026
https://github.com/atymri/linqsimulator
LINQ Simulator is an interactive C# console application designed to let you experiment with LINQ queries in real time.
console csharp data data-analysis linq query sql
Last synced: 23 Oct 2025
https://github.com/cano1998/eda-survival-of-the-titanic
This project focuses on Exploratory Data Analysis (EDA) to identify the key determinants that influenced survival during the infamous Titanic accident.
data-analysis data-cleaning data-preprocessing data-visualization exploratory-data-analysis jupyter-notebook titanic-survival-exploration
Last synced: 21 Jun 2026
https://github.com/souravsuvarna/whatsapp-chat-analyzer-api
The WhatsApp Chat Analyzer API is a public api specifically designed for frontend enthusiasts who are interested in building a WhatsApp Chat Data Visualizer project. Built on FastAPI, this API offers a seamless and efficient method to process chat data and returns the processed result data in JSON format.
api data-analysis data-science fastapi publicapi python
Last synced: 20 Jun 2026
https://github.com/nafisalawalidris/building-a-clustering-model-for-customer-segmentation
Customer Segmentation Using Clustering: This repo applies clustering algorithms to a customer transaction dataset, grouping similar customers together based on their purchasing behavior. Targeted marketing strategies can be developed by analyzing distinct customer segments.
clustering customer-segmentation data-analysis data-visualization k-means machine-learning marketing-analytics unsupervised-learning
Last synced: 16 Mar 2025
https://github.com/nafisalawalidris/buybuy-e-commerce-company
The BuyBuy E-commerce Company repository is a comprehensive hub for the company's e-commerce platform. It includes source code, documentation, and data analysis insights, providing a data-driven approach to improve customer experience, drive revenue, and inform decision-making.
buybuy cleaning-data company customer-experience data data-analysis decision-making documentation e-commerce excel insights postgresql repository revenue source-code sql
Last synced: 16 Mar 2025
https://github.com/eslamdyab21/imdb-data-analysis
This data set contains information about 10,000 movies collected from The Movie Database (TMDb), including user ratings and revenue
data-analysis pandas python udacity-data-analyst-nanodegree
Last synced: 06 May 2026
https://github.com/freebirdscrew/covid-19-data-analysis
Coronavirus Data-Analysis with Live Data Streaming from the Website and Made a DASH Web-App at Last.
coronavirus coronavirus-real-time coronavirus-tracking countryinfo covid-19 covid-19-india covid19 covid19-data dash dash-button dashboard-application data data-analysis data-cleaning data-science data-visualization github jupyter pycountry python
Last synced: 07 May 2026
https://github.com/prime-infinity/type-one
Software to visualize and analyze GitHub repos based on certain statistics such as stars, forks and issues
data-analysis data-visualization
Last synced: 03 Feb 2026
https://github.com/ssreeramj/youtube_channels_analysis
This web app gives a detailed analysis of the videos uploaded in a particular youtube channel.
data-analysis heroku pandas python streamlit youtube
Last synced: 29 Apr 2026
https://github.com/hamzacham/data_set_projet-5
data-analysis data-science database dataset jupyter jupyter-notebook paython training
Last synced: 07 May 2026
https://github.com/ayaanjawaid/google_playstore_data_analysis
This project provides an in-depth analysis of Google Play Store apps and user reviews, focusing on understanding app performance, user sentiment, and key trends in app categories. Using Python, I performed data cleaning, feature engineering, and exploratory data analysis (EDA) on app data and reviews.
data-analysis eda html numpy pandas-dataframe plotly python vizualisation
Last synced: 24 Feb 2026
https://github.com/idaraabasiudoh/knn-customer-classification
Labels telecommunication customer base to respective groups to determine service type required for each customer.
data-analysis jupyter-notebook machine-learning pyhton3 scikit-learn
Last synced: 07 May 2026
https://github.com/as16082023/coffee-bean-sales-analysis
Analyzing coffee bean sales data to optimize consumer targeting, product offerings, and strategic marketing in the coffee industry.
coffee-bean-sales dashboard data-analysis data-visualization ms-excel
Last synced: 22 Jan 2026
https://github.com/nikita-data/eda_projects
Exploratory data analysis projects
cac data-analysis data-visualization eda folium-maps hypothesis-testing ltv math matplotlib numpy plotly python regular roi scipy seaborn segmentation statistics unit-economics
Last synced: 06 Jan 2026
https://github.com/y-india/retail-sales-analysis-project
Analysis and preprocessing of retail store sales data. Includes data loading, merging, and initial inspection. 📌 Recommended: See README.md for detailed project progress and dataset information.
ai dashboard data-analysis data-science data-visualization jupiter-notebook machine-learning matplotlib python real-world-problem-solving real-world-project retail-analytics sales-analysis seaborn sklearn-library streamlit
Last synced: 07 May 2026
https://github.com/rekha0suthar/e-commerce-shopper-s-behaviour-understanding
Understand the online shopper purchasing pattern through Machine learning
data-analysis data-preprocessing data-visualization logistic-regression machine-learning numpy pandas python3 scikit-learn seaborn-plots
Last synced: 12 Apr 2026