Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-07-03 00:07:37 UTC
- JSON Representation
https://github.com/deva-246/datacleaning-excel-powerqueryeditor
data-analysis data-science excel powerquery
Last synced: 04 Jan 2026
https://github.com/27ahmad/ibm-data-science-capstone
The Capstone is the final course in the IBM Data Science Professional Certificate program. It's a project that combines all the skills and knowledge you've gained throughout the specialization.
data-analysis data-science folium-maps machine-learning plotly-dash python sql
Last synced: 26 May 2026
https://github.com/audy21/datacamp
Learning portfolio documenting my progress, while taking Data Analyst & Data Science certifications from DataCamp.
data-analysis data-science machine-learning matplotlib numpy pandas python scikit-learn seaborn
Last synced: 11 Apr 2026
https://github.com/aksoni07/movie-recommendation
A hybrid movie recommendation system designed to deliver personalized and accurate suggestions by combining user preferences, item attributes, and collaborative patterns, ensuring a seamless and engaging experience.
clustering content-based-filtering data-analysis embeddings jupyter-notebook numpy ollaborative-filtering pandas personalization python recommendation-systems scikit-learn user-item-interactions
Last synced: 11 Apr 2026
https://github.com/shreyaamenon/data-analysis-aiml-mini-projects
mini projects to help me grow skills in data analysis, artificial intelligence and machine learning.
ai data-analysis jupyter-notebook machine-learning python
Last synced: 11 Apr 2026
https://github.com/docuvesta/youtube-api-fragrance-channel-analytics
Engagement metrics analysis of perfume Youtube channel using Youtube API 🎀
analysis beauty-products comments data-analysis data-analysis-python engagement-metrics insights jupyter-notebook likes-count marketing marketing-analytics perfume python views-count youtube youtube-api youtube-api-v3
Last synced: 03 May 2026
https://github.com/ianfelps/jornada_python
Projetos realizados durante a Jornada Python da Hashtag Treinamentos em maio de 2024.
artificial-intelligence automation data-analysis python
Last synced: 28 Apr 2026
https://github.com/andersoncrs/analisis_exploratorio_de_datos-eda-_rendimiento_estudiantil
Este análisis exploratorio de datos (EDA) realizado sobre el conjunto de datos de rendimiento estudiantil tiene como objetivo identificar y comprender los factores que influyen en el desempeño académico de los estudiantes. A través de la limpieza, transformación y visualización de datos, se busca descubrir patrones y relaciones significatvas.
data-analysis data-exploration data-exploration-and-preprocessing data-visualization seaborn
Last synced: 30 Mar 2025
https://github.com/malucor/livros
Programa em Python para fazer uma análise de dados sobre livros, a partir de um arquivo Excel.
analise-de-dados book books bookshelf data-analysis ipynb jupyter-notebook livro livros python
Last synced: 16 May 2026
https://github.com/dpbm/diabetes-analysis
simple diabete analysis with python
analysis data-analysis data-science data-science-projects data-set diabetes-detection diabetes-prediction machine-learning pandas python
Last synced: 11 Apr 2026
https://github.com/javedali99/geospatial-and-earth-science-data
A comprehensive collection of global earth science and geospatial datasets 🌍
data-analysis dataset earth-observations earth-science earth-sciences earthscience geography geospatial geospatial-analysis geospatial-analytics geospatial-data open-datasets satellite-data
Last synced: 05 Jan 2026
https://github.com/omnipotence-eth/manufacturing-quality-analytics
SQL + Python pipeline for semiconductor NCR analysis — supplier performance, defect Pareto, yield trends
analytics data-analysis etl manufacturing matplotlib pandas postgresql python quality sql
Last synced: 11 Apr 2026
https://github.com/jaseel342/pizza_sales_report
This Pizza Sales dashboards provide valuable insights, including sales trends, pizza category breakdown, size distribution, top-selling, and least-selling pizzas, enabling data-driven decisions to boost sales and business performance.
data-analysis dax-query power-query powerbi sql sql-server-management-studio visualization
Last synced: 05 Jan 2026
https://github.com/jbizzlefoshizzle/crowdfunding-trends-excel
Excel project examining funding trends for Kickstarter projects
category-breakdown data-analysis excel kickstarter kickstarter-campaigns line-graph pivot-charts pivot-tables trends
Last synced: 05 Jan 2026
https://github.com/jatin-s16/hr_mysql_powerbi
This repository contains raw HR data along with key business questions. I performed data cleaning using MySQL queries and wrote analytical queries to extract meaningful insights. The results were then visualised using Power BI to enhance business understanding.
data-analysis data-science data-visualization mysql powerbi
Last synced: 29 May 2026
https://github.com/haonamnguyen/costumer-shopping-trends-analysis
This project analyzes a synthetic dataset of customer shopping behavior to see key trends and insights. Using SQL and Tableau, the analysis focuses on customer demographics, purchase patterns, and preferences, including age distribution, payment methods, shipping types, and top product categories.
data-analysis data-visualization sql tableau
Last synced: 05 Jan 2026
https://github.com/siddharthbadal/kpmgdataanalysisproject
Data Analytics Consulting Virtual Internship
data-analysis data-cleaning data-visualization googlestudio msexcel powerpoint
Last synced: 05 Jan 2026
https://github.com/arianarmw/da01-bike-sharing-analysis
🚴♀️ Data analysis project on bike-sharing systems. Includes data wrangling, exploratory data analysis (EDA), visualization, and interactive dashboards built with Streamlit. Explore patterns in bike usage and rental data!
bike-sharing-analysis data-analysis exploratory-data-analysis python streamlit visualization
Last synced: 11 Apr 2026
https://github.com/jedrzej-wydra/data-analysis-associate
Associate Data Analyst Exam by DataCamp
Last synced: 23 Mar 2025
https://github.com/mohit01chugh/edu_sql_analysis
SQL queries used to analyze student data.
data-analysis database education plpgsql postgresql sql
Last synced: 17 May 2026
https://github.com/chanmeng666/advanced-neural-network-applications
Practical implementations of perceptron and linear neuron models for classification and regression, with mathematical analysis and visualizations in Jupyter notebooks.
classification data-analysis data-science educational gradient-descent jupyter-notebook linear-neuron machine-learning matplotlib neural-network neural-networks numpy perceptron python regression
Last synced: 03 May 2026
https://github.com/takshshah-16/spotify_eda
Spotify data analytics and advanced querying
data-analysis eda pgadmin4 postgresql
Last synced: 30 Oct 2025
https://github.com/vedantshi/coffee-sales-dashboard
This project analyzes coffee sales data using Excel, featuring data cleaning, trend analysis, and an interactive dashboard. Key insights highlight top-performing products, regional sales trends, and seasonal patterns. Recommendations focus on marketing strategies and inventory optimization. Future plans include Power BI integration for visuals.
business-insights data-analysis data-visualization excel-dashboard pivot-tables sales-trends
Last synced: 05 Jan 2026
https://github.com/benami171/ml_knn_decision-trees
A ml implementation comparing Decision Trees and k-Nearest Neighbors (k-NN) algorithms for Iris flower classification. Features comprehensive analysis of different approaches including brute-force and entropy-based decision trees, along with k-NN using multiple distance metrics.
classification cross-validation data-analysis decision-trees iris-dataset k-nearest-neighbours machine-learning nearest-neighbors python
Last synced: 30 Jun 2025
https://github.com/mahmoudwal27/powerbi-projects-for-data-analysis
This project leverages Power BI for data visualization, DAX for custom calculations, and integrates SQL and Excel for data preprocessing, analysis, and reporting, enabling dynamic and interactive insights.
data-analysis data-analysis-project data-analytics-project project
Last synced: 07 Mar 2026
https://github.com/first-coding/smart_analysis
Smart Analysis is an AI-powered data analysis tool that leverages large language models (LLMs) to generate SQL queries from natural language prompts. Upload CSV files, explore the data schema, and retrieve insights with ease. The system ensures error correction in SQL queries, delivering detailed reports and visualizations in a streamlined workflow
data-analysis llm openai prompt-engineering python
Last synced: 08 Mar 2025
https://github.com/sun-lab-nbb/sl-shared-assets
A Python library that stores assets shared between multiple Sun (NeuroAI) lab data acquisition and processing repositories.
data-analysis data-collection data-processing experiment sunlab
Last synced: 10 Mar 2026
https://github.com/upes-open/open-cryptocurrency-analysis
A web app to visualise and predict the cryptocurrency’s impact by using Web scraping, data exploration, EDA and Data Visualization.
analysis cryptocurrency data-analysis data-science data-visualization jupyter-notebook streamlite visualization
Last synced: 15 Apr 2025
https://github.com/akansharajput280799/strategic-analysis-of-retail-brand-in-south-america-using-sql
Leveraged Big Query and MySQL to analyze 100K records for sales optimization, trend identification, and enhancing customer satisfaction for a retail brand in South America and to provide insights and recommendations to improve their userbase and improve their services
bigquery data-analysis data-science database database-schema google-bigquery mysql-server sql
Last synced: 19 May 2026
https://github.com/vidyadnina/cyclistic-sql-tableau-project
Trip data analysis for a bike-sharing service company using SQL and Tableau.
bigquery dashboard data-analysis data-analytics-sql data-cleaning data-visualization sql
Last synced: 02 Jan 2026
https://github.com/lvmalware/lsm-module
A simple statistics module, which provides 4 basic types of regressions using the Least Squares Method (LSM)
data-analysis least-square-regression regression regression-analysis statistics
Last synced: 31 Mar 2025
https://github.com/syarwinaaa09/hypothesis-testing-with-mens-and-womens-soccer-matches
a data-driven exploration of international men's and women's football (soccer) match results using Python
data-analysis data-visualization football jupyter-notebook men-vs-women pandas python soccer sports-analytics visualization
Last synced: 05 May 2026
https://github.com/dacrol/filterdataset
Filters a dataset based on attributes
data-analysis dataset deep-learning machine-learning python python3
Last synced: 25 Jul 2025
https://github.com/navp7/roadaccident_powerbi
An interactive Power BI dashboard designed to analyze road accident data
dashboards data-analysis data-visualization powerbi
Last synced: 19 Mar 2026
https://github.com/sahilmb/social-media-data-analysis
A social media platform chat analysis system built using python for root level analysis of huge blocks of text
analytics data-analysis python3 streamlit
Last synced: 31 Mar 2025
https://github.com/rcv911/cluster_generation
Generation of cluster test data
cluster cluster-analysis cluster-generation clustering clustering-algorithm clusters data-analysis machine-learning
Last synced: 18 Jan 2026
https://github.com/jfaccioli/citi-bike-tableau
A data analysis of Citi Bike users in Jersey City using Tableau
data-analysis data-visualization tableau tableau-public
Last synced: 26 Jan 2026
https://github.com/bhavinpatel4199/artificial-intelligence---ai-for-decision-making
Artificial Intelligence for Decision Making is a collection of projects focused on applying AI and machine learning techniques to solve decision-making challenges. It includes projects on wine quality prediction, Cassandra data modeling, and text classification, showcasing a range of data science and machine learning applications.
artificial-intelligence cassandra-cql data-analysis data-engineering data-preprocessing data-structures decision-making deep-learning feature-selection machine-learning-algorithms sentiment-analysis text-classification
Last synced: 20 Jun 2026
https://github.com/leosimoes/datascienceacademy-powerbi-clinicadebi
Atividades do curso Análise de Dados com Microsoft Power BI e Clínica de BI da Data Science Academy.
dashboards data-analysis data-visualization microsoft-power-bi power-bi
Last synced: 05 Jan 2026
https://github.com/richardwarepam16/learning-statistics-using-python-for-data-analysis
Data Analyst: Learning Statistics using Python (Descriptive Stats, Inferential Stats, ML)
data-analysis data-analysis-python data-science descriptive-statistics inferential-statistics statistical-learning statistics
Last synced: 15 Mar 2025
https://github.com/rodrigojunqueiradev/rodrigojunqueiradev.github.io
Professional Portfolio - Rodrigo Junqueira
analytics artificial-intelligence data-analysis data-engineering data-science data-visualization machine-learning mathematics nosql powerbi python r sql statistics
Last synced: 15 May 2026
https://github.com/amlanmohanty1/genai-data-analysis-report-generator
Generating data analysis and EDA reports from CSV files using Generative AI - Langchain, Llama, Groq.
ai data-analysis data-science flask generative-ai groq langchain llama3 llm prompt-engineering python
Last synced: 28 Jan 2026
https://github.com/vyjayanthipolapragada/marketing_statistical_analysis
Statistical analysis of customer data and their impact on the sales of products based on marketing campaigns
customer-data data-analysis dataframes marketing matplotlib numpy pandas python seaborn statistical-analysis
Last synced: 11 Apr 2026
https://github.com/manumoolimani/data-analysis
Data Analysis Projects
data-analysis data-visualization excel
Last synced: 21 Feb 2026
https://github.com/tolumie/exploratory-data-analytics-projects
Exploratory Data Analytics – A collection of projects covering data exploration, feature engineering, hypothesis testing, and predictive modeling across diverse datasets, including insurance, real estate, laptops, cars, COVID-19, and the Olympics.
data-analysis data-visualization data-wrangling exploratory-data-analysis-eda feature-engineering hypothesis-testing machine-learning matplotlib numpy pandas predictive-modeling python seaborn statistical-analysis
Last synced: 11 Apr 2026
https://github.com/vanshuchaudhary/retail-sale
project uses MySQL to analyze retail sales data, focusing on customer behavior, sales trends, and product performance. The dataset includes transactions, customer demographics, and purchase details, helping businesses optimize strategies. Key Insights: 📊 Revenue Analysis – Total sales, top-spending customers 📅 Sales Trends
business-intelligence customer-behavior customer-behavior-analysis data-analysis mysql predictive-analytics retail-analytics sales-analysis sql-queries
Last synced: 23 Mar 2025
https://github.com/vishal-verma-96/pre-owned-car-price-prediction-using-streamlit-app
Capstone Project by skill Academy- Exploratory Analysis, Visualization and Prediction of Used Car Prices. Deploying the highest-scoring model with Streamlit web app
data-analysis data-science jupyter-notebook machine-learning machine-learning-algorithms matplotlib numpy pandas python3 regression-algorithms scikit-learn seaborn streamlit
Last synced: 11 Apr 2026
https://github.com/ilovenooodles/probstat-water-potability
Tugas Besar Probabilitas dan Statistika 1
csv data-analysis jupyter-notebooks python
Last synced: 03 May 2026
https://github.com/chardyb/prob-and-stats-bmi6106
A repository for Spring 2025 BMI 6106: Statistics and Probability. This repository contains coursework, code examples, and projects exploring statistical methods and probabilistic models in biomedical informatics.
biomedical-informatics data-analysis data-science probability r statistical-modeling
Last synced: 02 Sep 2025
https://github.com/nimomach/skateboarding-in-olympics
Skateboarding made its debut in Olympics at the 2020 Summer Olympics. This is a dashboard focused on "Skateboarding in the Olympics" representing a comprehensive overview of the sport's performance, popularity, and key metrics during the Olympic Games.
data-analysis data-visualization olympics paris skateboarding tokyo
Last synced: 10 Mar 2026
https://github.com/jbalooshie/movies-etl
Exercise working with movie datasets from Kaggle and Wikipedia. Python is used to extract, clean, and combine the data, and then it is loaded into a postgreSQL database.
data-analysis data-science jupyter-notebook numpy pandas postgresql postgresql-database python sqlalchemy
Last synced: 11 Apr 2026
https://github.com/shru924/ecommerce_customer_behavior_analysis
A machine learning project that analyzes and segments e-commerce customers based on behavior patterns using Python, Random Forest, and data visualization.
customer-segmentation data-analysis jupyter-notebook machine-learning matplotlib pandas python scikit-learn
Last synced: 11 Apr 2026
https://github.com/rosanafss/r-journey
Diving into to wonderful see of DATA
Last synced: 19 Nov 2025
https://github.com/geoninja/reddit_data_analysis
Data analysis application presented at the 2016 NTC (Non-profit Technology Conference) in San Jose, CA.
data-analysis python reddit-data-analysis text-analysis
Last synced: 03 May 2026
https://github.com/alejandrolara11/machinelearningcourse
Machine Learning Basics: From Setup to Clustering
data-analysis data-science machine-learning numpy pandas plotly preprocessing-data python scikit-learn seaborn streamlit
Last synced: 11 Apr 2026
https://github.com/codewithmayank-py/covid19-data-analysis-using-python
COVID-19 and Happiness Analysis
data-analysis data-analysis-python data-visualization dataset jupyter-notebooks numpy pandas python3 seaborn
Last synced: 11 Apr 2026
https://github.com/anderson-andre-p/wine-data-analysis
This repository contains a data analysis project that focuses on a series of wine data. The project was completed using Python libraries such as NumPy, Pandas, Seaborn, and Matplotlib. The goal of this project was to gain insights into the characteristics of the wines and to practice data analysis skills.
data-analysis data-science data-science-portfolio pandas-dataframe wine-dataset
Last synced: 15 Mar 2025
https://github.com/anderson-andre-p/exploratory-data-analysis.roller-coaster
This repository contains an exploratory data analysis (EDA) project focused on roller coasters. The project involved organizing, cleaning, and visualizing the data to gain insights into roller coasters' characteristics and performance.
data-analysis eda exploratory-data-analysis exploratory-data-visualizations notebook
Last synced: 15 Mar 2025
https://github.com/aya-jafar/python
Practice files & exercises during the journey of Python leaning 🐍
Last synced: 16 May 2025
https://github.com/agrdatasci/climmob-analysis
Workflow for data analysis applied on ClimMob.net
citizen-science data-analysis workflow
Last synced: 24 Jun 2025
https://github.com/jimohola/movielens_data_analysis
Movielens Data Analysis
data-analysis data-visualization exploratory-data-analysis pyhton3
Last synced: 11 Jun 2025
https://github.com/mylena13s/electric-vehicle-data-analysis
data-analysis data-science programming pyspark-notebook
Last synced: 15 Mar 2025
https://github.com/hfzdzakii/dicoding-solvinghrproblem
This repo is a master submission for my Dicoding Final Project. Employee Attrition & Performance Dataset was being used to fulfill the submission. Feel free to explore and I hope my work give you some insight!
data-analysis data-visualization
Last synced: 16 May 2025
https://github.com/kzon94/torn-market-analyzer
Streamlit app that parses Torn Add Listing text, matches items with a custom dictionary, fetches market data via the public API, and generates KPIs and price recommendations using a modular Python analytics pipeline.
data-analysis data-engineering fuzzy-matching market-analytics numpy pandas python streamlit torn-city torn-city-api
Last synced: 11 Apr 2026
https://github.com/vriv06/btk-trials-data-analysis
Data analysis of Bioteksa plant nutrition trials for measure nutrient efficacy, resistance against biotic and abiotic factors, etc.
agriculture-research confluence crops data-analysis quarto r
Last synced: 23 Mar 2025
https://github.com/trim0500/fe-stats-classifier
An experiment to create a machine learning model via PyTorch to classify select Fire Emblem unit base stat distributions.
creational-patterns data-analysis data-science data-visualization design-patterns excel jupyter jupyter-notebook matplotlib-pyplot numpy pandas python python-modules python3 pytorch singleton
Last synced: 11 Apr 2026
https://github.com/mborrillo/ranking-ciudades-espana
Sistema end-to-end de análisis multicriterio que evalúa 50 ciudades españolas en calidad de vida mediante datos oficiales
business-intelligence data-analysis multi-criteria-decision-analysis pandas python3 quality-of-life ranking-system scikit-learn scoring-models
Last synced: 13 Jan 2026
https://github.com/firdevstorlak/maritime-signature-lab
Prototyp einer maritimen Signaturdatenbank (Akustik, Magnetik, RCS, IR) mit Python, SQLite und einfacher Computer-Vision.
acoustic-signatures cli-tool computer-vision data-analysis demo-project engineering-prototype infrared-imaging maritime opencv python radar rcs relational-database scientific-computing signal-processing sqlite synthetic-data
Last synced: 07 May 2026
https://github.com/deeksha-dhawan/pizza-outlet-analysis-using-sql
This project analyzes pizza sales data to gain insights into customer behavior and revenue patterns. Key analyses include customer insights, popular pizza types and sizes, revenue generation, and order trends. The findings help optimize menu offerings, staffing, and marketing strategies to boost overall business performance.
coding-challenge data-analysis data-science microsoft my portfolio-project programming project projects sql sql-analysis sql-project sqlproject sqlserver
Last synced: 23 Mar 2025
https://github.com/hemangsharma/dataanalysis
This repo contains analysis like a dashboard and time series forecast on NASDAQ data
analysis data data-analysis data-visualization python
Last synced: 10 Mar 2026
https://github.com/ezmiller/esd-viz
Visualization of European Social Survey (http://www.europeansocialsurvey.org/data/)
clojure data-analysis visualization
Last synced: 28 May 2026
https://github.com/riju18/data-analysis-and-visualizaton
Most complex data analyzing for clustering, preparing, complex calculation, joining, cross-over & more for Data science.
data-analysis data-mining data-science data-visualization powerbi tableau
Last synced: 04 Jan 2026
https://github.com/fatihilhan42/eda-spacex-launches-falcon9-and-falcon-heavy
In this project, we analyze the space flight data of Spacex space research company Falcon 9 rocket.
data-analysis data-science data-visualization eda elonmusk spacex
Last synced: 23 Mar 2025
https://github.com/wittyicon29/kritika-iit-b-2023
Seletcion task for the summer projects of Kritika IIT-B
data data-analysis data-science
Last synced: 15 Mar 2025
https://github.com/manisharora96/instagram-reach-analysis
This project provides a detailed approach to analyzing Instagram reach and engagement metrics. By leveraging the code and tools shared here, you can gain valuable insights into your Instagram content's performance and optimize your strategy to grow your audience effectively
data-analysis data-visualization instagram-reach python-tools
Last synced: 23 Mar 2025
https://github.com/kernelshreyak/kaggle-notebooks
Collection of my Kaggle notebooks for data analysis and machine learning on a variety of datasets
data-analysis data-science data-visualization kaggle kaggle-competition machine-learning
Last synced: 27 Apr 2026
https://github.com/nikhil-donthusaram/heartdiseaseprediction
Heart Disease Prediction App is a machine learning web application that predicts the likelihood of heart disease based on user medical inputs. Built using a Decision Tree Classifier and deployed with Streamlit for an interactive, user-friendly interface.
data-analysis descision-tree joblib jupyter-notebook machine-learning matplotlib numpy pandas python3 seaborn sklearn streamlit vscode
Last synced: 11 Apr 2026
https://github.com/loginchik/mid_contracts
Анализ контрактов государственных закупок МИДа РФ
data-analysis dataset pandas python
Last synced: 17 Apr 2025
https://github.com/felpzreiz/stockdata_pipeline
Este projeto consiste no desenvolvimento de um pipeline de dados que consome informações financeiras de uma API da Bolsa de Valores Americana (StockData.org) para análise e tratamento. Utilizando Python e bibliotecas como pandas, matplotlib e pyarrow
api data-analysis data-science jupyter-notebook pandas python
Last synced: 19 Apr 2026
https://github.com/walid0912/rfm_analysis
RFM Analysis is employed to comprehend and categorize customers according to their purchasing patterns. RFM, an acronym for recency, frequency, and monetary value, comprises three essential metrics that offer insights into customer involvement, allegiance, and significance to a business.
data-analysis data-visualization python rfm-analysis
Last synced: 02 Sep 2025
https://github.com/al-ghaly/iti-project
ITI Final/Graduation Project.
data-analysis data-cleaning data-visualization data-warehousing machine-learning power-bi python-data-analysis sql statistical-analysis
Last synced: 15 Mar 2025
https://github.com/taralas209/moscow-programmer-salaries-analysis-dvmn
A Python script analyzing the average salaries of programmers in Moscow by popular programming languages using data from HeadHunter and SuperJob.
api data-analysis headhunter job-market-analysis python superjob
Last synced: 15 Mar 2025
https://github.com/kailenroa/dashboad-excel-huisprijzen
This project focuses on developing a dashboard powered by Funda to visualize house pricing in the Netherlands. The dashboard simplifies the home-buying process by allowing users to compare prices, energy labels, number of rooms, and square meters across different provinces, all in one interactive platform..
dashboard data-analysis excel house-prices
Last synced: 05 Jan 2026
https://github.com/abelarduu/power_bi_analyst
Projeto Power BI para relatório de dados financeiros, com navegação intuitiva e recursos interativos. Oferece uma experiência completa ao usuário, combinando apresentação sofisticada e funcionalidade eficaz para análise de dados.
dashboard data-analysis data-analytics modelagem-de-dados powerbi tratamento-de-dados
Last synced: 08 Sep 2025
https://github.com/adilshamim8/eda-on-health-and-sleep-data
Exploratory Data Analysis (EDA) on health and sleep data, uncovering patterns and insights using Python and visualization tools.
data-analysis data-visualization eda health healthcare sleep sleep-analysis
Last synced: 15 Mar 2025
https://github.com/shahriarha/sql
Structured query language
data-analysis mysql mysql-database sql
Last synced: 02 Sep 2025
https://github.com/alphatwirl/qtwirl
qtwirl (quick-twirl), one-function interface to AlphaTwirl
alphatwirl data-analysis data-frame pandas r root-cern
Last synced: 11 Apr 2026
https://github.com/reddyprasade/r-program
R is a programming language and free software environment for statistical computing and graphics supported by the R Foundation for Statistical Computing. The R language is widely used among statisticians and data miners for developing statistical software and data analysis.
data-analysis data-science r-programming
Last synced: 11 Apr 2026
https://github.com/syed-amjad-ali/-bank-churn-ml
Predicting bank customer churn using machine learning. This project includes exploratory data analysis (EDA), feature engineering, classification models (Logistic Regression, Random Forest), and customer segmentation using K-Means clustering.
classification data-analysis data-science eda jupyter-notebook k-means-clustering machine-learning ml python segmentation
Last synced: 09 Mar 2025
https://github.com/sasanthns/sql_data_warehouse_project
A comprehensive guide to building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics.
data data-analysis data-science data-warehouse datacleaning etl etlpipeline sql sqlserver
Last synced: 24 Mar 2025
https://github.com/mradovic38/cnn-dominant-color-recognition
Detecting a dominant color of Tulip images using various convolutional neural networks.
artificial-intelligence cnn computer-vision data-analysis deep-learning dominant-color kmeans-clustering machine-learning neural-network pytorch regression resnet resnet-18 squeeze-and-excitation
Last synced: 01 Jul 2026
https://github.com/yixin0829/web-data-scraping-project
Python web scraping script and MySQL database design :bug:
beautifulsoup4 data-analysis data-visualization database gender-api matplotlib python website
Last synced: 19 Apr 2026
https://github.com/thbaylson/datascience
All of my past data science assignments put into one singular notebook. Most of this comes from my Machine Learning course.
data-analysis data-science data-visualization decision-tree jupyter-notebook k-nearest-neighbors linear-regression machine-learning neural-network pandas-library python3 scikit-learn
Last synced: 09 May 2026
https://github.com/prakashjha1/new-analysis-using-llm-locally
An interactive news analysis tool built with Streamlit and local LLMs. This app allows users to analyze and gain insights from the latest news articles using advanced language models, all running locally. Explore trends, sentiment, and key topics with an intuitive interface.
artificial-intelligence data-analysis data-science llms ollama python streamlit
Last synced: 14 Mar 2025
https://github.com/lucaso21/euro-2021-player-stats-analysis
A short project analyzing stats for players at the Euro 2021 tournament.
data-analysis data-science r rvest tidyverse
Last synced: 16 Mar 2025
https://github.com/aliiahmadi/data-manipulation
Algorithms and solutions
algorithm algorithms data-analysis data-science
Last synced: 30 Jun 2025
https://github.com/tolumie/rfm-marketing-analysis
This project focuses on RFM (Recency, Frequency, and Monetary) Analysis, a powerful customer segmentation technique used in marketing and business analytics. The analysis helps businesses identify their most valuable customers, potential loyalists, at-risk customers, and churned users.
business-analytics customer-behavior-analysis customer-loyalty customer-retention customer-segmentation-analysis data-analysis data-driven-decisions ecommerce marketing-analytics python
Last synced: 18 May 2026
https://github.com/lunafrost-lab/berry-donut
Exploring berry combinations to produce Donut in Pokémon Legends: Z-A: Mega Dimensions.
data-analysis data-filtering parquet pokemon winforms
Last synced: 13 Jan 2026
https://github.com/wojtekdomino/titanic-eda
Exploratory Data Analysis (EDA) of Titanic dataset using Pandas, Matplotlib, and Seaborn.
data-analysis eda matplotlib pandas python seaborn
Last synced: 10 Jun 2025