Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-07-01 00:07:23 UTC
- JSON Representation
https://github.com/chaitanyaprasad60/sql-queries
This is a list of complex SQL Queries I have practiced.
data-analysis sql window-functions
Last synced: 03 Mar 2026
https://github.com/abelarduu/power_bi_analyst
Projeto Power BI para relatório de dados financeiros, com navegação intuitiva e recursos interativos. Oferece uma experiência completa ao usuário, combinando apresentação sofisticada e funcionalidade eficaz para análise de dados.
dashboard data-analysis data-analytics modelagem-de-dados powerbi tratamento-de-dados
Last synced: 08 Sep 2025
https://github.com/nmelgar/lego_my_data
Data visualization project to sell LEGO bulks.
csv data-analysis data-visualization data-viz google-sheets tableau
Last synced: 08 Jan 2026
https://github.com/gintuvedula/crime-data-analysis-with-mysql-and-python
This project aims to analyze crime data using MySQL for database management and Python for data analysis and visualization. The objective is to uncover crime trends, hotspots, and patterns to support law enforcement and urban planning efforts.
data-analysis data-exploration database mysql python
Last synced: 05 May 2026
https://github.com/nurulashraf/customer-segmentation-hierarchical-clustering
A customer segmentation project using hierarchical clustering to group customers based on their spending behaviour and demographics. This helps businesses identify patterns and create targeted marketing strategies.
business-analytics clustering-algorithm customer-segmentation data-analysis hierarchical-clustering machine-learning python unsupervised-learning
Last synced: 18 Apr 2025
https://github.com/kimiaak/f1-data-analysis
data-analysis fastf1 formula1 python
Last synced: 08 Jul 2025
https://github.com/mxagar/data_science_udacity
My personal notes, code and projects of the Udacity Data Science Nanodegree.
dashboard data-analysis data-engineering data-science machine-learning-pipelines
Last synced: 09 Apr 2025
https://github.com/robinmillford/hr-analytics-employee-performance-analysis
HR Analytics: Unveiling Employee Performance - A comprehensive exploration of employee data using SQL and Power BI, uncovering key insights for strategic HR decision-making.
data-analysis data-visualization jupyter-notebook powerbi python3 sql
Last synced: 20 Apr 2026
https://github.com/souravsuvarna/whatsapp-chat-analyzer-and-visualizer-web-application
The WhatsApp chat analyzer and visualizer uses NLP algorithms to analyze chat data, tracking usage patterns and presenting insights through visually appealing charts and graphs. It helps users understand communication patterns and behaviors on WhatsApp.
data-analysis data-science data-visualization python python3 streamlit
Last synced: 18 Apr 2026
https://github.com/pkjjoshi/behind-the-menu-uncovering-insights-from-restaurant-data
Discover hidden patterns in dining data — from popular cuisine pairings to geographic restaurant clusters
data-analysis data-visualization insights jupyter-notebook pandas python restaurant-data
Last synced: 05 Jul 2025
https://github.com/shriansh8619/sql_eda
Explored relational databases using SQL to perform comprehensive Exploratory Data Analysis (EDA), covering database exploration, segmentation, trend analysis, and performance ranking. Developed reusable SQL scripts to analyze dimensions, measures, and time-based metrics, helping uncover key business insights.
data-analysis exploratory-data-analysis mysql
Last synced: 20 Aug 2025
https://github.com/vetronics/data_analisys_by_pandas
piccolo script in python per analisi dei dati sugli incidenti del 2019
accident accidents-analysis car data-analysis data-science data-visualization dataset github github-actions istat maplotlib pandas python python3 scripts windows-11
Last synced: 11 Apr 2026
https://github.com/aravindnathan02/bi-projects
Data Analysis and Visualization projects involving only BI tools (Power BI, Tableau, MS Excel).
data-analysis data-visualisation ms-excel powerbi tableau
Last synced: 08 Jan 2026
https://github.com/mishaa931/amazon-sales-dashboard-power-bi
This project features a dynamic Power BI dashboard built on dummy Amazon sales data. It visualizes key business metrics such as revenue trends, top-selling categories, discount impact, and geographic performance. The dashboard is designed to help stakeholders make data-driven decisions through clear, interactive visuals.
data-analysis data-quality data-visualization microsoftpowerbi
Last synced: 05 Feb 2026
https://github.com/vatshayan/youtube-user-analysis
Analysis of Youtube Users about their choice and preferences
data data-analysis data-mining data-science data-visualization dataset machine-learning machine-learning-algorithms
Last synced: 05 Feb 2026
https://github.com/joaquinmoron/airbnb-eda-python
EDA de Airbnb — limpieza, exploración y visualización en Python (pandas, matplotlib, seaborn).
airbnb data-analysis eda matplotlib pandas python seaborn
Last synced: 13 Apr 2026
https://github.com/kailenroa/dashboad-excel-huisprijzen
This project focuses on developing a dashboard powered by Funda to visualize house pricing in the Netherlands. The dashboard simplifies the home-buying process by allowing users to compare prices, energy labels, number of rooms, and square meters across different provinces, all in one interactive platform..
dashboard data-analysis excel house-prices
Last synced: 05 Jan 2026
https://github.com/elrf3lipes/ramon-s_portfolio
I'm passionate about Cloud and DevOps, and for the moment I'm posting some of my work and personal projects here to showcase that. If its useful for you, feel free to integrate or contribute!
api-integration biopython clinical-trials data-analysis data-extraction data-parsing django docker entrez ipython medline-xml pandas pubmed-parser requests rest-api
Last synced: 27 Mar 2026
https://github.com/manojrathod0777/loan-prediction
Predict loan approval status using machine learning techniques. This project demonstrates data preprocessing, feature engineering, model training, and evaluation, along with an interactive Streamlit app for real-time predictions. Ideal for financial decision-making.
classification-models data-analysis data-science financial-analytics jupyter-notebook loan-prediction machine-learning predictive-modeling python streamlit-app
Last synced: 13 Apr 2026
https://github.com/marianamartiyns/rfm-cluster-analysis
Customer behavior and sales analysis, including data cleaning, RFM calculation, churn analysis and customer clustering.
cluster-analysis data-analysis data-cleaning data-visualization pyhton
Last synced: 16 Mar 2025
https://github.com/abdullahashfaqvirk/powerbi-dashboards
A collection of Microsoft Power BI dashboards and reports designed to address business challenges and support data driven decision-making.
dashboards data-analysis data-driven data-science microsoft powerbi reports visualization
Last synced: 10 Mar 2026
https://github.com/annnieglez/nlp-stock-market-and-news
This project focuses on detecting fake news from news headlines using advanced Natural Language Processing (NLP) techniques. It combines sentiment analysis with news headlines embeddings, generated from Hugging Face transformer models, to train a binary classification model that distinguishes between real and fake news.
classification-model data-analysis embeddings machine-learning machine-learning-models nlp nlp-deep-learning nlp-machine-learning python scraping-websites sentiment-analysis
Last synced: 25 Apr 2026
https://github.com/deller23/hotel_booking_data_cleaning
Efficiently transforming raw hotel booking data into actionable insights! This project leverages Python and Pandas for advanced data cleaning—handling missing values, detecting outliers, and optimizing features—ensuring a high-quality dataset ready for analysis and modeling.
data-analysis data-cleaning data-preprocessing data-visualization data-wrangling pandas python
Last synced: 31 Mar 2025
https://github.com/marina-gal/elderly-care-ranking
Data analysis and scoring model for elderly care homes, including data cleaning, transformation, 0–100 scoring, and ranking across multiple quality dimensions.
Last synced: 30 May 2026
https://github.com/vedanty3/supermarket-sales-data-analysis
This project contains data visualization techniques (using pandas and matplotlib) to explore different aspects of supermarket sales data of 3 months.
data-analysis data-science jupyter-notebook matplotlib numpy pandas python
Last synced: 08 May 2026
https://github.com/taralas209/moscow-programmer-salaries-analysis-dvmn
A Python script analyzing the average salaries of programmers in Moscow by popular programming languages using data from HeadHunter and SuperJob.
api data-analysis headhunter job-market-analysis python superjob
Last synced: 15 Mar 2025
https://github.com/al-ghaly/iti-project
ITI Final/Graduation Project.
data-analysis data-cleaning data-visualization data-warehousing machine-learning power-bi python-data-analysis sql statistical-analysis
Last synced: 15 Mar 2025
https://github.com/luminati-io/Indeed-dataset-samples
A sample dataset of over 1000 Indeed job listings, extracted using the Bright Data API, ideal for market analysis and growth.
api data-analysis datasets indeed jobs web-scraping
Last synced: 09 Apr 2025
https://github.com/k178412/sql-data-warehouse-project
A hands-on data warehouse project using SQL Server, covering ETL processes, and data modeling.
bronze-layer data-analysis data-analytics data-cleaning data-engineering data-warehouse database datalake dataset datawarehouse etl etl-pipeline etl-process gold-layer silver-layer sql sql-query sql-server sqlserver
Last synced: 25 Apr 2026
https://github.com/luminati-io/Shopee-dataset-samples
A sample dataset of over 1000 Shopee products, extracted using the Bright Data API, ideal for pricing optimization, gap analysis, and market strategy refinement..
api data-analysis data-mining datasets products shopee web-scraping
Last synced: 09 Apr 2025
https://github.com/luminati-io/Target-dataset-samples
A sample dataset of over 1000 target products, extracted using the Bright Data API, ideal for brand reputation, tracking inventory, and optimizing prices.
api data-analysis data-mining datasets target web-scraper web-scraping
Last synced: 09 Apr 2025
https://github.com/hess125/data-visualizations
A repository of data visualization projects
data data-analysis data-science data-visualization powerbi projects sql sqlite tableau
Last synced: 31 Aug 2025
https://github.com/chaganti-reddy/weather-prediction-australia
Creating a fully-automated system that can use today's weather data for a given location to predict whether it will rain at the location tomorrow.
data-analysis logistic-regression machine-learning prediction-model python3
Last synced: 13 Apr 2026
https://github.com/mohammad-malik/covid-visualizations-d3
This project provides a dashboard with five different perspectives on the pandemic, from patient-infection relationships to regional trends and hierarchical distributions. This was developed as part of a project for the course Data Analysis and Visualization (DS3001).
covid-19 d3 d3-visualization d3js data data-analysis data-analytics data-science visualization
Last synced: 28 May 2026
https://github.com/leandrocollares/nyc-film-permits
NYC film permits: an exploratory data analysis
data-analysis data-visualization pandas plotly
Last synced: 05 Jul 2025
https://github.com/deliprofesor/virtual-reality-in-education-impact-analysis-and-insights
This project examines the impact of Virtual Reality (VR) on education, focusing on its effects on student engagement, learning outcomes, and creativity. It uses data analysis techniques like descriptive statistics, correlation analysis, and clustering to assess VR's effectiveness in enhancing learning.
clustering data data-analysis data-science data-visualization exploratory-data-analysis hypothesis-testing machine-learning python regression-analysis virtual-reality
Last synced: 14 Jun 2025
https://github.com/walid0912/rfm_analysis
RFM Analysis is employed to comprehend and categorize customers according to their purchasing patterns. RFM, an acronym for recency, frequency, and monetary value, comprises three essential metrics that offer insights into customer involvement, allegiance, and significance to a business.
data-analysis data-visualization python rfm-analysis
Last synced: 02 Sep 2025
https://github.com/anas436/student-performance-analysis
In this project I have constructed a Machine Learning System which will analyis students performance with about their academic records. Note that, this project will work with any students recods which you want to provide.
data-analysis jupyter-notebook matplotlib numpy pandas python3 seaborn
Last synced: 16 Apr 2026
https://github.com/profasem/logistics-performance-analysis
Power BI dashboard analyzing logistics performance, delivery delays, carrier efficiency, and regional risk.
business-intelligence dashboard data-analysis logistics powerbi python supply-chain
Last synced: 21 Apr 2026
https://github.com/ejw-data/pandas-school
Analysis of school data with Pandas
Last synced: 08 May 2026
https://github.com/hari7261/playwithdata-python
This is one of the repository where I have put lot of data science and machine learning related questions on their solutions I hope you will find something better than some other platforms. Thank you Happy exploring
data-analysis data-science data-science-learning machienlearning matplotlib matplotlib-python ml numpy numpy-arrays numpy-library pandas pandas-dataframe pandas-library python python-script sklearn
Last synced: 13 Apr 2026
https://github.com/dina-hosny/investigate-a-dataset-project
Investigate a Dataset project - Udacity Data Analysis Professional Track. In this project, I used the Python libraries NumPy, Pandas and Matplotlib to analyze a dataset and communicate my findings about it.
csv data-analysis jupyter-notebook matplotlib numpy pandas python
Last synced: 16 Apr 2026
https://github.com/shellynagar27/good-cabs-data-analysis-project
This project is part of CodeBasics Challenge #13, where the goal was to provide actionable insights to the Chief of Operations at Goodcabs, a cab service provider in tier-2 cities of India. The project focused on analyzing key metrics like trip volume, repeat passenger rate, and passenger satisfaction.
critical-thinking data-analysis data-visualization excel exploratory-data-analysis power-bi presentation problem-solving sql storytelling
Last synced: 25 Jan 2026
https://github.com/code-jl/dna-sequence-analyzer
A robust Python-based bioinformatics tool for comprehensive DNA sequence analysis and manipulation.
bio-tools bioinformatics biological-data computational-biology data-analysis dna-analysis dna-sequencing fasta gc-content gene-detection genetics genomics molecular-biology motif-finding nucleotide-analysis python python3 scientific-computing sequence-analysis sequence-manipulation
Last synced: 11 Mar 2025
https://github.com/felpzreiz/stockdata_pipeline
Este projeto consiste no desenvolvimento de um pipeline de dados que consome informações financeiras de uma API da Bolsa de Valores Americana (StockData.org) para análise e tratamento. Utilizando Python e bibliotecas como pandas, matplotlib e pyarrow
api data-analysis data-science jupyter-notebook pandas python
Last synced: 19 Apr 2026
https://github.com/loginchik/mid_contracts
Анализ контрактов государственных закупок МИДа РФ
data-analysis dataset pandas python
Last synced: 17 Apr 2025
https://github.com/shellynagar27/business-insights-360-project
A comprehensive Dashboard which provides better understanding of the business's market standing, key focus areas for optimization, underperforming customers, and year-wise financial insights, aiding in better inventory planning and performance tracking. Further it can be used in answering n number of why questions based on the situations.
dashboard data-analysis data-visualization dax-languague dax-studio excel performance-optimization power-bi reporting sql storage-manager
Last synced: 27 Jan 2026
https://github.com/shellynagar27/candy-market-share-analysis
Candy Market Share Analysis explores confectionery sales data using Power BI, Python, and Power Query. It uncovers key market trends, top-selling candies, manufacturer performance, and packaging preferences to support data-driven decision-making for industry researchers.
critical-thinking data-analysis data-visualization exploratory-data-analysis powerbi powerquery problem-solving sales-analysis
Last synced: 03 Feb 2026
https://github.com/steviecurran/dashboards
Compilation of Links to the dashboards in the other repositories
dashboard data-analysis data-science data-visualization pandas powerbi python-dash tableau
Last synced: 21 Feb 2026
https://github.com/nikhil-donthusaram/heartdiseaseprediction
Heart Disease Prediction App is a machine learning web application that predicts the likelihood of heart disease based on user medical inputs. Built using a Decision Tree Classifier and deployed with Streamlit for an interactive, user-friendly interface.
data-analysis descision-tree joblib jupyter-notebook machine-learning matplotlib numpy pandas python3 seaborn sklearn streamlit vscode
Last synced: 11 Apr 2026
https://github.com/mahmoudwal27/e-commerce-data-analysis
A collection of data analysis and visualization projects focused on ecommerce datasets. Using Python in Google Colab for analysis and Excel for exploration, these projects uncover key insights and trends, showcasing expertise in data manipulation and visualization to inform business decisions.
analytics data-analysis data-analysis-python data-set google-cloud python
Last synced: 21 Apr 2026
https://github.com/mrprajapati18/100-days-of-code-data-science
100 Days of Code Challenge to learn Data Science from scratch! 📊🔍
anaconda-navigator data-analysis data-science data-visualization machine-learning-algorithms pyhton-library python-3
Last synced: 18 Apr 2026
https://github.com/pujolsluis/businessintelligencecourse
Repository for my BI Course projects
business-intelligence data-analysis data-mining data-warehouse
Last synced: 27 Mar 2026
https://github.com/omkar2503/credit-risk-dashboard
A SQL-based Credit Risk Scoring System visualized using Metabase
credit-risk dashboard data-analysis data-analytics metabase postgresql sql
Last synced: 01 Jul 2025
https://github.com/rrrodzilla/net_promoter_score
A Rust crate for calculating Net Promoter Score (NPS) from survey responses.
customer-analytics customer-experience customer-feedback customer-insights customer-satisfaction data-analysis feedback-analysis feedback-management sentiment-analysis survey-analytics survey-tool user-engagement user-feedback user-insights user-loyalty user-research user-retention user-satisfaction user-survey
Last synced: 25 Mar 2025
https://github.com/kernelshreyak/kaggle-notebooks
Collection of my Kaggle notebooks for data analysis and machine learning on a variety of datasets
data-analysis data-science data-visualization kaggle kaggle-competition machine-learning
Last synced: 27 Apr 2026
https://github.com/BAMresearch/Utah-SAXS-Tools
The Utah SAXS Tools (USToo), adapted for Python 3, originally by David P. Goldenberg, 2009-2012
data-analysis saxs small-angle-scattering small-angle-xray-scattering
Last synced: 16 Jan 2026
https://github.com/manisharora96/instagram-reach-analysis
This project provides a detailed approach to analyzing Instagram reach and engagement metrics. By leveraging the code and tools shared here, you can gain valuable insights into your Instagram content's performance and optimize your strategy to grow your audience effectively
data-analysis data-visualization instagram-reach python-tools
Last synced: 23 Mar 2025
https://github.com/jedrzej-wydra/improving-accuracy
Improving accuracy of age estimates for insect evidence—calibration of physiological age at emergence (k) using insect size but without “k versus size” model
Last synced: 02 Sep 2025
https://github.com/vinayakarannil/covid-19-india-analysis
EDA of covid-19 data in India
covid-19 covid-19-india data-analysis eda
Last synced: 14 Jun 2025
https://github.com/saifalibaig/-online-retail-exploratory-data-analysis-with-python
Exploratory Data Analysis of an online retail store
data-analysis data-visualization matplotlib numpy pandas python3
Last synced: 11 Apr 2026
https://github.com/pratanup/bank-customer-churn
A prediction model based on ML as well as DL and compare their performances to find Churned Customers
adaboost-classifier ann churn-prediction data-analysis data-visualization decision-tree-classifier deep-learning deep-learning-algorithms gaussian-naive-bayes-classification gradient-boosting-classifier k-nearest-neighbours logistic-regression machine-learning machine-learning-algorithms random-forest-classifier svc svm-classifier xgboost-classifier
Last synced: 10 Mar 2026
https://github.com/wittyicon29/kritika-iit-b-2023
Seletcion task for the summer projects of Kritika IIT-B
data data-analysis data-science
Last synced: 15 Mar 2025
https://github.com/rupashi03/fitbit-user-eda-case-study
Performed Exploratory Data Analysis (EDA) on Fitbit users' data to uncover trends in activity and health metrics.
business-analysis case-study consumer-insights data-analysis exploratory-data-analysis health-data r user-behavior-analytics
Last synced: 25 Mar 2025
https://github.com/karlyndiary/adidas-sales-analysis
Analyzed Adidas' product sales performance, top retailers, monthly trends, yearly growth, regional distribution, and pricing insights. Performed ETL from Python (Pandas) to SQL Server, extracted data with SQL, and visualized key insights in Excel.
adidas-sales-analysis adidas-sales-dashboard dashboard data-analysis data-cleaning data-pipeline data-visualization etl excel-dashboard microsoft-excel microsoft-sql-server python
Last synced: 10 Feb 2026
https://github.com/fatihilhan42/eda-spacex-launches-falcon9-and-falcon-heavy
In this project, we analyze the space flight data of Spacex space research company Falcon 9 rocket.
data-analysis data-science data-visualization eda elonmusk spacex
Last synced: 23 Mar 2025
https://github.com/ravi-prakash1907/covid-19-china
A data-science research work to understand the growth rate of the novel Coronavirus.
china coronavirus covid-19 data-analysis data-mining data-science mathematical-modelling project r research research-paper
Last synced: 06 Sep 2025
https://github.com/allanotieno254/powerbi-dax-filter-context
This repository contains a Power BI project that explores **DAX Filter Context**, a crucial concept in DAX calculations. The project focuses on **Bank Loan Analysis**, demonstrating how different filter contexts affect DAX formulas.
business-intelligence data data-analysis dax dax-functions powerbi powerbi-visuals visualization
Last synced: 08 Jan 2026
https://github.com/paul0vinicius/ad2
Repositório da disciplina de Análise de Dados 2 (Data Analysis II)
Last synced: 08 Jan 2026
https://github.com/ibrahimceyisakar/hotel-finder-streamlit-dashboard
Streamlit dashboard of hotel-finder
data-analysis data-science data-visualization pandas plotly python streamlit
Last synced: 16 Apr 2026
https://github.com/chiragkumargohil/co2-emissions-data-analysis
A Python programme that analyses CO2 emission data from 1997 to 2010. This programme prints data, provides brief of a given year, displays and compares Year vs. Emission graphs for chosen countries, and generates a separate data file for chosen countries. It was a self-paced project that Guru 99 provided.
co2-emission data-analysis matplotlib python
Last synced: 28 Aug 2025
https://github.com/jcm-ai/Standard-Bank-Data-Science-Virtual-Experience-Programme
This repository has all of the assignments I had to do for the Standard Bank Data Science Virtual Experience Program. 📉👨💻📊📈
automl business-analysis business-solutions client-communication data-analysis data-mining data-science data-visualization machine-learning machine-learning-algorithms matplotlib-pyplot model-evaluation model-interpretation power-point presentation-slides programming-language python3 seaborn sql statical-analysis
Last synced: 19 Aug 2025
https://github.com/fatihilhan42/turkey_earthquake_analysis_1915-2021_python
In this project, earthquakes in Turkey from 1915 to 2021 were analyzed. The data taken from the data set, which you can find in the repo, was first organized using data cleaning algorithms. Afterwards, these cleaned data were printed out as graphics and animation using data visualization algorithms.
data-analysis data-cleaning data-visualization jupyter-notebook
Last synced: 23 Mar 2025
https://github.com/82luli02/sakila_dvd_rental_database_analysis
Analysis of the Sakila DVD Rental database using SQL
data data-analysis data-science data-visualization sql
Last synced: 10 Mar 2026
https://github.com/riju18/data-analysis-and-visualizaton
Most complex data analyzing for clustering, preparing, complex calculation, joining, cross-over & more for Data science.
data-analysis data-mining data-science data-visualization powerbi tableau
Last synced: 04 Jan 2026
https://github.com/ezmiller/esd-viz
Visualization of European Social Survey (http://www.europeansocialsurvey.org/data/)
clojure data-analysis visualization
Last synced: 28 May 2026
https://github.com/pawlo77/smarty
End-to-End Data Science tool
data-analysis data-processing pandas pipeline
Last synced: 08 May 2026
https://github.com/anushkundu/student-performance-analysis
Exploring Student Performance Factors
classification-algorithm clustering-algorithm data-analysis data-science exploratory-data-analysis machine-learning matplotlib numpy pandas python seaborn
Last synced: 13 Apr 2026
https://github.com/hemangsharma/dataanalysis
This repo contains analysis like a dashboard and time series forecast on NASDAQ data
analysis data data-analysis data-visualization python
Last synced: 10 Mar 2026
https://github.com/jesuserro/ab-testing-ui-redesign-vanguard
A/B testing analysis to evaluate the impact of a user interface redesign at Vanguard.
a-b-testing data-analysis eda exploratory-data-analysis testing ui-design ux-design
Last synced: 08 Jul 2025
https://github.com/rishitabansal9/adult-census-income-prediction
This is a project made for data analysis and income prediction using random forest classifier with 91% accuracy.
data data-analysis data-science feature-engineering random-forest-classifier
Last synced: 25 Mar 2025
https://github.com/diligencefrozen/dcinside-data
Analyzing the Dcinside Frozen Gallery Dataset. #디시
Last synced: 30 May 2026
https://github.com/soumya-kushwaha/uber-analysis
data-analysis data-science data-visualization uber-analysis
Last synced: 16 Apr 2026
https://github.com/roggersanguzu/weather-medical-expense-prediction-ml-models
This repo contains a model for determining the rainfall patterns and another for medical expense prediction model
data data-analysis data-science datasets joblib machine-learning machine-learning-algorithms scikitlearn-machine-learning
Last synced: 30 Aug 2025
https://github.com/asghar-rizvi/eda_student_dataset
This repository contains the results of data analysis and exploratory data analysis (EDA) conducted on the Student_Dataset. The analysis focuses on understanding various factors affecting student grades and visualizing these relationships using Matplotlib and Seaborn.
data-analysis data-analysis-python data-science jupyter-notebook python3
Last synced: 16 Apr 2026
https://github.com/dvaser/world-happiness-expanatory-data-analysis
DATA ANALYSIS
data-analysis data-visualization dataset jupyter jupyter-notebook kaggle python
Last synced: 03 Mar 2026
https://github.com/saro0307/exploratory-data-analysis-terrorism
Phase 1 of Data Science project (program) to perform Exploratory Data Analysis on Terrorism using Python On Google Colab for Coderscave Internship sept 2023
colaboratory data-analysis datascience machine-learning numpy pandas python seaborn skit-learn visualization
Last synced: 13 Apr 2026
https://github.com/rubyyy1118/share-price-analysis
The assignment in my MSc Business Analytics course
data-analysis data-preprocessing data-science data-visualization matplotlib numpy pandas python seaborn
Last synced: 10 Apr 2026
https://github.com/deeksha-dhawan/pizza-outlet-analysis-using-sql
This project analyzes pizza sales data to gain insights into customer behavior and revenue patterns. Key analyses include customer insights, popular pizza types and sizes, revenue generation, and order trends. The findings help optimize menu offerings, staffing, and marketing strategies to boost overall business performance.
coding-challenge data-analysis data-science microsoft my portfolio-project programming project projects sql sql-analysis sql-project sqlproject sqlserver
Last synced: 23 Mar 2025
https://github.com/filip-kustura/statistics-olympics-analysis
A group seminar analyzing the relationship between citizens' average height and a country's Olympic success. The project involved data collection, descriptive statistics and statistical testing. Created and presented as part of the mandatory undergraduate Statistics course in spring 2021.
correlation-analysis data-analysis data-visualization descriptive-statistics group-project hypothesis-testing olympic-games r-programming research sports-analytics statistical-testing statistics university-project
Last synced: 05 Jan 2026
https://github.com/1401dev/customer-lifetime-value-prediction
A data science project leveraging Python and Scikit-Learn to build predictive models that estimate customer lifetime value (CLV). Includes data cleaning, feature engineering, and model selection to identify key drivers of CLV, supporting strategic decision-making in customer retention and marketing.
clv clv-analysis customer-retention data-analysis dataprocessing feature-engineering machine-learning marketing-analytics predictive-modeling python regression-analysis scikit-learn
Last synced: 06 May 2026
https://github.com/bagusperdanay7/fcc-da-mean-variance-standard-deviation-calculator
One of Data Analysis with Python (freecodecamp) task, created a Mean Variance Standard Deviation Calculator.
data-analysis freecodecamp-project numpy python
Last synced: 06 May 2026
https://github.com/tillbiskup/trepr
A Python package based on the ASpecD framework for handling TREPR data.
data-analysis data-processing electron-paramagnetic-resonance reproducible-research reproducible-science spectroscopy time-resolved
Last synced: 06 Sep 2025
https://github.com/steno-aarhus/mediation-analysis-course
Modern mediation analysis for basic, clinical and epidemiological research in diabetes and endocrinology
data-analysis data-analysis-in-r diabetes diabetes-epidemiology mediation-analysis open-educational-resource
Last synced: 03 Mar 2026
https://github.com/firdevstorlak/maritime-signature-lab
Prototyp einer maritimen Signaturdatenbank (Akustik, Magnetik, RCS, IR) mit Python, SQLite und einfacher Computer-Vision.
acoustic-signatures cli-tool computer-vision data-analysis demo-project engineering-prototype infrared-imaging maritime opencv python radar rcs relational-database scientific-computing signal-processing sqlite synthetic-data
Last synced: 07 May 2026
https://github.com/tatilimongi/first_python_project
Este repositório contém um estudo de caso de automação de planilhas em Python para análise de vendas de carros por fabricante ao longo dos anos
data-analysis email-sending file-manipulation graphical-visualization spreadsheet-automation
Last synced: 26 Mar 2025
https://github.com/mborrillo/ranking-ciudades-espana
Sistema end-to-end de análisis multicriterio que evalúa 50 ciudades españolas en calidad de vida mediante datos oficiales
business-intelligence data-analysis multi-criteria-decision-analysis pandas python3 quality-of-life ranking-system scikit-learn scoring-models
Last synced: 13 Jan 2026
https://github.com/iliyasalve/tiktok_claim_classification_model
Develop a predictive model for classifying videos with claims to reduce the backlog of user reports and optimize the content moderation process.
data-analysis machine-learning python regression-models tiktok
Last synced: 21 May 2026