Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-07-02 00:07:33 UTC
- JSON Representation
https://github.com/anshmnsoni/pizza-market-analysis
data-analysis powerbi-visuals powerbidashboard
Last synced: 05 Feb 2026
https://github.com/anthonybench/datapeek
Peek summary of datafile in a succinct, opinionated manner.
Last synced: 02 Mar 2026
https://github.com/mayankagg9722/movie-recommendation
Collaborative Filtering is performed over Movie Lens Dataset.
collaborative-filtering data-analysis jupyter-notebook movie-recommendation python-script website
Last synced: 29 Jul 2025
https://github.com/garcane/global-shipping-analytics-dashboard
This Tableau project provides a comprehensive visual analysis of global sales, shipping costs, and quality metrics across different regions and countries.
data data-analysis data-analyst data-visualization metrics tableau
Last synced: 01 Mar 2026
https://github.com/aravind-selvam/bikeshare-company-analysis
Google Data Analytics Professional Certificate program's Capstone project, of a bike sharing company
analytics business-analytics business-intelligence data data-analysis data-visualization dataanalytics google-data-analytics postgresql sql sql-server
Last synced: 22 Apr 2026
https://github.com/aymane-maghouti/sentiment-analysis-for-jumia-reviews-and-smartphone-price-prediction-system
The project focuses on customer sentiment analysis for Jumia, aiding informed online decisions. It collects and analyzes product comments to determine sentiments and implements a decision-making algorithm. Additionally, it includes product price prediction system using regression techniques.
beutifulsoup data-analysis data-cleaning data-collection data-preprocessing data-scraping data-visualization eda falsk machine-learning python web-application
Last synced: 18 Apr 2026
https://github.com/madeiradata/microsoft-data-analysts-club
Open-source Repository of Useful Scripts and Solutions for Microsoft Data Analysts
data-analysis data-visualization microsoft-data-analysis powerbi powerbi-report
Last synced: 19 Mar 2026
https://github.com/ganeshkumartk/ncov-2019
[EDA] Statistical modelling of Novel Coronavirus breakout nCoV-2019
corona data-analysis ncov ncov-2019 statistics wuhan wuhan-coronavirus wuhan-virus
Last synced: 05 Jun 2026
https://github.com/chandansoren/diabetics_prediction
Predicting that whether the patient has diabetes or not on the basis of the features we will provide to our machine learning model.
data-analysis machine-learning python svm
Last synced: 06 Jun 2026
https://github.com/savinrazvan/degrees
A program that computes the "degrees of separation" between two actors by identifying the sequence of movies connecting them, inspired by the Six Degrees of Kevin Bacon game. Uses IMDb-based datasets for actors, movies, and their relationships.
actor-connections ai data-analysis degrees-of-separation educational-project graph-theory imdb movie-database python six-degrees-of-kevin-bacon
Last synced: 24 Apr 2026
https://github.com/noodleslove/house-of-representative-analysis-i
This project uses public data about the stock trades made by members of the US House of Representatives.
data-analysis data-science eda kaggle-dataset matplotlib-pyplot pandas python stocks-trading
Last synced: 18 Apr 2026
https://github.com/athityakumar/btp
btech btp daru data-analysis networkx nlp project python ruby
Last synced: 24 Apr 2026
https://github.com/keganedwards/housing-prices-exploration
Using machine learning algorithms to explore housing prices
data-analysis data-science python school-project
Last synced: 24 Apr 2026
https://github.com/serhatderya/medical_examination_research
This repository contains a research about medical examinations (such as body measurements, results from various blood tests, and lifestyle choices).
catplot data-analysis data-analytics data-cleaning data-preparation data-preprocessing data-science data-visualization eda exploratory-data-analysis exploratory-data-visualizations heatmap jupyter-notebook medical preprocessing python research seaborn
Last synced: 24 Apr 2026
https://github.com/bretsw/beds
Bookdown project for an open education resource (OER) book: Becoming Educational Data Scientists
analytics data-analysis data-analytics data-science
Last synced: 31 Mar 2025
https://github.com/robcyberlab/linear-regression-application
🔢Linear Regression Application💻
artificial-intelligence data-analysis data-science data-visualization linear-regression machine-learning python python-programming regression-analysis statistics
Last synced: 31 Mar 2025
https://github.com/tomasoak/datahopper
Python package for data engineering and data wrangling
data data-analysis data-engineering data-mining data-science data-structures data-wrangling datascience pandas python
Last synced: 12 Mar 2026
https://github.com/supersjgk/data-analysis-dns-over-https
A Data Analytics + ML project to classify Benign and Malicious DNS-over-HTTPS traffic
classification-model data-analysis data-analysis-python data-analytics datamining decision-trees dns dns-over-https doh gradient-boosting knn machine-learning random-forest
Last synced: 19 Mar 2025
https://github.com/faezeh-gholamrezaie/visual-google-scholar-search
A Python script that searches Google Scholar for specific keywords and visually presents the results in various chart formats, enabling researchers to analyze trends and insights in academic literature.
academic academic-research academic-trends ai ai-research bibliometrics data-analysis data-visualization google-scholar publication-analysis python research-trends scholarly scholarly-data word-cloud
Last synced: 25 Apr 2026
https://github.com/nikita-data/unit_economics_projects
unit economics & cohort analysis projects
cac churn-rate conversion create-function data-analysis data-visualization eda hypothesis-testing ltv math matplotlib numpy python retention-rate roi scipy seaborn segmentation statistics unit-economics
Last synced: 06 Jan 2026
https://github.com/tejas-130704/whatsapp-analyser
ChatMate is a web app that analyzes WhatsApp chats, providing insightful visualizations like word clouds, heatmaps, and activity timelines. It calculates total messages, words, media, links, and more, helping you understand chat patterns for groups or individuals with ease. Simply upload your chat file and get detailed reports instantly!
data-analysis data-visualization python streamlit web-application whatsapp-analysis
Last synced: 26 Apr 2026
https://github.com/nafisalawalidris/building-a-clustering-model-for-customer-segmentation
Customer Segmentation Using Clustering: This repo applies clustering algorithms to a customer transaction dataset, grouping similar customers together based on their purchasing behavior. Targeted marketing strategies can be developed by analyzing distinct customer segments.
clustering customer-segmentation data-analysis data-visualization k-means machine-learning marketing-analytics unsupervised-learning
Last synced: 16 Mar 2025
https://github.com/ssreeramj/youtube_channels_analysis
This web app gives a detailed analysis of the videos uploaded in a particular youtube channel.
data-analysis heroku pandas python streamlit youtube
Last synced: 29 Apr 2026
https://github.com/nikita-data/eda_projects
Exploratory data analysis projects
cac data-analysis data-visualization eda folium-maps hypothesis-testing ltv math matplotlib numpy plotly python regular roi scipy seaborn segmentation statistics unit-economics
Last synced: 06 Jan 2026
https://github.com/adrija-debnath/ideas-isi-data-science-internship
Topic of the Project - Predictive Maintenance Analysis, Data Science Internship at IDEAS - Institute of Data Engineering, Analytics and Science Foundation Technology Innovation Hub at Indian Statistical Institute, Kolkata.
data-analysis data-science predictive-analytics predictive-maintenance streamlit
Last synced: 27 Apr 2026
https://github.com/mengyaohuang/data-manipulation-and-analysis
Data processing implementation with tools in Python
data-analysis nlp-machine-learning pandas-dataframe python
Last synced: 27 Apr 2026
https://github.com/rekha0suthar/e-commerce-shopper-s-behaviour-understanding
Understand the online shopper purchasing pattern through Machine learning
data-analysis data-preprocessing data-visualization logistic-regression machine-learning numpy pandas python3 scikit-learn seaborn-plots
Last synced: 12 Apr 2026
https://github.com/sathyasris27/time-series-and-spectral-analysis-
The aim of this project involves the analyses the data, removing trends and seasonal effects, identifying the underlying process, understanding the dominant frequencies, and using the residuals to make predictions.
data-analysis data-visualization forecasting r spectral-analysis time-series-analysis
Last synced: 07 Jun 2026
https://github.com/chigwell/partnershipparser
partnershipparser extracts and structures key info from tech partnership news for easy analysis of companies, focus areas, and impacts
business-analysts collaborating-companies competitive-advantages consistency data-analysis data-integration focus-area goals investors market-opportunities potential-impact researchers standardized-output strategic-alliances strategic-partnerships structured-data tech-industry technology-sector text-extraction trends
Last synced: 14 Jan 2026
https://github.com/mrsamsonn/monolithic-polylithic-crystal-segmentation
A grid segmentation algorithm for clustering crystal structures using diffraction patterns. Useful in material science and nanotechnology, this code enables detailed analysis of crystals for research and industrial applications.
clustering crystal-structure crystallography data-analysis diffraction-patterns grid-segmentation image-processing k-means machine-learning matertial-science nanotechnology python research-project research-tools scientific-computing
Last synced: 28 Apr 2026
https://github.com/vyjayanthipolapragada/early_sepsis_detection_ml
An end-to-end project leveraging clinical datasets (PhysioNet, MIMIC-IV, MIMIC-IV-ED) to develop and compare ML and LSTM-based models for early sepsis prediction.
data-analysis data-visualization deep-learning healthcare jupyter-notebook keras-tensorflow lstm-neural-networks machine-learning neural-network python
Last synced: 28 Apr 2026
https://github.com/alexandrelamarre/fission
Data analytics & Structured streaming optimized for the Edge
data-analysis data-engineering rust structured-data unstructured-data
Last synced: 08 Jun 2026
https://github.com/elissorokin/data-analyst-portfolio-rus
Это репозиторий, в котором я демонстрирую свои навыки, делюсь проектами и отслеживаю прогресс в области анализа данных и Data Science.
ab-testing data data-analysis datalense matplotlib numpy pandas plotly portfolio postgresql python scipy seaborn sql statistical-analysis
Last synced: 25 Feb 2026
https://github.com/dsrodrigovieira/houserocketsales
Este repositório contém um projeto desenvolvido para praticar habilidades de análise de dados utilizando Python
data-analysis data-visualization heroku kaggle-dataset python
Last synced: 29 Apr 2026
https://github.com/robinmillford/analytics_for_fashion_supply_management
This Streamlit dashboard provides a comprehensive analysis of supply chain data, focusing on key metrics such as production volumes, stock levels, order quantities, revenue, manufacturing costs, lead times, shipping costs, transportation routes, risk factors, and sustainability factors
dashboard data-analysis data-visualization streamlit supply-chain-management
Last synced: 07 Sep 2025
https://github.com/programmer-rd-ai/moviedatascraper
Explore the cinematic universe with our IMDb web scraping project! Dive into movie data with ease, uncovering insights from cast to critical reviews. With dynamic visualizations and reliable data, let's journey through the world of movies like never before. Lights, camera, analysis!
beautifulsoup beautifulsoup4 data data-analysis jupyter-notebook matplotlib numpy pandas programming python python3 scraping seaborn software web
Last synced: 01 Mar 2025
https://github.com/programmer-rd-ai/dimensionality-reduction
DimRed is a comprehensive Python toolkit for advanced dimensionality reduction, integrating with major machine learning libraries and featuring real-time performance monitoring to enhance data analysis and model efficiency.
analytics data-analysis data-science lightgbm machine-learning matplotlib numpy pandas programming python python3 sklearn university xgboost
Last synced: 01 Mar 2025
https://github.com/as16082023/hotel-booking-analysis-eda-
Exploratory Data Analysis on hotel booking data using Python
data-analysis data-visualization exploratory-data-analysis jupyter-notebook python
Last synced: 29 Apr 2026
https://github.com/nafisalawalidris/hotel-reservation-analysis
This project analyses hotel reservation data from Resort Hotels and City Hotels to uncover booking trends and insights. Utilising Microsoft Excel for initial data cleaning, PostgreSQL for data analysis and Tableau for creating visualisations, the project aims to deliver a comprehensive dashboard that highlights key metrics such as booking status.
data-analysis data-cleaning data-visualisation hotel-reservations microsoft-excel postgresql sql tableau tableau-dashboards tableau-desktop tableau-public
Last synced: 06 Jul 2025
https://github.com/joannescode/data-series-with-kaggle
Repositório de notebooks práticos sobre tratamento e análise de datasets
data-analysis matplotlib pandas python
Last synced: 13 Mar 2025
https://github.com/dataideaorg/dataidea-science-backup
The Complete Programming For Data Science Course
data-analysis data-science data-visualization deep-learning machine-learning nbdev programming python
Last synced: 10 Jan 2026
https://github.com/derrickbaruga7/mapping-median-age-europe
An R project that creates an interactive map of the median age across European regions using Eurostat data and spatial visualization packages.
data-analysis data-science data-visualization datascience european-union mapping r
Last synced: 25 Mar 2025
https://github.com/datalopes1/ds_salaries2024_eda
Neste projeto será realizado o processo de EDA (Exploratory Data Analysis) a partir do dataset Data Science Salaries 2024, que pode ser encontrado no Kaggle, com licensa Database: Open Database e enviado por Sazidul Islam.
data-analysis data-visualization eda exploratory-data-analysis jupyter-notebook python
Last synced: 29 Apr 2026
https://github.com/alemalvarez/data-analysis-web-project
Web-app providing a simple interface for data storage,
data-analysis data-science javascript react webapp
Last synced: 29 Apr 2026
https://github.com/ismailtekin05/caloriedetectingai
🍎🔍 Smart AI system that identifies food items in photos and calculates their calorie content automatically. Built with TensorFlow, YOLOv8, CUDA and computer vision for accurate nutrition tracking.
ai aimodel calorie-calculator computer-vision cuda data-analysis data-science data-segmentation data-visualization dataset dataset-generation image-processing image-recognition python segmentation-models tensorflow ultralytics yaml yolo yolov8
Last synced: 29 Apr 2026
https://github.com/muhammadhilmyputrarisma/ab-test
Python code for A/B testing on Cookie Cats game data. This project analyzes the impact of moving the first gate from level 30 to level 40 on player retention and game rounds, helping to evaluate if delaying the gate improves player engagement and gameplay experience.
ab-testing cookie-cats data-analysis data-visualization game-analytics python statistics
Last synced: 18 May 2026
https://github.com/freebirdscrew/dataanlaysis_and_datasets
Data Analysis on the Datasets that are Provided by the Govt., Kaggle and Other Data Source Providers.
data-analysis data-science datanalysis datasets deep-learning govt kaggle kaggle-competition kaggle-dataset kaggledatasets machine-learning machine-learning-algorithms neural-networks
Last synced: 18 Apr 2026
https://github.com/savinrazvan/heredity
An AI that assesses the likelihood of genetic traits in individuals using a Bayesian Network to analyze family genetic data, modeling genetic inheritance and mutations to infer probabilities of gene presence and trait expression.
ai bayesian-network biological-data-analysis data-analysis educational-project family-genetics genetic-inheritance genetic-traits heredity mutation-modeling probability-calculation python
Last synced: 29 Jun 2026
https://github.com/pradipece/weather_forecast_data_analysis
Using decision trees and random forest algorithms to solve real-world data analysis. "sklearn_decision_trees_random_forests"
data-analysis data-science data-visualization git github python python3
Last synced: 19 Apr 2026
https://github.com/ayu-hack/ayu-hack
Enthusiastic learner passionate about building software and exploring the world of technology. Eager to contribute to open-source projects and collaborate with the developer community. Continuously developing my skills in Python,SQL,HTML,CSS,PowerBI, MacOS. Always open to feedback and excited to keep growing!
config css data-analysis github-config html powerbi-desktop python3 sql
Last synced: 30 Apr 2026
https://github.com/colburncodes/se_pudding_2023
This project is a React app designed to showcase research conducted by a team of data scientists and data analysts. The app is utilizing React and React-Chartjs-2
chartjs-2 data-analysis data-science data-visualization react-chartjs-2 reactjs
Last synced: 11 May 2026
https://github.com/suhas-005/power-bi-dashboard
Power BI Dashboard Projects
data-analysis data-visualization dataset power-bi-project powerbi
Last synced: 01 Apr 2025
https://github.com/celineboutinon/bottleneck
ENSAE-ENSAI Formation Continue (Cepe)/OpenClassrooms Data Analyst 2022-2023 - Projet 5
data-analysis data-analytics data-visualisation dataframes market-intelligence marketing-analytics matplotlib-pyplot missingno numpy pandas python seaborn
Last synced: 07 Sep 2025
https://github.com/lmuffato/project-ting-trybe
Projeto ting - Projeto avaliativo da Trybe do Bloco 37: Estrutura de Dados II: Listas, Filas e Pilhas
data data-analysis python queue read-file stack trybe trybe-projects
Last synced: 12 Jun 2025
https://github.com/alcestide/scianalytics
Playground for Data Analysis and Visualization for Research and Scientifical Purposes with Pandas and Plotly.
csv data-analysis data-science data-visualization pandas plotly python science-research statistics
Last synced: 30 Apr 2026
https://github.com/alrza2003/alrza2003.github.io
This repository contains the source files for my personal portfolio website. It highlights my background as a data analyst and radiology student, and showcases real-world projects, tools I use, and ways to connect with me. The site is based on a pre-built template that I customized to reflect my profile and experience.
data data-analysis data-visualization portfolio portfolio-website python
Last synced: 30 Apr 2026
https://github.com/ajimaulana123/e-commerce-data-analis
Analisis dataset e-commerce guna menjawab kebutuhan product mana yang paling laris dibeli customer
Last synced: 28 Apr 2026
https://github.com/affec-ds/dashboard-ventas-vinilos
Dashboard interactivo de ventas para tienda de vinilos. Análisis visual, KPIs clave y filtros dinámicos para decisiones comerciales.
business-intelligence data-analysis data-visualization ipywidgets jupyter-notebook kpis matplotlib music-industry notebook-project python retail-analytics sales-analysis seaborn vinyl-records
Last synced: 30 Apr 2026
https://github.com/gher-uliege/bluecloud-plankton
Spatial interpolation of plankton data using a neural network
data data-analysis data-visualization neural-network oceanography
Last synced: 30 Mar 2025
https://github.com/scarblase/salary-comparison
Submission for the DataCamp Salary Competition(1 level). 🏆
data data-analysis data-science data-visualization engineering python sql structured-data
Last synced: 01 May 2026
https://github.com/pedestriandynamics/cloudfast-dl4pude
A Cloud-based Deep Learning System for Improving Crowd Safety at Event Entrances
anomaly-detection artificial-intelligence cloud-environment computer-vision convolutional-neural-network crowd-behavior-analysis data-analysis data-visualisation deep-learning live-camera machine-learning
Last synced: 01 May 2026
https://github.com/reza-saeedi-coding/netflix-data-analysis
A complete end-to-end Netflix dataset analysis using Python, SQL, and Matplotlib. Explores genres, content ratings, and trends using exploratory data analysis and visualizations.
data-analysis data-cleaning eda matplotlib netflix pandas portfolio-project python sql sqlite
Last synced: 17 Apr 2026
https://github.com/lisa-ho/breadit
Respository for scraping and analysing data from the Reddit/Sourdough community to explore lockdown baking trends.
data-analysis data-viz nltk python reddit-api sentiment-analysis web-scraping
Last synced: 01 May 2026
https://github.com/renatomaynard/statistical-modeling-and-regression-analysis-life-expectancy
Statistical Modeling and Regression Analysis for Life Expentancy
data-analysis healthcare linear-regression machine-learning predictive-modeling r regression-analysis statistical-models statitics
Last synced: 23 Mar 2025
https://github.com/com-480-data-visualization/project-2023-choo-choo-data-darlings
This repository contains the source code for our data visualization project, an interactive platform designed to explore the intricate Swiss transportation network. Developed by the Choo Choo Data Darlings team at EPFL, the project provides an in-depth view into the vast array of Swiss transportation operations, including trains, buses, and trams.
boats buses data-analysis data-science data-visualisation data-visualization epfl metro public-transport public-transportation switzerland trains trams
Last synced: 01 May 2026
https://github.com/dcs-training/intromachinelearning
This course is aimed at providing an introduction to machine learning for those with some beginner level python skills. Go to the readme file
data-analysis data-wrangling machine-learning python statistics
Last synced: 06 Mar 2026
https://github.com/shridhar1504/power-bi-visualization-project
This repository contains Visualization Projects which is visualized through Power BI Software, by using the visualization we can gain multiple insights and strategies which helps to develop the business for gaining high profit margins and by the insights we can reduce the damages by accidents & calamities.
dashboard data-analysis data-cleaning data-science data-visualization exploratory-data-analysis microsoft-excel microsoft-power-bi microsoft-powerpoint powerbi powerbi-report powerbi-visuals powerpoint-slides
Last synced: 21 Jan 2026
https://github.com/vre-hub/science-projects
VRE example science projects
dark-matter data-analysis docker extreme-universe jupyter-notebook
Last synced: 18 Jan 2026
https://github.com/henrylin03/china-gdp
Analysis and visualisation of China GDP data using Python.
data data-analysis data-visualisation dataset kaggle pandas
Last synced: 01 May 2026
https://github.com/eshaagarwa/sales_insight_project
Sales insights project using Powerbi and SQL
data-analysis data-visualization databse datacleaning datamodeling microsoft-power-bi mysql-database powerbi sales-insights sql
Last synced: 08 Aug 2025
https://github.com/victoriapm/analyze_a-b_test_results
Understand the results of an A/B test run by an e-commerce website.
ab-testing data-analysis ecommerce-website
Last synced: 06 Oct 2025
https://github.com/ifibla/adsdb-project
Algorithms, Data Structures and Databases Project
data-analysis data-engineering python
Last synced: 12 Apr 2026
https://github.com/cintia0528/data_cleaning_and_analytics-python
Evaluate if aggressive discounting benefits Eniac long-term, considering differing views on customer acquisition and brand positioning. Focus on data cleaning for informed decision-making.
colab-notebook data data-analysis datacleaning dataquality jupyter-notebook matplotlib pandas python seaborn
Last synced: 08 Jan 2026
https://github.com/raccoon-hero/gender-equality-tracker
A web application visualizing gender equality metrics with a focus on Ukraine. Built with Flask, it's powered by live data from global open sources, with dynamic research insights and analysis.
chartjs css dashboard data-analysis data-visualization flask frontend gender-equality global-metrics html linked-data openalex opendata python representation semantic-web ukraine webapp wikidata world-bank-api
Last synced: 07 May 2026
https://github.com/cs-joy/pandasv2.0.3
learn data analysis with pandas
data-analysis pandas pandas-learning
Last synced: 03 May 2026
https://github.com/vipul2001/cousera-courses
This repo covers the solution to the assignments of various courses on algorithm,deep learning and data Analytics
coursera-courses data-analysis data-analytics-ibm deep-learning divide-and-conquer neural-network
Last synced: 29 May 2026
https://github.com/shubham5027/kisanai--the-ultimate-ai-ml-powered-platform-smart-farming-platform
KisanAI – The Ultimate AI/ML-Powered Smart Farming Platform KisanAI leverages AI/ML to optimize farming practices, enhance crop yields, and empower small-scale farmers with data-driven insights.
ai api aws chatbot crm data-analysis deep-learning deplyment farming llm mapping ml nodejs predictive-modeling reactjs supabase sustainability
Last synced: 30 May 2026
https://github.com/theairbend3r/mice-memory-response
Effect of memory on current response in mice using methods from computational neuroscience and machine learning.
computational-neuroscience data-analysis data-science machine-learning neuroscience python
Last synced: 09 Jun 2026
https://github.com/shivshah19/movie-recommendation-system
This Movie Recommendation System is designed to provide personalized movie recommendations based on user preferences.
cosine-similarity data-analysis machine-learning pandas python streamlit
Last synced: 03 May 2026
https://github.com/sunnybibyan/exploratory-data-analysis-eda
Welcome to the Titanic Dataset - Exploratory Data Analysis (EDA) project repository! This project aims to uncover insights from the Titanic dataset using Python and Jupyter Notebook. By analyzing key variables such as age, gender, and class, we aim to visualize relationships between passenger characteristics and survival rates.
data-analysis data-visualization jupyter-notebook python titanic-dataset
Last synced: 18 Jan 2026
https://github.com/randomshek/Working-With-Excel
Using Excel Power Query and PowerPivot, reorganise the data into a star schema and showcasing reports that can be created by data analysts using DAX formulae and PowerPivot
data-analysis excel power-pivot power-query
Last synced: 20 Jul 2025
https://github.com/shelton-beep/predicting-gpa-using-lifestyle-factors
Predicting student GPA using lifestyle factors like study habits, sleep, and stress levels. A machine learning model built to help students and educators understand the impact of lifestyle choices on academic performance.
data-analysis data-preprocessing data-science feature-engineering gpa-prediction machine-learning model-interpretability predictive-modeling python regression-analysis student-performance xgboost
Last synced: 04 May 2026
https://github.com/madhuresh2011/career-aspiration-of-gen-z-project-using-excel
Career Aspiration Of Gen-Z ,To explore the industries, roles, and pathways using Excel .
dashboards data-analysis data-visualization datacleaning dataset designing excel functional-dashboard gen-z kpi pivot-charts pivot-tables project-using-excel
Last synced: 27 Jan 2026
https://github.com/cliffordnwanna/climate_data_analysis_and_visualization
The Climate Data Visualization and Analysis project showcases a comprehensive exploration of African climate data, with an emphasis on identifying patterns and trends in average temperatures across different countries and time periods. It serves as a practical demonstration of my data analysis, data visualization, and problem-solving skills.
data-analysis data-science mathplotlib pandas plotly visualization
Last synced: 04 May 2026
https://github.com/zen204/airbnb-availability
A machine learning model that predicts Airbnb listing availability, utilizing feature engineering and supervised learning techniques to improve guest experience and optimize host management.
binary-classification data-analysis data-preprocessing data-visualization feature-engineering machine-learning matplotlib model-evaluation nlp pandas predictive-modeling python scikit-learn seaborn supervised-learning
Last synced: 21 Jan 2026
https://github.com/steno-aarhus/legliv
Substitution of red meat with legumes and risk of primary liver cancer in UK Biobank participants: A prospective cohort study
cancer-research data-analysis epidemiology nutritional-epidemiology nutritional-science open-science reproducibility reproducible-research rstats ukbiobank
Last synced: 03 Mar 2026
https://github.com/apache/cloudberry-devops-release
DevOps and Release for Apache Cloudberry (Incubating)
ai big-data cloudberry data-analysis data-warehouse database devops distributed-database greenplum mpp olap postgres postgresql
Last synced: 04 Sep 2025
https://github.com/ruchit0807/heart_disease_prediction
An interactive ML-powered web app that predicts the risk of heart disease based on clinical inputs like age, chest pain, cholesterol, ECG, and more. Built using Python, Streamlit, and scikit-learn, it offers early risk assessment in a simple and accessible way—just enter your health metrics and get instant feedback.
data-analysis data-science knn-regression pandas streamlit
Last synced: 04 May 2026
https://github.com/sandergi/designbuildfly
Useful tools made for Design Build Fly at UW, hosted on Glitch so teammates can easily access. Check out our optimization tools here: https://github.com/JPaonaskar/DBF-Optimization
data-analysis inav-blackbox webapp
Last synced: 01 Apr 2025
https://github.com/elkronos/stat_py
Statistics functions for python
assumption-check data-analysis data-visualization python regression statistical-analysis statistical-inference statistical-models statistical-tests statistics
Last synced: 10 Oct 2025
https://github.com/kimtth/agent-data-analyst-stream-chainlit
⚡️Chainlit-based Data Analyst Chat Agent (Responses API, Server Sent Events) 📈
agent azure-openai chainlit code-interpreter data-analysis server-sent-events stream-response
Last synced: 09 Jun 2026
https://github.com/saeun-park/data-analysis
데이터 분석 프로젝트 및 공모전
anova-test data-analysis data-visualization statistics
Last synced: 21 Jan 2026
https://github.com/vidhi1290/zomato-data-analysis
Zomato Data Analysis - Explore the world of Zomato restaurant data through Python and data analysis. Uncover trends and insights using Pandas for data manipulation and Matplotlib for visualization. Join us in this journey to reveal the hidden stories within the data!
data-analysis data-analysis-python data-science data-visualization dataprocessing machine-learning machine-learning-algorithms matplotlib numpy pandas python scikit-learn zomato-data-analysis
Last synced: 11 Apr 2026
https://github.com/cyberoctane29/unicorn-companies-analysis
This project explores unicorn companies, private startups valued at over $1 billion, using Python for data analysis. It covers industry trends, geographic distribution, and investment patterns through EDA, including data cleaning, handling missing values, datetime transformations, and visualizations to uncover key insights.
data-analysis eda numpy pandas python
Last synced: 02 May 2026
https://github.com/rcv911/lyapunov-indicators
Calculating Lyapunov indicators with multiprocessing in Python
data-analysis lyapunov lyapunov-indicators multiprocessing
Last synced: 18 Jan 2026