Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-30 00:07:38 UTC
- JSON Representation
https://github.com/isaacmaffeis/imad-2023
Model Identification and Data Analysis (IMAD) | University course
data data-analysis data-science model model-identification
Last synced: 09 May 2026
https://github.com/agrdatasci/climmob-analysis
Workflow for data analysis applied on ClimMob.net
citizen-science data-analysis workflow
Last synced: 24 Jun 2025
https://github.com/shafaq-aslam/data-analytics-dairy
A comprehensive repository for Data Analytics learning and projects. It includes MySQL, Python, Power BI, Tableau, and Excel. The goal is to analyze data, generate insights, and create compelling visualizations for real-world datasets.
data-analysis data-visualization excel excel-based-data-analysis powerbi python-scripts sql sql-queries sql-queries-for-data-manipulation sql-query-for-data-visualization tableau
Last synced: 20 Jan 2026
https://github.com/marina-gal/sql-business-questions
A collection of SQL queries designed to strengthen analytical problem-solving skills using the AdventureWorks2019 sample database. tested and optimized in SQL Server Management Studio (SSMS).
adventureworks data-analysis data-analyst interview-preparation learning microsoft-sql-server practice sql sql-queries
Last synced: 30 May 2026
https://github.com/anderson-andre-p/exploratory-data-analysis.roller-coaster
This repository contains an exploratory data analysis (EDA) project focused on roller coasters. The project involved organizing, cleaning, and visualizing the data to gain insights into roller coasters' characteristics and performance.
data-analysis eda exploratory-data-analysis exploratory-data-visualizations notebook
Last synced: 15 Mar 2025
https://github.com/hyperentangledqubit/shellplot
shellplot -- Generate plot(s) directly from terminal via matplotlib or ggplot2 (plotnine)!
data-analysis ggplot2 graphics matplotlib plotnine plotting pyplot terminal
Last synced: 10 May 2026
https://github.com/tiagocavalcante/nesfit
NES 2024 Practical and Research Work - Group 2
Last synced: 09 Jun 2026
https://github.com/anderson-andre-p/wine-data-analysis
This repository contains a data analysis project that focuses on a series of wine data. The project was completed using Python libraries such as NumPy, Pandas, Seaborn, and Matplotlib. The goal of this project was to gain insights into the characteristics of the wines and to practice data analysis skills.
data-analysis data-science data-science-portfolio pandas-dataframe wine-dataset
Last synced: 15 Mar 2025
https://github.com/codewithmayank-py/covid19-data-analysis-using-python
COVID-19 and Happiness Analysis
data-analysis data-analysis-python data-visualization dataset jupyter-notebooks numpy pandas python3 seaborn
Last synced: 11 Apr 2026
https://github.com/dineshh912/analysis_stock_price_data
Experiment analysis of stock price data with python3
data-analysis data-visualization financial-data python3 scikit-learn stock-price-prediction
Last synced: 24 Apr 2026
https://github.com/srinibas-masanta/electric-vehicle-analysis-dashboard
This repository features an interactive Tableau dashboard that visualizes electric vehicle (EV) adoption trends in the U.S. 🚗⚡ Explore EV growth, top manufacturers, regional distribution, and the impact of incentives—all in one dynamic view. 📊 Use filters to dive deeper into the data and uncover key insights! 🚀
dashboards data-analysis data-visualization tableau
Last synced: 15 Jan 2026
https://github.com/srinibas-masanta/zomato-customer-and-restaurant-analysis
This repository contains a comprehensive analysis of Zomato's platform, focusing on various aspects of customer behavior, restaurant performance, and market trends. The analysis leverages data-driven insights to answer key questions that can guide business strategies, enhance customer satisfaction, and optimize operational efficiency.
business-analytics data-analysis data-science data-visualization
Last synced: 02 Apr 2025
https://github.com/alejandrolara11/machinelearningcourse
Machine Learning Basics: From Setup to Clustering
data-analysis data-science machine-learning numpy pandas plotly preprocessing-data python scikit-learn seaborn streamlit
Last synced: 11 Apr 2026
https://github.com/salma-mamdoh/investigating-netflix-movies-and-guest-stars-in-the-office
My Project to learn the Basics of Analysis & Visualization on DataCamp
data-analysis data-visualization datacamp matplotlib pandas python
Last synced: 11 Apr 2026
https://github.com/rosanafss/r-journey
Diving into to wonderful see of DATA
Last synced: 19 Nov 2025
https://github.com/sehgal-vishal/ev-vehicle-market-analysis-dashboard
This Dashboard is related to EV vehicles adoption
clean-energy data-analysis data-visualization electricvehicles future-technologies
Last synced: 04 Mar 2026
https://github.com/beyzabasarir/brazilian-e-commerce-analysis
Brazilian E-Commerce Dataset By Olist PostgreSQL Analysis
data-analysis data-visualization sql
Last synced: 08 Jan 2026
https://github.com/chitranjan806/predicting-on-time-premium-deposits
A Predictive analysis project to predict the success rate of On-Time deposits of Premiums by Policy Holders.
analytics-vidhya analytics-vidhya-competition catboostregressor data-analysis data-science linear-regression logistic-regression python3
Last synced: 16 May 2026
https://github.com/mahmoudwal27/sql-data-analysis
This project demonstrates SQL operations for managing student enrollments, including creating tables, inserting data, updating records, and running queries to analyze student and course information. It showcases skills in data manipulation, aggregation, and advanced query formulation.
analysis data-analysis sql sql-data-analysis sql-queries
Last synced: 13 Feb 2026
https://github.com/jofaval/game-of-thrones
Data Analysis and Predictions of the Game of Thrones' character's survivance from 2016
classification data-analysis data-science data-visualization deep-learning game-of-thrones google-colab kaggle keras machine-learning matplotlib python scikit-learn seaborn tensorflow xgboost
Last synced: 11 Apr 2026
https://github.com/yashpaneliya/bank-loan-default-analysis
Analyze and understand the driving factors (or driver variables) behind loan default, i.e. the variables which are strong indicators of default.
data-analysis loan-default-analysis matplotlib numpy pandas python
Last synced: 06 May 2026
https://github.com/codeslash21/wrangle-twitter-archive
Wrangle Twitter Archive WeRateDog. WeRateDog has 8M followers and they rate the dogs with funny comments and unique rating system. Also use dog-breed classifier to predict dog's breed in the tweets.
data-analysis data-wrangling neural-networkt twitter-api twitter-archive
Last synced: 10 Apr 2025
https://github.com/codeslash21/wrangle_twitter_archive
Wrangle Twitter Archive WeRateDog. WeRateDog has 8M followers and they rate the dogs with funny comments and unique rating system. Also use dog-breed classifier to predict dog's breed in the tweets.
data-analysis data-wrangling nanodegree-project neural-network twitter-api twitter-archive
Last synced: 10 Apr 2025
https://github.com/samruddhi3012/public-health-data-analysis
Hi! This repo involves analyzing the Healthcare analytics using Advanced Microsoft Excel.
dashboard data-analysis data-visualization healthcare microsoft-excel pivot-chart pivot-tables vlookup
Last synced: 05 Feb 2026
https://github.com/satyam4229/prediction-of-cement-compressive-strength
Prediction of cement compressive strength is a model which is based on Regression model, Here we predict that how much is the compressive strength of the particular cement has with variety of mixtures of its component.
data-analysis data-science data-visualization jupyter-notebook kaggle python
Last synced: 13 Apr 2026
https://github.com/robinmillford/cardiac-care-performance-dashboard
This project presents a comprehensive data analysis and interactive dashboard focused on Cardiac Surgery and Percutaneous Coronary Interventions (PCI) performance by hospital, spanning from 2008 onwards.
cardiac data-analysis data-visualization plotly-express streamlit-dashboard tableau tableau-public
Last synced: 07 Sep 2025
https://github.com/vanshuchaudhary/retail-sale
project uses MySQL to analyze retail sales data, focusing on customer behavior, sales trends, and product performance. The dataset includes transactions, customer demographics, and purchase details, helping businesses optimize strategies. Key Insights: 📊 Revenue Analysis – Total sales, top-spending customers 📅 Sales Trends
business-intelligence customer-behavior customer-behavior-analysis data-analysis mysql predictive-analytics retail-analytics sales-analysis sql-queries
Last synced: 23 Mar 2025
https://github.com/abhinav330/customer-behavior-analysis-linear-regression
This repository explores customer behavior data for an NYC clothing company with both a mobile app and website. They want to understand which platform drives higher sales.
data-analysis data-science data-visualization eda exploratory-data-analysis jupyter jupyter-notebook linear-regression machine-learning machine-learning-algorithms machinelearning-python numpy pandas python regression-analysis
Last synced: 06 May 2026
https://github.com/as16082023/motor-vehicle-thefts
Using SQL to analyze vehicle theft patterns across New Zealand, focusing on trends related to specific times and locations.
Last synced: 10 Apr 2025
https://github.com/mikma03/datascience_python_datacamp
DataScience with Python. Code and examples. Python libraries, including pandas, NumPy, Matplotlib, and many more.
data-analysis data-science datacamp datascience numpy pandas python
Last synced: 06 May 2026
https://github.com/manumoolimani/data-analysis
Data Analysis Projects
data-analysis data-visualization excel
Last synced: 21 Feb 2026
https://github.com/amlanmohanty1/genai-data-analysis-report-generator
Generating data analysis and EDA reports from CSV files using Generative AI - Langchain, Llama, Groq.
ai data-analysis data-science flask generative-ai groq langchain llama3 llm prompt-engineering python
Last synced: 28 Jan 2026
https://github.com/farhannirzhor/vrinda_store_excel_project
This project is about excel analysis and visualization. In this project, I analyzed Vrinda Store's sales and made an annual sales report
data-analysis data-cleaning data-preprocessing data-visualization microsoft-excel reporting
Last synced: 05 Jan 2026
https://github.com/esther-poniatowski/multitask-context-dependent-behavior
Data analysis of neuronal recordings in naive and trained animals performing multiple tasks in active and passive attentional states
cognitive-neuroscience computational-neuroscience data-analysis data-visualization information-processing
Last synced: 26 Mar 2025
https://github.com/matheusafonseca/c111
Este repositório é dedicado ao armazenamento e organização dos códigos desenvolvidos na disciplina C111 - Análise de Dados, oferecida pelo Instituto Nacional de Telecomunicações (INATEL).
data-analysis matplotlib numpy pandas python
Last synced: 06 May 2026
https://github.com/mattholy/haka
HaKa is an out-of-the-box tool system designed for data engineers and data analysts in medium-sized enterprises. It is easy to deploy and scale.
celery data-analysis data-engineering fastapi python uvicorn-gunicorn
Last synced: 19 May 2026
https://github.com/rcv911/cluster_generation
Generation of cluster test data
cluster cluster-analysis cluster-generation clustering clustering-algorithm clusters data-analysis machine-learning
Last synced: 18 Jan 2026
https://github.com/samruddhi3012/rfm-sales-analysis
Hi there! In this project I have performed Sales Analysis (RFM Analysis) using SQL and Tableau.
data-analysis data-visualization mssqlserver rfm-analysis segmentation tableau
Last synced: 12 Mar 2025
https://github.com/manditacaos/hypefemme-analise-vendas
Projeto de análise de dados e visualização no Power BI da loja fictícia Hype Femme.
data-analysis jupyter-notebook portfolio powerbi python
Last synced: 10 Apr 2025
https://github.com/nimomach/cafe-sales
This analysis focuses on evaluating the sales performance of a cafe by examining key metrics such as total revenue, sales by product category, peak sales times, and many more.
cafe data-analysis data-visualization sales
Last synced: 12 Mar 2026
https://github.com/aalekhpatel07/statcan
StatCAN dataset fetcher and cleaner.
census data-analysis data-science statcan
Last synced: 02 Apr 2025
https://github.com/navp7/roadaccident_powerbi
An interactive Power BI dashboard designed to analyze road accident data
dashboards data-analysis data-visualization powerbi
Last synced: 19 Mar 2026
https://github.com/abhipatel35/svm-hyperparameter-optimization-for-breast-cancer
Utilizing SVM for breast cancer classification, this project compares model performance before and after hyperparameter tuning using GridSearchCV. Evaluation metrics like classification report showcase the effectiveness of the optimized model.
breast-cancer cancer-diagnosis classification data-analysis data-science gridsearchcv healthcare hyperparameter-tuning jupyter-notebook machine-learning medical-imaging pycharm python scikit-learn support-vector-machine svm
Last synced: 05 Feb 2026
https://github.com/friendotjava/air-quality-analysis
Data Analysis for Air Quality in different District in Beijing.
data-analysis explanatory-data-analysis exploratory-data-analysis jupyter-notebook pytho streamlit
Last synced: 06 May 2026
https://github.com/edanur-y/airline-customer-satisfaction-prediction-with-multiple-logistic-regression
Performing multiple logistic regression analysis on airline and customer data to predict the satisfaction. 🔵R
data-analysis missing-values-analysis multiple-logistic-regression optimal-cut-off-points r
Last synced: 09 Jun 2026
https://github.com/arsalan-dev-engineer/ai-repository
A repository that contains AI related projects, notes, practice files and documentations.
ai algorith beginner-friendly data-analysis data-preprocessing developer jupyterlab matplotlib matplotlib-pyplot natural-language-processing numpy pandas python unsupervised-learning visualization
Last synced: 12 Apr 2026
https://github.com/hanzopgp/lolanalysis
League Of Legends game data engineering, analysis, visualization and machine learning. Business intelligence project.
data-analysis data-cleaning data-engineering data-visualization dataiku deep-learning etl machine-learning scraping university
Last synced: 27 May 2026
https://github.com/nero103/airbnb-destination
This is and end-to-end project to uncover the ideal destination based on listings and hosts. Strategy included: Data workflow-SQL analysis-Data modeling-Data Visualization-Findings
data-analysis data-modeling data-visualization etl etl-pipeline excel microsoft-sql-server powerpoint sql tableau
Last synced: 27 Mar 2026
https://github.com/chokzb/covid19_vaccination_analysis
An EDA project examining global COVID-19 vaccination progress. The notebook investigates vaccination trends by country, daily vaccination rates, timeline patterns, and dose distribution. The project includes visualisations created with Matplotlib, Seaborn, and Plotly.
covid-19 data-analysis data-visualization jupyter-notebook matplotlib numpy pandas plotly python seaborn vaccination
Last synced: 07 May 2026
https://github.com/chitranjan806/greyatom_learning_repo
A Collection of Projects, Tasks and Challenges as part of Data Science Masters - Transition Program at GreyAtom.
data-analysis data-science greyatom python3
Last synced: 29 Jun 2026
https://github.com/bitcoin-apps-suite/bitcoin-spreadsheet
Open source Bitcoin-powered spreadsheet application with blockchain data integration, smart contract calculations, and collaborative financial modeling | By THE BITCOIN CORPORATION LTD
bitcoin bitcoin-sv blockchain bsv cryptocurrency dapp data-analysis decentralized excel-alternative nextjs spreadsheet typescript web3-spreadsheet
Last synced: 05 May 2026
https://github.com/jianxi-erin/bigdata-machinelearning-lab
本项目是一个综合性的大数据与机器学习实验平台,包含两个主要任务,每个任务涵盖三个关键技术模块:大数据处理、数据分析和机器学习。项目基于真实的竞赛设计,提供完整的数据处理模拟和建模实践。
data-analysis data-visualization hadoop machine-learning python spark sql
Last synced: 03 May 2026
https://github.com/mahmoud2abdallah/improvado-marketing-homework
This Looker Studio dashboard provides a comprehensive analysis of marketing performance for August 2024, transforming raw data into actionable insights for data-driven decision making.
bigquery business-intelligence data-analysis looker-studio marketing
Last synced: 05 Oct 2025
https://github.com/ankitwalimbe/ecommerce-funnel-analysis
SQL-based analysis of the Olist e-commerce dataset — building an order funnel (purchase → approval → delivery) with breakdowns by payment type, product category, region, and monthly trend. Includes insights, CSV exports, and Tableau dashboard.
bigquery business-intelligence data-analysis ecommerce funnel-analysis sql tableau-public
Last synced: 05 Oct 2025
https://github.com/josepablodmg/python--linear-regression---housing-exercise
A predictive analysis exploring the relationship between household characteristics and median income in California. Using linear regression, the project investigates whether blocks with fewer households correspond to higher median incomes.
california data-analysis data-science exploratory-data-analysis housing-data linear-regression machine-learning python regression scikit-learn statistics visualization
Last synced: 05 Oct 2025
https://github.com/manishkaa/google_data_analytics_capstone_case_study
This case study is a part of Google Data Analytics Capstone Project
bigquery data-analysis sql tableau
Last synced: 05 Oct 2025
https://github.com/ayushsiloiya619/spotify-song-analysis
Data Analytics with Python
data-analysis matplotlib-pyplot pandas-dataframe python3 seaborn
Last synced: 08 May 2026
https://github.com/marielachirinosr/analysis-urgencias-hospital-pitalito
This project involves analyzing emergency room admission data from the E.S.E Hospital Departamental de Pitalito using a star schema model.
bigquery data data-analysis etl-pipeline tableau
Last synced: 21 Jan 2026
https://github.com/subhamghimire/dataanavis
Learning Data analysis and visualization
data-analysis data-science data-visualization dataset
Last synced: 06 Oct 2025
https://github.com/takshshah-16/spotify_eda
Spotify data analytics and advanced querying
data-analysis eda pgadmin4 postgresql
Last synced: 30 Oct 2025
https://github.com/chanmeng666/douban-review-scraper
【One star = One happy developer doing a little dance 💃⭐️】A robust Python scraper for collecting and analyzing movie reviews from Douban.com, featuring comprehensive data processing and analysis capabilities.
beautifulsoup4 data-analysis data-processing douban movie-reviews pandas python sentiment-analysis text-mining web-scraping
Last synced: 02 May 2026
https://github.com/ilaxi/lomicontadores
data management tool in reference to number of actions per day in a year
data-analysis gdscript godot godot4 python
Last synced: 19 Apr 2026
https://github.com/myles/notebooks
Some of my random Jupyter Notebooks.
data-analysis data-science jupyter-notebooks
Last synced: 18 Jan 2026
https://github.com/nuriadevs/informes-powerbi
Este repositorio contiene informes elaborados con Power BI.
Last synced: 18 Feb 2026
https://github.com/michaelcurrin/yahoo-finance-reports
Use the Yahoo Finance API to get info on shares of interest and report on them
data-analysis data-science python reporting shares stock-market yahoo-finance yahoo-finance-api
Last synced: 07 Oct 2025
https://github.com/eharshit/end-to-end-vendor-insights
End-to-end analysis of vendor performance for wholesale/retail businesses, featuring data ingestion, cleaning, insights, and interactive Power BI dashboards.
analysis analysis-algorithms analytics dashboard data data-analysis datascience jupyter jupyter-notebook pandas powerbi powerbi-report retail wholesale
Last synced: 07 Oct 2025
https://github.com/jedrzej-wydra/data-analysis-pro
Professional Data Analyst Exam by DataCamp
Last synced: 23 Mar 2025
https://github.com/haonamnguyen/costumer-shopping-trends-analysis
This project analyzes a synthetic dataset of customer shopping behavior to see key trends and insights. Using SQL and Tableau, the analysis focuses on customer demographics, purchase patterns, and preferences, including age distribution, payment methods, shipping types, and top product categories.
data-analysis data-visualization sql tableau
Last synced: 05 Jan 2026
https://github.com/npodlozhniy/podlozhnyy-module
One place for the most useful methods for work
data-analysis data-science pypi
Last synced: 21 Jan 2026
https://github.com/aymanmomin/excel-coffee-data-analytics-exploring-coffee-orders-dataset
This project utilizes a coffee orders dataset to perform comprehensive data analytics and gain insights into customer preferences, popular items, and sales trends. The analysis aims to provide valuable information for coffee shop owners and enthusiasts, facilitating data-driven decision-making and improved customer satisfaction.
data-analysis data-visualization excel project
Last synced: 18 Jan 2026
https://github.com/rita94105/ethereum-fraud-detection
This project focuses on detecting fraudulent transactions in the Ethereum network using both traditional machine learning models and deep learning techniques. By analyzing transaction attributes and interaction patterns, we aim to develop an effective fraud detection model.
data-analysis deep-learning ethereum fraud-detection machine-learning
Last synced: 01 May 2026
https://github.com/harryrlk/data_analysis_showcase
This repository showcases my data analysis and visualization projects using Excel, Python, R, and Tableau. Some projects are under NDA, so key figures and specific numbers are not included, but brief overviews and methodologies are provided. Feel free to explore and contact me for further details.
data-analysis data-science data-visualization excel portfolio python r tableau
Last synced: 06 May 2026
https://github.com/rupeshtr78/machine_learning
Machine Learning TensorFlow Neural Networks Deep Learning
classification data-analysis deep-learning deep-neural-networks flink jupyter-notebook keras machine-learning machinelearning-python perceptron python3 spark tensorflow
Last synced: 11 Apr 2026
https://github.com/yandexdataschool/ml-sweights-experiments
Experiments for the "Machine Learning on data with sPlot background subtraction" paper
data-analysis high-energy-physics machine-learning statistics
Last synced: 15 May 2025
https://github.com/alexquilis1/spanish-fuel-stations-analysis
Real-time analysis of Spanish fuel prices using government API data with interactive maps and regional comparisons
data-analysis data-visualization fuel-prices geospatial-analysis ggplot2 government-data leaflet open-data r shiny spain tidyverse
Last synced: 08 Oct 2025
https://github.com/inddrsingh/e-commerce_orders
ETL project, with Python for Data cleaning and MySQL for Data analysis
data-analysis etl-pipeline mysql python
Last synced: 18 Apr 2026
https://github.com/bhavanachitragar/data-analysis-using-pyspark
Working with pyspark module in python and using google colab environment in order to apply some queries to the dataset. The dataset consist of two csv files listening.csv and genre.csv. Also, visualizing query results using matplotlib.
data-analysis google-colab pyspark-sql
Last synced: 30 Jun 2025
https://github.com/jlee9503/telecommunication-churn
Analyze key factors influencing customer churn using Python data analytics technique. Explore key factors through data preprocessing, exploratory data analysis (EDA), and predictive modeling.
data-analysis data-visualization matplotlib pandas python scikit-learn
Last synced: 18 Jan 2026
https://github.com/faisal-khann/ipl-analysis
The IPL Analysis project is a comprehensive data-driven exploration of the Indian Premier League (IPL), analyzing historical match data to uncover patterns in team performance, player statistics, and match outcomes.
data-analysis exploratory-data-analysis jupyter-notebook matplotlib numpy pandas seaborn
Last synced: 08 May 2026
https://github.com/docuvesta/youtube-api-fragrance-channel-analytics
Engagement metrics analysis of perfume Youtube channel using Youtube API 🎀
analysis beauty-products comments data-analysis data-analysis-python engagement-metrics insights jupyter-notebook likes-count marketing marketing-analytics perfume python views-count youtube youtube-api youtube-api-v3
Last synced: 03 May 2026
https://github.com/izzyl3333/mosquito_analysis
An exercise using Python and statistical analysis in mosquito data to understand the relationship between the different variables and the mosquito number.
chicago data-analysis data-science exploratory-data-analysis mosquitoes python statistical-analysis west-nile-virus
Last synced: 19 Jan 2026
https://github.com/aksoni07/movie-recommendation
A hybrid movie recommendation system designed to deliver personalized and accurate suggestions by combining user preferences, item attributes, and collaborative patterns, ensuring a seamless and engaging experience.
clustering content-based-filtering data-analysis embeddings jupyter-notebook numpy ollaborative-filtering pandas personalization python recommendation-systems scikit-learn user-item-interactions
Last synced: 11 Apr 2026
https://github.com/superpandas-ai/superpandas
Adding LLM integration to Pandas library
Last synced: 06 May 2026
https://github.com/drill-n-bass/dealavo-project
Cartesian product from dictionary to list of dictionaries and faster methods for finding index than the `index` method.
data-analysis data-analysis-python matplotlib pandas python python3 random timeit
Last synced: 06 May 2026
https://github.com/sillyash/untappd-viz
A data visualisation page using public datasets and HTML/CSS/JS with D3.js.
beer beer-statistics data data-analysis data-visualization kaggle kaggle-dataset public-dataset school-project
Last synced: 18 May 2026
https://github.com/amanyadav-07/customer-churn-prediction
Machine Learning project to predict customer churn using Logistic Regression, Random Forest, and XGBoost. Includes data preprocessing, feature engineering, SMOTE balancing, model training, evaluation, and business insights.
accuracy-metrics data-analysis data-visualization logistic-regression machine-learning matplotlib numpy pandas python3 random-forest-classifier seaborn sklearn xgboost-classifier
Last synced: 11 Apr 2026
https://github.com/sabdikay/analysis-of-biodiversity
This project analyzes biodiversity data from the National Parks Service, focusing on species in various park locations. Conducted in Jupyter Notebook, it uses pandas, matplotlib, NumPy, seaborn, and chi2_contingency for analysis and visualization.
data-analysis data-analysis-python data-visualization exploratory-data-analysis jupyter-notebook matplotlib numpy pandas python seaborn
Last synced: 14 Apr 2026
https://github.com/ibromeat/road-accident-risk
Exploratory Data Analysis of road accident risk predictions — visualizing model stability and distribution of predicted probabilities.
data-analysis jupyter-notebook matplotlib python traffic-data visualization
Last synced: 18 May 2026
https://github.com/delabrov/jwstoolkit
A python package for handling JWST observations
astronomy-astrophysics data-analysis data-cube data-visualization imagery jwst python3 spectroscopic-data
Last synced: 26 May 2026
https://github.com/karsterr/repeated-measurement
An R-based workflow for conducting repeated measures ANOVA using the ez package, with data wrangling via tidyverse and visualization through ggplot2. Includes data import, transformation to long format, statistical analysis, and graphical summary.
anove data-analysis experimental-design ezanove ggplot2 r repeated-measurements rstats statistics tidyverse
Last synced: 18 Sep 2025
https://github.com/amirreza81/kaggle-pandas-course-solutions
Kaggle Pandas Course - Solved exercises in another way of sample solution
data data-analysis data-cleaning data-manipulation data-science dataframe jupyter-notebook kaggle machine-learning open-source pandas
Last synced: 14 Apr 2026
https://github.com/imrandil/excel_learning_dir
Excel learning practice with some data, the doing
Last synced: 27 Jan 2026
https://github.com/cyberoctane29/diamonds-anova-analysis
This project uses ANOVA in Python to analyze how diamond color and cut affect pricing. By testing for statistical significance and running post hoc comparisons, it reveals key pricing patterns. Built with pandas, statsmodels, and Seaborn, the findings help inform diamond valuation and purchasing decisions.
anova-test data-analysis data-analytics data-science diamonds-dataset regression-analysis statistical-analysis tukey-hsd
Last synced: 10 Oct 2025
https://github.com/27ahmad/netflix_sql_project
The Netflix SQL Project analyzes the Netflix dataset using SQL queries to gain insights into its content, identify trends, and address business problems related to movies and TV shows.
data-analysis postgresql-database sql
Last synced: 03 Feb 2026
https://github.com/abhay-sinha-0/carpricepredictionproject
A machine learning project that predicts the selling price of a car based on its features such as year, mileage, fuel type, transmission, and more. This model can assist individuals and dealerships in estimating fair market prices for used cars.
artificial-intelligence data-analysis data-science data-visualization exploratory-data-analysis machine-learning-algorithms matplotlib-pyplot mysql-database numpy-library pandas-library python skit-learn sklearn-library
Last synced: 15 May 2025
https://github.com/dug22/jjournal
A Jupyter like notebook software for Java
data data-analysis data-science java jshell jshell-repl notebook swing swing-application
Last synced: 11 Apr 2026
https://github.com/priyanshubiswas-tech/deloitte-daikibo-telemetry-analysis-task-1
Tableau dashboard analyzing Daikibo telemetry data. Tracks downtime by factory/device with interactive filters. Deloitte task solution with JSON processing.
data-analysis data-visualization deloitte json tableau tableau-public
Last synced: 11 Oct 2025
https://github.com/jiwookseo/natural_language_analysis
api sample for google natural language and ECOS(한국은행 경제통제시스템)
data-analysis google-natural-language-api text-analysis
Last synced: 11 Oct 2025