Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-23 00:07:29 UTC
- JSON Representation
https://github.com/stimulsoft/samples-dashboards.web-for-blazor-webassembly
Blazor WebAssembly (Wasm) samples for Reports.BLAZOR embedded components, Visual Studio C# projects, .NET 6, .NET 7, .NET 8 dashboards tool
blazor client-side converter dashboard data data-analysis data-sources database datagrid designer diagram dimension json net presentation print runtime viewer wasm webassembly
Last synced: 18 Apr 2026
https://github.com/shubh-bharadwaj/income-dataset-analysis
data-analysis data-science pandas python
Last synced: 18 Apr 2026
https://github.com/mksingh431/free-data-science-courses
Data science is a rapidly growing tech field that’s transforming business decision-making. To break into this field, you need the right skills. Fortunately, top institutions like Harvard and IBM offer free online courses. These courses cover everything from basic programming to advanced machine learning.
course data data-analysis data-science data-visualization free freecou python
Last synced: 19 Apr 2026
https://github.com/shyamkumarnagilla/ai-powered-forecasting-for-agricultural-productivity
AI Powered Forecasting for Agricultural Productivity is a project that utilizes machine learning to predict crop yields and optimize farming practices. By harnessing historical and real-time data, this model empowers farmers with data-driven insights to enhance productivity and sustainability in agriculture.
data-analysis data-visualization deep-learning flask neural-network
Last synced: 19 Apr 2026
https://github.com/rodriguesl1/analise-ibovespa-fiap
Modelo de previsão do índice IBOVESPA utilizando técnicas de séries temporais. O projeto inclui análise exploratória, decomposição sazonal, testes de estacionariedade e modelagem com Prophet, AutoARIMA e outros modelos estatísticos para apoiar decisões de investimento.
autoarima b3 brasil data-analysis economia finance forecasting ibovespa pandas prophet python statsmodels time-series
Last synced: 19 Apr 2026
https://github.com/yuvrajsaraogi/unemployment-analysis-with-python
Unemployment is measured by the unemployment rate which is the number of people who are unemployed as a percentage of the total labour force. We have seen a sharp increase in the unemployment rate during Covid-19, so analyzing the unemployment rate can be a good data science project.
big-data big-data-analytics data-analysis data-science data-visualization engineering excel jupyter-notebook machine-learning mini-project natural-language-processing nlp project python3 sql
Last synced: 19 Apr 2026
https://github.com/tsffarias/my-books
Exploratory analysis of my Dataset 'All_the_Books_I_read' which contains all the books I've read
books data-analysis python tableau
Last synced: 19 Apr 2026
https://github.com/leosimoes/nexoseducacao-imersao-powerbi
Atividades realizadas na Imersão PowerBI pela Nexos Educação com Karine Lago e Leticia Smirelli em Setembro de 2023.
business-intelligence dashboards data-analysis microsoft-power-bi
Last synced: 06 Jan 2026
https://github.com/souraevshing/data-science-01
Data analysis using jupyter notebook.
data-analysis data-science data-visualization jupyter-notebook python
Last synced: 19 Apr 2026
https://github.com/mugambi645/eda-projects
A list of EDA projects
data-analysis eda matplotlib numpy pandas plotly seaborn webscraping
Last synced: 19 Apr 2026
https://github.com/mlucifer27/bilateral-visualization
Streamlit app visualizes bilateral relationship scores between 100 countries from 1945 to 2024. It supports interactive heatmaps, network graphs, pairwise comparisons, and more.
d3blocks data-analysis data-visualization plotly-python python streamlit
Last synced: 04 Jun 2026
https://github.com/akash-v7/telecom_customer_churn_prediction
A machine learning project to predict customer churn in the telecom industry using data analysis and classification models. The project includes data preprocessing, exploratory data analysis (EDA), model building, and insights to help telecom companies improve customer retention strategies.
data-analysis data-science data-visualization jupyter-notebook machine-learning predictive-modeling python
Last synced: 20 Apr 2026
https://github.com/leftcoastnerdgirl/introduction_to_python
This project provides an introduction to data analysis using Python.
data-analysis data-analysis-python data-analytics data-comparison data-import for-loop jupyter-notebook min-max python
Last synced: 20 Apr 2026
https://github.com/nikolaos-mavromatis/etf-data-analysis-dashboard
Insights into SPY ETF performance with an interactive Streamlit dashboard powered by Alpha Vantage data.
api data-analysis data-visualization financial-analysis pandas plotly python streamlit
Last synced: 20 Apr 2026
https://github.com/namratha2301/carprice_analysisandprediction
This project analyzes factors influencing vehicle prices using a dataset of various attributes, including Engine capacity, Power, Mileage, and Seating capacity.
data-analysis data-visualization exploratory-data-analysis machine-learning pandas predictive-modeling random-forest-classifier regression scikit-learn seaborn
Last synced: 20 Apr 2026
https://github.com/natnaelhhaile/text-similarity-analysis
bag-of-words cosine-similarity data-analysis machine-learning natural-language-processing nltk-python one-hot-encoding python stemming stop-word-removal stop-words text-mining text-processing text-similarity-analysis tf tf-idf tokenization
Last synced: 20 Apr 2026
https://github.com/lucashomuniz/Project-05
Statistical Analysis of Hospitalization Costs: Leveraging SQL and R for Insights
anova-analysis anova-test data-analysis finance-analysis-data language-r linear-regression sql stastistical-model statistical-analysis
Last synced: 20 Oct 2025
https://github.com/jbalooshie/school_district_analysis
Analysis of standardized testing results using NumPy and Pandas, executed in Jupyter Notebook. Summaries of the testing results are provided based on school, test type, and grade level.
data-analysis data-science dataframes jupyter-notebook numpy pandas python
Last synced: 20 Apr 2026
https://github.com/hilalguleryuz/powerbi_hotelbooking_data_analysis_project
Hotel Booking Data Analysis with Power BI
dashboard data-analysis data-visualization dax hotel-booking powerbi
Last synced: 06 Jan 2026
https://github.com/xre22zax/roller-coaster
Explore award-winning wood and steel coasters from 2013-2018 Golden Ticket Awards & Captain Coaster, all powered by Python and interactive visualizations.
analytics data-analysis data-visualization pandas python python-lambda python3 visualization
Last synced: 20 Apr 2026
https://github.com/alejandrolara11/data-preprocessing
Data preprocessing through the use of the libraries NumPy and pandas.
data-analysis data-cleaning data-preprocessing numpy pandas python
Last synced: 09 May 2026
https://github.com/abinashsahoo007/project-bankruptcy-prevention
The project is to create a classification model that predicts the chances of a business facing bankruptcy based on the key feature like Industrial Risk, Management Risk, Financial Flexibility, Credibility, Competitiveness, Operating Risk.
data-analysis data-mining data-visualization deployments eda machine-learning pickle python statistics streamlit
Last synced: 20 Apr 2026
https://github.com/an4pdm/relatorio-de-vendas
O presente projeto foi feito através das ferramentas oferecidas pelo Power BI afim de aprimorar meus conhecimentos sobre ETL. Os dados utilizados foram de origem do site "Kaggle".
data-analysis data-visualization database etl powerbi
Last synced: 20 Jun 2026
https://github.com/adrianlardies/from-data-to-insight
This project creates and manages a MySQL database to analyze the performance of Bitcoin, Gold, and the S&P 500 in response to economic factors. It integrates historical data, executes advanced SQL queries, and visualizes key insights, showcasing the power of SQL and Python in financial analysis.
data-analysis data-science matplotlib pandas python seaborn sql
Last synced: 12 Apr 2026
https://github.com/ayushbaid/football_stats
Analysing the competitiveness in different European football leagues
Last synced: 03 Apr 2025
https://github.com/mahmoudwal27/e-commerce-data-analysis
A collection of data analysis and visualization projects focused on ecommerce datasets. Using Python in Google Colab for analysis and Excel for exploration, these projects uncover key insights and trends, showcasing expertise in data manipulation and visualization to inform business decisions.
analytics data-analysis data-analysis-python data-set google-cloud python
Last synced: 21 Apr 2026
https://github.com/codesaadumair/exploratory-data-analysis
A centralized repository showcasing various Exploratory Data Analysis (EDA) projects using Jupyter notebooks, visualizations, and accompanying documentation.
data-analysis data-science data-visualization eda jupyter-notebook jupyterlab python
Last synced: 24 Mar 2025
https://github.com/florence-nyokabi/house-power-consumption
Machine Learning: Exploring Regression Analysis
data-analysis data-cleaning data-science data-visualization feature-engineering jupyter-notebook jupyterlab machine-learning pandas-python regression-analysis regression-models
Last synced: 05 Jun 2026
https://github.com/rachel-xmr/data-analysis-in-health-set-csc3062
CSC3062 Data Analysis and visualization
classification-algorithm data-analysis data-visualization model-evaluation nmf pca python svm t-sne visualization
Last synced: 05 Jun 2026
https://github.com/prady2309/air-traffic-analysis
data-analysis data-science data-visualization flights jupyter-notebook python3
Last synced: 21 Apr 2026
https://github.com/juliuspinsker/bioconductor-learning-container
🧬 Containerized development environment for Harvard's Professional Certificate in Data Analysis for Genomics (PH525.x series). Streamlined setup for Bioconductor, R, and genomic data analysis with RStudio and DevContainer support.
bioconductor bioinformatics chip-seq data-analysis data-science devcontainer dna-methylation docker edx functional-genomics genomics harvard harvardx ph525 ph525x r reproducible-research rna-seq rstudio single-cell-rna-seq
Last synced: 14 May 2026
https://github.com/meerantajalli/networksecuritydefense
This Network Security defense systems acts as an indicator against SMP Floods, UDP Floods, ICMP Floods. This model is trained using packets from wireshark and can easily differentiate between normal network traffic and traffic that has been targetted on the machine by an attacker using the rate of packets transfer and using the source IP.
anomaly-detection classification cyber-security data-analysis ddos-detection icmp-flood intrusion-detection machine-learning network-security packet-analysis python random-forest security smp-flood udp-flood wireshark
Last synced: 21 Apr 2026
https://github.com/anderson-andre-p/uber-data-analysis
This repository contains a comprehensive data analysis project focused on Uber rides. The dataset used in this project is a spreadsheet obtained from Uber, containing data related to ride details, such as pick-up and drop-off locations, date and time of the ride, and the fare amount.
data-analysis data-science data-visualization python
Last synced: 15 Jun 2026
https://github.com/aliiahmadi/data-manipulation
Algorithms and solutions
algorithm algorithms data-analysis data-science
Last synced: 30 Jun 2025
https://github.com/tmmvn/analytics-notebooks
A bunch of data analytics notebooks done testing out JetBrains DataLore
ai algorithms data-analysis datalore elements-of-ai helsinki-university-mooc python
Last synced: 22 Apr 2026
https://github.com/prgermux/yield-reporter
This Python application provides a graphical user interface (GUI) for analyzing and visualizing production data from various machines. It uses the PyQt5 framework for the GUI and Matplotlib for plotting data.
automation data-analysis python reporting
Last synced: 22 Apr 2026
https://github.com/rajesh9943/sentiment-analysis-of-consumer-opinions-on-amazon-products
Developed a comprehensive Sentiment Analysis System aimed at classifying Amazon product reviews into positive, neutral, and negative sentiments. The project leveraged advanced Natural Language Processing (NLP) techniques alongside machine learning algorithms to deliver accurate and actionable insights from customer feedback
amazon data-analysis data-manipulation data-preprocessing data-presentation data-visualization machine-learning nlp nlp-library nltk product-reviews-analysis sentiment-analysis sklearn-library word-cloud-generator-in-python-3
Last synced: 05 Jun 2026
https://github.com/kgotsosm/epl-analysis
Preparing data for machine learning algorithms to predict English Premier League match winners.
data-analysis data-cleaning data-modeling
Last synced: 22 Apr 2026
https://github.com/leabrodyheine/california-schools-data-visualization
This front-end project provides interactive visualizations of learning models adopted by California schools during the pandemic. Using D3.js and Mapbox, it dynamically presents data through bar charts, bubble charts, heatmaps, and geographic maps, allowing users to explore trends across school types, sizes, and districts.
d3-visualization d3js data-analysis data-visualization mapbox openai plotly
Last synced: 22 Apr 2026
https://github.com/ayushi-gajendra/restaurant-order-analysis-sql
End-to-end SQL analysis of 12,266 restaurant transactions to identify high-performing menu items, revenue concentration, bulk ordering behavior, and strategic growth opportunities.
analytics-portfolio business-intelligence case-study customer-segmentation data-analysis data-analytics database-analysis menu-engineering mysql revenue-analysis sql sql-project
Last synced: 05 Jun 2026
https://github.com/floffah/my-listening
Various ways to analyse your Spotify extended streaming history data
convex data-analysis listening-history spotify
Last synced: 23 Apr 2026
https://github.com/al-ogr/sf_pr1_job_analysis_hh
SkillFactory DataScience PROJECT-1. Анализ резюме из HeadHunter
data-analysis data-science ipynb plotly python
Last synced: 23 Apr 2026
https://github.com/tranngoca5039/bigquery-a5y
📊 Streamline your data analysis with bigquery-a5y, a powerful tool for optimizing BigQuery performance and improving query efficiency.
analytics api big-data bigquery cloud-computing data-analysis data-integration data-management data-pipeline data-visualization data-warehouse google-cloud machine-learning serverless sql
Last synced: 05 Jun 2026
https://github.com/thc1006/nycu_timtable_crawler
🎓 NYCU Course Data Crawler & Timetable System | 國立陽明交通大學課程爬蟲與選課系統 - Python web scraper for course schedules, syllabi & educational data analysis. Crawls 18K+ courses with 98% success rate. Features: interactive timetable, JSON API, Google Colab support, batch processing, resume capability.
academic course course-selection crawler data-analysis education educational-data google-colab json-api nycu open-data python schedule student-tools syllabus taiwan timetable university web-automation web-scraping
Last synced: 24 Apr 2026
https://github.com/misszeferino/cyclistic-bike-share-analysis
Data Analysis using R
data-analysis ggplot2 lubridate r r-programming tidyverse
Last synced: 06 Jun 2026
https://github.com/suhas-005/eda-indian-startup-funding
Exploratory Data Analysis on Indian Startup Funding (2015-2020)
data-analysis data-analytics data-science data-visualization exploratory-data-analysis matplotlib pandas python seaborn startup-funding
Last synced: 09 May 2026
https://github.com/arunabhagit/inventory-misalignment-and-revenue-loss-in-multi-store-bike-retail
This project focuses on identifying the inventory and demand mismatch causing stagnant sales and lost revenue in a bike retail chain. By analyzing store-level performance and regional customer preferences, the project aims to detect underperforming products.
data-analysis data-visualization powerbi python
Last synced: 24 Apr 2026
https://github.com/lfariello/atmospheric_reentry
Matlab code for the determination of the reentry trajectory, deceleration profiles, and heat flux of the ARD capsule during orbital reentry into Earth's atmosphere.
data-analysis heat-flux-prediction heat-transfer hypersonic hypersonic-capsule matlab-programming trajectory-prediction
Last synced: 23 Mar 2025
https://github.com/voidnire/redditviralmysteryposts
Análise de posts de subreddits de mistério. O que define um post viral neste tipo de sub?
data-analysis data-visualization mysteries mystery nlms python-3 reddit
Last synced: 24 Apr 2026
https://github.com/muthukumar0908/youtube-data-harvesting-and-warehousing-using-sql-mongodb-and-streamlit
Create a simple and intuitive user interface using Streamlit, From the youtube getting and extracting the data by using API key. That data stored in database.
data-analysis mongodb-atlas python sqldatabase streamlit-webapp youtube-api
Last synced: 24 Apr 2026
https://github.com/jpcadena/ventas-facturas
Ventas con facturas
data data-analysis data-exploration data-extraction data-science excel feature-engineering matplotlib microsoft numpy pandas powerbi product-sales pylint python receipts sales
Last synced: 12 Apr 2026
https://github.com/cyberoctane29/python-for-data-analysis
A repository dedicated to learning Python for data analysis, data science, and data analytics. This collection of Jupyter notebooks covers practical exercises and concepts from the Google Advanced Data Analytics Professional Certificate program.
data data-analysis data-analytics data-science python
Last synced: 24 Apr 2026
https://github.com/tsbarr/toronto-open-data
Analysis of Toronto's open data initiatives. 🌆 Exploring Toronto's urban systems through data science 📊 Python-based analyses of public datasets 🔍 Focus on community impact and urban patterns 🎓 Academic rigour meets practical insights 🔄 Regularly updated with new analyses
api-integration civic-tech ckan-api data-analysis data-cleaning data-science data-visualization exploratory-data-analysis jupyter-notebook open-data pandas public-data python tableau toronto urban-analytics
Last synced: 09 May 2026
https://github.com/amlanmohanty1/zepto-sql-data-analysis-project
Complete Data Analysis on Zepto Inventory data using SQL
data-analysis database inventory-management postgresql sql zepto
Last synced: 24 Apr 2026
https://github.com/mikeesto/ausvotes19
:bird: A collection of 67,284 public tweets published on the night of the 2019 Australian election
australia data-analysis data-visualization elections open-data twitter
Last synced: 06 Apr 2025
https://github.com/pedrohdosanjos/economic-data-analysis
This project aims to analyze the export data from various states in the United States to Brazil over time. The data is sourced from the FRED (Federal Reserve Economic Data) API and processed to identify the top 5 exporting states for each year, as well as the states with the highest total export value across all years.
api data-analysis data-visualization jupyter-notebook python
Last synced: 24 Apr 2026
https://github.com/ismielabir/pycsvsummarizer
A lightweight tool to summarize CSV files using various features.
csv data-analysis data-summary python
Last synced: 25 Apr 2026
https://github.com/mehmetkahya0/gallstone_dataset_analysis_project
Safra Taşı Hastalığı (Gallstone-1) Veri Seti Analizi (https://archive.ics.uci.edu/dataset/1150/gallstone-1)
analysis analytics data data-analysis data-science data-visualization database graph matplotlib python
Last synced: 25 Apr 2026
https://github.com/tmoulik/bikeshare-python
Analysis of Bikeshare data from three major cities
data-analysis data-visualization python udacity-nanodegree
Last synced: 25 Apr 2026
https://github.com/itsharshparmar/uber_data_analysis
Data analysis on Uber ride data using Python and visualization libraries.
analytics-projects business-analysis colab-notebook data-analysis data-cleaning data-science data-visualization eda exploratory-data-analysis matplotlib pandas python python-project real-world-data seaborn time-series-analysis transportation uber uber-data-analysis
Last synced: 09 May 2026
https://github.com/avilash1212/sales-dashboard-using-google-looker-
Sales dashboard project using Google looker studio
data-analysis data-visualization jupyter-notebook python sql
Last synced: 25 Apr 2026
https://github.com/urmesthamondal/data_analysis_projects
Portfolio Data analysis projects built using Excel, Python, SQL and for visualization used Power bi .
data-analysis pivot-tables powerbi python sql sql-server visualisation
Last synced: 09 May 2026
https://github.com/marielachirinosr/bellabeat-wellness-data-trends
Analyzing smart device data for insights on user activity patterns to optimize interventions for better health outcomes.
data data-analysis data-visualization pandas python python3 tableau tableau-public
Last synced: 25 Apr 2026
https://github.com/viniciusds2020/streamlit_app_adult
Protótipo APP - Machine learning - Streamlit
app data-analysis data-science front-end joblib machine-learning python streamlit
Last synced: 25 Apr 2026
https://github.com/marielachirinosr/hotel-data-analysis
Pandas & Matplotlib Learning Analysis. Repository featuring data analysis projects using Pandas and Matplotlib libraries
data data-analysis matplotlib pandas python
Last synced: 25 Apr 2026
https://github.com/mysto-007/world_population_growth_analysis
World Population Growth Analysis
data-analysis data-science data-visualization kaggle matplotlib
Last synced: 25 Apr 2026
https://github.com/devexpress-examples/wpf-pivotgrid-customize-the-cell-template
This example demonstrates how to customize the cell appearance in Pivot Grid for WPF.
data-analysis dotnet dxpivotgrid pivot-grid pivot-grid-for-wpf wpf
Last synced: 26 Apr 2026
https://github.com/abdullahashfaqvirk/powerbi-dashboards
A collection of Microsoft Power BI dashboards and reports designed to address business challenges and support data driven decision-making.
dashboards data-analysis data-driven data-science microsoft powerbi reports visualization
Last synced: 10 Mar 2026
https://github.com/rociobenitez/happiness-index-data-processing
Repository for Big Data Processing - Contains Jupyter Notebooks and Datasets for data analysis and processing tasks related to Big Data.
big-data big-data-processing data-analysis data-processing happiness-index happiness-report jupyter-notebook matplotlib pandas seaborn
Last synced: 15 May 2026
https://github.com/prakashjha1/new-analysis-using-llm-locally
An interactive news analysis tool built with Streamlit and local LLMs. This app allows users to analyze and gain insights from the latest news articles using advanced language models, all running locally. Explore trends, sentiment, and key topics with an intuitive interface.
artificial-intelligence data-analysis data-science llms ollama python streamlit
Last synced: 14 Mar 2025
https://github.com/ys1f/geothermal_project
Geothermal Data Analysis & Visualization for Texas – well data, temperature gradients & zone mapping
bht bottom-hole-temperature data-analysis folium geopandas geospatial geothermal gis interpolation irena jupyter-notebook mapping python rasterio spatial-analysis temperature-gradient texas visualization well-data zone-mapping
Last synced: 26 Apr 2026
https://github.com/swapnanildutta/prediction-with-python
The projects are made using Jupyter Notebook
data-analysis jupyter-notebook machine-learning prediction python regression-models
Last synced: 27 Apr 2026
https://github.com/noturlee/iris-dataanalyis
This project aims to classify Iris flowers into three species—setosa, versicolor, and virginica—based on their sepal and petal measurements using machine learning techniques. The dataset comprises 150 samples evenly distributed among these species
data-analysis data-modeling data-science data-structures-and-algorithms data-visualization
Last synced: 08 Apr 2025
https://github.com/pararang/nams-thesis-fuzzy
A specialized data processing tool designed to help with Fuzzy Delphi Method calculations for thesis research data analysis. Then extended with some new features for data processing with different method.
data-analysis dematel hacktoberfest hacktoberfest-accepted house-of-quality python sustainability vibecoding
Last synced: 27 Apr 2026
https://github.com/arush-codes/paris-olympic-de
data engineering project on paris olympics 2024
azure data-analysis data-engineering microsoft-azure olympics2024 pipeline
Last synced: 27 Apr 2026
https://github.com/mumtaz4118/amazon-iphone-12-data-scrapped
Beautiful Soup is a Python library for getting data out of HTML, XML, and other markup languages.
data-analysis data-extraction data-science data-scraping html mark-up python
Last synced: 27 Apr 2026
https://github.com/lit26/novel-corona-virus-2019
Data Analysis for Novel Corona Virus 2019
analysis coronavirus-case data-analysis sir-model
Last synced: 10 Jun 2025
https://github.com/sohamb21/analysis-of-superstore-dataset
I completed the IBM SkillsBuild Data Analytics Internship Program to develop my Data Analytics skills and apply them to a real-world problem by working on this project.
Last synced: 27 Apr 2026
https://github.com/manasashetty01/regulatory-affairs-of-road-accidents
Regulatory Affairs of Road Accidents in Million-Plus Cities (India, 2020)
data-analysis data-science data-visualization exploratory-data-visualizations jupyter-notebook numpy pandas python
Last synced: 27 Apr 2026
https://github.com/anburocky3/cbse-schools-data
Fetch CBSE Schools in seconds and use it for your data projects
cbse data data-analysis data-science grabber nextjs
Last synced: 24 Jun 2026
https://github.com/edanur-y/laptop-price-prediction-with-regression-models
Comparing the performances of multi-layer perceptron, k-nearest neighbors, random forest, gradient boosting and extreme gradient boosting regression and on laptop data to predict the price.
data-analysis data-transformation feature-importance hyperparameter-tuning python
Last synced: 27 Apr 2026
https://github.com/jofaval/supertsore-timeseries
Timeseries Data Analysis and Forecast of the sales from a superstore in 2015-2018
data-analysis data-science deep-learning deep-neural-networks forecasting google-colab lstm openml python tensroflow time-series time-series-analysis time-series-forecasting
Last synced: 27 Apr 2026
https://github.com/banyc/dfplot
Summarize a data frame by plotting. `cargo install --git https://github.com/Banyc/dfplot.git`.
csv data-analysis plotly plotting statistics
Last synced: 27 Apr 2026
https://github.com/airdac/ml-palmerpenguins
Classification and analysis of the palmerpenguins dataset in Python. Team project from UPC's Master's Degree in Data Science
classification data-analysis data-science machine-learning palmer-penguin python upc
Last synced: 07 Jun 2026
https://github.com/lotfiferaga/hotel-reviews-sentiment-analysis
Efficient Python-driven sentiment analysis for hotel reviews, providing insightful evaluations.
data-analysis data-visualization nlp python
Last synced: 07 Jun 2026
https://github.com/josedanielchg/1990s-netflix-movie-insight
Small exploratory analysis of Netflix movie data from the 1990s. This project is part of the DataCamp Associate Data Scientist in Python program and focuses on filtering, visualizing, and extracting insights from a dataset using Python. Analyze trends in movie durations and count short action films to practice key data science skills!
Last synced: 27 Apr 2026
https://github.com/tillscode/personal-finance-ml-analysis
Machine learning analysis of personal financial data with predictive modeling and interactive dashboard
dashboard data-analysis finance machine-learning python scikit-learn
Last synced: 28 Apr 2026
https://github.com/sweta2501/netflix_dataanalysis
With the help of Netflix Data, I have done some Data Analysis.
data-analysis data-science jupyter-notebook python
Last synced: 28 Apr 2026
https://github.com/analysisbyvivek/road-accident
Analyzes road accident patterns, exploring factors like lighting, weather, speed limits, time of day, and road conditions to uncover trends in severity and frequency.
data-analysis data-visualization eda jupyter-notebook kaggle tableau-public
Last synced: 19 Jun 2026
https://github.com/mktechai-0786/data-analysis-on-dr-visits
Data Analysis On Dr. Visits dataset
data-analysis matplotlib-pyplot numpy pandas seaborn
Last synced: 09 May 2026
https://github.com/simranshaikh20/diwali-sales-analysis-for-business-insights
A data analyst project on diwali sales . In this state according state , gender, age we are able to know how much sale it done.
data-analysis data-visualization python
Last synced: 28 Apr 2026
https://github.com/stefagnone/movies-dataset-analysis-project
Comprehensive analysis of the Movies dataset, exploring genre trends, comparisons, and qualitative insights using Python, Pandas, and visualizations. Designed to uncover actionable findings for stakeholders.
data-analysis data-visualization exploratory-data-analysis matplotlib movies-analysis pandas python seaborn storytelling-with-data
Last synced: 28 Apr 2026
https://github.com/datalopes1/warehouse_rfv
Neste projeto será realizada uma análise do tipo RFV (Recência, Frequência e Valor) com dados que encontrei neste video no Youtube do canal Jie Jenn.
analise-rfv data-analysis data-science kmeans python rfm-analysis
Last synced: 28 Apr 2026
https://github.com/dcs-training/data-wrangling-and-vis-pandas
Introduction to analyzing structured data with the Python libraries pandas, for CSV and TSV data, and ElementTree, for XML data. Go to the readme file
data-analysis data-visualisation data-wrangling python
Last synced: 16 Jun 2026
https://github.com/rajivaleaakash/customer-churn-prediction
A machine learning project focused on predicting customer churn using various data analysis and modeling techniques. The repository includes data preprocessing, feature engineering, exploratory data analysis (EDA), model training, evaluation, and visualization to help businesses identify customers at risk of leaving.
churn-prediction classification customer-churn data-analysis data-science gridsearchcv imblearn machine-learning numpy pandas pyhton randomsearchcv scikit-learn
Last synced: 28 Apr 2026
https://github.com/abdeldjalilchafai/us-flight-delay-eda
Structured EDA on 2015 US flight delay data. Clean, reproducible notebook using a 6-step data analysis framework for real-world datasets.
data-analysis data-cleaning eda exploratory-data-analysis flight-delays kaggle matplotlib numpy pandas python seaborn
Last synced: 28 Apr 2026
https://github.com/yixin0829/web-data-scraping-project
Python web scraping script and MySQL database design :bug:
beautifulsoup4 data-analysis data-visualization database gender-api matplotlib python website
Last synced: 19 Apr 2026