Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-29 00:07:38 UTC
- JSON Representation
https://github.com/andersoncrs/aprendizaje_no_supervisado_kmeans_customers
Este repositorio contiene un análisis de datos de clientes de un centro comercial utilizando técnicas de aprendizaje no supervisado, específicamente K Means y clustering jerárquico. El objetivo del proyecto es segmentar a los clientes en grupos homogéneos para entender mejor sus comportamientos y características.
data-analysis kmeans-clustering matplotlib numpy seaborn visualization
Last synced: 10 May 2026
https://github.com/azaz9026/loan_approval_prediction
Welcome to the Loan Approval Prediction repository! This project aims to build a predictive model that can determine whether a loan application should be approved or denied based on various features. Purpose The goal of this repository is to develop a machine learning model that can accurately predict loan approval decisio
data data-analysis data-visualization eda machine-learning numpy pandas python statistics
Last synced: 06 Apr 2026
https://github.com/davidmalko87/steam-library-exporter
Python script to export your Steam game library to CSV — playtime, genres, reviews, metacritic scores, prices, tags & estimated owners via Steam Web API + Store API + SteamSpy
csv-export data-analysis game-data metacritic playtime-tracker python steam steam-api steam-games steam-library steamspy
Last synced: 04 Apr 2026
https://github.com/vasilescur/decsci-group-9
DECSCI 101 - Final Project (Group 9)
data-analysis decision-science qualtrics statistics visualization
Last synced: 17 Apr 2026
https://github.com/souravxbera/credit-card-approval-predictor
End-to-end Machine Learning project to predict credit card approval decisions using real-world financial features. Includes EDA, model training, and deployment-ready architecture
credit-card-approval-prediction data-analysis machine-learning python scikit-learn streamlit
Last synced: 15 May 2026
https://github.com/mozeel-v/spam-detection
ML-powered SMS Spam Classifier using NLP and Scikit-learn. Detects and filters spam messages with interactive Streamlit UI.
classification data-analysis mnb streamlit
Last synced: 10 May 2026
https://github.com/sanam2405/ahs
This contains the analysis of result of AHS Madhyamik Examination 2022
data-analysis data-visualization jupyter-notebook python
Last synced: 18 Apr 2026
https://github.com/dina-hosny/retail-store-data-modeling-and-analysis-using-datastage
The project implements a star-schema data warehousing flow, then utilize IBM InfoSphere DataStage to develop efficient ETL pipelines to create data marts and perform some analysis on them.
data-analysis datastage datawarehousing etl extract ibm load transform
Last synced: 06 Mar 2026
https://github.com/pawlo77/airline-performance-data-analysis
Preprocessing of structured data - part of IAD study program, Faculty of Mathematics and Information Science, Warsaw University of Technology
data-analysis data-science visualization
Last synced: 10 May 2026
https://github.com/rajeev2806/retail-order-data-analysis
Dataset downloaded from kaggle api and then data cleaning and analysis is performed
data-analysis data-cleaning postgresql
Last synced: 18 Apr 2026
https://github.com/vansh-py04/data-analysis-questions-pandas-numpy-sql
Solution to 450+ Data Science Tech Stack questions essential for Data Analysts and Scientists!
data-analysis data-science deepnote machine-learning numpy pandas python sql
Last synced: 18 Apr 2026
https://github.com/vagnerbellacosa/116_usandoamazontextractocrextracaodadosdynamodb
Usando o Amazon Textract como OCR para Extração de Dados no DynamoDB
amazon-textract aws data-analysis data-extraction digital-innovation-one dio dynamodb lab ocr python
Last synced: 18 Apr 2026
https://github.com/danpoynor/data-analysis-of-video-game-sales-2000-2015
This analysis reviews sales for the top 100 video games from the years 2000-2015 to gather insights. Within the notebook I use Python’s Pandas, Matplotlib, and Seaborn libraries to interact with the data and create graphs.
data-analysis jupyter-notebook matplotlib pandas-dataframe python3 seaborn-plots video-game-sales
Last synced: 18 Apr 2026
https://github.com/revtpark/teamseas_scrapper
Scraping Team Seas for data analysis and visualization.
chartjs data-analysis python webscraping
Last synced: 28 Mar 2025
https://github.com/scailfin/rob-webapi-flask
Default RESTful Web API implementation for the Reproducible Open Benchmarks for Data Analysis Platform (ROB) using the Flask web framework.
benchmarks data-analysis reproducibility webapi
Last synced: 17 Mar 2026
https://github.com/manalisbhavsar/mall-customers-clustering
K-Means clustering to mall customer data, segmenting customers based on their annual income and spending score. To identify patterns and group customers for targeted marketing.
data-analysis data-visualization matplotlib numpy pandas python scikit-learn
Last synced: 18 Apr 2026
https://github.com/prakhar-ff13/finding-donors-for-charityml
Udacity Machine Learning Engineer Nanodegree project 2
data-analysis data-science machine-learning supervised-learning udacity udacity-machine-learning-nanodegree udacity-nanodegree
Last synced: 05 Apr 2026
https://github.com/kenatsf/basic_data_analysis
Basic data science project: ETL, forecast and data visualization.
analysis data data-analysis data-science logistic-regression matplotlib matplotlib-pyplot numpy pandas powerbi python scikit-learn time-series time-series-analysis time-series-forecasting
Last synced: 05 Apr 2026
https://github.com/al-ghaly/prosper-loans-analysis
A statistical Analysis Project, to analyze the data of a finance company’s loans Using Python packages (pandas – NumPy – seaborn – matplotlib)
data-analysis matplotlib numpy pandas python python-data-analysis seaborn statistical-analysis statistics
Last synced: 18 Apr 2026
https://github.com/vl1507/data_science_pro_course
Курс "Аналитик данных PRO (PRO DA-6)"
da data-analysis data-science ds jupyter-notebook machine-learning ml pro-da python
Last synced: 18 Apr 2026
https://github.com/vetrivel07/flight-price-prediction
Developed a flight price prediction model using Python, analyzing historical data to forecast airfare prices and help travelers make informed booking decisions
data-analysis data-visualization jupyter-notebook numpy pandas python
Last synced: 15 Jun 2025
https://github.com/stimulsoft/samples-dashboards.web-for-blazor-webassembly
Blazor WebAssembly (Wasm) samples for Reports.BLAZOR embedded components, Visual Studio C# projects, .NET 6, .NET 7, .NET 8 dashboards tool
blazor client-side converter dashboard data data-analysis data-sources database datagrid designer diagram dimension json net presentation print runtime viewer wasm webassembly
Last synced: 18 Apr 2026
https://github.com/robinmillford/sales-metrics-dashboard-streamlit
This Streamlit dashboard provides an interactive and comprehensive analysis of customer behavior, regional sales trends, and revenue insights. The dashboard enables businesses to identify key performance metrics, customer segments, and revenue drivers, supporting data-driven decision-making.
dashboard data-analysis data-visualization duckdb sales-analysis sales-dashboard streamlit-dashboard
Last synced: 19 Apr 2026
https://github.com/scanf-s/basic_dataanalysis
data-analysis jupyter-notebook matplotlib pandas python
Last synced: 19 Apr 2026
https://github.com/tsffarias/my-books
Exploratory analysis of my Dataset 'All_the_Books_I_read' which contains all the books I've read
books data-analysis python tableau
Last synced: 19 Apr 2026
https://github.com/cassiofb-dev/fide-rating-analysis
The plot speaks for itself
chess data-analysis fide hans rating
Last synced: 15 Jun 2025
https://github.com/vyjayanthipolapragada/data_analytics_medical_appointments
Analyzing the data set which consists of medical appointments to draw insights about patient's no-show scenarios
data-analysis data-analytics data-cleaning data-visualization data-wrangling jupyter-notebook matplotlib numpy pandas python seaborn
Last synced: 19 Apr 2026
https://github.com/12danielll/neurogenomics_project
This project focuses on analyzing sequencing data to understand molecular mechanisms of neurological diseases and predict the effectiveness of immunotherapy in breast cancer patients. It integrates Python and R scripts for data processing, statistical analysis, and visualization, alongside a comprehensive report detailing methods and findings.
bioinformatics biostatistics clustering clustering-algorithms data-analysis data-visualization deseq2 differential-gene-expression functional-analysis immune-therapy machine-learning neurological-disease neuroscience pca-analysis python r seurat single-cell-analysis
Last synced: 06 Apr 2026
https://github.com/akash-v7/telecom_customer_churn_prediction
A machine learning project to predict customer churn in the telecom industry using data analysis and classification models. The project includes data preprocessing, exploratory data analysis (EDA), model building, and insights to help telecom companies improve customer retention strategies.
data-analysis data-science data-visualization jupyter-notebook machine-learning predictive-modeling python
Last synced: 20 Apr 2026
https://github.com/kmbuki/uk_police_data
R programming - Using open data about crime and policing in England, Wales and Northern Ireland.
data-analysis data-visualization r
Last synced: 04 Jun 2026
https://github.com/namratha2301/carprice_analysisandprediction
This project analyzes factors influencing vehicle prices using a dataset of various attributes, including Engine capacity, Power, Mileage, and Seating capacity.
data-analysis data-visualization exploratory-data-analysis machine-learning pandas predictive-modeling random-forest-classifier regression scikit-learn seaborn
Last synced: 20 Apr 2026
https://github.com/misaghmomenib/soccer-match-analysis
This Project Predicts Football Match Outcomes (Home Win, Away Win, or Draw) Using Historical Match Data. It Involves Data Preprocessing, Exploratory Analysis, and Training a Random Forest Model to Predict Results Based on Features Like Shots, Possession, and Passes.
data-analysis git open-source python
Last synced: 20 Apr 2026
https://github.com/jerinpious/movie-recommendation-system
A content-based movie recommendation system built using Python. The system processes movie data, extracts relevant features, and provides recommendations based on user preferences
content-based-recommendation data-analysis jupyter-notebook machine-learning pandas python streamlit
Last synced: 20 Apr 2026
https://github.com/pentalpha/bti-performance-study
A series of analysis on a large amount of data about the grades of students in the Technology Information course at UFRN
analysis big-data clustering data-analysis data-science data-visualization ipynb ipython jupyter-notebook performance-analysis plot python python3
Last synced: 15 May 2026
https://github.com/szuzick/us-immigration-presidential-analysis
Power BI dashboard analyzing 40 years of U.S. immigration data across presidential administrations (1981-2020)
dashboard data-analysis data-visualization government-data immigration powerbi powerbi-dashboards powerbi-visuals presidential-analysis
Last synced: 10 Jun 2026
https://github.com/pipe199x/end-to-end-prediction-california
End-to-end prediction project using various technologies to predict housing prices in California.
california-housing data-analysis machine-learning python
Last synced: 11 May 2026
https://github.com/salfaris/toy-data-analysis
Random toy data projects. For my portfolio data projects, see linked website
Last synced: 20 Apr 2026
https://github.com/robinmillford/hr-analytics-employee-performance-analysis
HR Analytics: Unveiling Employee Performance - A comprehensive exploration of employee data using SQL and Power BI, uncovering key insights for strategic HR decision-making.
data-analysis data-visualization jupyter-notebook powerbi python3 sql
Last synced: 20 Apr 2026
https://github.com/mtholahan/advanced-mysqlquery-tuning-mini-project
Analyzed EuroCup 2016 data with advanced SQL queries. Imported CSV datasets into MySQL, designed schema with match, player, and referee details, and implemented queries covering match outcomes, penalty shootouts, player stats, bookings, substitutions, and referee activity to explore tournament dynamics.
bootcamp data-analysis data-engineering data-modeling database eurocup football mysql queries soccer sports springboard sql
Last synced: 15 May 2026
https://github.com/mahmoudwal27/e-commerce-data-analysis
A collection of data analysis and visualization projects focused on ecommerce datasets. Using Python in Google Colab for analysis and Excel for exploration, these projects uncover key insights and trends, showcasing expertise in data manipulation and visualization to inform business decisions.
analytics data-analysis data-analysis-python data-set google-cloud python
Last synced: 21 Apr 2026
https://github.com/danpoynor/pet-shelter-data-analysis-notebook
Demonstration of skills analyzing data from a pet shelter. The CSV data contains tables detailing the incoming and outgoing animals and I use my knowledge of Pandas to gather and present the requested information.
csv data-analysis data-cleaning data-science jupyter-notebook matplotlib numpy pandas pet-shelter tabular-data
Last synced: 21 Apr 2026
https://github.com/nxion/sql-data-warehouse-project
Building a modern data warehouse with MS SQL server, ETL processes, data modeling and analyitics.
data data-analysis data-analytics data-engineering data-lakehouse data-warehouse datalake datascience etl etl-job medallion-architecture ms mssql sql sql-query sql-server
Last synced: 05 Jun 2026
https://github.com/nishumehta/uber-rides-data-analysis
An in-depth analysis of Uber ride data for the year 2016, to uncover patterns in ride behavior, mileage trends, and frequent start locations to generate actionable insights for business decisions.
data-analysis jupyter-notebook matplotlib-pyplot pandas python tableau-dashboards
Last synced: 09 May 2026
https://github.com/anushkundu/churn-prediction
Telecom Customer Churn Prediction Using Machine Learning!
accuracy-score classification-algorithm classification-report data-analysis data-science deep-learning gradient-boosting-classifier keras-tensorflow logistic-regression machine-learning random-forest-classifier recall-precision roc-auc-score smote-sampling svm-classifier
Last synced: 21 Apr 2026
https://github.com/nikhilfuke1/a-b-testing-and-regression-analysis-python
Python Statistical Project involves data analysis, visualization, A/B testing, and regression analysis to determine the best-performing platform.
ab-testing data-analysis hypothesis-testing libraries python regression-analysis statistics visualization
Last synced: 21 Apr 2026
https://github.com/jonnor/acm-2019-dbscan
clustering data-analysis data-science health machine-learning nhanes nutrition
Last synced: 03 Apr 2025
https://github.com/tmmvn/analytics-notebooks
A bunch of data analytics notebooks done testing out JetBrains DataLore
ai algorithms data-analysis datalore elements-of-ai helsinki-university-mooc python
Last synced: 22 Apr 2026
https://github.com/prgermux/yield-reporter
This Python application provides a graphical user interface (GUI) for analyzing and visualizing production data from various machines. It uses the PyQt5 framework for the GUI and Matplotlib for plotting data.
automation data-analysis python reporting
Last synced: 22 Apr 2026
https://github.com/rajesh9943/sentiment-analysis-of-consumer-opinions-on-amazon-products
Developed a comprehensive Sentiment Analysis System aimed at classifying Amazon product reviews into positive, neutral, and negative sentiments. The project leveraged advanced Natural Language Processing (NLP) techniques alongside machine learning algorithms to deliver accurate and actionable insights from customer feedback
amazon data-analysis data-manipulation data-preprocessing data-presentation data-visualization machine-learning nlp nlp-library nltk product-reviews-analysis sentiment-analysis sklearn-library word-cloud-generator-in-python-3
Last synced: 05 Jun 2026
https://github.com/kgotsosm/epl-analysis
Preparing data for machine learning algorithms to predict English Premier League match winners.
data-analysis data-cleaning data-modeling
Last synced: 22 Apr 2026
https://github.com/devexpress-examples/web-forms-pivot-grid-export-additional-captions-header-or-footer
This example illustrates how to add a custom header to the document exported to PDF in Pivot Grid for Web Forms.
asp-net-web-forms data-analysis dotnet pivot-grid pivot-grid-for-web-forms
Last synced: 22 Apr 2026
https://github.com/ayushi-gajendra/restaurant-order-analysis-sql
End-to-end SQL analysis of 12,266 restaurant transactions to identify high-performing menu items, revenue concentration, bulk ordering behavior, and strategic growth opportunities.
analytics-portfolio business-intelligence case-study customer-segmentation data-analysis data-analytics database-analysis menu-engineering mysql revenue-analysis sql sql-project
Last synced: 05 Jun 2026
https://github.com/floffah/my-listening
Various ways to analyse your Spotify extended streaming history data
convex data-analysis listening-history spotify
Last synced: 23 Apr 2026
https://github.com/tranngoca5039/bigquery-a5y
📊 Streamline your data analysis with bigquery-a5y, a powerful tool for optimizing BigQuery performance and improving query efficiency.
analytics api big-data bigquery cloud-computing data-analysis data-integration data-management data-pipeline data-visualization data-warehouse google-cloud machine-learning serverless sql
Last synced: 05 Jun 2026
https://github.com/shudhanshurp/adidas-us-data-analysis
This Power BI project analyzes Adidas sales data across different regions, retailers, and product categories in the U.S. The dashboards provide insights into sales performance, operational metrics, and future forecasts to support data-driven decision-making.
data-analysis data-transformation data-visualization forecasting powerbi python retail-analytics
Last synced: 24 Apr 2026
https://github.com/misszeferino/cyclistic-bike-share-analysis
Data Analysis using R
data-analysis ggplot2 lubridate r r-programming tidyverse
Last synced: 06 Jun 2026
https://github.com/strixion/demoversion_ai
The demoversion of StrixionAI
ai csv data-analysis data-analytics json python txt
Last synced: 24 Apr 2026
https://github.com/datalopes1/bank_marketing
Este projeto será baseado no Dataset Bank Marketing encontrado na UC Irvine - Machine Learning Repository e disponibilizado por S. Moro, R. Laureano e P. Cortez
data-analysis data-science data-visualization eda python
Last synced: 24 Apr 2026
https://github.com/codingvangogh/data-science
Data Science, Machine Learning, Data Exploration, Big Data etc
data-analysis datascience decision-tree-classifier decision-tree-regression heatmap jupyter-notebook machinelearning python python3 ridge-regression seaborn sklearn svm-classifier
Last synced: 11 May 2026
https://github.com/muthukumar0908/youtube-data-harvesting-and-warehousing-using-sql-mongodb-and-streamlit
Create a simple and intuitive user interface using Streamlit, From the youtube getting and extracting the data by using API key. That data stored in database.
data-analysis mongodb-atlas python sqldatabase streamlit-webapp youtube-api
Last synced: 24 Apr 2026
https://github.com/kathisnehith/realestate-sales-analysis
Investigating real estate sales trends to understand market dynamics and inform investment decisions.
data-analysis excel realestate sales sql stastical-analysis-tools tableau
Last synced: 12 Feb 2026
https://github.com/cyberoctane29/python-for-data-analysis
A repository dedicated to learning Python for data analysis, data science, and data analytics. This collection of Jupyter notebooks covers practical exercises and concepts from the Google Advanced Data Analytics Professional Certificate program.
data data-analysis data-analytics data-science python
Last synced: 24 Apr 2026
https://github.com/edwinrlambert/emomap-sentiment-analysis
To analyze public sentiment related to specific locations in a city (e.g., parks, transit stations, restaurants, neighborhoods) using geo-tagged social media posts, reviews, and comments. The goal is to visualize how people feel across different areas and times.
data-analysis jupyter-notebook python sentiment-analysis
Last synced: 24 Apr 2026
https://github.com/shridhar1504/titanic-survivor-prediction-datascience-classification-project
This projects predicts whether a passenger on the titanic survived or not using machine learning algorithms with the given details of the passenger data.
classification-algorithm data-analysis data-analytics data-cleaning data-preprocessing data-science data-visualization eda exploratory-data-analysis gradient-boosting jupyter-notebook machine-learning machine-learning-algorithms matplotlib naive-bayes-classifier predictive-modeling python3 scikit-learn seaborn supervised-learning
Last synced: 11 May 2026
https://github.com/pedrohdosanjos/economic-data-analysis
This project aims to analyze the export data from various states in the United States to Brazil over time. The data is sourced from the FRED (Federal Reserve Economic Data) API and processed to identify the top 5 exporting states for each year, as well as the states with the highest total export value across all years.
api data-analysis data-visualization jupyter-notebook python
Last synced: 24 Apr 2026
https://github.com/bkataru/math-ia
Data and analysis for IB Math IA
data-analysis data-science data-visualization matplotlib modeling plotting regression-analysis regression-models
Last synced: 09 Apr 2025
https://github.com/mehmetkahya0/gallstone_dataset_analysis_project
Safra Taşı Hastalığı (Gallstone-1) Veri Seti Analizi (https://archive.ics.uci.edu/dataset/1150/gallstone-1)
analysis analytics data data-analysis data-science data-visualization database graph matplotlib python
Last synced: 25 Apr 2026
https://github.com/rubix982/product-quality-classification
This is an implementation for the CIKM AnalytiCup 2017, around the topic of "Product Title Quality". The goal is to take SKUs and rank its title's clarity and conciseness. Referenced papers are attached to this repository. And as such, the aim is to craft ensemble models that either try to replicate results or find new methods for classification.
data data-analysis information-retrieval jupyter-notebook machine-learning nlp python spacy-nlp
Last synced: 25 Apr 2026
https://github.com/rodrigojunqueiradev/curso-python-3-do-basico-ao-avancado
Curso de Python 3 do básico ao avançado - com projetos reais
data data-analysis data-science python python-3 python-library python-script python3
Last synced: 27 Jun 2026
https://github.com/k31ner/inmopipeline
Proyecto integral de análisis y modelado predictivo de datos inmobiliarios, que abarca recolección, transformación, visualización y machine learning utilizando Python y herramientas modernas de ingeniería y ciencia de datos.
data-analysis data-engineering data-science fastapi python streamlit
Last synced: 08 May 2026
https://github.com/ddihora1604/iit_patna
A multifaceted project involving applying ML models like Ridge Classifier, RNN, RIDOR, Rotation Forest and RUSBoost, integrating SMOTE for class balancing, and handling diverse datasets including those for seating arrangement tasks.
data-analysis data-visualization datamodelling machine-learning-algorithms python
Last synced: 25 Apr 2026
https://github.com/sarangs1621/weather-prediction
Weather Prediction Using Machine Learning is a project that leverages machine learning algorithms to predict weather conditions based on historical data. It evaluates three popular ML models (Decision Tree, KNN, and Logistic Regression) and provides performance insights through metrics and visualizations.
data-analysis decision-tree jupyter-notebook knn logistic-regression machine-learning predictive-modeling python scikit-learn weather-prediction
Last synced: 25 Apr 2026
https://github.com/aalkiyumi/predicting-hospital-readmission-risk
This project aims to create a predictive model that forecasts the likelihood of a patient being readmitted to the hospital within 30 days of discharge.
big-data-analytics cs5165 data-analysis data-cleaning data-engineering data-science introduction-to-cloud-computing jupyter-notebook machine-learning precipitation-analysis predictive-modeling pyspark statistical-analysis uc uc2026 university-of-cincinnati
Last synced: 11 Oct 2025
https://github.com/aastopher/mma_outcome
Simple exploratory analysis of UFC Fights and Vegas fight odds from 1993 to 2021
data-analysis data-visualization
Last synced: 06 Jun 2026
https://github.com/mysto-007/world_population_growth_analysis
World Population Growth Analysis
data-analysis data-science data-visualization kaggle matplotlib
Last synced: 25 Apr 2026
https://github.com/chandansoren/customer-personality-analysis
Predict how different customer segments will respond for a particular product or service.
data-analysis data-visualization python
Last synced: 26 Apr 2026
https://github.com/parthds02/customer-segmentation-with-kmeans-clustering
Analyze customer behavior using Python and KMeans Clustering on transactional data. Features RFM analysis, data cleaning, clustering insights, and actionable visualizations to support business decision-making.
data-analysis data-visualization feature-engineering kmeans-clustering numpy pandas vscode
Last synced: 11 May 2026
https://github.com/rociobenitez/happiness-index-data-processing
Repository for Big Data Processing - Contains Jupyter Notebooks and Datasets for data analysis and processing tasks related to Big Data.
big-data big-data-processing data-analysis data-processing happiness-index happiness-report jupyter-notebook matplotlib pandas seaborn
Last synced: 15 May 2026
https://github.com/haseebn19/urban-housing-demand
A full-stack web application for visualizing housing and labour market data
data-analysis data-visualization docker full-stack gradle statistics web webapp
Last synced: 22 Jun 2026
https://github.com/prady2309/unemployement-analysis
Data Science Project
colab-notebook data-analysis data-science data-visualization eda jupyter-notebook machine-learning python3
Last synced: 26 Apr 2026
https://github.com/arush-codes/paris-olympic-de
data engineering project on paris olympics 2024
azure data-analysis data-engineering microsoft-azure olympics2024 pipeline
Last synced: 27 Apr 2026
https://github.com/odinleepro/airbnbnewyorkcityanalysis
AirbnbNewYorkCityAnalysis is a comprehensive data analysis and visualization project exploring short-term Airbnb rental trends across New York City (2008–2022). Using open source Airbnb data, the project combines data cleaning, statistical summaries, and Tableau dashboards to uncover pricing patterns, borough level distribution, and insights.
airbnb analytics-project data-analysis data-cleaning data-science data-visualization new-york-city real-estate-analytics tableau urban-analysis
Last synced: 27 Apr 2026
https://github.com/syarwinaaa09/analyzing-students-mental-health
data-driven exploration into student mental health trends using survey data
csv-dataset data-analysis education jupyter-notebook mental-health-awareness pandas psychology student-mental-health visualization
Last synced: 29 Jun 2026
https://github.com/as16082023/project-portfolio
A guide to all my projects
dashboard data-analysis data-cleaning data-visualization excel mysql power-bi python sql tableau
Last synced: 27 Apr 2026
https://github.com/alkasaliss/nosql_opendata_nyc
PROJET NoSQL - ENSAI
data-analysis mongodb nosql open-data
Last synced: 27 Apr 2026
https://github.com/garcane/exodus_analysis
This project analyses cryptocurrency transaction data exported from the Exodus wallet. The goal is to explore and visualize the inflows and outflows of assets, the types of transactions, and other key metrics over time.
bitcoin btc crypto cryptocurrencies cryptocurrency data-analysis data-visualization eth ethereum pandas seaborn
Last synced: 27 Apr 2026
https://github.com/elakkiya-u/digital-marketing-campaign-conversion-prediction
A Predictive Modelling whether a customer will convert based on digital marketing campaign data.
campaign-analytics churn-prediction data-analysis deployment digital-marketing-analytics machine-learning power-bi predictive-modelling presentation-slides python
Last synced: 27 Apr 2026
https://github.com/hfzdzakii/dicoding-airqualityanalysisdata
This repo is a master submission for my Dicoding Final Project. Air Quality Dataset is being used to fulfill the submission. Feel free to explore and I hope my work give you some insight!
data-analysis data-visualization streamlit
Last synced: 27 Apr 2026
https://github.com/hrosicka/czechpopulationestimation
This GitHub repository contains Python code for data analysis and population prediction in the Czech Republic up to the year 2050. The code is written in Python and utilizes the Pandas and Matplotlib libraries.
data-analysis data-visualization matplotlib matplotlib-figures matplotlib-pyplot pandas pandas-dataframe pandas-library pandas-python python python3
Last synced: 11 May 2026
https://github.com/tillscode/personal-finance-ml-analysis
Machine learning analysis of personal financial data with predictive modeling and interactive dashboard
dashboard data-analysis finance machine-learning python scikit-learn
Last synced: 28 Apr 2026
https://github.com/sweta2501/netflix_dataanalysis
With the help of Netflix Data, I have done some Data Analysis.
data-analysis data-science jupyter-notebook python
Last synced: 28 Apr 2026
https://github.com/sujata-adhikari/data-analysis
Data analysis of Market sales data using PowerBi, created dashboard to show analysis.
data-analysis excel pandas powerbi
Last synced: 12 Jun 2026
https://github.com/jovicdev97/Financial-Loan-DataScience-Notebook
using numpy and pandas to analyze a synthetic loan dataset with python
data-analysis matlabplot numpy pandas plotting python seaborn
Last synced: 12 Mar 2025
https://github.com/OdessaZ/Portfolio-Projects
This is a repository I have created to showcase skills, share projects and track my progress in Data Analytics and Data Science
applied-mathematics data-analysis data-science excel jupyter-notebook matplotlib-pyplot pandas portfolio python r r-studio seaborn sql statistics
Last synced: 12 May 2026
https://github.com/hadson0/chess-live-ratings-data
A study project focused on web scraping the live chess ratings from chess.com, with data analysis and visualization on nearly 5000 players in the classical world ranking.
beautifulsoup chess data-analysis data-visualization numpy pandas python seaborn web-scraping
Last synced: 28 Apr 2026
https://github.com/rajivaleaakash/customer-churn-prediction
A machine learning project focused on predicting customer churn using various data analysis and modeling techniques. The repository includes data preprocessing, feature engineering, exploratory data analysis (EDA), model training, evaluation, and visualization to help businesses identify customers at risk of leaving.
churn-prediction classification customer-churn data-analysis data-science gridsearchcv imblearn machine-learning numpy pandas pyhton randomsearchcv scikit-learn
Last synced: 28 Apr 2026
https://github.com/abdeldjalilchafai/us-flight-delay-eda
Structured EDA on 2015 US flight delay data. Clean, reproducible notebook using a 6-step data analysis framework for real-world datasets.
data-analysis data-cleaning eda exploratory-data-analysis flight-delays kaggle matplotlib numpy pandas python seaborn
Last synced: 28 Apr 2026
https://github.com/sufyan14/weather-data-analysis
A Streamlit dashboard that forecasts 30-day weather trends using uploaded CSV data and Facebook Prophet.
data-analysis python streamlit
Last synced: 28 Apr 2026