Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-07-03 00:07:37 UTC
- JSON Representation
https://github.com/priyanshubiswas-tech/deloitte-daikibo-forensic-analysis-task-2
Forensic pay equity analyzer for Deloitte. Processes compensation data to classify gender equality scores into Fair/Unfair/Discriminative tiers. Outputs modified Excel with 3-tier evaluation system.
data data-analysis deloitte excel forensic-analysis
Last synced: 06 Feb 2026
https://github.com/jmssnr/shuffle-kit
shuffle-kit: model and analyze playing card shuffles in Python
data-analysis playing-cards python shuffle statistics
Last synced: 19 Jun 2026
https://github.com/gdbecker/analyticsportfolio
Analytics Professional Project Work
data-analysis data-science decision-trees firebase google-looker-studio k-means-clustering k-nearest-neighbors kaggle linear-regression logistic-regression machine-learning microsoft-fabric powerbi principal-component-analysis python3 random-forest synapse-data-engineering
Last synced: 10 Apr 2026
https://github.com/kirkalyn13/open-signal-report-generator
Script used to generate results/summary, including the trends of flagged provinces, from the raw excel data file,
data-analysis data-science data-visualization matplotlib numpy pandas python
Last synced: 19 Jun 2026
https://github.com/bgr8/bokeh-ile-veri-gorsellestirme
Data visualization with Bokeh Library
bokeh color data-analysis data-visualization hbar html python vbar
Last synced: 30 Oct 2025
https://github.com/garcane/global-shipping-analytics-dashboard
This Tableau project provides a comprehensive visual analysis of global sales, shipping costs, and quality metrics across different regions and countries.
data data-analysis data-analyst data-visualization metrics tableau
Last synced: 01 Mar 2026
https://github.com/arnikz/piqmie
Proteomics Identifications & Quantitations Data Management & Integration Service
data-analysis data-management data-visualisation mass-spectrometry peptide-identification protein-inference protein-quantification proteomics silac web-application
Last synced: 03 Feb 2026
https://github.com/antonio-f/big-data-analysis-with-scala-and-spark
Coding assignments from the course "Big Data Analysis with Scala and Spark" (Coursera).
big-data bigdata coursera data-analysis scala spark
Last synced: 27 Apr 2026
https://github.com/mengyaohuang/data-manipulation-and-analysis
Data processing implementation with tools in Python
data-analysis nlp-machine-learning pandas-dataframe python
Last synced: 27 Apr 2026
https://github.com/sarthak-0-sach/drivermasterdata_database_table
This code enables data integration from multiple sources and ensures a single source for all driver-related attributes. Designed for scalability and pipeline compatibility, this project supports clean data transformations, validations, and storage-ready outputs. Ideal for quick analytics, created using python & airflow, automated using cronjob.
apache-airflow-etl-pipeline data-analysis data-visualization database-management python
Last synced: 27 Apr 2026
https://github.com/jongan69/potion-leaderboard
Start of Entry for potion leaderboard contest
data-analysis leaderboard potion trading
Last synced: 11 Jun 2026
https://github.com/alejo1630/sport_stats
Data analysis of information from the summer and winter Olympic games over the years. UC Davis SQL Specialization Final Project
data-analysis jupyter-notebook olympics-dataset plotly python seaborn sql
Last synced: 26 Apr 2026
https://github.com/airdac/sim-ames_housing
Prediction of house prices with linear regression in R. Team project from UPC's Master's Degree in Data Science
data-analysis data-science linear-regression r statistical-models upc
Last synced: 07 Jun 2026
https://github.com/ronaldkanyepi/python-streamlit-covid-19-dashboard
This is a responsive streamlit covid 19 Dashboard
analytics data data-analysis data-visualization datascience python streamlit
Last synced: 18 May 2026
https://github.com/carterlasalle/sportsarbfinder
Sports Betting Arbitrage Finder: Python tool for identifying profitable arbitrage opportunities across bookmakers. Features multi-region support, customizable profit margins, interactive calculator, and web interface. Uses real-time odds data from The Odds API. Ideal for betting enthusiasts, analysts, and educational purposes.
arbitrage-betting betting-strategy data-analysis finance gambling odds-api python sports-analytics sports-betting
Last synced: 31 Mar 2025
https://github.com/jofaval/mlai-portfolio
My portfolio about Data Analysis, Machine Learning and A.I.
computer-science data-analysis data-science machine-learning portfolio python
Last synced: 25 Apr 2026
https://github.com/billy-enrizky/yelpfusion
Finding All restaurants in the Maryland area using YelpFusion API
data-analysis pandas yelp-api yelpfusion
Last synced: 28 Apr 2026
https://github.com/vansh-py04/data-extraction-and-text-analysis
The objective of this assignment is to extract textual data articles from the given URL and perform text analysis to compute variables that are explained
data-analysis data-extraction data-science nlp nlp-machine-learning python textanalysis webscraping
Last synced: 24 Apr 2026
https://github.com/manvendra747/customer-segmentation
Customer segmentation using Python and PowerBI
customer-segmentation dashboard data-analysis data-science data-visualization powerbi python rfm-analysis
Last synced: 28 Apr 2025
https://github.com/prernarohra/quakeguard
QuakeGuard is an innovative project for reducing earthquake intensity and structural damage. It takes a proactive approach to seismic activity, by using complex algorithms and real-time data to improve safety and resilience for people in earthquake-prone areas.
artificial-intelligence backend data-analysis data-science earthquake-intensity final-year-project front-end geology machine-learning open-source python visualization
Last synced: 21 May 2026
https://github.com/chiemekaifemegbulem/useful_tools
Advanced Web Scraping
automation beautifulsoup captcha-solving data-analysis data-extraction data-science proxy-rotation python scraping-bots selenium tor-network web-scraping web-scraping-python webscraping
Last synced: 28 Apr 2026
https://github.com/athityakumar/btp
btech btp daru data-analysis networkx nlp project python ruby
Last synced: 24 Apr 2026
https://github.com/dsrodrigovieira/houserocketsales
Este repositório contém um projeto desenvolvido para praticar habilidades de análise de dados utilizando Python
data-analysis data-visualization heroku kaggle-dataset python
Last synced: 29 Apr 2026
https://github.com/bcko/ud-da-eda-whitewinequality
Udacity Data Analyst Nanodegree Project : Exploratory Data Analysis : White Wine Quality dataset
data-analysis exploratory-data-analysis rmarkdown rstudio udacity udacity-data-analyst-nanodegree
Last synced: 03 Jan 2026
https://github.com/mr-chang95/loan_data_visualization
Data Visualization Project for Udacity's Data Analyst Program. Using Python in Jupyter Notebook.
data-analysis data-visualization jupyter-notebook loans python udacity-data-analyst-nanodegree
Last synced: 24 Apr 2026
https://github.com/mohnoor94/datasciencefundementalsusingpython
My journey to learn Data Science with Python
data data-analysis data-science data-visualization learning learning-by-doing python python3
Last synced: 19 Jun 2026
https://github.com/datavil/framex
A light-weight, dataset obtaining library for fast prototyping, tutorial creation, and experimenting.
data-analysis data-fetching data-science dataframe datasets visualization
Last synced: 06 Jun 2026
https://github.com/manwithacap/by-the-metric-match
🎲🃏 A game data tracker for your board/card/video games!
data-analysis data-visualization games jupyter-notebook python utility
Last synced: 29 Apr 2026
https://github.com/virajbhutada/movie-rental-store-analytics-sql-powerbi-excel
Dive into the DVD rental industry with my Capstone project, Movie Rental Analytics. Analyzing the Sakila DVD Rental Store Database, I extract insights through exploratory data analysis (EDA) and Power BI visualizations. Findings inform strategies for optimizing film inventory, enhancing business operations, and customer experiences.
business-intelligence capstone-project customer-behavior-analysis data-analysis data-science excel exploratory-data-analysis film-ratings mece movie-database movie-rental mysql powerbi powerbi-visuals revenue-analysis sql sql-database
Last synced: 05 Jun 2026
https://github.com/parmeetbhamrah/air-quality-india-analysis
Exploratory data analysis of real-time air quality data from Indian cities using Python, Pandas, Matplotlib, and Seaborn.
air-quality data-analysis eda exploratory-data-analysis government-data india matplotlib numpy pandas python seaborn
Last synced: 05 May 2026
https://github.com/varshithdupati/yelp-business-analysis
Big Data analysis on Yelp reviews/businesses for Arizona. Using Hadoop, Spark, PySpark.
arizona-state-university big-data big-data-analytics data-analysis hadoop pyspark spark yelp
Last synced: 04 May 2026
https://github.com/r12habh/datacamp.com-micro_projects
data data-analysis data-science datascience python python3
Last synced: 23 May 2026
https://github.com/chelseammatta/nopd-cad-data-analysis
Analysis of 911 call data from New Orleans' 3rd & 4th police districts (2019-2022) using BigQuery
911-calls 911-data bigquery cad-data crime-analysis data-analysis emergency-response new-orleans public-safety sql
Last synced: 01 Jul 2025
https://github.com/programmer-rd-ai/moviedatascraper
Explore the cinematic universe with our IMDb web scraping project! Dive into movie data with ease, uncovering insights from cast to critical reviews. With dynamic visualizations and reliable data, let's journey through the world of movies like never before. Lights, camera, analysis!
beautifulsoup beautifulsoup4 data data-analysis jupyter-notebook matplotlib numpy pandas programming python python3 scraping seaborn software web
Last synced: 01 Mar 2025
https://github.com/equicirco/cirquant
Code and data delivering for quantifying circularity through open data and digital innovation.
circular-economy data-analysis database julialang official-statistics
Last synced: 13 Jan 2026
https://github.com/alemalvarez/data-analysis-web-project
Web-app providing a simple interface for data storage,
data-analysis data-science javascript react webapp
Last synced: 29 Apr 2026
https://github.com/as16082023/restaurant-order-analysis
Analyzing order data to identify the most and least popular menu items and types of cuisine
data-analysis maven-analytics mysql restaurant-order sql
Last synced: 10 Apr 2025
https://github.com/sejalmankar1012/yuvaco_data_analysis_assessment
This assignment involves writing a Python script to calculate the cost of package deliveries based on provided data and a cost grid. The script takes package details such as weight, distance, and delivery type, applies the cost calculation rules, and saves the results in an output file. You can also run the script in Google Colab for convenience.
csv-file-handling data-analysis google-colab package-delivery python python-scripting
Last synced: 29 Apr 2026
https://github.com/nafisalawalidris/hotel-reservation-analysis
This project analyses hotel reservation data from Resort Hotels and City Hotels to uncover booking trends and insights. Utilising Microsoft Excel for initial data cleaning, PostgreSQL for data analysis and Tableau for creating visualisations, the project aims to deliver a comprehensive dashboard that highlights key metrics such as booking status.
data-analysis data-cleaning data-visualisation hotel-reservations microsoft-excel postgresql sql tableau tableau-dashboards tableau-desktop tableau-public
Last synced: 06 Jul 2025
https://github.com/27ahmad/foreign-direct-investment-analytics
This repository contains an exploratory data analysis (EDA) and visualization project on a dataset of Foreign Direct Investment (FDI) by companies. The objective is to analyze FDI trends and present key insights through an interactive Tableau dashboard.
data-analysis eda matplotlib pandas python seaborn tableau
Last synced: 29 Apr 2026
https://github.com/pranavarora1895/proteintypeprediction
Data Analysis on Protein Type Prediction
bioinformatics data-analysis supervised-learning
Last synced: 19 Apr 2026
https://github.com/virajbhutada/google-stock-price-forecasting-lstm
Analyzing and predicting Google's stock prices through detailed data exploration and advanced LSTM models. This project involves data preprocessing, creating time-series sequences, constructing and training LSTM networks, and evaluating their performance to forecast future stock prices utilizing Python and Machine Learning libraries.
data-analysis data-science data-visualization future-prediction google-dataset google-stock-price-prediction google-stocks lstm-model lstm-neural-network machine-learning machine-learning-models matplotlib model-building model-training numpy python stock-forecasting
Last synced: 27 Feb 2025
https://github.com/jcaperella29/stock_evaluation_python
A Python script to classify companies based on financial metrics like Piotroski F-Score and Stock Valuation, using CSV financial data for analysis and output.
ai-in-finance artificial-intelligence classification csv-processing data-analysis expert-system finance financial-analysis financial-analysis-tools piotroski-f-score python quantitative-analysis rule-based-classifier stock-analysis stock-valuation
Last synced: 07 Sep 2025
https://github.com/markmusic27/data-statistics-calculator
💣 This method (made in JavaScript / Python) can find the mean, median, mode, range, and standard deviation.
data-analysis standard-deviation statistics statistics-calculator
Last synced: 20 Jun 2026
https://github.com/shibam120302/heart-disease-data-analysis-by-shibam
You can read more on the heart disease statistics and causes for self-understanding. This project covers manual exploratory data analysis
analysis data-analysis scraper
Last synced: 13 Aug 2025
https://github.com/phillbertnevinemmanuel/movieindustryanalysis-correlation
This project is a comprehensive data analysis endeavor within the Movie Industry, spanning from Data Cleaning to Exploratory Data Analysis, Correlation Analysis, and Temporal Analysis. The dataset was sourced from Kaggle, purportedly scraped using the IMDb API. Python was the primary tool utilized for analysis.
data-analysis data-cleaning python
Last synced: 30 Apr 2026
https://github.com/luminati-io/Airbnb-dataset-samples
A sample dataset of over 1000 Airbnb listings, extracted using the Bright Data API, ideal for competitor tracking, brand reputation, and market analysis.
airbnb airbnb-listings api data-analysis datasets web-scraper web-scraper-api web-scraping
Last synced: 09 Apr 2025
https://github.com/alcestide/scianalytics
Playground for Data Analysis and Visualization for Research and Scientifical Purposes with Pandas and Plotly.
csv data-analysis data-science data-visualization pandas plotly python science-research statistics
Last synced: 30 Apr 2026
https://github.com/ankit21111/filmilytics
This repository contains data and analysis on RSVP Movie House Production, focusing on past performance metrics and audience trends. Our goal is to derive actionable insights that can guide future productions for greater success. Explore the data, analysis scripts, and recommendations to understand how RSVP can thrive in the film industry.
data-analysis database database-design database-schema erdiagram sql
Last synced: 13 Jun 2025
https://github.com/alicankaya192/world-happiness-report-2025
Comprehensive exploratory data analysis (EDA) and visualization of the World Happiness Report 2025. Analyzes global rankings, regional distributions, key happiness factors, and detects wealth-happiness paradox outliers using Python (Pandas, Matplotlib, SciPy).
correlation-analysis data-analysis data-science data-visualization eda exploratory-data-analysis global-happiness happiness-index matplotlib pandas python scipy statistics whr-2025 world-happiness-report
Last synced: 21 Jun 2026
https://github.com/musbi8788/free_python_book_for_gambian_dev
Free, beginner-friendly Python books for Gambian learners and devs 📘🐍
algorithms automation data-analysis data-science django flask machine-learning oops-in-python programming-language python python27 python3 testing-automation web-development
Last synced: 14 Aug 2025
https://github.com/jeniljani-4444/end-to-end-world-cup-analysis-web-app
Our streamlined Streamlit web app fetches and processes ESPN CricInfo data delivering dynamic graphs for a quick and engaging cricket experience. Deployed on AWS EC2 with CI/CD pipelines.
aws-ec2 data-analysis plotly preprocessing streamlit-webapp
Last synced: 02 Apr 2025
https://github.com/hetuvpatel/research-chatgpt
Research and data analysis project evaluating the social, ethical, and educational impacts of ChatGPT using survey-driven insights and Python-powered data analysis. 📚🤖
data-analysis matplotlib pandas python seaborn
Last synced: 01 May 2026
https://github.com/scarblase/salary-comparison
Submission for the DataCamp Salary Competition(1 level). 🏆
data data-analysis data-science data-visualization engineering python sql structured-data
Last synced: 01 May 2026
https://github.com/nmsby/pca-machine-learning-lab
Principal Component Analysis (PCA) implementation and analysis lab for Machine Learning. Features manual PCA implementation, scikit-learn applications, data compression, and feature extraction with detailed visualizations.
data-analysis dimensionality-reduction jupyter-notebook machine-learning numpy pca python scikit-learn visualization
Last synced: 01 May 2026
https://github.com/vidhi1290/zomato-data-analysis
Zomato Data Analysis - Explore the world of Zomato restaurant data through Python and data analysis. Uncover trends and insights using Pandas for data manipulation and Matplotlib for visualization. Join us in this journey to reveal the hidden stories within the data!
data-analysis data-analysis-python data-science data-visualization dataprocessing machine-learning machine-learning-algorithms matplotlib numpy pandas python scikit-learn zomato-data-analysis
Last synced: 11 Apr 2026
https://github.com/airscholar/data_analysis_with_ai
A repository showing how to use AI and ChatGPT for Data Analysis with Pandas and Python
chatgpt data-analysis gpt4 openai pandas pandasai python
Last synced: 10 Apr 2026
https://github.com/aravindnathan02/sales-and-customer-analytics
This is a repository for sales and customer performance Tableau dashboard.
customer-dashboard dashboard data-analysis data-visualization sales-analysis sales-dashboard tableau
Last synced: 08 Jan 2026
https://github.com/riddhis2226/titanic-survival-data-analysis
Titanic-Survival-Data-Analysis : Analyze passenger data from the Titanic to predict survival based on features like age, gender, class, and fare.
data-analysis data-mining data-science data-visualization database jupyter-notebook machine-learning-models machinelearning-python plotlyjs python3
Last synced: 01 May 2026
https://github.com/misszeferino/sql-projects
bigquery data-analysis mysql queries sql sqlite3
Last synced: 29 Jan 2026
https://github.com/shridhar1504/power-bi-visualization-project
This repository contains Visualization Projects which is visualized through Power BI Software, by using the visualization we can gain multiple insights and strategies which helps to develop the business for gaining high profit margins and by the insights we can reduce the damages by accidents & calamities.
dashboard data-analysis data-cleaning data-science data-visualization exploratory-data-analysis microsoft-excel microsoft-power-bi microsoft-powerpoint powerbi powerbi-report powerbi-visuals powerpoint-slides
Last synced: 21 Jan 2026
https://github.com/buchananja/dpyp
A convenience tool for small-scale data pipelines in Python
data data-analysis data-cleaning data-engineering data-pipeline data-preprocessing data-processing data-science pandas pipeline
Last synced: 18 Apr 2026
https://github.com/bretsw/beds
Bookdown project for an open education resource (OER) book: Becoming Educational Data Scientists
analytics data-analysis data-analytics data-science
Last synced: 31 Mar 2025
https://github.com/leticiamilan/santander-tech-data-science
Este repositório contém os projetos desenvolvidos durante o curso de Data Science, uma parceria entre a Ada Tech e o Santander Open Academy. O curso é dividido em vários módulos, cada um focado em um aspecto fundamental da ciência de dados.
ada-tech calculus data-analysis data-science letscode python santander
Last synced: 09 Jun 2026
https://github.com/benzerinsio/breastcancer-eda
📊 Análise Exploratória de Dados (EDA) - Câncer de Mama | Exploração de características clínicas para identificar padrões e relações no diagnóstico de câncer de mama.
analise-de-dados analise-exploratoria analise-exploratoria-de-dados data-analysis data-visualization diagnosis eda exploratory-data-analysis health-care medical-data python seaborn
Last synced: 02 May 2026
https://github.com/sarah-marion/sovereign-osint-toolkit
Sovereign OSINT Toolkit - Advanced, self-hosted intelligence platform for security researchers and investigators. Ethical, private and production-ready.
correlation-engine cybersecurity data-analysis docker fastapi infosec intelligence investigation open-source osint privacy python3 security-research security-tools threat-intelligence
Last synced: 02 May 2026
https://github.com/badranalyst/movie-correlation-analysis-in-python
This project analyzes movie data correlations using Python libraries like Pandas, NumPy, Seaborn, and Matplotlib. It examines relationships between attributes such as ratings, genres, and box office performance to uncover trends that inform recommendations and enhance understanding of movie success factors.
data data-analysis dataset jupyter jupyter-notebook matplotlib matplotlib-pyplot numpy pandas python seaborn
Last synced: 03 May 2026
https://github.com/stepankuzmin/machine-learning-data-analysis
My homeworks on Coursera Machine Learning and Data Analysis specialization
coursera data-analysis jupiter machine-learning python
Last synced: 03 May 2026
https://github.com/zients/tw-lottery-recommandation
Taiwan lottery draw analyzer & number recommender with Transformer ML model. Supports 539, 649, 638, 3D, and 4D lotteries.
cli data-analysis lottery machine-learning python pytorch taiwan transformer
Last synced: 03 May 2026
https://github.com/rohitinu6/tesla-price-prediction
A machine learning project that predicts future stock price movements using Logistic Regression, SVC, and XGBoost with engineered financial features.
data-analysis data-visualization feature-engineering financial-analysis logistic-regression machine-learning matplotlib python scikit-learn seaborn stock-market stock-price-prediction support-vector-machine time-series xgboost
Last synced: 03 May 2026
https://github.com/emredemirbas/movie-ratings-analysis
A data analysis project investigating potential bias in movie ratings from 2015, comparing them with ratings from other platforms using Python, pandas, and visualization libraries.
data-analysis matplotlib pandas python seaborn
Last synced: 03 May 2026
https://github.com/vipulbunny/restaurant-insight-analysis
A comprehensive data analysis project exploring restaurant ratings, locations, and customer sentiments. This project includes data preprocessing, descriptive analysis, geospatial mapping, sentiment analysis, and price-rating correlations using Python and visualization tools.
data-analysis data-preprocessing data-visualization folium geospatial geospatial-analysis geospatial-visualization machine-learning nlp pandas python restaurant-insights seaborn sentiment-analysis
Last synced: 03 May 2026
https://github.com/nathadriele/diabetes-clinical-etl-pipeline
Este projeto de Engenharia de Dados em Saúde Pública implementa um pipeline completo para coletar, tratar, padronizar, validar, integrar e visualizar dados públicos do SUS relacionados ao Diabetes Mellitus no Brasil, filtrando pelos códigos CID-10 E10 a E14.
cid data-analysis data-extraction data-pipeline data-science data-structures data-visualization datasus diabetes-detection diabetes-prediction epidemiology-analysis etl-pipeline healthcare-analytics ibge logger pytest sih streamlit sus
Last synced: 09 Jun 2026
https://github.com/salma-mamdoh/project-writing-functions-for-product-analysis
My Project to learn the Basics of Analysis on DataCamp
data-analysis data-camp pandas python
Last synced: 03 May 2026
https://github.com/syed-m-nofel/python-data-science-fundamentals
Python notebooks for data manipulation (Pandas/NumPy) and API workflows – from basics to practical examples.
api beginner-friendly data-analysis data-science http-requests jupyter-notebook numpy pandas pandas-dataframe python tutorial
Last synced: 03 May 2026
https://github.com/ljadhav25/swiggy-restaurant-analysis
This repository contains data and analysis related to restaurants listed on Swiggy, one of India's largest online food ordering and delivery platforms. The objective is to explore restaurant trends, customer reviews, pricing strategies, and delivery metrics to gain insights into the food delivery industry.
data-analysis data-visualization matplotlib-pyplot numpy-library pandas-library python seaborn-plots
Last synced: 03 May 2026
https://github.com/syarwinaaa09/analyzing-crime-in-los-angeles
Exploratory data analysis of Los Angeles crime data with insights on temporal patterns, locations, and age demographics.
crime-data data-analysis eda los-angeles pandas public-safety python visualization
Last synced: 03 May 2026
https://github.com/r13i/cheapest-phone-call
Small challenge to find the best phone operator to use based on call price
big-data big-data-analytics cheapest data-analysis data-cruncher pandas phone-number pricelist
Last synced: 04 May 2026
https://github.com/fatihilhan42/the-office-eda
Data analysis study of my favorite sitcom, The Office (US).
data-analysis data-science data-visualization fatihilhan office python sitcom
Last synced: 04 May 2026
https://github.com/damisparks/become_data_analyst
Are you new to Data Analysis ? Here you will find simple notebook that will help through your journey. These are personal projects I work on and still working.
data data-analysis data-visualization matplotlib numpy pandas-tutorial
Last synced: 04 May 2026
https://github.com/mchenryspagg/investigate_a_dataset
This is a data analysis project that demonstrates the student's ability to use python data analysis libraries such as pandas, numpy and pyplot in matplotlib to investigate a dataset and answer specific questions from the dataset, thus demonstrating skills in data cleaning, data wrangling, and exploratory data analysis.
data-analysis datetime descriptive-analysis descriptive-statistics exploratory-data-analysis numpy pandas pyplot python visualization
Last synced: 04 May 2026
https://github.com/marionchaff/real-estate-price-prediction-france
Real estate price prediction using French public database DVF
data-analysis dvf-data machine-learning price-prediction python real-estate scikit-learn
Last synced: 04 May 2026
https://github.com/sagarprajapat2004/data-analysis-visualization
Downloaded and analyzed a dataset from Kaggle using NumPy and Pandas created visualizations with Matplotlib and Seaborn developed a Flask web application to showcase data insights and conclusions.
data-analysis data-modeling data-visualization exploratory-data-analysis flask python statical-analysis
Last synced: 04 May 2026
https://github.com/halyusa16/e-commerce-analysis
This project analyzes a public e-commerce dataset to uncover valuable insights and answer critical business questions. The dataset contains customer, product, order, and transaction details, providing a comprehensive view of the e-commerce platform's operations.
data-analysis data-cleaning data-exploration data-visualization self-project
Last synced: 09 Jun 2026
https://github.com/jatin-mehra119/flight-price-prediction
This study aims to analyze flight booking data from "Ease My Trip" website, using statistical tests and linear regression to extract insights. By understanding this data, valuable information can be gained to benefit passengers using the platform.
data-analysis datacleaning datavisualization machine-learning preprocessing-data python sklearn-pipeline sklearn-regression-algorithm streamlit-webapp
Last synced: 04 May 2026
https://github.com/mugilan1309/csv_analyzer
📊 A simple Streamlit-based CSV Analysis & Preprocessing Tool for quick data insights.
csv-processing data-analysis data-visualization machine-learning python streamlit
Last synced: 04 May 2026
https://github.com/youssefyaser/scrape-the-imdb-site-for-the-top-250-movies
Web scraping the top 250 movies in IMDB site.
data-analysis numpy pandas python
Last synced: 04 May 2026
https://github.com/surayasumona/test_bowlers_analysis
Data Analysis with Python
data-analysis data-manipulation data-preprocessing numpy pandas
Last synced: 04 May 2026
https://github.com/vara-co/crowdfunding_etl
ETL Mini Project based on a Crowdfunding Database, using CRUD operations. SQL, Postgres, and an ERD.
data-analysis database datacleaning erd erdiagram etl jupyter-notebook postgres postgresql regex schema sql
Last synced: 04 May 2026
https://github.com/jcm-ai/personal-data-science-projects
This page contains all of my personal data science projects. 📊📈📉👨💻
data-analysis data-visualization exploratory-data-analysis jupyter-notebooks machine-learning-algorithms matplotlib-pyplot numpy-library pandas-python personal-project predictive-modeling programming python3 scikit-learn scipy seaborn statistical-analysis
Last synced: 05 May 2026
https://github.com/jacktheprogrammer/time-series-forecasting-and-analysis
My personal project consisting of my personally created notebooks to work with time series forecasting and analysis. In these projects, I've used deep learning using tensorflow, xgboost, statsmodels and scipy libraries of python. The series were of weather, energy consumption and that of stocks.
data-analysis data-science deep-neural-networks energy-consumption machine-learning portfolio prophet-facebook prophet-model python python3 scipy statsmodels stocks tensorflow time-series time-series-analysis timeseries-forecasting weather xgboost
Last synced: 05 May 2026
https://github.com/codewithmayank-py/box-office-analysis-with-seaborn-and-python
This repository contains Python code and datasets for analyzing box office data. Explore trends, patterns, and factors influencing movie performance.
analysis box-office-data-analysis data-analysis data-visualization dataset jupyter-notebook matplotlib pandas python3 seaborn
Last synced: 05 May 2026
https://github.com/monish-nallagondalla/universal-bank
Credit Card Ownership Prediction A machine learning project that predicts credit card ownership using features like age and income, balancing class distributions for improved accuracy.
classification-models credit-card-prediction data-analysis data-classification decision-tree-classifier imbalanced-datasets machine-learning model-evaluation python scikit-learn
Last synced: 05 May 2026
https://github.com/kammarah/data-sample
I designed a database website 🌐 that can be uploaded easily for use 📤. You can check my website 👀.
data-analysis data-visualization database deploy deployment library-management-system panaversity streamlit webapp
Last synced: 05 May 2026
https://github.com/nkamilla/titanic-eda
Exploratory Data Analysis of the Titanic dataset using Python (Pandas, NumPy, Matplotlib). Includes data cleaning, visualizations, correlations, and key business insights.
data-analysis eda jupyter-notebook matplotlib numpy pandas python titanic-dataset
Last synced: 05 May 2026
https://github.com/caesaredia/ymusic-project
Exploratory data analysis (EDA) of music streaming behavior in two fictional cities using Python, Pandas, and Jupyter Notebook. It explores user behavior, genre preferences, and listening patterns throughout the week.
data-analysis eda pandas python
Last synced: 05 May 2026
https://github.com/meinhere/dicoding-analisis-data
Submission Analisis Data dengan tema E-Commerce Streamlit App
data-analysis data-mining e-commerce python streamlit
Last synced: 05 May 2026