Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-23 00:07:29 UTC
- JSON Representation
https://github.com/syarwinaaa09/exploring-nyc-public-school-test-result-scores
📊 analyzing NYC school test scores with python 🐍 to spot top performers 🏆 & trends 📈
data-analysis education pandas python visualization
Last synced: 06 May 2026
https://github.com/sankaran-s2001/us-traffic-accidents-analysis-python-eda
Exploratory data analysis of US traffic accidents from 2016-2023, analyzing patterns by time, location, weather, and severity using Python data science libraries.
data-analysis data-science data-visualization eda matplolib numpy pandas python
Last synced: 06 May 2026
https://github.com/erick957/saleprice-prediction-dataset-analysis-and-cleaning-advance-regression
🏠 Predict house prices using advanced regression techniques with this comprehensive analysis and cleaning project, from data loading to model deployment.
data-analysis data-science eda google-colab machine-learning numpy pandas python scikit-learn scikit-learn-python
Last synced: 06 May 2026
https://github.com/chaitanyac22/investment-analysis-for-an-asset-management-company
Data analysis to identify the best sectors, countries, and a suitable investment type for making investments.
business-analytics business-intelligence data-analysis data-cleaning data-insights data-manipulation data-preparation data-visualization decision-making finance python3 risk-management statistics
Last synced: 06 May 2026
https://github.com/syarwinaaa09/visualizing-the-history-of-nobel-prize-winners
analysis and visualization of Nobel Prize winners
data-analysis data-visualization jupyter-notebook machine-learning matplotlib nobel-prize pandas python
Last synced: 06 May 2026
https://github.com/deanlogan/data-analysis-course
Code created when completing the Data Analysis with Python Course on freecodecamp.org
course data-analysis numpy pandas python python3
Last synced: 06 May 2026
https://github.com/marknature/oibsip
AICTE Oasis Infobyte Data Science Internship
data-analysis data-science data-visualization github google-sheets jupyter-notebook linkedin machine-learning project-management python
Last synced: 06 May 2026
https://github.com/edanur-y/variable-analysis-of-banks-ratio-data
Testing variables for multicollinearity, multivariate normality and analyzing outliers and missing values. ⭕SPSS 🔵R
data-analysis log-transformation missing-values-analysis multicollinearity normality-test r spss
Last synced: 10 Jun 2026
https://github.com/rlalpha49/anisearch-model
AniSearchModel leverages Sentence-BERT (SBERT) models to generate embeddings for synopses, enabling the calculation of semantic similarities between descriptions. This allows users to find the most similar anime or manga based on a given description.
anime api data-analysis data-merging embeddings flask hugging-face-datasets kaggle-datasets machine-learning manga natural-language-processing nlp python sentence-bert similarity-search
Last synced: 06 May 2026
https://github.com/siddhantprateek/machine-learning-resources
Machine Learning Resources
best-practices clustering-algorithm data-analysis deep-learning in-progress journey linear-regression machine-learning machine-learning-algorithms neural-language-modelling neural-language-processing neural-network numpy python3 read reinforcement-learning-algorithms tensorflow visualisation
Last synced: 07 May 2026
https://github.com/aicorsair/python-case-study-ab-testing-for-lunartech-homepage-cta-button
This repository contains a detailed case study on an A/B test of LunarTech's homepage CTA button, using proxy data structured similarly to the company's real data.
ab-testing click-through-rate confidence-intervals data-analysis data-analytics data-exploration data-science data-visualization hypothesis-testing matplotlib normal-distribution numpy pandas practical-significance python statistical-analysis statistical-significance z-critical z-statistic z-test
Last synced: 07 May 2026
https://github.com/helosantosdesousa/analise-dados-titanic
Análise de dados com o dataset 'Titanic - Machine Learning from disaster'
analise-de-dados analise-exploratoria bootcamp bootcamp-project data-analysis data-girls data-science matplotlib numpy pandas python
Last synced: 07 May 2026
https://github.com/mahmoudnamnam/fc-barcelona-reports
FC Barcelona Reports: An interactive web application to analyze and visualize FC Barcelona's match data. Built with Streamlit, it scrapes match data from WhoScored, stores it in MongoDB, and presents insights through interactive visualizations like pass networks, shot maps, and player statistics.
data-analysis data-visualization football-analytics mplsoccer pandas streamlit web-scraping
Last synced: 07 May 2026
https://github.com/muthukumar0908/-singapore-resale-flat-prices-predicting
This project is to develop a machine learning model and deploy it as a user-friendly web application that predicts the resale prices of flats in Singapore.
data-analysis data-visualization mechine-learing plotly python streamlit
Last synced: 07 May 2026
https://github.com/anshulkansal121/music_store_analysis_sql
SQL project to analyze online music store data
beginner-friendly contributions-welcome data-analysis database mysql sql
Last synced: 07 May 2026
https://github.com/bnvulpe/regression-and-time-series
This work centers on assessing and comparing predictive models for regression and time series prediction using specific datasets, with the goal of selecting the most effective methodology for unseen test data.
colab data-analysis data-analysis-python data-science data-visualization forecasting jupyter-notebook machine-learning model-evaluation predictive-modeling python regression sarima sarimax time-series-analysis time-series-analysis-and-forecasting
Last synced: 08 May 2026
https://github.com/fahamidur/cuisine-analysis
This project analyzes recipes from AllRecipes.com to reveal global cooking patterns, nutritional trends, and cultural food differences, offering data-driven insights for food enthusiasts and researchers.
beautifulsoup data-analysis datavisualization pandas selenium tableau-public webscraping
Last synced: 08 May 2026
https://github.com/samjoesilvano/password_strength_prediction_using_nlp
Developed a predictive model to categorize passwords as Strong, Good, or Weak, enhancing security and reducing breach risks. The project involves cleaning and analyzing data from an SQL database, using the TF-IDF technique for transformation, and implementing a Logistic Regression model to achieve accurate classifications.
data-analysis data-classification data-cleaning data-visualization logistic-regression machine-learning natural-language-processing pandas password-security password-strength python scikit-learn sql tf-idf
Last synced: 08 May 2026
https://github.com/dhruwsunita/zomato-data-analysis-project
Zomato data analysis project using Python.
data-analysis data-visualization jupyter-notebook matplotlib numpy pandas-dataframe python
Last synced: 08 May 2026
https://github.com/pedrosfaria2/analisandopostshn
Projeto para analisar as postagens da comunidade HackerNews
analise-de-dados data-analysis datetime jupyter-notebook matplotlib python python3
Last synced: 08 May 2026
https://github.com/alejandrolara11/data-preprocessing
Data preprocessing through the use of the libraries NumPy and pandas.
data-analysis data-cleaning data-preprocessing numpy pandas python
Last synced: 09 May 2026
https://github.com/ranagaballah/true-fake-news
True Fake News Detector NLP model
data-analysis data-science data-visualization deployment machine-learning matplotlib nlp numpy pandas python
Last synced: 09 May 2026
https://github.com/isaqueiros/motorpremium-predictions-mlpclassifier
This Jupyter Notebooks is an initial study of the application of sklearn neural network MLP Classifier model. The model is applied to dataset MotorPremiums, which is supplied separately in .csv format.
data-analysis data-science machine-learning neural-network python sklearn-library
Last synced: 02 May 2026
https://github.com/chuxinh/our-data-manual
All in one place for our data science learning journey by Chuxin and Melody
data-analysis data-science machine-learning python
Last synced: 09 Jun 2026
https://github.com/fatihilhan42/spotify-songs-recommendations-system_with_python
We developed a song recommendation system for the user with the data we received from our Spotify song dataset. Data set and other applications are given in the description. Have a nice day.
data-analysis data-science data-visualization jupyter-notebook python recommendation-engine recommendation-system
Last synced: 02 May 2026
https://github.com/leticiamilan/santander-tech-data-science
Este repositório contém os projetos desenvolvidos durante o curso de Data Science, uma parceria entre a Ada Tech e o Santander Open Academy. O curso é dividido em vários módulos, cada um focado em um aspecto fundamental da ciência de dados.
ada-tech calculus data-analysis data-science letscode python santander
Last synced: 09 Jun 2026
https://github.com/bhaveshbhakta/gold-price-prediction-using-ml
Gold Price Prediction
data-analysis data-visualization gold-price-prediction machine-learning python
Last synced: 02 May 2026
https://github.com/dissorial/prx21_erikz
Analysis of self-tracked data: interactive visualizations & predictive algorithms
analytics data-analysis data-science data-visualization machine-learning matplotlib pandas python python3 visualization
Last synced: 02 May 2026
https://github.com/mehanix/dhrw
🎢 IaaS visual editor to create & deploy data processing pipelines - python, rmq, react, meteorjs
computational-graph computational-graphs data-analysis data-engineering data-pipeline data-pipelines data-processing data-processing-and-analysis data-processing-pipelines data-processing-system data-science data-visualization docker-compose good-first-issue help-wanted meteorjs-application rabbitmq react-flow
Last synced: 02 May 2026
https://github.com/benzerinsio/breastcancer-eda
📊 Análise Exploratória de Dados (EDA) - Câncer de Mama | Exploração de características clínicas para identificar padrões e relações no diagnóstico de câncer de mama.
analise-de-dados analise-exploratoria analise-exploratoria-de-dados data-analysis data-visualization diagnosis eda exploratory-data-analysis health-care medical-data python seaborn
Last synced: 02 May 2026
https://github.com/aravindnathan02/data-science-capstone
IBM Data Science Certificate capstone project on Coursera.
data-analysis data-science data-visualization machine-learning predictive-modeling python sql
Last synced: 03 May 2026
https://github.com/inevolin/multivariate-data-analysis
Showcases of modern multivariate & multidimensional data analysis in industrial and high-tech settings.
analytics data-analysis data-science data-visualization javascript
Last synced: 09 Jun 2026
https://github.com/faiyaz-zaman/used-car-market-trends-on-bikroy.com
Used Car Market Trends on Bikroy.com
data-analysis python scraping-websites selenium tableau
Last synced: 02 May 2026
https://github.com/maddieemihle/pandas-challenge
Python analysis to create and manipulate school and standardized test data. Scores are calculated, grouped, aggregated, summarized, and organized using pandas.
Last synced: 09 Jun 2026
https://github.com/robertpaulp/expenseadvisor
HackITall 2023- Hackathon
chatgpt-api data-analysis data-processing python scrapping-python
Last synced: 03 May 2026
https://github.com/jofaval/red-wine-quality
Data Analysis of the Red Portuguese's Wine's Quality in 2009
classification data-analysis data-science data-visualization google-colab kaggle logistic-regression machine-learning python scikit-learn wine-quality xgboost
Last synced: 03 May 2026
https://github.com/helenaden/data-science-fundamentals
This project delves into fundamental data science concepts using Python libraries like NumPy and Pandas
data-analysis datascience datasets datavisualization datawrangling heatmap numpy pandas patterns python
Last synced: 03 May 2026
https://github.com/bhavna-kale/cars-eda-project
Project analyzing used car market data to identify high-impact price drivers and depreciation curves, presented through an interactive web application.
data-analysis excel matplotlib numpy pandas python3 searborn streamlit
Last synced: 03 May 2026
https://github.com/ahmedhosssam/lesser_pandas
Pandas-like Data Analysis library in C++
cpp data-analysis data-science pandas
Last synced: 03 May 2026
https://github.com/stepankuzmin/machine-learning-data-analysis
My homeworks on Coursera Machine Learning and Data Analysis specialization
coursera data-analysis jupiter machine-learning python
Last synced: 03 May 2026
https://github.com/fatihilhan42/tourist_analysis_in_turkey_with_python
In this project, the number of tourists coming to Turkey between 2008-2021 was analyzed. The data from the data set you can find in the warehouse was first organized using data cleaning algorithms. These cleaned data were then output graphically using data visualization algorithms.
data-analysis data-cleaning data-science data-visualization jupyter-notebook python
Last synced: 03 May 2026
https://github.com/aicorsair/python-case-study-imdb-movie-reviews-sentiment-analysis-with-nlp
This repository contains a comprehensive case study on sentiment analysis using the IMDb dataset of movie reviews.
ada-boost artificial-intelligence classification data-analysis data-analytics data-cleaning data-preprocessing data-science data-visualization feature-engineering feature-extraction hyperparameter-tuning logistic-regression machine-learning naive-bayes natural-language-processing nltk python random-forest shap
Last synced: 03 May 2026
https://github.com/chaedoll/analysis-python-foreignerinfra
국내 외국인 대상 인프라 개선을 위한 보고서 (Report on improving infrastructure for foreigners)
data-analysis python team-project
Last synced: 03 May 2026
https://github.com/cassiofb-dev/projetos-intensivao-python
Projetos do evento intensivão de Python da Hashtag treinamentos.
automation data-analysis data-science data-visualization jupyter-notebook machine-learning python webscraping
Last synced: 03 May 2026
https://github.com/emredemirbas/movie-ratings-analysis
A data analysis project investigating potential bias in movie ratings from 2015, comparing them with ratings from other platforms using Python, pandas, and visualization libraries.
data-analysis matplotlib pandas python seaborn
Last synced: 03 May 2026
https://github.com/maddieemihle/python-challenge
Creating a Python script that analyzes financial records and election results
Last synced: 09 Jun 2026
https://github.com/devlucho/modelos-predictivos
Modelos predictivos utilizando los algoritmos de Regresión Lineal, Regresión Logística y Árboles de Decisión.
data-analysis jupyter-notebook python3
Last synced: 03 May 2026
https://github.com/ababic/dumpling
Fast, flexibile, powerful static data anonymisation for SQL dumps
anonymisation cli data-analysis data-science pii pii-redaction postgres privacy rust rust-lang scrubber scrubbing security tooling
Last synced: 03 May 2026
https://github.com/nathadriele/diabetes-clinical-etl-pipeline
Este projeto de Engenharia de Dados em Saúde Pública implementa um pipeline completo para coletar, tratar, padronizar, validar, integrar e visualizar dados públicos do SUS relacionados ao Diabetes Mellitus no Brasil, filtrando pelos códigos CID-10 E10 a E14.
cid data-analysis data-extraction data-pipeline data-science data-structures data-visualization datasus diabetes-detection diabetes-prediction epidemiology-analysis etl-pipeline healthcare-analytics ibge logger pytest sih streamlit sus
Last synced: 09 Jun 2026
https://github.com/muskanmi/data_analysis_python
Data analysis on students result dataset using python libraries.
boxplot countplots data-analysis numpy pandas pie-chart python3 seaborn
Last synced: 03 May 2026
https://github.com/ankitgmishra/machinelearning
Continuously deep diving in understanding & advancing my expertise in Machine Learning through ongoing education and hands on experience with practical learning.
artificial-intelligence data-analysis data-cleaning data-gathering machine-learning machinel-learning-algorithms matplotlib numpy pandas python seaborn
Last synced: 03 May 2026
https://github.com/obinnaokoye89/fraud-detection-monitoring
ML model monitoring for fraud detection using NannyML
analytics automation data-analysis fraud-detection jupyter-notebook machine-learning monitoring nannyml pandas python sciki-learn
Last synced: 03 May 2026
https://github.com/joelfaldin/data-analysis
A collection of data-analysis projects I've built over time! ✨⛏️
Last synced: 03 May 2026
https://github.com/matteospanio/speed-analysis
A project to analyze the internet speed
Last synced: 03 May 2026
https://github.com/devesh8423/machine_learning
Machine Learning practice projects, Jupyter notebooks, and datasets for learning regression, classification, and data analysis.
classification data-analysis data-science data-visualization jupyter-notebook machine-learning matplotlib ml-project numpy-library pandas python regression sckit-learn seaborn
Last synced: 03 May 2026
https://github.com/donmaruko/flask-data-analysis
Flask API for statistical calculations. Data analysis, cleansing, visualization, and manipulation. Documented by Swagger.
api api-rest data-analysis data-science data-visualization datascience flasgger matplotlib pandas seaborn sqlite wordcloud
Last synced: 03 May 2026
https://github.com/nathadriele/world-marathon-run-majors-analytics-challenge
This project presents a complete data engineering, analytics, machine learning, and Streamlit dashboard pipeline focused on the Abbott World Marathon Majors: Tokyo, Boston, London, Berlin, Chicago, and New York City. Covering the 2018 to 2025 seasons, it analyzes more than 628,000 runner records and 86 verified winner entries.
challenge data-analysis data-pipeline gradient-boosting lasso-regression linear-regression machine-learning models predictive-modeling python random-forest ridge-regression run-analytics world-marathon
Last synced: 09 Jun 2026
https://github.com/bpkaur/whats-in-a-name
Exploring dataset of first names of babies born in the US in order to uncover interesting stories
data-analysis datacamp numpy pandas python3
Last synced: 04 May 2026
https://github.com/mindlessmuse666/titanic-data-visualization
Проект по визуализации данных о пассажирах Титаника с использованием библиотек Python Matplotlib, Seaborn и Plotly.
data-analysis data-visualization matplotlib pandas plotly python seaborn titanic
Last synced: 04 May 2026
https://github.com/nickenshidqia/uber-new-york-data-analysis
Analyze Uber pickups on New York to get insight from this data
data-analysis data-analyst exploratory-data-analysis python
Last synced: 04 May 2026
https://github.com/xiaohan2012/myunisport
Visualize your Unisport annual training records
data-analysis data-visualization pandas pygal sports-stats tikzposter
Last synced: 04 May 2026
https://github.com/fatihilhan42/the-office-eda
Data analysis study of my favorite sitcom, The Office (US).
data-analysis data-science data-visualization fatihilhan office python sitcom
Last synced: 04 May 2026
https://github.com/arv-anshul/ipl-api
IPL API using Flask framework and ipl dataset.
api data-analysis fast-api flask flask-api ipl ipl-api python3
Last synced: 04 May 2026
https://github.com/damisparks/become_data_analyst
Are you new to Data Analysis ? Here you will find simple notebook that will help through your journey. These are personal projects I work on and still working.
data data-analysis data-visualization matplotlib numpy pandas-tutorial
Last synced: 04 May 2026
https://github.com/aaaa-source/us-stock-market-analysis-and-prediction
US Stock Market Analysis and Prediction
artificial-intelligence artificial-intelligence-algorithms artificial-neural-networks classification clustering data-analysis finance financial-analysis python
Last synced: 09 Jun 2026
https://github.com/mr-chang95/sf_data_visualization
In this personal project, I am interested in examining all of the active businesses in the San Francisco Bay Area while performing some simple data visualizations, mainly on categorical variables.
business data-analysis data-visualization jupyter-notebook pandas python san-francisco
Last synced: 04 May 2026
https://github.com/marionchaff/real-estate-price-prediction-france
Real estate price prediction using French public database DVF
data-analysis dvf-data machine-learning price-prediction python real-estate scikit-learn
Last synced: 04 May 2026
https://github.com/analitico-771/etf_analyzer
This is an An application that pulls and analyzes ETF data from a database
conda-environment data-analysis data-structures data-visualization database etf-investments fintech hvplot pandas-dataframe python quantitative-finance sqlalchemy
Last synced: 04 May 2026
https://github.com/fatihilhan42/book-recommendation-system-with-python
In this project, we are making a book recommendation system that recommends similar books according to the genres or ratings that the user enters, using a large book dataset. The link of the dataset is given below. Happy reading...
books data-analysis data-science data-visualization kaggle python recommendation-engine recommendation-system
Last synced: 04 May 2026
https://github.com/hyperplasma/olympic-visualization-analysis
Multidimensional analysis and visualization of Olympic medals, economy, and happiness index.
data-analysis data-visualization matplotlib numpy pandas python wordcloud
Last synced: 04 May 2026
https://github.com/abhinav330/911-emergency-calls-analysis
This Python Notebook analyzes emergency call data from the '911.csv' dataset. It uses various data visualization techniques to explore and gain insights into the emergency call data, including the types of calls, reasons for calls, and call patterns over time.
data-analysis data-science data-visualization eda exploratory-data-analysis exploratory-data-visualizations numpy pandas python
Last synced: 09 Jun 2026
https://github.com/ljadhav25/logistic-regression-data-science-
Logistic regression estimates the probability of an event occurring, such as voted or didn’t vote, based on a given data set of independent variables.
data-analysis data-science data-visualization logestic-regression machine-learning
Last synced: 04 May 2026
https://github.com/mugilan1309/csv_analyzer
📊 A simple Streamlit-based CSV Analysis & Preprocessing Tool for quick data insights.
csv-processing data-analysis data-visualization machine-learning python streamlit
Last synced: 04 May 2026
https://github.com/saitoxu/data-analysis-workspace
Docker image for data analysis
Last synced: 04 May 2026
https://github.com/bishopce16/pyber_analysis
The purpose of this project was to complete an exploratory analysis and create visualizations of the 2019 ride sharing data from PyBer.
data-analysis data-visualization jupyter-notebook matplotlib pandas python
Last synced: 04 May 2026
https://github.com/drod75/nyc-arrests-analysis
This is a simple Data Science Project made to analyze and display data and trends found within the NYC Arrests Year to Date Dataset.
data-analysis data-visualization folium jupyter-notebook matplotlib-pyplot nyc-opendata nypd python scikit-learn seaborn
Last synced: 04 May 2026
https://github.com/georgehanymilad/plantycare-app
Graduation Project - Fayoum Center
ai backend cnn-classification colab-notebook data-analysis deep-learning diagrams front-end java kaggle machine-learning native ui-design
Last synced: 04 May 2026
https://github.com/flytomarsz/bike-sharing-system-analysis
This analysis project aim to identify bike rental's behavior in 2012 from Capital Bikeshare system, Washington D.C., USA. This project is part of my Data Analysis study at Dicoding.
data-analysis data-visualization jupyter-notebook python streamlit
Last synced: 04 May 2026
https://github.com/yokawaiik/data_science
Time series forecasting with future predict.
data-analysis keras lstm neural-network predict-future python python-3 rnn time-series-forecast visualization
Last synced: 05 May 2026
https://github.com/zobayerakib/credit-card-fraud-analysis__data-analysis-project
credit-card data-analysis decision-trees fraud-detection gradient-descent knn-classification logistic-regression machine-learning machine-learning-algorithms naive-bayes-classifier random-forest-classifier
Last synced: 05 May 2026
https://github.com/tasosfotiadis/time-series-analysis-and-forecasting-of-cryptocurrency-prices
Forecasted Cardano (ADA) cryptocurrency prices using time series analysis. The project involved data preprocessing, trend and seasonality analysis, and model building with ARIMA, SARIMA, and LSTM. Models were evaluated using metrics like MAE and MAPE, providing insights for financial decision-making.
applied-st classical-statistical-models data-analysis deep-learning lstm machine-learning neural-network python r time-series
Last synced: 05 May 2026
https://github.com/shishirshekhar/census-web-app
This web app allows a user to explore and visualise census data
data-analysis data-science data-visualization machine-learning python python3 streamlit streamlit-application streamlit-dashboard streamlit-web streamlit-webapp visualization
Last synced: 05 May 2026
https://github.com/dhruvsrikanth/basic-data-science
A short Data Science Project I took up for fun! This is a data analysis based on a dataset I created to predict the distribution of wealth within an economy as well as several characteristics of each class within society!
analysis data-analysis data-pipeline data-science data-visualization machine-learning matplotlib pandas python seaborn sklearn
Last synced: 05 May 2026
https://github.com/13anush/python-libraries-
A collection of essential Python libraries—NumPy, Pandas, Matplotlib, and Seaborn—perfect for anyone starting out in data analysis.
data-analysis matplotlib numpy pandas python seaborn
Last synced: 05 May 2026
https://github.com/jacktheprogrammer/time-series-forecasting-and-analysis
My personal project consisting of my personally created notebooks to work with time series forecasting and analysis. In these projects, I've used deep learning using tensorflow, xgboost, statsmodels and scipy libraries of python. The series were of weather, energy consumption and that of stocks.
data-analysis data-science deep-neural-networks energy-consumption machine-learning portfolio prophet-facebook prophet-model python python3 scipy statsmodels stocks tensorflow time-series time-series-analysis timeseries-forecasting weather xgboost
Last synced: 05 May 2026
https://github.com/sajjad425/edaipl
The dataset covers the Indian Premier League (IPL) with details on matches (date, teams, venue, results), player stats (runs, wickets), team stats (wins, losses), season summaries, and umpire info. The EDA reveals patterns and insights, highlighting dominant teams, star players, and trends across seasons.
data-analysis eda exploratory-data-analysis ipl python
Last synced: 05 May 2026
https://github.com/cicku/en.650.672
HW of EN.650.672
analytics data-analysis numpy pandas
Last synced: 05 May 2026
https://github.com/monish-nallagondalla/universal-bank
Credit Card Ownership Prediction A machine learning project that predicts credit card ownership using features like age and income, balancing class distributions for improved accuracy.
classification-models credit-card-prediction data-analysis data-classification decision-tree-classifier imbalanced-datasets machine-learning model-evaluation python scikit-learn
Last synced: 05 May 2026
https://github.com/akotronis/qualitycontrol
HRH Quality Control app
data-analysis gui-application latex newton-method oop pandas progress-bar pyinstaller pysimplegui python quality-control sqlite3
Last synced: 05 May 2026
https://github.com/ayaatmohammed/amazon-sales-analysis-pyspark
In-depth analysis of the Olist E-commerce dataset from Kaggle using PySpark for customer segmentation (RFM) and market basket analysis.
big-data big-data-analytics customer-segmentation data-analysis data-science ecommerce jupyter-notebook kaggle pyspark python rfm-analysis
Last synced: 05 May 2026
https://github.com/kammarah/data-sample
I designed a database website 🌐 that can be uploaded easily for use 📤. You can check my website 👀.
data-analysis data-visualization database deploy deployment library-management-system panaversity streamlit webapp
Last synced: 05 May 2026
https://github.com/zyna-b/insurance-cost-analysis-and-prediction
Medical insurance EDA and prediction: feature engineering, correlation analysis & Chi-square tests
adjusted-r-squared chisquare-test data-analysis data-science data-visualization eda exploratory-data-analysis linear-regression pandas r2-score sklearn statistical-analysis
Last synced: 05 May 2026
https://github.com/nimbostratos/titanic-survival-prediction
Machine learning project predicting Titanic survival using AdaBoost with feature engineering and hyperparameter optimization
data-analysis data-science data-science-projects kaggle machine-learning machine-learning-models python scikit-learn
Last synced: 05 May 2026
https://github.com/caesaredia/ymusic-project
Exploratory data analysis (EDA) of music streaming behavior in two fictional cities using Python, Pandas, and Jupyter Notebook. It explores user behavior, genre preferences, and listening patterns throughout the week.
data-analysis eda pandas python
Last synced: 05 May 2026
https://github.com/hms75/movie_rating_analysis
A movie rating analysis which identifies trends amongst a dataset of 5000 movies.
data-analysis data-visualization matplotlib-pyplot numpy pandas python
Last synced: 05 May 2026
https://github.com/benjaminrose/data-analysis-book
A Jupyter Book for my Spring 2025 PHY 5381 class on Data Analysis
book data-analysis data-science data-visualization jupyter-book open-book python r statistics-course
Last synced: 06 May 2026
https://github.com/iamrajmani/sentimental-analysis
Sentimental Analysis - Final Year College Project
data-analysis data-visualization machine-learning python pytorch
Last synced: 06 May 2026
https://github.com/ryuzen6/bangalore-real-estate-price-prediction
This is a Data Science Project which predicts the cost of Real Estate in Bangalore. Requirements: Jupyter Notebook (for Data Cleaning and creating the Linear Regression using various python libraries) , Pycharm (python IDE for creating Python Flask Server), Visual Studio Code (to create the UI with HTML, CSS and Javascript).
css3 data-analysis data-science html5 javascript jupyter-notebook machine-learning python3
Last synced: 06 May 2026