Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists tagged with eda
A curated list of projects in awesome lists tagged with eda .
https://github.com/raulmaulidhino-dev/ml_modelling_regression
There are many factors that influence the grades/scores of students. One of the factors is study hours. In this mini analysis project, there are 3 models that will learn and predict the relation between study hours of students and their scores in an exam/test. This project will result the best ML model to solve the problem.
data data-analysis-python data-science eda machine-learning scikit-learn
Last synced: 21 Dec 2024
https://github.com/jt2m0l3y/quantified-self
The final project for an introduction to data science, this project is a practice in supervised machine learning with kNN and Decision Tree Classifiers.
data-visualization decision-trees eda jupyter-notebooks knn-classification latex markdown numpy pandas python scikit-learn statistical-analysis supervised-learning
Last synced: 27 Dec 2024
https://github.com/chandkund/sentiment-analysis-on-movie-reviews
Project on Sentiment Analysis of Movie Reviews using K-means clustering. This project clusters movie reviews based on their sentiment scores to understand audience reactions. Includes data preprocessing, feature selection, model development, and visualization.
data-science eda jupyter-notebook machine-learning matplotlib numpy pandas preprocessing python sklearn
Last synced: 21 Dec 2024
https://github.com/prady2309/unemployement-analysis
Data Science Project
colab-notebook data-analysis data-science data-visualization eda jupyter-notebook machine-learning python3
Last synced: 22 Dec 2024
https://github.com/alchemine/computer-vision-anomaly-detection
Computer Vision 이상치 탐지 알고리즘 경진대회
Last synced: 16 Jan 2025
https://github.com/alchemine/diabetes-prediction
Diabetes Prediction and Analysis (NHIS-2018)
eda jupyter python scikit-learn streamlit
Last synced: 16 Jan 2025
https://github.com/abhash-rai/regression-car-price-prediction
This repository contains my first complete data science project from web scrapping for data to data preprocessing, cleaning, exploratory data analysis, model training and deployment.
data data-science data-visualization eda exploratory-data-analysis machine-learning neural-network prediction prediction-model regression
Last synced: 16 Jan 2025
https://github.com/samridhisainii/airbnb-data-analysis
Data analysis of airbnb dataset
analysis data data-visualization eda models
Last synced: 29 Nov 2024
https://github.com/bastianlq/analisis-instacart
Proyecto de análisis a la plataforma de delivery Instacart, análisis exploratorio y visualización de datos.
analisis-de-datos eda matplotlib
Last synced: 06 Dec 2024
https://github.com/neerajcodes888/ipl-victory-analysis-with-prediction
This repository contains code for analyzing and predicting outcomes in the Indian Premier League (IPL) cricket matches from 2008 to 2022. It includes data analysis notebooks, a prediction model, and a Flask-based web application for interactive predictions. Explore historical match data, gain insights, and make predictions on upcoming matches .
css3 csv-datasets eda feature-extraction flask-application github html5 notebooks pandas-library preprocessing python3 render-deployment sckiit-learn
Last synced: 04 Dec 2024
https://github.com/neerajcodes888/data-science
This repository is a hub for data science enthusiasts, offering a diverse collection of projects, notebooks, and resources covering topics such as data analysis, machine learning, deep learning, and generative AI. Explore innovative ideas, contribute to cutting-edge research, and enhance your skills in the dynamic field of data science
data-analysis data-science data-visualization deep-learning deep-learning-algorithms eda genai jupyter-notebook machine-learning machine-learning-algorithms openai-api pandas plotting python3 sklearn-library streamlit
Last synced: 04 Dec 2024
https://github.com/lu-sketch/chocolate-imports-dataset
Chocolate Imports for South Africa
Last synced: 29 Nov 2024
https://github.com/shreshth-112/zomato-data-analysis
Helping Zomato segregate restaurants according to the data collected from the masses.
data-science eda sentiment-analysis
Last synced: 29 Dec 2024
https://github.com/daviddavo/18eda
This is a mirror from https://gitlab.com/daviddavo/18eda
Last synced: 23 Nov 2024
https://github.com/albertofaraujo/ml_ocupacao_casa
Criar um modelo de Rede Neural Artificial, capaz de prever o valor da mediana de ocupação das casas, utilizando dados locais.
data-science eda machine-learning redes-neurais-artificiais rstudio
Last synced: 06 Dec 2024
https://github.com/vikram-bhati/kmeans-for-university
Cluster Universities into to two groups, Private and Public
clustering-algorithm dataanalysis eda kmeans kmeans-clustering machine-learning ml python python3 sklearn
Last synced: 31 Dec 2024
https://github.com/sehgal-vishal/sql-nyc-collision-analysis
this analysis is based on the Collisions(Accidents) happend in New York City. I have used Sql Server For EDA(Exploratory Data Analysis
data-analysis database eda sql-server
Last synced: 18 Jan 2025
https://github.com/ashish-kr-srivastava/social-media-database-analysis-sql-project
I have recently worked on a project of analyzing a social media platform data on MS SQL SERVER. In this project I have used advanced SQL functions and keywords like Views, Indexes, CTE, Windows Functions and many more.
eda joins mssqlserver schema views windowsfunction
Last synced: 18 Jan 2025
https://github.com/abinashsahoo007/project-bankruptcy-prevention
The project is to create a classification model that predicts the chances of a business facing bankruptcy based on the key feature like Industrial Risk, Management Risk, Financial Flexibility, Credibility, Competitiveness, Operating Risk.
data-analysis data-mining data-visualization deployments eda machine-learning pickle python statistics streamlit
Last synced: 09 Jan 2025
https://github.com/abinashsahoo007/project-song-recommendation-system
This Project is a Simple Content-Based Song Recommendation System. It suggest similar item to the user based on the content the user provide.
correlation cosine-similarity data-mining dbscan-clustering deployment eda heirarchical-clustering k-means-clustering pandas-profiling pca pickle recommender-system statistics streamlit visualization
Last synced: 09 Jan 2025
https://github.com/shibasishb2/ensemble-techniques
This project is based on the case study of a telecommunication company, which is facing a customer churn issue. The project aims at understanding the pattern of the data and predicting customers who are going to churn based on multiple variables to help the company in retaining their existing customers.
adaboost decision-trees eda logistic-regression ml-workflow python random-forest
Last synced: 24 Nov 2024
https://github.com/albertofaraujo/ml_notas_de_alunos
Prever as notas dos alunos com base em diversas métricas.
data-science eda machine-learning regressao-linear rstudio
Last synced: 06 Dec 2024
https://github.com/albertofaraujo/sql_people_analytics
O objetivo desta exploração de dados é responder algumas perguntas de negócios para posteriormente criar um painel de visualização com as métricas estratégicas para tomada de decisão do gestor da área, facilitando a compreensão e análise dos indicadores de forma visual, atrativa e eficiente.
azure-data-studio data-science eda sql
Last synced: 06 Dec 2024
https://github.com/albertofaraujo/sql_eda_comercio_exterior
A área de comércio exterior de uma empresa automotiva, busca melhorar o monitoramento dos embarques de importação, implementando uma torre de controle eficiente
analise-exploratoria azure-data-studio eda sql
Last synced: 06 Dec 2024
https://github.com/syedzaheerabbas/risk-analytics-with-python
This project focuses on developing a basic understanding of risk analytics in banking and financial services and understand how data is used to minimize the risk of losing money while lending to customers.
eda hypothesis-testing numpy pandas python risk-analysis seaborn
Last synced: 07 Dec 2024
https://github.com/dmarks84/ind_project_superstore-sales-time-series-analysis--kaggle
Independent Project - Kaggle Dataset-- I worked on the Superstore Sales Dataset, performing (as Part 1) data cleaning and preparation and exploratory data analysis. The main task was to make predictions for future sales based on time-series analysis, which is found in Part 2.
chloropleth data-modeling data-visualization eda linear-regression matplotlib numpy pandas python seaborn sklearn statistics statsmodels supervised-ml time-series-analysis
Last synced: 23 Dec 2024
https://github.com/mdanwarulkarim/netflix-data-analysis-excel-project
This project analyzes Netflix's content data, emphasizing trends in production and distribution. It addresses business questions through an interactive dashboard, exploring movie and TV show distribution, key contributors, genre trends, and geographic diversity. The analysis provides insights into Netflix's expanding library.
Last synced: 30 Nov 2024
https://github.com/albertofaraujo/sql_eda_data_travels
A Data Travels é uma empresa que vende pacotes de viagens e tem o objetivo de melhorar a compreensão de seus dados de vendas, identificar oportunidades de crescimento e otimizar suas estratégias de marketing.
analise-exploratoria azure-data-studio eda sql vendas
Last synced: 06 Dec 2024
https://github.com/sahiltiwariiii/email-spam-classifier
This model will tell you weather mail is spam or not
dataanalysis datacleaning datascience eda machine-learning nlp-machine-learning nltk numpy pandas python scikit-learn streamlit streamlit-webapp tfidf-vectorizer wordcloud-visualization wordtovec
Last synced: 31 Dec 2024
https://github.com/albertofaraujo/sql_eda_hospitalar
Análise Exploratória de dados da área hospitalar para resposta de perguntas de negócios
Last synced: 06 Dec 2024
https://github.com/msikorski93/predicting-prices-on-king-county-housing-dataset
Predicting house prices using different regression analysis models.
catboost eda gradient-boosting king-county lightgbm linear-regression machine-learning neural-network polynomial-regression real-estate regression-models scikit-learn tensorflow xgboost
Last synced: 09 Jan 2025
https://github.com/albertofaraujo/sql_eda_financeiro
A empresa Logit do ramo de Logística, busca otimizar a gestão de seu fluxo de caixa e obter uma visão abrangente de suas receitas e despesas para um planejamento financeiro mais eficaz.
analise-exploratoria azure-data-studio eda financeiro sql
Last synced: 06 Dec 2024
https://github.com/kaushikrohida/bank-customer-data-prep
Cleaning and Exploring the bank customer data to prepare it for machine learning models
business eda finance geospatial python
Last synced: 19 Jan 2025
https://github.com/vinicius999/eda-imdb-top1000-films
Análise exploratória dos Top 1000 filmes no IMDB até 2020
Last synced: 12 Jan 2025
https://github.com/albertofaraujo/r_qui.quad_imoveis
O objetivo deste projeto rápido é verificar se há uma relação entre os tipos de imóveis e seus respectivos status, tratando de variáveis categóricas, utilizando o teste Qui.Quadrado para alcançar o objetivo da pergunta de Negócio.
data-science eda imoveis rstudio
Last synced: 06 Dec 2024
https://github.com/anuraganalog/twitter-data-analysis
My internship work during the 2020 summer
analysis data eda exploratory-data-analysis jupyter-notebook nlp spotle textblob twitter wordcloud
Last synced: 19 Jan 2025
https://github.com/dhananjayporwal/whatsapp-chat-analyser
A Streamlit web app for analyzing WhatsApp chat data, offering insights into communication patterns and user behavior.
chat-analysis dhananjayporwal eda whatsapp whatsapp-chat-analysis whatsapp-chat-analysis-using-python whatsapp-chat-analyzer whatsapp-chat-analyzer-github whatsapp-chat-analyzer-streamlit
Last synced: 12 Jan 2025
https://github.com/albertofaraujo/r_eda_socioeconomico
Realizar uma exploração de dados em uma base socioeconômica e responder as perguntas de negócios
data-science eda rstudio socioeconomic
Last synced: 06 Dec 2024
https://github.com/fatihilhan42/eda-spacex-launches-falcon9-and-falcon-heavy
In this project, we analyze the space flight data of Spacex space research company Falcon 9 rocket.
data-analysis data-science data-visualization eda elonmusk spacex
Last synced: 01 Dec 2024
https://github.com/dragonman225/ngrp
A Ngspice ASCII rawfile parser written in Javascript.
eda eletronics ic-design ngspice spice
Last synced: 19 Jan 2025
https://github.com/albertofaraujo/ml_airbnb_rj
Este projeto tem por finalidade prever os preços do Airbnb do RJ, baseado em dados disponibilizados pelo Kaggle
data-science eda machine-learning python
Last synced: 06 Dec 2024
https://github.com/assem-elqersh/creativa-data-science-bootcamp
Jupyter notebooks from the Creativa Data Science Bootcamp, covering key data science concepts and practices across multiple sessions, from data preprocessing to model building and time series analysis.
data data-science eda exploratory-data-analysis machine-learning pandas time-series-analysis xgboost xgboost-classifier
Last synced: 19 Jan 2025
https://github.com/tanyagarg25/local_store_performance_analysis
Analyzing local store performance using sales data to identify trends, inefficiencies, and opportunities for growth. This project includes data cleaning, descriptive statistics, and interactive visualizations using Tableau and Excel
analytics cleaning-data eda excel tableau visualization
Last synced: 29 Dec 2024
https://github.com/debjyotisaha/data-analytics-projects-phase-2
Developed and showcased various data analytics projects, including data preprocessing, exploratory data analysis, and visualization. Utilized tools such as Python, Pandas, NumPy, and Matplotlib to derive actionable insights and demonstrate problem-solving capabilities.
data-analysis data-preprocessing eda matplotlib numpy pandas python seaborn
Last synced: 07 Dec 2024
https://github.com/hiakshatjain/mobilepriceprediction
Welcome to the Mobile Price Prediction project! This repository contains code for predicting mobile phone prices using various machine learning models. It includes data preprocessing, model training, and evaluation for both regression and classification tasks.
classification eda knn-classification knn-regression linear-regression logistic-regression machine-learning regression
Last synced: 12 Jan 2025
https://github.com/nicklasbekkevold/anonymized-dataset-classification
Classifying an unknown data set using ensemble machine learning methods with a focus on exploratory data analysis. This was a part of the course TDT05 - Modern Machine Learning in Practice at NTNU autumn 2021.
eda ensemble-learning machine-learning ntnu
Last synced: 22 Dec 2024
https://github.com/karthikarajagopal44/pandas-beginner-to-advanced
This repository is designed to be a comprehensive guide to mastering pandas, the powerful data manipulation and analysis library in Python.
data-manipulation datascience eda pandas pandas-dataframe python
Last synced: 25 Nov 2024
https://github.com/shubhamsoni98/project_using_knn
This project applies the K-Nearest Neighbors (KNN) algorithm to predict iPhone purchases based on customer data. Using features like age, salary, and previous purchase behavior, the KNN model classifies customers into buyers and non-buyers.
anaconda analytics data data-science eda knn knn-classification machine-learning-algorithms predict project python scikit-learn tableau
Last synced: 22 Jan 2025
https://github.com/jeffandyalltogether/mlrecommendationsystem
project code for a recommendation system for Amazon using collaborative filtering, ranking, and matrix factorization to enhance customer satisfaction and product discovery.
eda matplotlib pandas python scikit-learn seaborn tensorflow
Last synced: 07 Dec 2024
https://github.com/oshinrathor/datsci
Dive into my Data Science Projects Repository, featuring a Spam SMS Classifier, NIA Dashboard, H1N1 Vaccine Prediction, and NYC Taxi Fare Prediction. Each project showcases my skills in data cleaning, exploratory analysis, modeling, and visualization, offering valuable insights and methodologies for data enthusiasts and practitioners.
dashboard data-analysis data-driven-decisions data-presentation data-science data-visualization dataexploration eda insights nia webanalytics
Last synced: 06 Jan 2025
https://github.com/nishanthmuruganantham/football-player-wages-eda
This repository uses Python for analyzing football player data, focusing on various aspects such as player positions, league distributions, wages, and the relationship between player age and appearances. It includes visualizations generated using Plotly to provide insights into the dynamics of football player demographics and performance.
data-analysis data-science data-visualization eda football football-analytics football-data kaggle kaggle-dataset pandas plotly python
Last synced: 25 Nov 2024
https://github.com/raufjatoi/heart
model implementation on heart disease dataset
data-visualization eda machine-learning machine-learning-algorithms
Last synced: 25 Nov 2024
https://github.com/raufjatoi/diabetes
EDA and model implementation on diabetes dataset
data-visualization eda machine-learning
Last synced: 25 Nov 2024
https://github.com/gregoritsch3/ml_eda_classification_loanapprovalprediction
An EDA and Machine Learning Classification exercise on the Loan Approval dataset demonstrating EDA, feature engineering, StratifiedKFold and the use of Tensorflow NN, SVC, LinearSVC, XGBoost, Naive-Bayes, Bagging, Random Forest and Decision Tree algorithms.etc. The modela are optimized using hyperparameter tuning through GridSearchCV.
eda feature-engineering machine-learning matplotlib numpy pandas scikit-learn scipy seaborn tensorflow
Last synced: 07 Dec 2024
https://github.com/ashwin331133/sql-project--sales-data-analysis--walmart
This SQL-based Walmart data analysis project aims to identify top-performing branches and products, optimize sales strategies using Kaggle's Walmart Sales Forecasting Competition dataset.
Last synced: 22 Jan 2025
https://github.com/filmil/bazel_rules_fusesoc_2
Yet another attempt at bazel rules for fusesoc. This one relies on a hermetic installation of fusesoc and edalize, and not a containerized build. See https://github.com/filmil/bazel_rules_fusesoc for that other bit.
Last synced: 25 Nov 2024
https://github.com/karlyndiary/bellabeat-eda
Bellabeat Case Study - Google Data Analytics Capstone using Python.
bellabeat bellabeat-case-study bellabeat-eda bellebeat-data-analysis case-study case-study-analysis data-analysis data-visualization eda python reports
Last synced: 29 Nov 2024
https://github.com/jsinkx/bigdata-player-stats-coursework
Analysis of sportsmen parameters using Big Data methods
analytics big-data coursework eda
Last synced: 09 Jan 2025
https://github.com/karlyndiary/mavens-pizza-sales-insight
Analyzing Maven Pizza's sales performance and business insights by exploring key metrics, product trends, customer behavior, and peak sales periods, utilizing SQL for querying and Excel for dashboard visualizations.
analysis data-exploratory data-pipeline data-visualization eda etl excel-dashboard pizza-dataset sql
Last synced: 29 Nov 2024
https://github.com/iris-events/iris-docs
Iris Event drivent arhitecture documentation
eda event event-driven event-driven-microservices event-driven-programming quarkus
Last synced: 12 Oct 2024
https://github.com/datalopes1/manufacturing_defects
Projeto de EDA utilizando o Manufacturing Defects que pode ser encontrado no Kaggle
data-analysis data-visualization eda exploratory-data-analysis python
Last synced: 07 Dec 2024
https://github.com/datalopes1/bankabc_churn
Neste projeto será realizado o processo de EDA (Exploratory Data Analysis) com foco na análise de Churn a partir do datas ser Bank Customer Churn Dataset, que pode ser encontrado no Kaggle e disponibilizado por Gaurav Topre.
churn-analysis data-analysis data-science eda python
Last synced: 07 Dec 2024
https://github.com/datalopes1/bank_marketing
Este projeto será baseado no Dataset Bank Marketing encontrado na UC Irvine - Machine Learning Repository e disponibilizado por S. Moro, R. Laureano e P. Cortez
data-analysis data-science data-visualization eda python
Last synced: 07 Dec 2024
https://github.com/datalopes1/ds_salaries2024_eda
Neste projeto será realizado o processo de EDA (Exploratory Data Analysis) a partir do dataset Data Science Salaries 2024, que pode ser encontrado no Kaggle, com licensa Database: Open Database e enviado por Sazidul Islam.
data-analysis data-visualization eda exploratory-data-analysis jupyter-notebook python
Last synced: 07 Dec 2024
https://github.com/cgkantidis/gen_design
Generate a verilog file and an acompanying SPEF file, for a hierarchical design
Last synced: 25 Nov 2024
https://github.com/aksharabhavitha/covid19-analysis
This repository contains the analysis of COVID19 and Visualisations including the CSV file used.
cleaning-data eda jupyter-notebook python visualisations
Last synced: 21 Jan 2025
https://github.com/sparab16/creditcardprediction
To build a classification methodology to determine whether a person defaults the credit card payment for the next month.
eda flask machine-learning naive-bayes python sqllite3 xgboost
Last synced: 01 Dec 2024
https://github.com/pramodkondur/dataspark-end-to-end-dataanalytics
Cleaned, performed EDA and stored data in MySQL. Queried, and analyzed data, uncovering opportunities to drive revenue growth and optimize operations, with a potential revenue growth of $30.03 million. Reported key insights using Power BI.
data-analysis data-visualization eda powerbi python sql
Last synced: 19 Jan 2025
https://github.com/izam-mohammed/ml-notebooks
This repository contain the my main notebooks📊 that contains practical ML notebooks, showcasing code, insights 🔍, and experiments in python. 🚀
eda machine-learning ml notebooks-jupyter
Last synced: 09 Jan 2025
https://github.com/mayhixza/insurance-dataset-analysis
Medical cost insurance EDA project
data-science data-visualization eda linear-regression matplotlib scikit-learn seaborn
Last synced: 08 Dec 2024
https://github.com/medss19/dps-ai-model
A Flask-based API for time-series forecasting using SARIMA, Prophet, and a Hybrid model, with traffic accident data analysis and visualization.
eda fbprophet flask-api hybrid-model postman prophet sarima
Last synced: 20 Dec 2024
https://github.com/shreeparab1890/india-gdp-rate-1960-to-2021-data-analysis
This ipython notebook is the Exploratory data analysis (EDA) of the India GDP Rate 1960 to 2021.
analysis data-analysis eda exploratory-data-analysis ipython-notebook jyputer-notebook matplotlib matplotlib-pyplot pandas python
Last synced: 12 Jan 2025
https://github.com/mohammad95labbaf/umap_breast_cancer
This repository explores the interplay between dimensionality reduction techniques and classification algorithms in the realm of breast cancer diagnosis. Leveraging the Breast Cancer Wisconsin dataset, it assesses the impact of various methods, including PCA, Kernel PCA, LLE, UMAP, and Supervised UMAP, on the performance of a Decision Tree.
breast-cancer-wisconsin decision-tree decision-tree-classifier dimensionality-reduction dimensionality-reduction-technique eda expolatory-data-analysis kernel-pca lle locally-linear-embedding principal-component-analysis umap
Last synced: 13 Jan 2025
https://github.com/0suphan0/heart-attack-analysis-prediction
Analysis and prediction study of heart attack data.
eda machine-learning-algorithms
Last synced: 13 Jan 2025
https://github.com/sarahloree/project-2--bank-loan-marketing-model
This is the second project I completed as part of the Machine Learning Module from my post-graduate certification in AI/ Machine Learning from University of Texas' McCombs School of Business.
business-analytics data-engineering decision-tree-classifier decision-trees eda modelbuilding modelevaluation performance-analysis performance-metrics performancemonitoring preprocessing-data
Last synced: 06 Jan 2025
https://github.com/muzhi1920/take-home_challenge_summary
Exploratory Data Analysis
data-science eda machine-learning
Last synced: 27 Nov 2024
https://github.com/zofiaqlt/professional_inequalities_knime
🎯 Gender inequality at work - use of KNIME (Background research, GDPR, Data governance, ETL, EDA, Data cleaning and validation, Statistical tests with R, and Data Visualization)
datagovernance datavalidation datavisualization eda etl-pipeline knime r rgpd statistical-tests
Last synced: 12 Jan 2025
https://github.com/sarahloree/project-3--credit-card-user-churn-prediction
This is the third project I completed as part of the Advanced Machine Learning module from my post-graduate certification in AI/ Machine Learning from University of Texas' McCombs School of Business.
bagging bagging-classifier boosting boosting-classifier cross-validation datapreprocessing eda exploratory-data-analysis hyperparameter-optimization hyperparameter-tuning random-forest random-forest-classifier sampling smote
Last synced: 06 Jan 2025
https://github.com/nero103/spotify-analysis
This is an analysis of the 2018 Spotify data from Maven Analytics, using T-SQL to explore the data along with Tableau to make visualizations and uncover insights on the top 100 artists and their songs.
dashboard eda nested-queries queries sql t-sql table-join-query tableau
Last synced: 27 Nov 2024
https://github.com/sc0v0ne/exploratorydataanalysis
Exploratory Data Analysis
datasets eda exploredataanalysis kaggle matplotlib notebooks plot python r
Last synced: 31 Dec 2024
https://github.com/alexandramartinez/asyncapi-mule-sfpe
MuleSoft EDA-based integration using AsyncAPI specifications, Anypoint Code Builder, and Salesforce Platform Events.
acb anypoint-code-builder api asyncapi asyncapi-specification eda mule mule-app mule4 mulesoft mulesoft-application platform-event platform-event-bus platform-events salesforce salesforce-api salesforce-developers
Last synced: 28 Nov 2024
https://github.com/drisskhattabi6/exploratory-data-analysis-projects
This Repo contains My Exploratory Data Analysis Projects for many datasets
data-analysis data-preprocessing data-visualization datasets diabetes-prediction eda exploratory-data-analysis iris-dataset
Last synced: 28 Nov 2024
https://github.com/zofiaqlt/credit_risk_pyspark
🎯 Credit risk detection - use of PySpark, Python and JupyterLab (Data collection, Cleaning, EDA, Regression, Classification, Statistical tests, and Data Visualization)
classification eda machinelearning pyspark regression
Last synced: 12 Jan 2025
https://github.com/nikhilsree5/aerofitcasestudy
Customer Profiling and Market Segmentation for AeroFit Treadmills: A Data-Driven Approach
customerprofile eda numpy pandas python visualization
Last synced: 28 Nov 2024
https://github.com/nikhilsree5/netflixcasestudy
Global Content Strategy: Leveraging Data Insights to Optimize Netflix's International Growth
eda numpy pandas python visualization
Last synced: 28 Nov 2024
https://github.com/nikhilsree5/walmartcasestudy
Analysis of Customer Spending Habits at Walmart Inc
clt eda numpy pandas python3 statistics visualization
Last synced: 28 Nov 2024
https://github.com/dhruvil-26/python-projects
This repository contains Python projects showcasing data analysis and visualization. 1. IMDB Movie Analysis: Analyzing movie trends, genres, and ratings. 2. Loan Default Analysis EDA: Exploring factors contributing to loan defaults.
eda imdb-dataset loan-default-analysis matplotlib numpy pandas python seaborn visualization
Last synced: 28 Nov 2024
https://github.com/sunsided/coding-stats
Explorative data analysis of my coding stats
c coding-stats cpp csharp eda python rust
Last synced: 20 Dec 2024
https://github.com/amitbisht99/ydata-profiling
This repository showcases my learning process of automating EDA using 'ydata-profiling'
data-analytics data-profiling eda pandas python3 ydata-profiling
Last synced: 08 Dec 2024
https://github.com/zofiaqlt/hunger_study
🎯 Global study to tackle hunger worldwide and support FAO's mission - use of Python and JupyterLab (Background research, Data collection, Cleaning, EDA, and Data Visualization)
Last synced: 12 Jan 2025
https://github.com/saravanansuriya/industrial-copper-modeling
In this project will equip with practical skills and experience in data analysis, machine learning modeling, and creating interactive web applications using Streamlit, and provide you with a solid foundation to tackle real-world problems in the manufacturing domain.
data-wrangling eda machine-learning-algorithms pandas python streamlit-webapp
Last synced: 28 Nov 2024
https://github.com/sunsided/gun-violence-eda
Exploratory Data Analysis on the Gun Violence Dataset
Last synced: 20 Dec 2024
https://github.com/luminousmen/data_explorer
Streamlit sample application for Exploratory Data Analysis
Last synced: 11 Jan 2025
https://github.com/mayhixza/breast-cancer-classification
Classified tumors as malignant or benign using various supervised ML models
classification eda evaluation-metrics machine-learning supervised-machine-learning
Last synced: 20 Jan 2025
https://github.com/devbluecomet/data-eda
(Exploratory Data Analysis) - Start with understanding the data I was able to be working with. This is crucial as it helps me grasp the characteristics of your dataset, identify patterns, and potential challenges.
artificial-intelligence eda jupyter-notebook
Last synced: 02 Dec 2024
https://github.com/moonmoonsamal/customer_purchase_behavior_analysis
Customer purchase analysis with SQL, Python, and PowerBI
cleaning-data cte eda manipulate-data normalization visualization
Last synced: 03 Dec 2024