Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists tagged with eda
A curated list of projects in awesome lists tagged with eda .
https://github.com/georgiosioannoucoder/2023-fall-data-science-ta
These are my code examples for the 2023-fall-data-science-ta as a Data Science Teaching Assistant at CUNY Tech Prep (CTP) Cohort 9. 📊
dashboard data-visualization decision-tree eda huggingface image-classification machine-learning ml neural-network nlp pandas random-forest regression teaching-assistant transformer
Last synced: 02 Jan 2025
https://github.com/tanyagarg25/lendingclub_loan_analysis
Data exploration and analysis of Lending Club loan data to predict loan default risk. Includes data cleaning, descriptive statistics, and visualizations using Tableau and Excel
analysis eda tableau visualization
Last synced: 29 Dec 2024
https://github.com/vinicius999/eda-imdb-top1000-films
Análise exploratória dos Top 1000 filmes no IMDB até 2020
Last synced: 13 Nov 2024
https://github.com/badranalyst/titanic-survival-prediction-full-data-science-project-classification
This project predicts Titanic survivors using classification models. It includes data cleaning, pre-processing, exploratory data analysis (EDA), categorical feature conversion, model building, and evaluation. Python libraries like Pandas, NumPy, Matplotlib, and Seaborn are used to analyze and predict survival outcomes.
classification data-analysis data-science eda exploratory-data-analysis machine-learning matplo matplotlib-pyplot ml model numpy pandas predictive-modeling python seaborn
Last synced: 30 Dec 2024
https://github.com/ahmedkhaled404/data-cleaning-and-eda-layoffs-mysql
This project involves cleaning a dataset containing information about layoffs from companies around the world.
data data-analysis data-cleaning data-preprocessing datacleaning eda exploratory-data-analysis mysql sql
Last synced: 13 Nov 2024
https://github.com/harmanveer2546/nasa-asteroid-classification
Classifying whether an asteroid is hazardous or not.
eda matplotlib numpy pandas python seaborn visualization xbgoost
Last synced: 13 Nov 2024
https://github.com/badranalyst/restaurant-reviews-sentiment-analysis-nlp-case-study
This project analyzes restaurant reviews using Natural Language Processing (NLP) for sentiment analysis. It covers data exploration, pre-processing (NLTK text cleaning), model building, prediction, and deployment. The goal is to predict sentiment from reviews using Python libraries such as Pandas, NumPy, Matplotlib, and Seaborn.
data-analysis data-science eda exploratory-data-analysis matplotlib-pyplot model model-building numpy pandas pre-processing predictive-modeling python seaborn
Last synced: 30 Dec 2024
https://github.com/badranalyst/e-commerce-customer-analysis-data-science-foundations-case-study
This case study explores e-commerce customer data through data exploration, pre-processing, and splitting. It includes model building and training to analyze customer behavior. Python libraries like Pandas, NumPy, Matplotlib, and Seaborn are used for the analysis and model development.
data-analysis data-science dataset eda exploratory-data-analysis machine-learning matplotlib ml model-building model-training numpy pandas pre-processing python seaborn
Last synced: 30 Dec 2024
https://github.com/jesly-joji/laptop-price-prediction
Laptop Price Prediction
Last synced: 13 Nov 2024
https://github.com/syedzaheerabbas/jamboree-education-linear-regression
Using data from Jamboree, this project explores the relationship between applicant profiles (GRE, TOEFL, GPA, etc.) and their chances of admission to Ivy League graduate programs. Linear regression, Ridge, and Lasso regression are employed to build predictive models and identify key factors.
data eda linear-regression python visualization
Last synced: 25 Dec 2024
https://github.com/riya2624/heart-disease-diagnostic-analysis
Analyzed heart disease diagnostic data through an ETL process, ensuring data accuracy and readiness for analysis using Python (Pandas, NumPy, Matplotlib, Seaborn). Conducted comprehensive exploratory data analysis (EDA) to uncover trends related to heart disease rates by gender and age. Developed interactive dashboards with Power BI and Tableau
dashboard dax-query eda machine-learning matplotlib numpy pandas python seaborn
Last synced: 30 Dec 2024
https://github.com/rohra-mehak/x-plora
backend django django-rest-framework eda python3 reactjs redux rest-api sqlite3
Last synced: 28 Dec 2024
https://github.com/ksharma67/intel-stock-predication-wiith-eda
We are trying to design a model that can predict the price of stock using different methods and algorithms.
eda linear-regression machine-learning-algorithms matplotlib numpy pandas prediction python scaler seaborn skit-learn
Last synced: 25 Dec 2024
https://github.com/tomfreudenberg/cedra
Harnessing the strengths of Cement for fast CLI apps, Dramatiq for reliable task processing, and Grpc for external access, CEDRA redefines efficiency in event-driven architecture.
cement dramatiq eda event-driven grpc message-queue python rabbitmq trpc
Last synced: 15 Dec 2024
https://github.com/mauriciovazquezm/caso_cas_fall2024
Repositorio de competencia de Caso de CAS (casualty actuarial society) otoño 2024.
data-visualization eda exploratory-data-analysis python
Last synced: 13 Nov 2024
https://github.com/hariprasath-v/job-a-thon---may-2021
Exploratory Data Analysis
eda exploratory-data-analysis matplotlib missingno numpy pandas pandas-python plotnine
Last synced: 13 Nov 2024
https://github.com/hariprasath-v/amazon-ml-hiring-challenge
Online machine learning hackathon to classify the customer based on various activity scores on the e-commerce website.
dataanalysis eda exploratory-data-analysis ggplot2 r
Last synced: 13 Nov 2024
https://github.com/ksharma67/anomaly-detection-on-temperature-device-failure
A typical anomaly detection task and performing KMeans, PCA, Gaussian distribution, and Isolation Forest.
eda ellipticenvelope feature-engineering gaussian-distribution isolation-forest kmeans-clustering numpy pca python sklearn
Last synced: 25 Dec 2024
https://github.com/ksharma67/eda-on-ipl
In this python notebook, analysis of IPL matches from 2008 to 2020 is done using python packages like pandas, matplotlib and seaborn.
data-analysis data-science eda matplotlib numpy pandas python seaborn
Last synced: 25 Dec 2024
https://github.com/celestialprogrammer/machinelearningalgorithms
Classification techniques and analysis using hugging face and google colab
classification-algorithm eda hacktoberfest hacktoberfest-accepted huggingface huggingface-transformers logistic-regression machine-learning sentiment-analysis zeroshot-classification
Last synced: 06 Nov 2024
https://github.com/arxiver/airbnb-eda-and-regression
Big data exploration and analysis on Airbnb dataset as well as regression model for price prediction of entities
airbnb analysis big-data big-data-analytics bigdata eda python regression regression-models visualization xgboost
Last synced: 15 Nov 2024
https://github.com/thatfiredev/eda2trabalhopratico
Trabalho Prático de EDA2. Realizado no 3º ano (2018) de Eng. Informática do ISCTEM.
eda graph-theory graphs student-project
Last synced: 01 Jan 2025
https://github.com/shwetapardhi/assignment-04-simple-linear-regression-1
Q1) Delivery_time -> Predict delivery time using sorting time. Build a simple linear regression model by performing EDA and do necessary transformations and select the best model using R or Python. EDA and Data Visualization, Feature Engineering, Correlation Analysis, Model Building, Model Testing and Model Predictions using simple linear regressi
correlation-analysis data-visualization distplot eda feature-engineering model-building model-prediction model-testing numpy ols-regression p-value pandas python regression-plot rsquare-values seaborn simple-linear-regression smf statsmodel t-score
Last synced: 11 Nov 2024
https://github.com/shwetapardhi/assignment-05-multiple-linear-regression-2
Prepare a prediction model for profit of 50_startups data. Do transformations for getting better predictions of profit and make a table containing R^2 value for each prepared model. R&D Spend -- Research and devolop spend in the past few years Administration -- spend on administration in the past few years Marketing Spend -- spend on Marketing in t
collinearity-diagnostics cooks-distance correlation-analysis eda heteroscedasticity homoscedasticity leverage-score multi-linear-regression numpy ols-regression p-value pair-plot python r-square-values regress-exog residual-analysis smf statsmodels vif
Last synced: 11 Nov 2024
https://github.com/ksharma67/partial-dependent-plots-and-individual-conditional-expectation-plots
Individual Conditional Expectation (ICE) plots display one line per instance that shows how the instance's prediction changes when a feature changes. The Partial Dependence Plot (PDP) for the average effect of a feature is a global method because it does not focus on specific instances, but on an overall average.
eda gradient-boosting individual-conditional-expectation linear-regression matplotlib numpy pandas partial-dependence-plot python seaborn sklearn xgboost
Last synced: 25 Dec 2024
https://github.com/imumi17/credit_eda_case_study
Exploratory Data Analysis on Banking Data
banking credit credit-risk eda fraud-prevention python
Last synced: 30 Dec 2024
https://github.com/ammahmoudi/bike-sharing-trends
Predicting bike sharing trends using classic machine learning methods (linear regression, decision tree)
bike-sharing decision-trees eda linear-regression machine-learning ml
Last synced: 15 Nov 2024
https://github.com/ammahmoudi/water-treatment-plant
Categorizing the plant's operation state using sensor data suing SVMs.
eda knn machine-learning ml svm water-treatment
Last synced: 15 Nov 2024
https://github.com/darshan12345678910/loan_predict-logisticregression-ml-algorithms
implementation of LogisticRegression
eda logistic-regression ml-algorithms numpy pandas
Last synced: 12 Nov 2024
https://github.com/shreeparab1890/heart-attack-analysis-prediction-accuracy-85.24
This iPython Notebook implements the Analysis and Predictions for Heart Attack. The Analysis is done on the dataset to identify the key features which has a impact on Heart Attack. After the analysis and data cleaning, the dataset is used to train the model using different ML algorithm.
eda exploratory-data-analysis matplotlib numpy pandas plotly prediction python scikit sklearn
Last synced: 01 Jan 2025
https://github.com/yuting1214/-kobe-bryant-career-stats-analysis
Explore the record of the legendary NBA player - Kobe Bryant
eda exploratory-data-analysis kobe
Last synced: 30 Dec 2024
https://github.com/ksharma67/k-means-algorithm-on-the-iris-dataset
Applied the K-Means algorithm on the Iris dataset, and utilized the Silhouette Score method to find the best value of K
eda elbow-method iris-dataset kmeans-clustering matplotlib numpy pandas python seaborn silhouette-score
Last synced: 25 Dec 2024
https://github.com/virajbhutada/diamond-price-estimator
This project develops a predictive model to estimate diamond prices based on characteristics like carat, cut, color, and clarity. It covers data preprocessing, feature engineering, model selection, training, and evaluation. The final product is a web app where users can input diamond attributes to get accurate and instant price predictions.
cross-validation css data-analysis data-science-projects data-visualization eda feature-engineering html hyperparameter-tuning jupyter-notebooks machine-learning ml-algorithms model-deployment model-selection performance-optimization predictive-modeling python python-app user-interface
Last synced: 11 Nov 2024
https://github.com/ayushtiwari134/stock_price_predictor_dl
This is a full stack end to end project with the model trained in jupyter notebook, the backend file written in python, and for simplicity, the frontend created using streamlit.
deep-learning eda keras lstm-model machine-learning matplotlib numpy pandas python rnn streamlit tensorflow yfinance-library
Last synced: 24 Dec 2024
https://github.com/shubhamdeepkeshav/visualization-on-tips
📊 Data visualization project analyzing tipping behavior in restaurants using Python. 🍽️ Explores insights based on ⏰ time, 👥 party size, 🧑🤝🧑 gender, and 🚬 smoker status with Matplotlib and Seaborn.
data-visualization dataanalysis eda matplotlib python seaborn
Last synced: 21 Dec 2024
https://github.com/omarsaad21/rfm-clustering-
A full Data science and deployment project focusing on Data analysis and ML ( create a customer segmentation model to recommend the best merchants for each user as targetted offers)
business-solutions data-science eda numpy pandas plotly python sickit-learn streamlit
Last synced: 21 Dec 2024
https://github.com/omarsaad21/shopping-cart-eda
An EDA python project focusing on getting the most out of a movies dataset (- 1- combine this data together - 2- check messing values - 3- show summary statistics - 4- deal with date time and extract features from dates - 5- answer at least three questions from this data)
eda jupyter-notebook numpy pandas plotly python
Last synced: 21 Dec 2024
https://github.com/ibm-cloud-architecture/store-mq-gitops
Bootstrapped GitOps Repository
Last synced: 17 Nov 2024
https://github.com/sanju-srivatsa/vizcraft-data-science-app
Please find the Demo Link for the DS App
data-science data-visualization eda pandas plotly python streamlit
Last synced: 21 Dec 2024
https://github.com/kamaljangir1/s-and-p-500_stock-market
Analyzed the stock market, focusing on the S&P 500 index and individual stocks using Python, EDA, and Excel
Last synced: 04 Jan 2025
https://github.com/suzukisakae/gui_eda_chesswomen
Lê Thành Vinh (21110940): LẬP TRÌNH PYTHON PHÂN TÍCH DỮ LIỆU THĂM DÒ (EDA) VỀ “NỮ ĐẠI KIỆN TƯỚNG CỜ VUA (8/2020)
chess custom-tkinter eda gui-application hcmute tkinter
Last synced: 21 Dec 2024
https://github.com/ibm-cloud-architecture/refarch-eda-item-inventory-sql-flink
A SQL Flink implementation to compute the item inventory and store inventory
Last synced: 17 Nov 2024
https://github.com/saadarazzaq/imtiaz-remastered
Data Science for Supermarket Customer Retention
data-acquisition data-normalization data-science data-transformation dbscan-clustering eda elbow-plot kmeans-clustering kmeans-plus-plus matplotlib numpy pandas sklearn
Last synced: 21 Dec 2024
https://github.com/computingvictor/thread_app_dataset
Analyses and models based on the "Thread app dataset: 37000 entities" Kaggle dataset
data-science dataset eda kaggle
Last synced: 25 Dec 2024
https://github.com/computingvictor/insurance-company-benchmark-practice
1st Practice for the subject of Machine Learning
cunef data-science eda insurance-company jupyter-notebook machine-learning python
Last synced: 25 Dec 2024
https://github.com/ibm-cloud-architecture/eda-kstreams-labs
Kafka Streams Examples / Labs to support EDA enablements
Last synced: 17 Nov 2024
https://github.com/udipta14/historical-olympic-games-eda-python
Exploratory Data Analysis of a Historical Olympic Games Dataset, including all the games from Athens 1896 to Rio 2016.
data-cleaning data-visualization eda matplotlib numpy pandas python3 seaborn
Last synced: 29 Dec 2024
https://github.com/robinmillford/air-flight
This repository is used to forecast aircraft delays and ticket prices.
big data data-science delay eda flight jupyter-notebook linear-regression logistic-regression machine-learning price-prediction pythin3 random-forest
Last synced: 17 Nov 2024
https://github.com/ashishsingh789/customer_purchase_prediction_using_decision-tree-_classifier
Decision Tree Classifier to predict customer purchases using demographic and behavioral data. Key steps: data preprocessing, EDA, model training, evaluation, and feature importance analysis.
data datascience desiciontree eda machine-learning-algorithms matplotlib numpy pandas-dataframe python seaborn
Last synced: 15 Nov 2024
https://github.com/aniruddha-10/data-201-group-project
Simple and basic Exploratory data analysis
Last synced: 15 Nov 2024
https://github.com/faizantkhan/automated-eda
This repository showcases tools for automatic Exploratory Data Analysis (EDA) in Python. These tools help you quickly understand your datasets and generate insightful reports.
automatic automation autoviz data-analysis data-analysis-python data-science data-visualization dtale dtale-library eda exploratory-data-analysis ml pandas pandas-profiling python python-library sweetviz
Last synced: 15 Nov 2024
https://github.com/computingvictor/payments_fraud_practice
2nd Practice for the subject of Machine Learning
cunef data-science eda fraud-detection interpretability machine-learning models preprocessing python
Last synced: 25 Dec 2024
https://github.com/faizantkhan/machine-learning
Machine Learning Practice and Exercises Welcome to our repository dedicated to the practice and mastery of machine learning (ML) concepts and techniques. This repository serves as a comprehensive resource for learners and enthusiasts looking to enhance their ML skills through hands-on exercises and practical applications.
classification-algorithm clustering-algorithm data-science datavisualization decision-trees eda linear-regression logistic-regression machine-learning machine-learning-algorithms machine-learning-library math matplotlib-pyplot model-selection pandas python sklearn-library testing-data training-data
Last synced: 15 Nov 2024
https://github.com/shwetapardhi/assignment-04-simple-linear-regression-2
Q2) Salary_hike -> Build a prediction model for Salary_hike Build a simple linear regression model by performing EDA and do necessary transformations and select the best model using R or Python. EDA and Data Visualization. Correlation Analysis. Model Building. Model Testing. Model Predictions.
correlation-analysis data-visualization distplot eda feature-engineering model-building model-predictions model-template numpy ols-regression p-value pandas python r-square-values regression-plot seaborn simple-linear-regression smf statsmodels t-score
Last synced: 11 Nov 2024
https://github.com/katebea/fib-eda-estructura-de-datos-y-algoritmos
Repositorio del curso de Estructuras de Datos y algoritmos en el grado de Ingeniería Informática en la FIB UPC
Last synced: 17 Nov 2024
https://github.com/djdhairya/rooftop-solar-detection
data-processing data-science deep-learning eda machine-learning pandas scikit-learn tif
Last synced: 10 Oct 2024
https://github.com/snowkylin/npn
A boolean matcher that computes the NPN canonical representative for a given boolean function.
boolean-matcher cpp eda logic-synthesis npn pypi-package python
Last synced: 15 Nov 2024
https://github.com/omarsaad21/credit-train-data-science-project
This a full web application to predict the credit score of clients plus I did many visulizations to express many insights in chart
eda matplotlib ml numpy pandas python sklearn streamlit-webapp
Last synced: 15 Nov 2024
https://github.com/karlyndiary/coffee-shop-sales-analysis
Analyzing coffee shop sales using Pandas for data cleaning and exploratory data analysis (EDA), and Streamlit for data visualization dashboards.
data-analysis data-cleaning data-preprocessing data-visualization eda pandas streamlit streamlit-dashboard
Last synced: 21 Dec 2024
https://github.com/shivamsharma32/haberman-data-analysis
The dataset contains cases from a study that was conducted between 1958 and 1970 at the University of Chicago’s Billings Hospital on the survival of patients who had undergone surgery for breast cancer.
Last synced: 14 Dec 2024
https://github.com/shreeparab1890/unicorns-of-india-till-sep-2022-analysis-eda
This ipython notebook is the Exploratory data analysis (EDA) of the Unicorns of India till Sep 2022.
analysis data-analysis eda exploratory-data-analysis matplotlib-pyplot numpy pandas plotly
Last synced: 01 Jan 2025
https://github.com/rmrt1n/chess_analysis_project
Webscraping and analysing games of Hikaru Nakamura
chess data-analytics data-visualization eda rvest tidyverse web-scraping
Last synced: 14 Dec 2024
https://github.com/renatodts/payments-technical-interview
Payments API project, developed for a technical interview.
api api-gateway api-rest backend cqrs ddd domain-driven-design eda event-driven-architecture event-sourcing express framework payments rest-api technical-interview typescript
Last synced: 06 Dec 2024
https://github.com/shreeparab1890/indian-elections-2019-analysis-eda
This ipython notebook is the Exploratory data analysis (EDA) of the Indian Lok Sabha Elections 2019.
data data-analysis data-science data-visualization eda exploratory-data-analysis matplotlib numpy pandas plotly python python3 visualization
Last synced: 01 Jan 2025
https://github.com/anuuragg/human-microbiome---eda
Fundamentals of Data Science - End Semester Project 1
data-science data-visualization eda fds microbiome
Last synced: 20 Nov 2024
https://github.com/seekai-786/stock-price-predictor
"Developed a machine learning model to predict stock prices using historical data and market indicators. Leveraged techniques like time series analysis and regression. Utilized Python libraries for data preprocessing, model training, and evaluation, achieving reliable accuracy for informed investment decisions."
dataanalysis eda jupyter-notebook machine-learning machine-learning-algorithms python stock-price-prediction streamlit
Last synced: 21 Dec 2024
https://github.com/noodleslove/house-of-representative-analysis-i
This project uses public data about the stock trades made by members of the US House of Representatives.
data-analysis data-science eda kaggle-dataset matplotlib-pyplot pandas python stocks-trading
Last synced: 30 Nov 2024
https://github.com/ahmad-ali-rafique/adult-income-dataset
This repository contains a Jupyter Notebook exploring the adult income dataset. The notebook performs Exploratory Data Analysis (EDA), including visualizations with charts and graphs. Additionally, it implements various classification models to predict income and analyzes their accuracy.
accuracy classification dataanalytics datavisualization-project decision-tree-classifier eda evaluation evaluation-metrics exploratory-data-analysis logistic-regression machine-learning random-forest-classifier
Last synced: 15 Nov 2024
https://github.com/ribin-baby/the-sparks-foundation-data-science-internship
This repository contains tasks and solutions assigned as part of internship program. This repository contains workbooks on data analysis and model building parts.
Last synced: 15 Nov 2024
https://github.com/raulmaulidhino-dev/ml_modelling_regression
There are many factors that influence the grades/scores of students. One of the factors is study hours. In this mini analysis project, there are 3 models that will learn and predict the relation between study hours of students and their scores in an exam/test. This project will result the best ML model to solve the problem.
data data-analysis-python data-science eda machine-learning scikit-learn
Last synced: 21 Dec 2024
https://github.com/chandkund/sentiment-analysis-on-movie-reviews
Project on Sentiment Analysis of Movie Reviews using K-means clustering. This project clusters movie reviews based on their sentiment scores to understand audience reactions. Includes data preprocessing, feature selection, model development, and visualization.
data-science eda jupyter-notebook machine-learning matplotlib numpy pandas preprocessing python sklearn
Last synced: 21 Dec 2024
https://github.com/samridhisainii/airbnb-data-analysis
Data analysis of airbnb dataset
analysis data data-visualization eda models
Last synced: 29 Nov 2024
https://github.com/spacebakery/analyze-data-with-python-portfolio-project
Analyze Data with Python
barplot categories chi-square-test conservation contingency-table crosstab data-analysis data-cleaning-and-preprocessing eda endangered-species matplotlib national-parks pandas-dataframe species species-conservation
Last synced: 03 Jan 2025
https://github.com/jt2m0l3y/quantified-self
The final project for an introduction to data science, this project is a practice in supervised machine learning with kNN and Decision Tree Classifiers.
data-visualization decision-trees eda jupyter-notebooks knn-classification latex markdown numpy pandas python scikit-learn statistical-analysis supervised-learning
Last synced: 27 Dec 2024
https://github.com/prady2309/unemployement-analysis
Data Science Project
colab-notebook data-analysis data-science data-visualization eda jupyter-notebook machine-learning python3
Last synced: 22 Dec 2024
https://github.com/alchemine/computer-vision-anomaly-detection
Computer Vision 이상치 탐지 알고리즘 경진대회
Last synced: 15 Nov 2024
https://github.com/alchemine/diabetes-prediction
Diabetes Prediction and Analysis (NHIS-2018)
eda jupyter python scikit-learn streamlit
Last synced: 15 Nov 2024
https://github.com/abhash-rai/regression-car-price-prediction
This repository contains my first complete data science project from web scrapping for data to data preprocessing, cleaning, exploratory data analysis, model training and deployment.
data data-science data-visualization eda exploratory-data-analysis machine-learning neural-network prediction prediction-model regression
Last synced: 15 Nov 2024
https://github.com/bastianlq/analisis-instacart
Proyecto de análisis a la plataforma de delivery Instacart, análisis exploratorio y visualización de datos.
analisis-de-datos eda matplotlib
Last synced: 06 Dec 2024
https://github.com/neerajcodes888/ipl-victory-analysis-with-prediction
This repository contains code for analyzing and predicting outcomes in the Indian Premier League (IPL) cricket matches from 2008 to 2022. It includes data analysis notebooks, a prediction model, and a Flask-based web application for interactive predictions. Explore historical match data, gain insights, and make predictions on upcoming matches .
css3 csv-datasets eda feature-extraction flask-application github html5 notebooks pandas-library preprocessing python3 render-deployment sckiit-learn
Last synced: 04 Dec 2024
https://github.com/neerajcodes888/data-science
This repository is a hub for data science enthusiasts, offering a diverse collection of projects, notebooks, and resources covering topics such as data analysis, machine learning, deep learning, and generative AI. Explore innovative ideas, contribute to cutting-edge research, and enhance your skills in the dynamic field of data science
data-analysis data-science data-visualization deep-learning deep-learning-algorithms eda genai jupyter-notebook machine-learning machine-learning-algorithms openai-api pandas plotting python3 sklearn-library streamlit
Last synced: 04 Dec 2024
https://github.com/lu-sketch/chocolate-imports-dataset
Chocolate Imports for South Africa
Last synced: 29 Nov 2024
https://github.com/jimmymugendi/luxdev-week-2-boot-camp
correlation-analysis eda matplotlib numpy pandas seaborn-plots visualizations
Last synced: 16 Nov 2024
https://github.com/shreshth-112/zomato-data-analysis
Helping Zomato segregate restaurants according to the data collected from the masses.
data-science eda sentiment-analysis
Last synced: 29 Dec 2024
https://github.com/daviddavo/18eda
This is a mirror from https://gitlab.com/daviddavo/18eda
Last synced: 23 Nov 2024
https://github.com/somjit101/data_science-eda
A collection of useful implementations to perform EDA on a new dataset in order to understand preliminary patterns in the dataset and gain a high-level grasp of the dataset using plots and visualizations.
boxplots contour-plots distribution eda histogram iris-dataset plots qqplot seaborn-plots statistical-analysis violin-plots
Last synced: 16 Nov 2024
https://github.com/albertofaraujo/ml_ocupacao_casa
Criar um modelo de Rede Neural Artificial, capaz de prever o valor da mediana de ocupação das casas, utilizando dados locais.
data-science eda machine-learning redes-neurais-artificiais rstudio
Last synced: 06 Dec 2024
https://github.com/albertofaraujo/ml_notas_de_alunos
Prever as notas dos alunos com base em diversas métricas.
data-science eda machine-learning regressao-linear rstudio
Last synced: 06 Dec 2024
https://github.com/sajjad425/olympic-analysis
Perfect for analyzing Olympic history, athlete trends, and country-level participation.
eda jupyter-notebook olympics olympics-dataset python3
Last synced: 17 Nov 2024
https://github.com/sajjad425/bankchurn
EDA on bank customer data to predict churn using features.
banking banking-application eda powerbi-desktop powerbi-visuals visualization
Last synced: 17 Nov 2024
https://github.com/sajjad425/edaipl
The dataset covers the Indian Premier League (IPL) with details on matches (date, teams, venue, results), player stats (runs, wickets), team stats (wins, losses), season summaries, and umpire info. The EDA reveals patterns and insights, highlighting dominant teams, star players, and trends across seasons.
data-analysis eda exploratory-data-analysis ipl python
Last synced: 17 Nov 2024
https://github.com/sajjad425/retail-eda
Perform the Exploratory Data Analysis on dataset sample superstore. As a business manager try to find out the weak areas where you can work to make more profit. What all business problem you can derived by exploring the data?
Last synced: 17 Nov 2024
https://github.com/vikram-bhati/kmeans-for-university
Cluster Universities into to two groups, Private and Public
clustering-algorithm dataanalysis eda kmeans kmeans-clustering machine-learning ml python python3 sklearn
Last synced: 31 Dec 2024
https://github.com/sehgal-vishal/sql-nyc-collision-analysis
this analysis is based on the Collisions(Accidents) happend in New York City. I have used Sql Server For EDA(Exploratory Data Analysis
data-analysis database eda sql-server
Last synced: 17 Nov 2024
https://github.com/stataziz/global-layoffs-an-eda-with-sql-and-tableau
This project is about exploring global layoff trends from 2020 to 2023, analyzing the impact across companies, industries, and geographies to uncover key insights into job market disruptions.
eda exploratory-data-analysis mysql sql sql-server tableau
Last synced: 17 Nov 2024
https://github.com/ashish-kr-srivastava/social-media-database-analysis-sql-project
I have recently worked on a project of analyzing a social media platform data on MS SQL SERVER. In this project I have used advanced SQL functions and keywords like Views, Indexes, CTE, Windows Functions and many more.
eda joins mssqlserver schema views windowsfunction
Last synced: 17 Nov 2024
https://github.com/analyticslover/salifort-motors-turnover-project
The Salifort Motors H.R. Project serves as the capstone for the Google Advanced Analytics Program on Coursera. This project presents a business scenario and a problem on the scnario context, employee turnover. In this project, essential techniques as EDA and Data Modeling are used to analyze and predict the employee turnover rates in the company.
data data-analysis datamodeling eda machine-learning pandas python sklearn
Last synced: 17 Nov 2024