Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists tagged with eda
A curated list of projects in awesome lists tagged with eda .
https://github.com/lorenzoranucci/eda-semi-async-http-handler
Handle HTTP requests asynchronously using Kafka events while maintaining a synchronous HTTP interface.
async docker dualwrite eda event-driven event-driven-architecture eventual-consistency go golang kafka
Last synced: 20 Dec 2024
https://github.com/csengupta1101/zomato-kolkata-eda
The Zomato dataset (Kolkata) contains 7388 rows and 7 columns. The repository is an EDA(exploratory data analysis) on the given Dataset.
eda jupyter-notebook kolkata python3 restaurant vscode zomato
Last synced: 29 Dec 2024
https://github.com/nikita-data/unit_economics_projects
unit economics & cohort analysis projects
cac churn-rate conversion create-function data-analysis data-visualization eda hypothesis-testing ltv math matplotlib numpy python retention-rate roi scipy seaborn segmentation statistics unit-economics
Last synced: 06 Dec 2024
https://github.com/nikita-data/eda_projects
Exploratory data analysis projects
cac data-analysis data-visualization eda folium-maps hypothesis-testing ltv math matplotlib numpy plotly python regular roi scipy seaborn segmentation statistics unit-economics
Last synced: 06 Dec 2024
https://github.com/hit07/data_science
Data [ Exploration, Cleaning, Manipulation, Visualisation ]
data-analysis data-cleaning data-exploration data-manipulation data-visualization eda jupyter-notebook matplotlib numpy pandas-dataframe scipy
Last synced: 06 Dec 2024
https://github.com/cosmoduende/r-uber-trips-analyisis
Explore your activity on Uber with R: How to analyze and visualize your personal data history. Find out how you consume the Uber App using a copy of your data.
analisis-de-data data-analysis data-analytics data-science data-visualisation data-visualization data-viz eda flexdashboard ggmap ggplot2 mobility-as-a-service qmplot r-language r-programming ridesharing uber uber-data visualizacion-de-datos
Last synced: 27 Dec 2024
https://github.com/39services/ansible-http-server
This is a web server written in Ansible. Yes, WRITTEN IN Ansible. Not using an external web server.
ansible ansible-eda ansible-playbook eda event-driven-ansible http
Last synced: 14 Dec 2024
https://github.com/kishlayjeet/zomato-data-exploration
In this project, we will be exploring a dataset containing information on various restaurants and their ratings, location, and other attributes.
data-analysis eda matplotlib numpy pandas zomato-data-exploration
Last synced: 24 Dec 2024
https://github.com/alexandramartinez/asyncapis-accounts-email
All the resources you need to implement a functional simple architecture with an Accounts and an Email services using AsyncAPI, Anypoint Code Builder, and the available message brokers: Anypoint MQ, Kafka, Solace PubSub+, and Salesforce CDC/Platform Events
acb anypoint-code-builder anypointmq async async-api asyncapi asyncapi-specification eda kafka kafka-consumer kafka-producer kafka-topic mule mule4 mulesoft queue queues topic
Last synced: 29 Dec 2024
https://github.com/x86-39/ansible-http-server
This is a web server written in Ansible. Yes, WRITTEN IN Ansible. Not using an external web server.
ansible ansible-eda ansible-playbook eda event-driven-ansible http
Last synced: 19 Nov 2024
https://github.com/sksubhadeep/world-population-exploratory-data-analysis-using-python
World-Population-Exploratory-Data-Analysis-using-Python
Last synced: 30 Dec 2024
https://github.com/alexandramartinez/asyncapi-example
Example resources to get started with AsyncAPI specifications in MuleSoft.
asyncapi asyncapi-specification eda event-driven-architecture mulesoft specification
Last synced: 29 Dec 2024
https://github.com/gajendrasharma-github/exploratory-data-analysis
This Repository contains all EDA projects
data-visualization eda exploratory-data-analysis
Last synced: 16 Nov 2024
https://github.com/anoopgeorge418/student-performance-indicator
Developing a model that helps to predict students performance
datascience-machinelearning deployment eda end-to-end flask python
Last synced: 10 Nov 2024
https://github.com/drxwat/rlms-stats
RLMS data investigation and visualization
Last synced: 10 Nov 2024
https://github.com/bhavik-jikadara/house-price-prediction
House Price Prediction
data-science dataprocessing eda jupyter-notebook machine-learning matplotlib model numpy pandas python seaborn test-train-dataset
Last synced: 03 Jan 2025
https://github.com/1ayanabil1/healthcare-machine-learning
Explore our open-source repository focused on healthcare machine learning. We've developed predictive models for cardiovascular disease, diabetes, breast cancer, and more. Our projects employ diverse machine learning algorithms and data science techniques, enhancing early detection, diagnosis, and patient outcomes.
data-analysis data-science deep-learning disease disease-detection disease-modeling disease-prediction eda healthcare-application heathcare jupyter-notebook machine-learning machine-learning-algorithms machinelearning-python python
Last synced: 10 Nov 2024
https://github.com/serhatderya/medical_examination_research
This repository contains a research about medical examinations (such as body measurements, results from various blood tests, and lifestyle choices).
catplot data-analysis data-analytics data-cleaning data-preparation data-preprocessing data-science data-visualization eda exploratory-data-analysis exploratory-data-visualizations heatmap jupyter-notebook medical preprocessing python research seaborn
Last synced: 03 Jan 2025
https://github.com/yessasvini23/ibm_data_science_-capstone-project-wining-space-race.ipnyb
IBM Data Science Capstone Project from Coursera
apicollection dashboards data-science data-visualization datawrangling descion-tree eda knn-classification logestic-regression machine-learning-algorithms predictive-analytics presentation sql svm-model webscraping
Last synced: 04 Jan 2025
https://github.com/pedro-manoel/eda
🎲 Repositório com questões resolvidas do tst-eda da disciplina de estrutura de dados da UFCG
Last synced: 30 Nov 2024
https://github.com/easonlai/eda_for_prudential_life_insurance_sample_data
Notebook sample of Exploratory Data Analysis (EDA) for Prudential Life Insurance Sample Data
azure-databricks azuredatabricks data-analysis data-analysis-python data-analytics databricks databricks-notebooks eda exploratory-data-analysis insurance insurance-sample-data jupyter-notebook python python3
Last synced: 10 Nov 2024
https://github.com/arjunan-k/whatsapp_digger
WhatsApp Digger is a project to analyze your Chat history in a single click.
analysis eda streamlit whtasapp
Last synced: 11 Nov 2024
https://github.com/dharmendradiwaker/eda-insurance-policies-by-tableau
Exploratory-Data-Analysis-of-Insurance-Policies-using-Tableau Loaded, cleaned & analyzed Insurance data for 37.5K+ policy containing 16 columns Created visualizations such as Area chart, stacked bar charts, lines charts etc. for insights
data-visualization eda tableau
Last synced: 07 Dec 2024
https://github.com/dharmendradiwaker/exploratory-data-analysis-of-los-angeles-crime-data
This project involves performing exploratory data analysis (EDA) on a large dataset containing over 802,000+ crime records from Los Angeles. The dataset includes 28 columns, and the analysis focuses on understanding crime patterns, victim demographics, crime types, and more.
eda exploratory-data-analysis matplotlib plotly seaborn
Last synced: 07 Dec 2024
https://github.com/shliakhovai/sales-analysis-project
This project involves analyzing sales data to gain insights into sales trends, performance metrics, and product categories. The analysis includes data cleaning, exploratory data analysis (EDA), sales trend analysis, profit dependency analysis, and ABC analysis.
abc-analysis data-analysis data-science data-visualization eda exploratory-data-analysis jupyter-notebook python
Last synced: 12 Nov 2024
https://github.com/aarryasutar/prodigy_ds_internship
Prodigy InfoTech Data Science Internship Tasks
bank-marketing-analysis barchart data-science eda exploratory-data-analysis heatmap histogram internships matplotlib pandas prodigy-infotech pyplot python scikit-learn seaborn sentiment-analysis
Last synced: 13 Nov 2024
https://github.com/rishitabansal9/loan-approval-predictor
The project aims to predict loan approvals based on various factors, leveraging machine learning models and data pipelines.
deployment eda end-to-end ipython-notebook loan-approval-prediction mlops pipelines python
Last synced: 13 Nov 2024
https://github.com/nialled69/sugarcane-production-project
Exploratory Data Analysis (Univariate and Bivariate analysis) on World-wide Sugarcane Production dataset.
csv data-visualization dataanalysis-projects eda jupyter-notebook matplotlib-pyplot pandas-dataframe python3 seaborn
Last synced: 13 Nov 2024
https://github.com/tynab/eda-basic
CyberSoft Data Analyst 08 - EDA Basic
anaconda cybersoft cybersoft-academy cybersoft-academy-da cybersoft-academy-da-08 cybersoft-academy-data-analyst cybersoft-academy-data-analyst-08 cybersoft-da cybersoft-da-08 cybersoft-data-analyst cybersoft-data-analyst-08 da data-analyst eda jupyter-notebook matplotlib pandas python seaborn yan
Last synced: 07 Dec 2024
https://github.com/jesly-joji/money-laundering-classification
Money Laundering Classification on IBM Transactions
Last synced: 13 Nov 2024
https://github.com/ahmednasef3/titanic-full-eda
Simple EDA for Titanic Dataset.
data-analysis data-visualization eda exploratory-data-analysis matplotlib pandas seaborn titanic titanic-data-analytics
Last synced: 15 Nov 2024
https://github.com/ahmednasef3/udemy-courses-full-eda
Exploratory Data Analysis on the factors that can affect the promotions and earnings in Udemy Courses and the perfect way to make a good saled course in Udemy.
data-analysis data-science data-visualization eda exploratory-data-analysis matplotlib pandas seaborn udemy-course-project
Last synced: 15 Nov 2024
https://github.com/ahmednasef3/store-sales-full-eda
Simple EDA for Store Sales.
data-analysis data-visualization eda exploratory-data-analysis matplotlib pandas plotly seaborn store
Last synced: 15 Nov 2024
https://github.com/ahmednasef3/heart-attack-full-eda
Simple EDA for Heart Attack Dataset.
data-analysis data-science data-visualization eda exploratory-data-analysis heartattack matplotlib pandas seaborn
Last synced: 15 Nov 2024
https://github.com/jameswmiller/atp_project
Data Science Project analysing Association of Tennis Professional's (ATP) data.
atp data-science eda machine-learning sports sports-analytics sports-stats tennis tennis-analytics
Last synced: 16 Nov 2024
https://github.com/ibm-cloud-architecture/refarch-eda-item-inventory
This project illustrates combining Kafka streams with reactive programming, reactive messaging with Quarkus. The use case is around getting item sold events and build a real time inventory in kafka streams exposed via REST on top of sequential queries.
eda kafka kafka-streams stream-processing
Last synced: 17 Nov 2024
https://github.com/ibm-cloud-architecture/eda-kc-order-mq-ui
A simple User Interface to demonstrate the saga pattern with MQ
Last synced: 17 Nov 2024
https://github.com/ibm-cloud-architecture/eda-rt-inventory-gitops
eda kafka kafka-connect kafka-streams stream-processing
Last synced: 17 Nov 2024
https://github.com/faizantkhan/kaggle-eda-ml-feature-engineering
Explore the Kaggle Codes Repository for concise and powerful code snippets covering the essentials of data science and machine learning. From Kaggle competitions to real-world projects, discover insights into exploratory data analysis, machine learning models, feature engineering, and data science mathematics.
data-science eda jupyter-notebook kaggle kaggle-competition kaggle-dataset kaggle-scripts kaggle-solution machine-learning machine-learning-algorithms mathematics python python-library
Last synced: 15 Nov 2024
https://github.com/faizantkhan/regression-project-bangalore-property-price-prediction
🏠 Bangalore Property Price Prediction is a comprehensive project designed to accurately predict property prices in Bangalore. Leveraging advanced regression techniques and a dataset sourced from Kaggle, the model undergoes meticulous feature engineering, data cleaning, and parameter tuning to ensure high accuracy.
backend-api css data-cleaning data-science data-visualization eda flask html javascript machine-learning-algorithms numpy pandas project project-repository property python regression-models server
Last synced: 15 Nov 2024
https://github.com/adi-200/adult_census_income_prediction_ml_project
Decoding Dollars: Adult Census Income Prediction
data-science datavisualization eda machine-learning machine-learning-algorithms prediction
Last synced: 17 Nov 2024
https://github.com/sralter/happy_customers
Predicting whether a customer is happy based on the results from a survey.
eda ensemble-classifier hyperopt lazypredict ml scikit-learn
Last synced: 17 Nov 2024
https://github.com/kingabzpro/digital-learning-during-covid19-eda
In this project, we will be using data analysis tools to figure out trends in digital learning and how it is effective towards improvised communities. We will be comparing districts and states on factors like demography, internet access, learning product access, and finance.
covid-19 data-science dataanalysis eda education learnplatform usa
Last synced: 17 Nov 2024
https://github.com/sap-samples/event-driven-integrations-e-bite
The examples included here accompany the Developing Event-Driven Integrations with SAP BTP E-Bite published by SAP Press.
cap eda event event-driven integrations mesh node-js
Last synced: 15 Nov 2024
https://github.com/moindalvs/learn_eda_on_zomato_dataset
Zomato Dataset What is the top 10 most preferred Cuisines?
Last synced: 17 Nov 2024
https://github.com/mdanwarulkarim/supershop_sales_analysis_sql
This project examines customer demographics, sales trends, and high-value transactions, uncovering patterns, top customers, and category performance. The insights provide actionable perspectives to understand sales, customer behavior, and product performance for informed decisions.
Last synced: 30 Nov 2024
https://github.com/pavankethavath/car_dekho_car_price_prediction
A Streamlit web app utilizing Python, scikit-learn, and pandas for used car price prediction. Features data preprocessing (scaling, encoding), Random Forest model optimization with GridSearchCV, and interactive user input handling. Achieves high accuracy (R² score: 0.9028), showcasing skills in machine learning, data engineering, and deployment.
dataanalysis datacleaning datapreprocessing eda encoding feature-extraction feature-selection featureimportance fine-tuning machine-learning minmaxscaling normalization pandas pickle prediction-model python random-forest randomsearch-cv regression streamlit
Last synced: 10 Dec 2024
https://github.com/aisurjyasamantaray/sales-perfomance-analysis-dashboard
A comprehensive sales performance analysis dashboard built using Python, and visualization tools. This project includes data cleaning, descriptive statistics, correlation analysis, and insights into sales trends, profitability, and the impact of discounts. Key features include interactive visualizations using Seaborn, and Matplot
analytics annova data data-analysis data-visualization-project dataproject eda hypothesis-testing pandas-dataframe python sales-performance-analysis statistics
Last synced: 18 Nov 2024
https://github.com/muhammadibrahim313/start-your-data-science-journey
In this Repo i will be Sharing all Resources that we will be Learning during December Data Science Workhops on iCode Guru
btajicrew data data-science eda icodeguru machine-learning matplotlib pandas python
Last synced: 22 Dec 2024
https://github.com/raghulrajn/kaggle-notebooks
This repo contains notebooks of competitions in Kaggle
eda kaggle-competition pandas python
Last synced: 22 Dec 2024
https://github.com/perezrd5/data-science
Entry-level looks at Exploratory Data Analysis (in R & Python) and Regression Models (in R)
analysis data-science eda multinomial-regression poisson-regression python r regression
Last synced: 25 Nov 2024
https://github.com/jungi21cc/taxi
Kaggle : New York City Taxi Trip Duration
eda kaggle ols regression taxi
Last synced: 27 Nov 2024
https://github.com/vidhi1290/disaster-tweet-classification-using-lstm
Developed a deep learning model using LSTM networks to classify tweets as disaster-related or non-disaster-related, vital for emergency response. Explored advanced EDA techniques, visualized tweet data insights, and achieved a high F1 score of 0.74. Check out the code and results!
classification deep-learning disaster-tweet-prediction eda kaggle-competition lstm lstm-neural-networks nlp nlp-machine-learning visualizations
Last synced: 08 Dec 2024
https://github.com/pavankethavath/dominos---predictive-purchase-order-system
This advanced forecasting tool leverages Prophet, ARIMA, SARIMA, and LSTM models to predict daily sales for 32 pizzas and 64 ingredients. With Prophet achieving the lowest MAPE, it ensures accurate demand forecasts, optimized inventory, and efficient purchase planning, reducing waste, preventing stockouts, and enhancing supply chain efficiency.
arima deep-learning eda exploratory-data-analysis forecasting lstm machine-learning mape matplotlib minmaxscaling numpy pandas prediction python sarimax seaborn seasonality tensorflow time-series
Last synced: 28 Nov 2024
https://github.com/saravanansuriya/final-retail-sales-forecasting
In this project Utilizing advanced time series forecasting models, successfully predicted department-wide sales for each store for the upcoming year and Visualizing the data in streamlit GUI.
data-wrangling eda model-building pandas python-script streamlit-webapp
Last synced: 28 Nov 2024
https://github.com/abdelhakim-gh/pfa-process-mining-fraud-detection
New Frontiers in the Fight against Fraud : The Contribution of Process Mining
celonis data-processing data-transformation eda jupyter-notebook machine-learning python
Last synced: 03 Dec 2024
https://github.com/superchordate/data-viz-talk
Resources from the "Data Visualization in Business Communication" presentation at the 2023 Gamma Iota Sigma Regional Conference in Fort Worth, TX.
data-visualization eda exploratory-data-analysis public-data
Last synced: 04 Dec 2024
https://github.com/artuk009/case-studies
This is my library of case studies that includes personal projects and capstone projects from various certifications.
analytics data-science eda matplotlib pandas plotly python seaborn sql tableau
Last synced: 09 Dec 2024
https://github.com/chen0040/cs-estimation-of-distribution-algorithms
Estimation of Distribution Algorithms implemented in C#
eda estimation-of-distribution local-search numerical-optimization
Last synced: 16 Dec 2024
https://github.com/shridhar1504/foreign-exchange-rate-time-series-datascience-project
This project will use time series analysis to forecast the exchange rate between the euro and the US dollar. The project will use a variety of statistical techniques, such as ARIMA to model the data and forecast the exchange rate.
data-analysis data-preprocessing data-science data-transformation data-visualization eda exploratory-data-analysis foreign-exchange-rates machine-learning model-fitting predictive-modeling python3 time-series time-series-analysis
Last synced: 23 Dec 2024
https://github.com/shridhar1504/loan-clustering-datascience-project
This project uses Machine Learning to Cluster loan together based on their similarities. The project uses a dataset of loan application which includes information about the Loan amount and Balance. The project then use the clustering algorithm to group the loan together based on the similarities.
clustering-algorithm data-analysis data-science data-visualization datanalysis eda kmeans-clustering machine-learning python sql sql-server unsupervised-learning
Last synced: 23 Dec 2024
https://github.com/dmarks84/ind_project_california-housing-data--kaggle
Independent Project - Kaggle Dataset-- I worked on the California Housing dataset, performing data cleaning and preparation; exploratory data analysis; feature engineering; regression model buildings; model evaluation.
cross-validation data-modeling data-reporting data-visualization eda folium grid-search matplotlib model-evaluation numpy pandas pca python seaborn sklearn statistics supervised-ml unsupervised-ml
Last synced: 23 Dec 2024
https://github.com/mxagar/airbnb_data_analysis
An analysis of the AirBnB dataset from Euskadi / the Basque Country.
airbnb data-analysis data-science eda feature-engineering modeling pandas regression
Last synced: 23 Dec 2024
https://github.com/ssiarhei115/taxi-trip-duration-prediction
Taxi trip duration prediction
big-data data-science data-visualization eda
Last synced: 23 Dec 2024
https://github.com/lgibson7/gender-wage-inequality-in-stem
Final Project for STAT 632 Linear and Logistic Regression, Cal State East Bay Spring 2022
Last synced: 18 Dec 2024
https://github.com/luisfelipepoma/datascience_project_with_r
EXPLORATORY ANALYSIS OF A DATA SET IN R
Last synced: 19 Dec 2024
https://github.com/luisfelipepoma/crypto_app
An application to predict which cryptocurrencies are likely to rise or fall.
api-source cryptocurrency dnn eda exchange flask-application lstm-neural-networks machine-learning
Last synced: 19 Dec 2024
https://github.com/ksharma67/ibm-stock-predication-wiith-eda
In this i tried to design a model that can predict the price of stock using different methods and algorithms.
bullish-signal death-cross decision-tree eda golden-cross gradient-boosting knn machine-learning-algorithms matplotlib numpy pandas prediction python random-forest scalar seaborn skit-learn svm xgboost
Last synced: 25 Dec 2024
https://github.com/shreeparab1890/flipkart-laptops-analysis-eda
This ipython notebook is the Exploratory data analysis (EDA) of the Laptops listed on Flipkart.
data-analysis eda exploratory-data-analysis matplotlib-pyplot numpy pandas plotly
Last synced: 01 Jan 2025
https://github.com/arv-anshul/easy-analysis
A python package to perform Data Analysis easily. (Not Recommended)
arv-dumped data-analysis data-science easy-analysis eda pypi pypi-package python3
Last synced: 25 Dec 2024
https://github.com/rakibhhridoy/exploratorydataanalysis-python
Exploratory data analysis is an approach to analyzing data sets to summarize their main characteristics, often with visual methods. A statistical model can be used or not, but primarily EDA is for seeing what the data can tell us beyond the formal modeling or hypothesis testing task.
ab-testing chitest data-science eda exploratory-data-analysis ftest hypotheses hypothesis-testing inferential-statistics numpy pandas python statistical-analysis statistics statsmodels ttest
Last synced: 25 Dec 2024
https://github.com/dsrichard97/medicare
Focus: Quick overview of DUAL enrollments and SQL manipulations.
communication eda googlebigquery medicare python python-3 reporting snooping sql tableau tableau-dashboards team-integration
Last synced: 26 Dec 2024
https://github.com/sahaavi/uber-vs-lyft
Advance Predictive Modeling in R
data-analysis data-science eda machine-learning predictive-modeling r
Last synced: 26 Dec 2024
https://github.com/md-emon-hasan/ml-project-intrusion-detection-systems-anomaly-detection-with-ml-dl
🔒 Everages Machine Learning and Deep Learning models to identify malicious activities in network traffic, enhancing cybersecurity.
anomaly anomaly-detection anomaly-detection-algorithm anomalydetection artificial-intelligence cnn cyber-security data-science deep-learning eda ensemble flask hybrid-model lstm lstm-neural-networks machine-learning ml ml-engineering network-security neural-network
Last synced: 27 Dec 2024
https://github.com/shivam5992/bokeh-vis
Visualising the acquisitions made by Google using python - Bokeh
bokeh bokeh-server data-visualization eda exploratory-data-analysis python
Last synced: 24 Dec 2024
https://github.com/sathviknayak123/eda
Exploartory Data Analysis
computer-vision deep-learning eda machine-learning natural-language-processing
Last synced: 16 Nov 2024
https://github.com/sonwaneshivani/fire-weather-index-prediction
ML Regression application built using Flask
css datacleaning eda elasticnet-regression feature-engineering flask html lasso-regression linear-regression regression ridge-regression
Last synced: 14 Nov 2024
https://github.com/balajimohan18/peerloankart-loan-fraud-detection-datascience-project
This project uses machine learning to predict whether a loan applicant will repay their loan. The project uses a dataset of historical loan data from PeerLoanKart, a peer-to-peer lending platform.
classification-model data-analysis data-analytics data-cleaning data-science data-visualization dimensional-analysis eda exploratory-data-analysis feature-engineering gradient-boosting-classifier hyperparameter-tuning jupyter-notebook maachine-learning machine-learning-algorithms predictive-modeling python supervised-learning
Last synced: 14 Nov 2024
https://github.com/balajimohan18/rafik-s-kitchen-data-analysis
The Project is about the Analysis of the Sales and Expenses Data of a Famous Fast-food Restaurant. This mainly focuses on gaining Insights that will boost the Future Sales and also Business Strategies it Improve the Profit Margins. Handled Tools are SQL, Python, Power BI, MS Office Tools.
business-analytics business-intelligence data-analysis data-analytics data-visualization eda ms-office powerbi-report powerpoint-presentations python sql-server
Last synced: 14 Nov 2024
https://github.com/balajimohan18/milk-production-time-series-forecasting-datascience-project
This project uses time series forecasting to predict future milk production. The data used in this project is monthly milk production data from January 1962 to December 1975. The ARIMA (autoregressive integrated moving average) model is used to forecast the milk production. The model is evaluated using various metric.
acf adf data-analysis data-cleaning data-science data-visualization eda exploratory-data-analysis machine-learning pacf seasonality time-series trends
Last synced: 14 Nov 2024
https://github.com/balajimohan18/foreign-exchange-rate-time-series-datascience-project
This project will use time series analysis to forecast the exchange rate between the euro and the US dollar. The project will use a variety of statistical techniques, such as ARIMA to model the data and forecast the exchange rate.
data-analysis data-analytics data-preprocessing data-science data-transformation data-visualization eda exploratory-data-analysis foreign-exchange-rates machine-learning model-fitting predictive-modeling python3 time-series time-series-analysis
Last synced: 14 Nov 2024
https://github.com/balajimohan18/loan-clustering-datascience-project
This project uses Machine Learning to Cluster loan together based on their similarities. The project uses a dataeset of loan application which includes information about the Loan amount and Balance. The project then use the clustering algorithm to group the loan together based on the similarities.
clustering-algorithm data-analysis data-science data-visualization eda kmeans-clustering machine-learning sql unsupervised-learning
Last synced: 14 Nov 2024
https://github.com/venkyiyer/project-deployment
A project about creating a model in the research environment, and then transform the research code into production code, package the code and deploy to an API, and add continuous integration and continuous delivery.
cicd eda jupyter-notebooks ml python3
Last synced: 30 Nov 2024
https://github.com/zofiaqlt/smart_device_consumers
🎯 Customer behaviour analysis for a high-tech manufacturer willing to enhance their marketing strategy - use of R and JupyterLab (Business insights, Data collection, Cleaning, EDA, and Data Visualization)
Last synced: 13 Nov 2024
https://github.com/zofiaqlt/professional_inequalities_knime
🎯 Gender inequality at work - use of KNIME (Background research, GDPR, Data governance, ETL, EDA, Data cleaning and validation, Statistical tests with R, and Data Visualization)
datagovernance datavalidation datavisualization eda etl-pipeline knime r rgpd statistical-tests
Last synced: 13 Nov 2024
https://github.com/zofiaqlt/credit_risk_pyspark
🎯 Credit risk detection - use of PySpark, Python and JupyterLab (Data collection, Cleaning, EDA, Regression, Classification, Statistical tests, and Data Visualization)
classification eda machinelearning pyspark regression
Last synced: 13 Nov 2024
https://github.com/zofiaqlt/hunger_study
🎯 Global study to tackle hunger worldwide and support FAO's mission - use of Python and JupyterLab (Background research, Data collection, Cleaning, EDA, and Data Visualization)
Last synced: 13 Nov 2024
https://github.com/yashika-malhotra/micromobility-service-provider---hypothesis-testing
Examined factors influencing demand for micro-mobility shared electric cycles Performed exploratory analysis and hypothesis testing, revealing the distinct influence of weather-season association on hourly counts
colab-notebook data-visualization eda exploratory-data-analysis hypothesis-testing jupyter-notebook matplotlib matplotlib-pyplot numpy pandas python scipy-library scipy-stats seaborn skit-learn
Last synced: 14 Nov 2024
https://github.com/tazeenrashid/orders-analysis-using-python-sql-server-and-tableau
I sourced some Orders data through Kaggle; did EDA using Python and then fetched some insights out of cleaned data using SQL Server (SSMS). Then, I built a Tableau Dashboard for some visual insights. Have a look and share your feedback!
analytics data eda jupyter-notebook python sql tableau
Last synced: 22 Dec 2024
https://github.com/coder5omkar/linear-regression-bike-sharing-assignment
This project aims to model the demand for shared bikes using various independent variables. The goal is to provide insights to the management team, helping them understand how demand fluctuates with different features
data-science eda matplotlib ml mlr pandas python3 seaborn
Last synced: 03 Jan 2025
https://github.com/raghavendranhp/attrition-alchemy
This project uses machine learning to predict and analyze employee attrition in Company.By developing three predictive models,it identifies key factors influencing turnover,providing actionable insights to mitigate attrition challenges.The analysis focuses on enhancing job satisfaction,work-life balance and career growth opportunities.
data datawrangling decision-trees eda gradient-boosting logistic-regression macine-learning pandas preprocessing random-forest-classifier skicit-learn svm
Last synced: 10 Nov 2024
https://github.com/k-forghani/rayan-ai-imldl
Introduction to Machine Learning and Deep Learning | Rayan AI Contest
clustering cnn contest course deep-learning diffusion eda homework knn linear-regression logistic-regression machine-learning neural-network pytorch rayan segmentation sklearn svm vae
Last synced: 10 Oct 2024
https://github.com/raghavendranhp/youtube_data_harvesting
The "YouTube Data Analyzer" is a versatile tool for businesses and content creators, enabling them to gather, analyze, and harness valuable insights from multiple YouTube channels. With streamlined data collection, storage in MongoDB, migration to SQL, and a user-friendly Streamlit interface, it empowers users to make data-driven decisions
apiintegration data datacollection eda googleapi googleapiclient matplotlib mongodb mysql mysqlconnector numpy oops pandas pymongo python pythonoops sql sqlalchemy streamlit youtube-api
Last synced: 10 Nov 2024
https://github.com/raghavendranhp/waiter_tip_prediction
The "Waiter Tips Prediction Model" is a machine learning tool that forecasts waiter tips based on factors like the total bill, customer demographics, and dining specifics. It assists waitstaff and restaurants in understanding and estimating tipping patterns
eda numpy pandas plotly-express plotly-graph-objects python sklearn
Last synced: 10 Nov 2024
https://github.com/vinicius999/eda-imdb-top1000-films
Análise exploratória dos Top 1000 filmes no IMDB até 2020
Last synced: 13 Nov 2024
https://github.com/raghavendranhp/airbnb-data-analysis
The Airbnb Data Analysis project focuses on analyzing Airbnb data using MongoDB Atlas, Python scripting, data preprocessing, visualization, and interactive geospatial insights. We delve into the world of property management and tourism to uncover trends, pricing variations, and location-based analysis.
eda jupyter-notebook mongodb numpy pandas powerbi preprocessing
Last synced: 10 Nov 2024
https://github.com/ahmedkhaled404/data-cleaning-and-eda-layoffs-mysql
This project involves cleaning a dataset containing information about layoffs from companies around the world.
data data-analysis data-cleaning data-preprocessing datacleaning eda exploratory-data-analysis mysql sql
Last synced: 13 Nov 2024