Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists tagged with eda

A curated list of projects in awesome lists tagged with eda .

https://github.com/lorenzoranucci/eda-semi-async-http-handler

Handle HTTP requests asynchronously using Kafka events while maintaining a synchronous HTTP interface.

async docker dualwrite eda event-driven event-driven-architecture eventual-consistency go golang kafka

Last synced: 20 Dec 2024

https://github.com/csengupta1101/zomato-kolkata-eda

The Zomato dataset (Kolkata) contains 7388 rows and 7 columns. The repository is an EDA(exploratory data analysis) on the given Dataset.

eda jupyter-notebook kolkata python3 restaurant vscode zomato

Last synced: 29 Dec 2024

https://github.com/bookshiyi/kicad_plugins

KiCAD插件收集

eda embedded kicad plugins scripts

Last synced: 04 Dec 2024

https://github.com/cosmoduende/r-uber-trips-analyisis

Explore your activity on Uber with R: How to analyze and visualize your personal data history. Find out how you consume the Uber App using a copy of your data.

analisis-de-data data-analysis data-analytics data-science data-visualisation data-visualization data-viz eda flexdashboard ggmap ggplot2 mobility-as-a-service qmplot r-language r-programming ridesharing uber uber-data visualizacion-de-datos

Last synced: 27 Dec 2024

https://github.com/39services/ansible-http-server

This is a web server written in Ansible. Yes, WRITTEN IN Ansible. Not using an external web server.

ansible ansible-eda ansible-playbook eda event-driven-ansible http

Last synced: 14 Dec 2024

https://github.com/kishlayjeet/zomato-data-exploration

In this project, we will be exploring a dataset containing information on various restaurants and their ratings, location, and other attributes.

data-analysis eda matplotlib numpy pandas zomato-data-exploration

Last synced: 24 Dec 2024

https://github.com/alexandramartinez/asyncapis-accounts-email

All the resources you need to implement a functional simple architecture with an Accounts and an Email services using AsyncAPI, Anypoint Code Builder, and the available message brokers: Anypoint MQ, Kafka, Solace PubSub+, and Salesforce CDC/Platform Events

acb anypoint-code-builder anypointmq async async-api asyncapi asyncapi-specification eda kafka kafka-consumer kafka-producer kafka-topic mule mule4 mulesoft queue queues topic

Last synced: 29 Dec 2024

https://github.com/x86-39/ansible-http-server

This is a web server written in Ansible. Yes, WRITTEN IN Ansible. Not using an external web server.

ansible ansible-eda ansible-playbook eda event-driven-ansible http

Last synced: 19 Nov 2024

https://github.com/sksubhadeep/world-population-exploratory-data-analysis-using-python

World-Population-Exploratory-Data-Analysis-using-Python

eda python

Last synced: 30 Dec 2024

https://github.com/alexandramartinez/asyncapi-example

Example resources to get started with AsyncAPI specifications in MuleSoft.

asyncapi asyncapi-specification eda event-driven-architecture mulesoft specification

Last synced: 29 Dec 2024

https://github.com/anoopgeorge418/student-performance-indicator

Developing a model that helps to predict students performance

datascience-machinelearning deployment eda end-to-end flask python

Last synced: 10 Nov 2024

https://github.com/drxwat/rlms-stats

RLMS data investigation and visualization

economics eda rlms statistics

Last synced: 10 Nov 2024

https://github.com/1ayanabil1/healthcare-machine-learning

Explore our open-source repository focused on healthcare machine learning. We've developed predictive models for cardiovascular disease, diabetes, breast cancer, and more. Our projects employ diverse machine learning algorithms and data science techniques, enhancing early detection, diagnosis, and patient outcomes.

data-analysis data-science deep-learning disease disease-detection disease-modeling disease-prediction eda healthcare-application heathcare jupyter-notebook machine-learning machine-learning-algorithms machinelearning-python python

Last synced: 10 Nov 2024

https://github.com/pedro-manoel/eda

🎲 Repositório com questões resolvidas do tst-eda da disciplina de estrutura de dados da UFCG

eda java

Last synced: 30 Nov 2024

https://github.com/arjunan-k/whatsapp_digger

WhatsApp Digger is a project to analyze your Chat history in a single click.

analysis eda streamlit whtasapp

Last synced: 11 Nov 2024

https://github.com/dharmendradiwaker/eda-insurance-policies-by-tableau

Exploratory-Data-Analysis-of-Insurance-Policies-using-Tableau Loaded, cleaned & analyzed Insurance data for 37.5K+ policy containing 16 columns Created visualizations such as Area chart, stacked bar charts, lines charts etc. for insights

data-visualization eda tableau

Last synced: 07 Dec 2024

https://github.com/dharmendradiwaker/exploratory-data-analysis-of-los-angeles-crime-data

This project involves performing exploratory data analysis (EDA) on a large dataset containing over 802,000+ crime records from Los Angeles. The dataset includes 28 columns, and the analysis focuses on understanding crime patterns, victim demographics, crime types, and more.

eda exploratory-data-analysis matplotlib plotly seaborn

Last synced: 07 Dec 2024

https://github.com/shliakhovai/sales-analysis-project

This project involves analyzing sales data to gain insights into sales trends, performance metrics, and product categories. The analysis includes data cleaning, exploratory data analysis (EDA), sales trend analysis, profit dependency analysis, and ABC analysis.

abc-analysis data-analysis data-science data-visualization eda exploratory-data-analysis jupyter-notebook python

Last synced: 12 Nov 2024

https://github.com/rishitabansal9/loan-approval-predictor

The project aims to predict loan approvals based on various factors, leveraging machine learning models and data pipelines.

deployment eda end-to-end ipython-notebook loan-approval-prediction mlops pipelines python

Last synced: 13 Nov 2024

https://github.com/nialled69/sugarcane-production-project

Exploratory Data Analysis (Univariate and Bivariate analysis) on World-wide Sugarcane Production dataset.

csv data-visualization dataanalysis-projects eda jupyter-notebook matplotlib-pyplot pandas-dataframe python3 seaborn

Last synced: 13 Nov 2024

https://github.com/jesly-joji/money-laundering-classification

Money Laundering Classification on IBM Transactions

binary-classificaiton eda

Last synced: 13 Nov 2024

https://github.com/ahmednasef3/udemy-courses-full-eda

Exploratory Data Analysis on the factors that can affect the promotions and earnings in Udemy Courses and the perfect way to make a good saled course in Udemy.

data-analysis data-science data-visualization eda exploratory-data-analysis matplotlib pandas seaborn udemy-course-project

Last synced: 15 Nov 2024

https://github.com/jameswmiller/atp_project

Data Science Project analysing Association of Tennis Professional's (ATP) data.

atp data-science eda machine-learning sports sports-analytics sports-stats tennis tennis-analytics

Last synced: 16 Nov 2024

https://github.com/ibm-cloud-architecture/refarch-eda-item-inventory

This project illustrates combining Kafka streams with reactive programming, reactive messaging with Quarkus. The use case is around getting item sold events and build a real time inventory in kafka streams exposed via REST on top of sequential queries.

eda kafka kafka-streams stream-processing

Last synced: 17 Nov 2024

https://github.com/ibm-cloud-architecture/eda-kc-order-mq-ui

A simple User Interface to demonstrate the saga pattern with MQ

eda saga

Last synced: 17 Nov 2024

https://github.com/faizantkhan/kaggle-eda-ml-feature-engineering

Explore the Kaggle Codes Repository for concise and powerful code snippets covering the essentials of data science and machine learning. From Kaggle competitions to real-world projects, discover insights into exploratory data analysis, machine learning models, feature engineering, and data science mathematics.

data-science eda jupyter-notebook kaggle kaggle-competition kaggle-dataset kaggle-scripts kaggle-solution machine-learning machine-learning-algorithms mathematics python python-library

Last synced: 15 Nov 2024

https://github.com/faizantkhan/regression-project-bangalore-property-price-prediction

🏠 Bangalore Property Price Prediction is a comprehensive project designed to accurately predict property prices in Bangalore. Leveraging advanced regression techniques and a dataset sourced from Kaggle, the model undergoes meticulous feature engineering, data cleaning, and parameter tuning to ensure high accuracy.

backend-api css data-cleaning data-science data-visualization eda flask html javascript machine-learning-algorithms numpy pandas project project-repository property python regression-models server

Last synced: 15 Nov 2024

https://github.com/sralter/happy_customers

Predicting whether a customer is happy based on the results from a survey.

eda ensemble-classifier hyperopt lazypredict ml scikit-learn

Last synced: 17 Nov 2024

https://github.com/kingabzpro/digital-learning-during-covid19-eda

In this project, we will be using data analysis tools to figure out trends in digital learning and how it is effective towards improvised communities. We will be comparing districts and states on factors like demography, internet access, learning product access, and finance.

covid-19 data-science dataanalysis eda education learnplatform usa

Last synced: 17 Nov 2024

https://github.com/sap-samples/event-driven-integrations-e-bite

The examples included here accompany the Developing Event-Driven Integrations with SAP BTP E-Bite published by SAP Press.

cap eda event event-driven integrations mesh node-js

Last synced: 15 Nov 2024

https://github.com/moindalvs/learn_eda_on_zomato_dataset

Zomato Dataset What is the top 10 most preferred Cuisines?

eda exploratory-data-analysis

Last synced: 17 Nov 2024

https://github.com/mdanwarulkarim/supershop_sales_analysis_sql

This project examines customer demographics, sales trends, and high-value transactions, uncovering patterns, top customers, and category performance. The insights provide actionable perspectives to understand sales, customer behavior, and product performance for informed decisions.

eda etl sql

Last synced: 30 Nov 2024

https://github.com/pavankethavath/car_dekho_car_price_prediction

A Streamlit web app utilizing Python, scikit-learn, and pandas for used car price prediction. Features data preprocessing (scaling, encoding), Random Forest model optimization with GridSearchCV, and interactive user input handling. Achieves high accuracy (R² score: 0.9028), showcasing skills in machine learning, data engineering, and deployment.

dataanalysis datacleaning datapreprocessing eda encoding feature-extraction feature-selection featureimportance fine-tuning machine-learning minmaxscaling normalization pandas pickle prediction-model python random-forest randomsearch-cv regression streamlit

Last synced: 10 Dec 2024

https://github.com/aisurjyasamantaray/sales-perfomance-analysis-dashboard

A comprehensive sales performance analysis dashboard built using Python, and visualization tools. This project includes data cleaning, descriptive statistics, correlation analysis, and insights into sales trends, profitability, and the impact of discounts. Key features include interactive visualizations using Seaborn, and Matplot

analytics annova data data-analysis data-visualization-project dataproject eda hypothesis-testing pandas-dataframe python sales-performance-analysis statistics

Last synced: 18 Nov 2024

https://github.com/muhammadibrahim313/start-your-data-science-journey

In this Repo i will be Sharing all Resources that we will be Learning during December Data Science Workhops on iCode Guru

btajicrew data data-science eda icodeguru machine-learning matplotlib pandas python

Last synced: 22 Dec 2024

https://github.com/raghulrajn/kaggle-notebooks

This repo contains notebooks of competitions in Kaggle

eda kaggle-competition pandas python

Last synced: 22 Dec 2024

https://github.com/perezrd5/data-science

Entry-level looks at Exploratory Data Analysis (in R & Python) and Regression Models (in R)

analysis data-science eda multinomial-regression poisson-regression python r regression

Last synced: 25 Nov 2024

https://github.com/jungi21cc/taxi

Kaggle : New York City Taxi Trip Duration

eda kaggle ols regression taxi

Last synced: 27 Nov 2024

https://github.com/vidhi1290/disaster-tweet-classification-using-lstm

Developed a deep learning model using LSTM networks to classify tweets as disaster-related or non-disaster-related, vital for emergency response. Explored advanced EDA techniques, visualized tweet data insights, and achieved a high F1 score of 0.74. Check out the code and results!

classification deep-learning disaster-tweet-prediction eda kaggle-competition lstm lstm-neural-networks nlp nlp-machine-learning visualizations

Last synced: 08 Dec 2024

https://github.com/pavankethavath/dominos---predictive-purchase-order-system

This advanced forecasting tool leverages Prophet, ARIMA, SARIMA, and LSTM models to predict daily sales for 32 pizzas and 64 ingredients. With Prophet achieving the lowest MAPE, it ensures accurate demand forecasts, optimized inventory, and efficient purchase planning, reducing waste, preventing stockouts, and enhancing supply chain efficiency.

arima deep-learning eda exploratory-data-analysis forecasting lstm machine-learning mape matplotlib minmaxscaling numpy pandas prediction python sarimax seaborn seasonality tensorflow time-series

Last synced: 28 Nov 2024

https://github.com/saravanansuriya/final-retail-sales-forecasting

In this project Utilizing advanced time series forecasting models, successfully predicted department-wide sales for each store for the upcoming year and Visualizing the data in streamlit GUI.

data-wrangling eda model-building pandas python-script streamlit-webapp

Last synced: 28 Nov 2024

https://github.com/abdelhakim-gh/pfa-process-mining-fraud-detection

New Frontiers in the Fight against Fraud : The Contribution of Process Mining

celonis data-processing data-transformation eda jupyter-notebook machine-learning python

Last synced: 03 Dec 2024

https://github.com/superchordate/data-viz-talk

Resources from the "Data Visualization in Business Communication" presentation at the 2023 Gamma Iota Sigma Regional Conference in Fort Worth, TX.

data-visualization eda exploratory-data-analysis public-data

Last synced: 04 Dec 2024

https://github.com/artuk009/case-studies

This is my library of case studies that includes personal projects and capstone projects from various certifications.

analytics data-science eda matplotlib pandas plotly python seaborn sql tableau

Last synced: 09 Dec 2024

https://github.com/shridhar1504/foreign-exchange-rate-time-series-datascience-project

This project will use time series analysis to forecast the exchange rate between the euro and the US dollar. The project will use a variety of statistical techniques, such as ARIMA to model the data and forecast the exchange rate.

data-analysis data-preprocessing data-science data-transformation data-visualization eda exploratory-data-analysis foreign-exchange-rates machine-learning model-fitting predictive-modeling python3 time-series time-series-analysis

Last synced: 23 Dec 2024

https://github.com/shridhar1504/loan-clustering-datascience-project

This project uses Machine Learning to Cluster loan together based on their similarities. The project uses a dataset of loan application which includes information about the Loan amount and Balance. The project then use the clustering algorithm to group the loan together based on the similarities.

clustering-algorithm data-analysis data-science data-visualization datanalysis eda kmeans-clustering machine-learning python sql sql-server unsupervised-learning

Last synced: 23 Dec 2024

https://github.com/dmarks84/ind_project_california-housing-data--kaggle

Independent Project - Kaggle Dataset-- I worked on the California Housing dataset, performing data cleaning and preparation; exploratory data analysis; feature engineering; regression model buildings; model evaluation.

cross-validation data-modeling data-reporting data-visualization eda folium grid-search matplotlib model-evaluation numpy pandas pca python seaborn sklearn statistics supervised-ml unsupervised-ml

Last synced: 23 Dec 2024

https://github.com/mxagar/airbnb_data_analysis

An analysis of the AirBnB dataset from Euskadi / the Basque Country.

airbnb data-analysis data-science eda feature-engineering modeling pandas regression

Last synced: 23 Dec 2024

https://github.com/lgibson7/gender-wage-inequality-in-stem

Final Project for STAT 632 Linear and Logistic Regression, Cal State East Bay Spring 2022

eda regression statistics

Last synced: 18 Dec 2024

https://github.com/luisfelipepoma/datascience_project_with_r

EXPLORATORY ANALYSIS OF A DATA SET IN R

data-analytics eda r r-studio

Last synced: 19 Dec 2024

https://github.com/luisfelipepoma/crypto_app

An application to predict which cryptocurrencies are likely to rise or fall.

api-source cryptocurrency dnn eda exchange flask-application lstm-neural-networks machine-learning

Last synced: 19 Dec 2024

https://github.com/shreeparab1890/flipkart-laptops-analysis-eda

This ipython notebook is the Exploratory data analysis (EDA) of the Laptops listed on Flipkart.

data-analysis eda exploratory-data-analysis matplotlib-pyplot numpy pandas plotly

Last synced: 01 Jan 2025

https://github.com/arv-anshul/easy-analysis

A python package to perform Data Analysis easily. (Not Recommended)

arv-dumped data-analysis data-science easy-analysis eda pypi pypi-package python3

Last synced: 25 Dec 2024

https://github.com/rakibhhridoy/exploratorydataanalysis-python

Exploratory data analysis is an approach to analyzing data sets to summarize their main characteristics, often with visual methods. A statistical model can be used or not, but primarily EDA is for seeing what the data can tell us beyond the formal modeling or hypothesis testing task.

ab-testing chitest data-science eda exploratory-data-analysis ftest hypotheses hypothesis-testing inferential-statistics numpy pandas python statistical-analysis statistics statsmodels ttest

Last synced: 25 Dec 2024

https://github.com/dsrichard97/medicare

Focus: Quick overview of DUAL enrollments and SQL manipulations.

communication eda googlebigquery medicare python python-3 reporting snooping sql tableau tableau-dashboards team-integration

Last synced: 26 Dec 2024

https://github.com/shivam5992/bokeh-vis

Visualising the acquisitions made by Google using python - Bokeh

bokeh bokeh-server data-visualization eda exploratory-data-analysis python

Last synced: 24 Dec 2024

https://github.com/balajimohan18/rafik-s-kitchen-data-analysis

The Project is about the Analysis of the Sales and Expenses Data of a Famous Fast-food Restaurant. This mainly focuses on gaining Insights that will boost the Future Sales and also Business Strategies it Improve the Profit Margins. Handled Tools are SQL, Python, Power BI, MS Office Tools.

business-analytics business-intelligence data-analysis data-analytics data-visualization eda ms-office powerbi-report powerpoint-presentations python sql-server

Last synced: 14 Nov 2024

https://github.com/balajimohan18/milk-production-time-series-forecasting-datascience-project

This project uses time series forecasting to predict future milk production. The data used in this project is monthly milk production data from January 1962 to December 1975. The ARIMA (autoregressive integrated moving average) model is used to forecast the milk production. The model is evaluated using various metric.

acf adf data-analysis data-cleaning data-science data-visualization eda exploratory-data-analysis machine-learning pacf seasonality time-series trends

Last synced: 14 Nov 2024

https://github.com/balajimohan18/foreign-exchange-rate-time-series-datascience-project

This project will use time series analysis to forecast the exchange rate between the euro and the US dollar. The project will use a variety of statistical techniques, such as ARIMA to model the data and forecast the exchange rate.

data-analysis data-analytics data-preprocessing data-science data-transformation data-visualization eda exploratory-data-analysis foreign-exchange-rates machine-learning model-fitting predictive-modeling python3 time-series time-series-analysis

Last synced: 14 Nov 2024

https://github.com/balajimohan18/loan-clustering-datascience-project

This project uses Machine Learning to Cluster loan together based on their similarities. The project uses a dataeset of loan application which includes information about the Loan amount and Balance. The project then use the clustering algorithm to group the loan together based on the similarities.

clustering-algorithm data-analysis data-science data-visualization eda kmeans-clustering machine-learning sql unsupervised-learning

Last synced: 14 Nov 2024

https://github.com/venkyiyer/project-deployment

A project about creating a model in the research environment, and then transform the research code into production code, package the code and deploy to an API, and add continuous integration and continuous delivery.

cicd eda jupyter-notebooks ml python3

Last synced: 30 Nov 2024

https://github.com/zofiaqlt/smart_device_consumers

🎯 Customer behaviour analysis for a high-tech manufacturer willing to enhance their marketing strategy - use of R and JupyterLab (Business insights, Data collection, Cleaning, EDA, and Data Visualization)

eda r

Last synced: 13 Nov 2024

https://github.com/zofiaqlt/professional_inequalities_knime

🎯 Gender inequality at work - use of KNIME (Background research, GDPR, Data governance, ETL, EDA, Data cleaning and validation, Statistical tests with R, and Data Visualization)

datagovernance datavalidation datavisualization eda etl-pipeline knime r rgpd statistical-tests

Last synced: 13 Nov 2024

https://github.com/zofiaqlt/credit_risk_pyspark

🎯 Credit risk detection - use of PySpark, Python and JupyterLab (Data collection, Cleaning, EDA, Regression, Classification, Statistical tests, and Data Visualization)

classification eda machinelearning pyspark regression

Last synced: 13 Nov 2024

https://github.com/zofiaqlt/hunger_study

🎯 Global study to tackle hunger worldwide and support FAO's mission - use of Python and JupyterLab (Background research, Data collection, Cleaning, EDA, and Data Visualization)

eda python

Last synced: 13 Nov 2024

https://github.com/yashika-malhotra/micromobility-service-provider---hypothesis-testing

Examined factors influencing demand for micro-mobility shared electric cycles Performed exploratory analysis and hypothesis testing, revealing the distinct influence of weather-season association on hourly counts

colab-notebook data-visualization eda exploratory-data-analysis hypothesis-testing jupyter-notebook matplotlib matplotlib-pyplot numpy pandas python scipy-library scipy-stats seaborn skit-learn

Last synced: 14 Nov 2024

https://github.com/tazeenrashid/orders-analysis-using-python-sql-server-and-tableau

I sourced some Orders data through Kaggle; did EDA using Python and then fetched some insights out of cleaned data using SQL Server (SSMS). Then, I built a Tableau Dashboard for some visual insights. Have a look and share your feedback!

analytics data eda jupyter-notebook python sql tableau

Last synced: 22 Dec 2024

https://github.com/coder5omkar/linear-regression-bike-sharing-assignment

This project aims to model the demand for shared bikes using various independent variables. The goal is to provide insights to the management team, helping them understand how demand fluctuates with different features

data-science eda matplotlib ml mlr pandas python3 seaborn

Last synced: 03 Jan 2025

https://github.com/raghavendranhp/attrition-alchemy

This project uses machine learning to predict and analyze employee attrition in Company.By developing three predictive models,it identifies key factors influencing turnover,providing actionable insights to mitigate attrition challenges.The analysis focuses on enhancing job satisfaction,work-life balance and career growth opportunities.

data datawrangling decision-trees eda gradient-boosting logistic-regression macine-learning pandas preprocessing random-forest-classifier skicit-learn svm

Last synced: 10 Nov 2024

https://github.com/raghavendranhp/youtube_data_harvesting

The "YouTube Data Analyzer" is a versatile tool for businesses and content creators, enabling them to gather, analyze, and harness valuable insights from multiple YouTube channels. With streamlined data collection, storage in MongoDB, migration to SQL, and a user-friendly Streamlit interface, it empowers users to make data-driven decisions

apiintegration data datacollection eda googleapi googleapiclient matplotlib mongodb mysql mysqlconnector numpy oops pandas pymongo python pythonoops sql sqlalchemy streamlit youtube-api

Last synced: 10 Nov 2024

https://github.com/raghavendranhp/waiter_tip_prediction

The "Waiter Tips Prediction Model" is a machine learning tool that forecasts waiter tips based on factors like the total bill, customer demographics, and dining specifics. It assists waitstaff and restaurants in understanding and estimating tipping patterns

eda numpy pandas plotly-express plotly-graph-objects python sklearn

Last synced: 10 Nov 2024

https://github.com/vinicius999/eda-imdb-top1000-films

Análise exploratória dos Top 1000 filmes no IMDB até 2020

eda numpy pandas python

Last synced: 13 Nov 2024

https://github.com/raghavendranhp/airbnb-data-analysis

The Airbnb Data Analysis project focuses on analyzing Airbnb data using MongoDB Atlas, Python scripting, data preprocessing, visualization, and interactive geospatial insights. We delve into the world of property management and tourism to uncover trends, pricing variations, and location-based analysis.

eda jupyter-notebook mongodb numpy pandas powerbi preprocessing

Last synced: 10 Nov 2024

https://github.com/ahmedkhaled404/data-cleaning-and-eda-layoffs-mysql

This project involves cleaning a dataset containing information about layoffs from companies around the world.

data data-analysis data-cleaning data-preprocessing datacleaning eda exploratory-data-analysis mysql sql

Last synced: 13 Nov 2024