Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists tagged with eda

A curated list of projects in awesome lists tagged with eda .

https://github.com/spidy20/kaggle_kernels

It's contain a Data scince - Machine learning ,Data visualizations codes & Datasets

clustering data-science data-visualization eda kaggle-competition kaggle-dataset kaggle-scripts kmeans-clustering

Last synced: 15 Nov 2024

https://github.com/labrijisaad/exploratory-data-analysis-in-python

In this project, we will see in a hands-on training jupyter notebook how to effectively diagnose and deal with missing data in Python.

airbnb-dataset cleaning-data-in-python eda exploratory-data-analysis jupyter-notebook python

Last synced: 06 Nov 2024

https://github.com/sandravizz/natural-disasters

Web based data visualisation project about natural disasters using d3.js

arquero climate-change d3js data-visualization eda natural-disasters observable plotjs sankey

Last synced: 07 Nov 2024

https://github.com/the-openroad-project/orassistant

OpenROAD's Chatbot Assistant

eda llm openroad python3

Last synced: 09 Nov 2024

https://github.com/lethalbit/yosys-vscode

Syntax Highlighting for Yosys Scripts and RTLIL

eda fpga rtlil syntax-highlighting vscode-extension yosys

Last synced: 06 Nov 2024

https://github.com/adirthaborgohain/bert-text-analysis

Text Analysis done on a business text dataset using KeyBERT and BERTopic

bert eda keybert lda nlp transformers

Last synced: 23 Oct 2024

https://github.com/01xz/w4e

WSL for commercial EDA tools. Now recommend to use https://github.com/01xz/c4e

eda wsl2

Last synced: 10 Nov 2024

https://github.com/abhinav-ark/mal_lyrics_analysis

Preprocessing and EDA on a Dataset of Malayalam Songs and Lyrics

data-science eda jupyter-notebook python

Last synced: 15 Dec 2024

https://github.com/leonism/sample-superstore

This is the Python version analysis approach, towards the legendary Sample Superstore Dataset with Pandas

data-analysis datamining datascience dataset eda jupyter-notebook machine-learning python

Last synced: 08 Dec 2024

https://github.com/dataspieler12345/ds-with-python

"This repository showcases various data analysis projects implemented in Python, aimed at providing hands-on experience in the field."

datascience eda jupyter-notebook numpy pandas python

Last synced: 22 Nov 2024

https://github.com/AlbertSuarez/Jutge-EDA

🎓 Tots els problemes del Jutge de EDA (FIB). All Jutge problems of EDA (FIB).

backtracking barcelona bfs cpp eda fib jutge jutge-eda upc

Last synced: 26 Oct 2024

https://github.com/kubealex/kubealex.eda

Repository for the kubealex.eda galaxy collection.

ansible ansible-galaxy ansible-galaxy-collections automation eda event-driven-automation

Last synced: 27 Oct 2024

https://github.com/adityashrm21/adult-income-prediction

A end-to-end data analysis pipeline including model deployment

data-science eda flask heroku logistic-regression r scikit-learn tidyverse

Last synced: 11 Nov 2024

https://github.com/event-catalog/create-eventcatalog

CLI tool that is used to create new catalogs

cli ddd eda eventcatalog

Last synced: 17 Nov 2024

https://github.com/dragonman225/opamp-generator

Generate folded-cascode opamp parameters with interactive CLI.

analog-ic-design eda ic-design integrated-circuits

Last synced: 12 Oct 2024

https://github.com/praveendecode/youtube-data-harvesting-warehousing

Efficient YouTube data harvesting and warehousing with Python, SQL, MongoDB, and Streamlit, enabling seamless analysis and visualization for insightful decision-making in content management and audience engagement strategies

api apiintegration dataanalysis dataharvesting datawarehousing eda mongodb postgres python sql

Last synced: 17 Dec 2024

https://github.com/carlotacb/starwar

Beta version of a Game inspired implemented during Data Structures and Algoristhmics subject

algorithm battle beta bfs dfs eda game ship starwars university upc

Last synced: 31 Dec 2024

https://github.com/aianytime/machine-learning-models-implementation

Implementation of several ML models on real-world datasets with detailed explanation in notebooks.

eda machine-learning machine-learning-algorithms ml numpy pandas pycaret python scikit-learn scikitlearn-machine-learning sklearn

Last synced: 07 Nov 2024

https://github.com/virajbhutada/spotify-track-analysis-and-recommendation

Experience a comprehensive exploration of Spotify's musical landscape seamlessly transitioned from Tableau visualizations to SQL analysis. Dive into track inventory, streaming metrics, and sonic trends via interactive dashboards, while leveraging SQL queries for deeper insights into KPIs and cross-platform rankings.

audio-analysis data-analysis data-analytics data-science data-visualization eda machine-learning-library ml-models mysql recommendation-system spotify spotify-data spotify-dataset sql-database sql-server streaming-metrics tableau tableau-public trends-analysis

Last synced: 11 Nov 2024

https://github.com/superchordate/storyteller

AutoML R framework functions for quickly finding stories from data.

automl eda exploratory-data-analysis r

Last synced: 04 Dec 2024

https://github.com/labrijisaad/data-warehousing-in-azure-postgresql

In this notebook, I tried to handle missing values ​​on the Prosper dataset, then uploaded it to Azure Postgres Database (pre-configured via azur portal) via jupyter notebook.

azure eda postgresql postgresql-database

Last synced: 06 Nov 2024

https://github.com/olivroy/reuseme

Collections of Utility Functions to Work Across Projects

dplyr eda r rstudio-project

Last synced: 09 Nov 2024

https://github.com/infoslack/eda-wine-review

Exploratory data analysis for wine reviews

eda exploratory-data-analysis matplotlib numpy pandas python seaborn

Last synced: 22 Nov 2024

https://github.com/md-emon-hasan/ml-project-email-sms-spam-classifier-with-mlops

📧 ML project focused on email spam classification, demonstrating data preprocessing, model training, and evaluation using Python and scikit-learn.

data-science eda email-classification machine-learning-projects nlp spam-classification text-classification

Last synced: 13 Nov 2024

https://github.com/yjg30737/pyqt-dataset-eda-helper

Using PyQt GUI to show performing Exploratory Data Analysis (EDA) on the CSV dataset used for binary or softmax text classification

eda matplotlib pandas pyqt pyqt-matplotlib pyqt-seaborn pyqt5 pyqt5-desktop-application python seaborn

Last synced: 03 Jan 2025

https://github.com/devanshi-bavaria/predictive-modeling-for-stock-market-trends

📈 Comprehensive stock price analysis, including preprocessing, clustering, correlation, and predictive modeling, to enhance investment insights and accuracy. 💡

clustering-analysis correlation-analysis eda ml permutation-test

Last synced: 29 Nov 2024

https://github.com/ibm-cloud-architecture/refarch-kc-ui

Container shipment user interface for demonstration purposes of the Event-Driven Architecture reference implementation.

eda event-driven event-driven-architecture microservices nodejs

Last synced: 17 Nov 2024

https://github.com/amrrs/introduction-to-eda-with-python

Introduction to EDA with Python Session Files

data-analysis eda python

Last synced: 15 Nov 2024

https://github.com/harmanveer2546/bird-species-prediction-using-deep-learning

Using convolutional neural networks to build and train a bird species classifier on bird pics data with corresponding species labels, also build GUI for the same.

callback deep-learning eda gui image-classification imagegenerator keras maxpooling mobilenetv2 opencv pillow plotly python tensorflow

Last synced: 13 Nov 2024

https://github.com/abonaplata/house-prices-eda-python

House Prices: Análisis exploratorio de datos con Python

eda ipynb-jupyter-notebook python

Last synced: 13 Nov 2024

https://github.com/nasdin/drone-strike-visualization

Connecting to Drone Strike API and performing an analysis on frequency of drone strikes, trend and patterns

analysis api basemap data-analysis data-science data-visualization drone drone-strikes eda geopy jupyter-notebook visualization

Last synced: 28 Dec 2024

https://github.com/shibam120302/black-friday-sales-data-analysis

This repository contain Data Analysis on Black Friday Sales Data using various Regression ML algorithms

data-analysis eda machine-learning python random-forest regression

Last synced: 20 Nov 2024

https://github.com/udipta14/social-media-database-project

I have recently worked on a project of analyzing a social media platform data on MS SQL SERVER. In this project I have used advanced SQL functions and keywords like Views, Indexes, CTE, Windows Functions and many more.

cte eda joins mssqlserver schema views windowsfunction

Last synced: 17 Nov 2024

https://github.com/gbowne1/kicadparser

This is a KiCAD project file(s) parser and eventually will include a generator.

csharp dotnet eda generator kicad parser parser-generator project

Last synced: 28 Dec 2024

https://github.com/andreaschandra/who-suicides-statistics

Exploratory Data Analysis for Suicides using Python

data-analysis data-science eda python

Last synced: 19 Dec 2024

https://github.com/cosmoduende/r-marvel-vs-dc

DC Comics vs Marvel Comics - Exploratory Data Analysis and Data Visualization with R. Who has the smartest, strongest, fastest, or most powerful hero or villain? How to answer this and more questions with R

comics data-analysis data-analysis-r data-analytics data-visualization dataviz dc-characters dc-comics eda exploratory-analysis exploratory-data-analysis exploratory-data-visualizations marvel-characters marvel-comics marvel-vs-dc shdb superherodb superheroes superheros

Last synced: 07 Nov 2024

https://github.com/ksharma67/partial-dependent-plots-individual-conditional-expectation-plots-with-shap

The goal of SHAP is to explain the prediction of an instance x by computing the contribution of each feature to the prediction. The SHAP explanation method computes Shapley values from coalitional game theory. The feature values of a data instance act as players in a coalition.

eda individual-conditional-expectation matplotlib numpy pandas partial-dependence-plot python seaborn shap shapley-additive-explanations sklearn xgboost

Last synced: 25 Dec 2024

https://github.com/jlehrer1/instanteda

Instantly generate common exploratory data plots without worrying about cleaning your DataFrame.

eda pandas python visualization

Last synced: 02 Jan 2025

https://github.com/ibm-cloud-architecture/refarch-kc-gitops

Event-driven Architecture reference implementation GitOps repository

eda

Last synced: 17 Nov 2024

https://github.com/sandravizz/data-breach-analysis

What Data Breaches Tell Us: An Analysis of 17,000 U.S. Data Breaches using D3.js

cybersecurity d3 d3js data-visualization eda ransomware

Last synced: 07 Nov 2024

https://github.com/ibm-cloud-architecture/refarch-eda-store-inventory

Aggregate store inventory using Kafka streams

eda

Last synced: 17 Nov 2024

https://github.com/statsim/select

All relevant feature selection with Boruta.js

eda feature-selection webassembly webworker

Last synced: 09 Nov 2024

https://github.com/ibm-cloud-architecture/eda-lab-mq-to-kafka

A hands-on lab to send sold item from store to MQ and then to Kafka (Confluent or Strimzi) using MQ Kafka connector

eda kafka kafka-connect

Last synced: 17 Nov 2024

https://github.com/jimbrig/eda

Exploratory Data Analysis R Package and Shiny App

data-analysis data-visualization eda r shiny

Last synced: 13 Nov 2024

https://github.com/geekquad/titanic-survival-exploration

Very basic data exploration of the Titanic Dataset.

basic-learning eda titanic-survival-prediction

Last synced: 10 Nov 2024

https://github.com/harmanveer2546/credit-card-fraud-detection

The Credit Card Fraud Detection Problem includes modeling past credit card transactions with the knowledge of the ones that turned out to be a fraud. This model is then used to identify whether a new transaction is fraudulent or not. Our aim here is to detect 100% of the fraudulent transactions while minimizing the incorrect fraud classifications.

ann catboost eda lightgbm machine-learning matplotlib neural-network numpy pandas python random-forest seaborn xgboost

Last synced: 13 Nov 2024

https://github.com/lafayettegabe/nlp-resume-extraction

📝 NER (Named Entity Recognition) project aimed at solving the problem of manually shortlisting resumes by automating the process. This project proposes using NLP techniques and NER model to classify and extract relevant entities from resumes such as person name, college name, academics information, relevant experiences, skill set, etc.

big-data data data-analysis data-science eda ner nlp resume-extractor

Last synced: 16 Dec 2024

https://github.com/lafayettegabe/g2m-insight-for-cab-investment-firm

📊 Exploratory Data Analysis (EDA) on multiple datasets related to the cab industry in the US, to provide actionable insights and recommendations to a private firm looking to invest in the market. The analysis includes data cleaning, transformation, visualization, and hypothesis testing.

big-data data data-analysis data-science data-visualization eda gotomarket

Last synced: 16 Dec 2024

https://github.com/facilebio/FacileShine

Shiny modules for FacileData

bioinformatics eda

Last synced: 04 Dec 2024

https://github.com/henrylin03/video-games

Using Python and SQL to clean, analyse and visualise video games' data from Metacritic. Includes scraping using BeautifulSoup.

analysis beautifulsoup beautifulsoup4 data data-analysis data-science eda jupyter-notebook pandas python sql sqlite3 video-game video-games

Last synced: 14 Nov 2024

https://github.com/jimbrig/EDA

Exploratory Data Analysis R Package and Shiny App

data-analysis data-visualization eda r shiny

Last synced: 04 Dec 2024

https://github.com/elysian01/ml-eda-and-modelling-using-streamlit

Beautiful Web interface made using Streamlit for quick Exploratory Data Analysis and building classification models which are implemented from scratch.

data-analysis data-visualization eda exploratory-data-analysis knn-classification logistic-regression matplotlib ml-model-on-web ml-models naive-bayes-classifier pandas seaborn streamlit streamlit-webapp

Last synced: 07 Nov 2024

https://github.com/ksharma67/heart-failure-prediction

This problem is a typical Classification Machine Learning task. Building various classifiers by using the following Machine Learning models: Logistic Regression (LR), Decision Tree (DT), Random Forest (RF), XGBoost (XGB), Light GBM and Support Vector Machines with RBF kernel.

auc-roc-curve auc-roc-score decision-trees eda eli5 gridsearchcv lightgbm lime logistic-regression numpy pandas python random-forest seaborn shap skit-learn sklearn svm xgboost

Last synced: 25 Dec 2024

https://github.com/drearondov/nlp-newspapersanalysis

End-to-end NLP Project of news headlines for the main newspapers of Perú

eda kedro nlp python sentiment-analysis topic-modeling

Last synced: 10 Nov 2024

https://github.com/nikoshet/exploratory-data-analysis-using-r

Exploratory Data Analysis using R Course Project for M.Sc. 'Data Science and Machine Learning' in NTUA

data data-analysis data-science eda exploratory-data-analysis ggplot2 r

Last synced: 09 Nov 2024

https://github.com/olgaele/kaggle-projects

Various Kaggle projects in Python!

analysis datasets eda machine-learning python

Last synced: 29 Nov 2024

https://github.com/x86-39/ansible-http-server

This is a web server written in Ansible. Yes, WRITTEN IN Ansible. Not using an external web server.

ansible ansible-eda ansible-playbook eda event-driven-ansible http

Last synced: 19 Nov 2024

https://github.com/perezrd5/data-science

Entry-level looks at Exploratory Data Analysis (in R & Python) and Regression Models (in R)

analysis data-science eda multinomial-regression poisson-regression python r regression

Last synced: 25 Nov 2024

https://github.com/muhammadibrahim313/start-your-data-science-journey

In this Repo i will be Sharing all Resources that we will be Learning during December Data Science Workhops on iCode Guru

btajicrew data data-science eda icodeguru machine-learning matplotlib pandas python

Last synced: 22 Dec 2024

https://github.com/raghulrajn/kaggle-notebooks

This repo contains notebooks of competitions in Kaggle

eda kaggle-competition pandas python

Last synced: 22 Dec 2024

https://github.com/mdanwarulkarim/supershop_sales_analysis_sql

This project examines customer demographics, sales trends, and high-value transactions, uncovering patterns, top customers, and category performance. The insights provide actionable perspectives to understand sales, customer behavior, and product performance for informed decisions.

eda etl sql

Last synced: 30 Nov 2024

https://github.com/pavankethavath/car_dekho_car_price_prediction

A Streamlit web app utilizing Python, scikit-learn, and pandas for used car price prediction. Features data preprocessing (scaling, encoding), Random Forest model optimization with GridSearchCV, and interactive user input handling. Achieves high accuracy (R² score: 0.9028), showcasing skills in machine learning, data engineering, and deployment.

dataanalysis datacleaning datapreprocessing eda encoding feature-extraction feature-selection featureimportance fine-tuning machine-learning minmaxscaling normalization pandas pickle prediction-model python random-forest randomsearch-cv regression streamlit

Last synced: 10 Dec 2024

https://github.com/alexandramartinez/asyncapis-accounts-email

All the resources you need to implement a functional simple architecture with an Accounts and an Email services using AsyncAPI, Anypoint Code Builder, and the available message brokers: Anypoint MQ, Kafka, Solace PubSub+, and Salesforce CDC/Platform Events

acb anypoint-code-builder anypointmq async async-api asyncapi asyncapi-specification eda kafka kafka-consumer kafka-producer kafka-topic mule mule4 mulesoft queue queues topic

Last synced: 29 Dec 2024

https://github.com/md-emon-hasan/4-eda-football-ml-app

A ML application focused on exploratory data analysis and football analytics, featuring data visualization and insights using Python and relevant libraries.

data-science-projects data-visualization eda exploratory-data-analysis football-analytics sports-analytics webapp

Last synced: 13 Nov 2024

https://github.com/md-emon-hasan/3-eda-basketball-ml-app

A ML application focused on EDA and basketball analytics, showcasing data visualization and insights using Python and relevant libraries.

basketball-analysis csv data-visualization eda exploratory-data-analysis exploratory-data-analysis-eda ml-app

Last synced: 13 Nov 2024

https://github.com/kishlayjeet/zomato-data-exploration

In this project, we will be exploring a dataset containing information on various restaurants and their ratings, location, and other attributes.

data-analysis eda matplotlib numpy pandas zomato-data-exploration

Last synced: 24 Dec 2024

https://github.com/md-emon-hasan/1-simple-stock-price-ml-app

A simple mahcine learning application for stock prices, demonstrating data preprocessing, model training, and deployment using scikit-learn.

data-analysis data-science eda ml-app streamlit-webapp time-series time-series-analysis webapp

Last synced: 13 Nov 2024

https://github.com/aisurjyasamantaray/sales-perfomance-analysis-dashboard

A comprehensive sales performance analysis dashboard built using Python, and visualization tools. This project includes data cleaning, descriptive statistics, correlation analysis, and insights into sales trends, profitability, and the impact of discounts. Key features include interactive visualizations using Seaborn, and Matplot

analytics annova data data-analysis data-visualization-project dataproject eda hypothesis-testing pandas-dataframe python sales-performance-analysis statistics

Last synced: 18 Nov 2024

https://github.com/39services/ansible-http-server

This is a web server written in Ansible. Yes, WRITTEN IN Ansible. Not using an external web server.

ansible ansible-eda ansible-playbook eda event-driven-ansible http

Last synced: 14 Dec 2024

https://github.com/shimaa83/data_analysis_thanwayaama

EDA analysis for egypt thanwaiaama data set - omdena challenge

data-science data-visualization eda

Last synced: 19 Nov 2024

https://github.com/rayyan9477/diamond-price-forecasting

This is a comprehensive machine learning project focused on predicting diamond prices. Using a dataset of diamond attributes, the project implements various machine learning models to forecast prices. Key features include data preprocessing, exploratory data analysis (EDA), and model training with algorithms such as Linear Regression, Decision Tree

data-analysis data-science decision-trees eda linear-regression machine-learning

Last synced: 11 Nov 2024

https://github.com/cosmoduende/r-uber-trips-analyisis

Explore your activity on Uber with R: How to analyze and visualize your personal data history. Find out how you consume the Uber App using a copy of your data.

analisis-de-data data-analysis data-analytics data-science data-visualisation data-visualization data-viz eda flexdashboard ggmap ggplot2 mobility-as-a-service qmplot r-language r-programming ridesharing uber uber-data visualizacion-de-datos

Last synced: 27 Dec 2024

https://github.com/bookshiyi/kicad_plugins

KiCAD插件收集

eda embedded kicad plugins scripts

Last synced: 04 Dec 2024

https://github.com/aakashsyadav1999/crime-data-from-2020-2023

This dataset reflects incidents of crime in the City of Los Angeles dating back to 2020. This data is transcribed from original crime reports that are typed on paper and therefore there may be some inaccuracies within the data.

deep-learning eda machine-learning machine-learning-algorithms

Last synced: 13 Nov 2024

https://github.com/sralter/sustainability_insights

A data analysis project that derived insights from an emissions dataset sourced from Climate TRACE.

duckdb eda matplotlib numpy pandas tableau

Last synced: 19 Dec 2024