Projects in Awesome Lists tagged with label-encoding
A curated list of projects in awesome lists tagged with label-encoding .
https://github.com/imharshag/nids-using-ml
This project showcases a Network Intrusion Detection System (NIDS) designed to bolster cybersecurity defenses against evolving threats
datamining ensemble-learning gaussian-naive-bayes knn label-encoding matplotlib network-intrusion-detection nsl-kdd one-hot-encoding principal-component-analysis python random-forest recursive-feature-elimination sklearn svm voting-classifier xgboost
Last synced: 23 Apr 2025
https://github.com/moindalvs/learn_feature_engineering
Data Set: House Prices: Advanced Regression Techniques Feature Engineering with 80+ Features
data-science data-transformation handling-missing-value label-encoding log-transformation minmaxscaling missing-values
Last synced: 16 Oct 2025
https://github.com/jigyasag18/iit-guhawati
Empower Sakhi is a data-driven platform that uses machine learning to identify women at risk of domestic violence in India. It offers confidential self-assessments, survivor stories, and emergency resources through a trauma-informed, privacy-focused web app. The project also provides NGOs with actionable insights via Power BI dashboard for support.
aiml data dataset datavisualization domestic-violence eda jupyter-notebook label-encoding machine-learning machine-learning-algorithms machine-learning-models machinelearning machinelearningprojects powerbi python python-app random-forest random-forest-classifier streamlit streamlit-webapp
Last synced: 28 Jun 2025
https://github.com/rubyyy1118/machine_learning_optimization_study
The Learning From Data - Assignment in my MSc Business Analytics course
data-classification data-cleaning data-science data-visualization hyperparameter-tuning label-encoding neural-network python support-vector-machine tensorflow
Last synced: 29 Jun 2025
https://github.com/saifalibaig/multi-label-emotion-recognition
This project focuses on detecting multiple emotions from English text using a fine-tuned **BERT** model. It leverages the [GoEmotions](https://huggingface.co/datasets/go_emotions) dataset — a large-scale human-annotated dataset of Reddit comments labeled with 27 emotions + neutral.
artificial-intelligence bert-model feature-engineering huggingface jupyter-notebook label-encoding machine-learning-algorithms preprocessing python3 sigmoid-function transformation
Last synced: 15 Apr 2025
https://github.com/aneeshmurali-n/project-ml-data-preprocessing
The main objective of this project is to design and implement a robust data preprocessing system that addresses common challenges such as missing values, outliers, inconsistent formatting, and noise. By performing effective data preprocessing, the project aims to enhance the quality, reliability, and usefulness of the data for machine learning.
data-analysis data-cleaning data-encoding data-exploration feature-scaling label-encoding matplotlib minmaxscaler numpy one-hot-encoding outlier-detection pandas standardscaler
Last synced: 20 Nov 2025
https://github.com/sunnyrao07/water-quality-analysis
A machine learning project that predicts water potability based on chemical and physical attributes, using models like Logistic Regression, Random Forest, and XGBoost.
data-cleaning label-encoding logistic-regression matplotlib model-evaluation numpy pandas pyhton random-forest sckiit-learn seaborn smote standard-scaler xgboost
Last synced: 16 Apr 2025
https://github.com/kumpatlapavankumar/medical-insurance-cost-estimation-using-machine-learning
Using python,numpy,pandas,seaborn,matplotlib and machine learning techniques
accuracy-score data-science data-visualization decision-trees exploratory-data-analysis gradient-boosting-classifier label-encoding linear-regression machine-learning matplotlib model-selection-and-evaluation numpy pandas prediction preprocessing python random-forest train-test-split xgboost-regression
Last synced: 23 Jun 2025
https://github.com/lkethridge/supervised_learning
Supervised Learning project from TripleTen
class-imbalance-handling confusion-matrix data-upload downsampling f1-score feature-prep feature-scaling fpr imbalanced-classification label-encoding one-hot-encoding ordinal-encoding pr-curve precision recall regression-metrics roc-curve supervised-learning tpr upsampling
Last synced: 28 Mar 2025
https://github.com/csengupta1101/data-is-good-exam---september
This Repository Consists the exam Problems and solutions conducted on September - 2021
central-limit-theorem data-is-good exam feature-scaling github label-encoding missing-value one-hot-encoding outliers python statistics
Last synced: 09 Oct 2025
https://github.com/jaspreetsingh-exe/vehicle-price-prediction
Vehicle Price Prediction is a machine learning project that estimates vehicle prices using features like make, model, year, mileage, and more. It employs multiple regression models, including Linear Regression, Random Forest, Gradient Boosting, CatBoost, and Stacking Regressor, with GridSearchCV for tuning.
catboost data-preprocessing exploratory-data-analysis gradient-boosting label-encoding linear-regression machine-learning price-prediction python random-forest regression regression-models stackingregressor standard-scaler
Last synced: 17 Oct 2025
https://github.com/abinashsahoo007/project-resume-classification
The document classification solution should significantly reduce the manual human effort in the HRM. It should achieve a higher level of accuracy and automation with minimal human intervention.
corpus count-vectorizer label-encoding lemmitization machine-learning nltk part-of-speech-tagging resume-classification spacy stemming text-mining text-preprocessing textract tfidf-vectorizer tokenization wordcloud
Last synced: 02 Feb 2026
https://github.com/navindafernando/feature-extraction
Heart Risk Level Predicting Regression Model & Web using Feature Engineering and Data Preprocessing :baby_chick:
categorical-encoding feature-engineering flask handling-outlier html5 joblib label-encoding machine-learning numpy pandas polynomial-features quantile-transformer scaling
Last synced: 15 Mar 2026
https://github.com/sunnyrao07/stroke-risk-prediction
Predicting stroke risk using machine learning models based on healthcare and demographic data.
data-cleaning data-visualization decision-trees feature-engineering label-encoding matplotlib model-evaluation numpy outlier-detection pandas python random-forest scikit-learn seaborn standard-scaler
Last synced: 30 Dec 2025