Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/davityak03/ensemble-modelling
This repository contains a comprehensive guide and implementation of ensemble modeling techniques, specifically focusing on Boosting, Bagging, and Voting. Ensemble methods are powerful techniques in machine learning that combine the predictions from multiple models to improve overall performance and robustness.
https://github.com/davityak03/ensemble-modelling
adaboost bagging boosting desiciontree ensemble-model logistic-regression machine-learning-algorithms python svm voting
Last synced: about 2 months ago
JSON representation
This repository contains a comprehensive guide and implementation of ensemble modeling techniques, specifically focusing on Boosting, Bagging, and Voting. Ensemble methods are powerful techniques in machine learning that combine the predictions from multiple models to improve overall performance and robustness.
- Host: GitHub
- URL: https://github.com/davityak03/ensemble-modelling
- Owner: Davityak03
- Created: 2024-06-03T08:56:25.000Z (7 months ago)
- Default Branch: main
- Last Pushed: 2024-06-03T09:04:00.000Z (7 months ago)
- Last Synced: 2024-06-03T10:56:37.754Z (7 months ago)
- Topics: adaboost, bagging, boosting, desiciontree, ensemble-model, logistic-regression, machine-learning-algorithms, python, svm, voting
- Homepage:
- Size: 12.7 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Ensemble Modeling on Pima Indians Diabetes Dataset
This repository provides an in-depth implementation of ensemble modeling techniques including Boosting, Bagging, and Voting on the Pima Indians Diabetes Dataset. Ensemble methods combine the predictions from multiple models to improve accuracy and robustness, making them powerful tools in the field of machine learning.## Table of Contents
- [Introduction](#introduction)
- [Dataset](#dataset)
- [Ensemble Techniques](#ensemble-techniques)
- [Boosting](#boosting)
- [Bagging](#bagging)
- [Voting](#voting)
- [Results](#results)## Introduction
This project demonstrates how to apply ensemble learning methods to classify whether a patient has diabetes based on diagnostic measurements. The three main ensemble techniques implemented are Boosting, Bagging, and Voting.## Dataset
The Pima Indians Diabetes Dataset is used in this project. It consists of 768 samples of female patients of at least 21 years old of Pima Indian heritage. The dataset includes several medical predictor variables and one target variable indicating whether the patient has diabetes. The datset was taken from kaggle.- **Features:**
- Number of pregnancies
- Plasma glucose concentration
- Diastolic blood pressure
- Triceps skinfold thickness
- 2-Hour serum insulin
- Body mass index (BMI)
- Diabetes pedigree function
- Age
- **Target:**
- `0` (no diabetes)
- `1` (diabetes)## Ensemble Techniques
### Boosting
Boosting builds models sequentially, with each model attempting to correct the errors of the previous one. In this project, we implement:
- **AdaBoost**### Bagging
Bagging involves training multiple models in parallel on different subsets of the data created through random sampling with replacement. In this project, we implement:
- **Bagging Classifier**### Voting
Voting combines the predictions from multiple models (either bagging, boosting, or standalone models) and uses a majority vote or average for classification. In this project, we implement:## Results
The results section in the Jupyter Notebook includes detailed evaluation metrics and visualizations to compare the performance of different ensemble methods. Key metrics include accuracy, precision, recall, F1-score.