https://github.com/mehwishferoz/bee-friendly-plants

This repository contains my first-place submission for the DataCamp Bee Friendly Plants Competition 🐝🌸
https://github.com/mehwishferoz/bee-friendly-plants

competition datacamp exploratory-data-analysis

Last synced: about 2 months ago
JSON representation

This repository contains my first-place submission for the DataCamp Bee Friendly Plants Competition 🐝🌸

Host: GitHub
URL: https://github.com/mehwishferoz/bee-friendly-plants
Owner: mehwishferoz
Created: 2024-07-21T17:52:35.000Z (about 2 years ago)
Default Branch: main
Last Pushed: 2024-07-21T18:13:26.000Z (about 2 years ago)
Last Synced: 2025-07-25T20:19:38.261Z (about 1 year ago)
Topics: competition, datacamp, exploratory-data-analysis
Language: Jupyter Notebook
Homepage: https://app.datacamp.com/learn/competitions/bee-friendly-plants
Size: 2.61 MB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# 🐝 Bee Friendly Plants Competition 🏆

![Bee Friendly Plants](image.png)

## 🌟 Overview
This repository contains my submission for the [DataCamp Bee Friendly Plants Competition](https://app.datacamp.com/learn/competitions/bee-friendly-plants), where I placed first and won DataCamp merchandise. The competition challenged participants to identify plants that are beneficial for bees using data science techniques.

## 🌸 Project Description
In this competition, the goal was to analyze various plant species to determine their suitability for bees. The dataset included features such as flower color, bloom time, and habitat.

## 🔍 Approach

### 🛠️ Data Preprocessing
- **Data Cleaning:** Handled missing values and outliers.
- **Normalization:** Scaled numerical features for better model performance.

### 🧩 Feature Engineering
- **New Features:** Created additional features based on domain knowledge.
- **Encoding:** Converted categorical variables into numerical format using techniques like one-hot encoding.

### 🤖 Model Building
- **Models Used:** Tried multiple models including Random Forest, Gradient Boosting, and Neural Networks.
- **Hyperparameter Tuning:** Used Grid Search and Random Search to find the best parameters.
- **Cross-Validation:** Implemented k-fold cross-validation to ensure model robustness.

### 📊 Evaluation
- **Metrics:** Evaluated models using metrics such as accuracy, precision, recall, and F1 score.
- **Final Model:** Selected the best-performing model based on evaluation metrics and used it for the final predictions.

## 🏅 Results
The final model achieved outstanding results, leading to a first-place finish in the competition. Here are some key findings:
- **Important Features:** Identified the most important features influencing the suitability for bees.
- **Model Performance:** Detailed performance metrics of the final model.

## 📂 Notebooks and Code
- **Data Analysis and Model Building:** All the steps mentioned above are detailed in the [notebook](notebook.ipynb).
- **Data:** The dataset used for this project can be found in the `data` directory ([plants_and_bees.csv](data/plants_and_bees.csv)).

## 🙏 Acknowledgments
I would like to thank DataCamp for organizing this competition and providing a platform to showcase and enhance our data science skills.

![DataCamp Winner](win.png)

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/mehwishferoz/bee-friendly-plants

Awesome Lists containing this project

README