https://github.com/mohith202/grasp_module

Last synced: 4 months ago
JSON representation

Host: GitHub
URL: https://github.com/mohith202/grasp_module
Owner: Mohith202
Created: 2025-02-13T12:29:13.000Z (11 months ago)
Default Branch: master
Last Pushed: 2025-09-05T08:38:48.000Z (4 months ago)
Last Synced: 2025-09-05T10:25:57.011Z (4 months ago)
Language: Python
Size: 3.56 MB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# CNN-Based Grasp Detection with Attention Mechanism on GraspNet Dataset (forked form Graspnet repo)

## Abstract
This project presents a hybrid approach for robotic grasp detection by combining **CNN-based image segmentation** and a **simple attention mechanism** for robust feature extraction. These modules were evaluated in conjunction with the **GraspNet baseline architecture**. The final model demonstrated an improvement in **Average Precision (AP)**, reaching **55%**, by effectively identifying graspable regions on seen objects.

---

## Methodology

### 1. Image Segmentation using CNN
We developed a lightweight CNN architecture to segment images and isolate graspable objects.
The segmentation mask reduces the search space for the grasp detection module.

### 2. Attention Mechanism
An attention block was introduced to refine grasp detection by focusing on high-importance regions.
The module used **softmax-weighted activations** to highlight potential grasp points from fused CNN features and **PointNet++** features.

### 3. Integration with GraspNet Baseline
The modified architecture was integrated into the **GraspNet-baseline pipeline**.
We experimented with:
- CNN-only
- Attention
- CNN + Attention

Each configuration was trained and evaluated on standard GraspNet benchmarks.

---

## Experiments
We trained and evaluated all model configurations on the **GraspNet dataset**, using **Average Precision (AP)** as the primary metric.

**Ablation study:**
- CNN-only (Final Model)
- Attention
- CNN + Attention

---

## Results

| Configuration | Average Precision (AP) |
|-----------------|-------------------------|
| Attention | 35% |
| CNN + Attention | 31% |
| CNN | **55%** |

---

## Conclusion
The integration of **CNN-based segmentation**, **Vision Mamba feature extraction**, and **attention mechanisms** improves grasp detection performance on the GraspNet dataset.

This multi-module pipeline enables the model to focus on grasp-relevant regions, resulting in better accuracy.

**Future Work:**
- Extend the approach to **real-time robotic systems**
- Evaluate performance on **physical hardware**

---

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/mohith202/grasp_module

Awesome Lists containing this project

README