Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/robotlocomotion/pytorch-dense-correspondence
Code for "Dense Object Nets: Learning Dense Visual Object Descriptors By and For Robotic Manipulation"
https://github.com/robotlocomotion/pytorch-dense-correspondence
3d artificial-intelligence computer-vision deep-learning manipulation pytorch robotics self-supervised-learning vision
Last synced: about 12 hours ago
JSON representation
Code for "Dense Object Nets: Learning Dense Visual Object Descriptors By and For Robotic Manipulation"
- Host: GitHub
- URL: https://github.com/robotlocomotion/pytorch-dense-correspondence
- Owner: RobotLocomotion
- License: other
- Created: 2018-02-13T16:41:18.000Z (almost 7 years ago)
- Default Branch: master
- Last Pushed: 2023-05-09T09:22:39.000Z (over 1 year ago)
- Last Synced: 2024-11-14T05:34:10.860Z (4 days ago)
- Topics: 3d, artificial-intelligence, computer-vision, deep-learning, manipulation, pytorch, robotics, self-supervised-learning, vision
- Language: Python
- Homepage: https://arxiv.org/pdf/1806.08756.pdf
- Size: 73.2 MB
- Stars: 558
- Watchers: 28
- Forks: 133
- Open Issues: 11
-
Metadata Files:
- Readme: README.md
- License: LICENSE.txt
Awesome Lists containing this project
README
### Updates
- September 4, 2018: Tutorial and data now available! [We have a tutorial now available here](./doc/tutorial_getting_started.md), which walks through step-by-step of getting this repo running.
- June 26, 2019: We have updated the repo to pytorch 1.1 and CUDA 10. For code used for the experiments in the paper see [here](https://github.com/RobotLocomotion/pytorch-dense-correspondence/releases/tag/pytorch-0.3).## Dense Correspondence Learning in PyTorch
In this project we learn Dense Object Nets, i.e. dense descriptor networks for previously unseen, potentially deformable objects, and potentially classes of objects:
![](./doc/caterpillar_trim.gif) | ![](./doc/shoes_trim.gif) | ![](./doc/hats_trim.gif)
:-------------------------:|:-------------------------:|:-------------------------:We also demonstrate using Dense Object Nets for robotic manipulation tasks:
![](./doc/caterpillar_grasps.gif) | ![](./doc/shoe_tongue_grasps.gif)
:-------------------------:|:-------------------------:### Dense Object Nets: Learning Dense Visual Descriptors by and for Robotic Manipulation
This is the reference implementation for our paper:
[PDF](https://arxiv.org/pdf/1806.08756.pdf) | [Video](https://www.youtube.com/watch?v=L5UW1VapKNE)
[Pete Florence*](http://www.peteflorence.com/), [Lucas Manuelli*](http://lucasmanuelli.com/), [Russ Tedrake](https://groups.csail.mit.edu/locomotion/russt.html)
Abstract: What is the right object representation for manipulation? We would like robots to visually perceive scenes and learn an understanding of the objects in them that (i) is task-agnostic and can be used as a building block for a variety of manipulation tasks, (ii) is generally applicable to both rigid and non-rigid objects, (iii) takes advantage of the strong priors provided by 3D vision, and (iv) is entirely learned from self-supervision. This is hard to achieve with previous methods: much recent work in grasping does not extend to grasping specific objects or other tasks, whereas task-specific learning may require many trials to generalize well across object configurations or other tasks. In this paper we present Dense Object Nets, which build on recent developments in self-supervised dense descriptor learning, as a consistent object representation for visual understanding and manipulation. We demonstrate they can be trained quickly (approximately 20 minutes) for a wide variety of previously unseen and potentially non-rigid objects. We additionally present novel contributions to enable multi-object descriptor learning, and show that by modifying our training procedure, we can either acquire descriptors which generalize across classes of objects, or descriptors that are distinct for each object instance. Finally, we demonstrate the novel application of learned dense descriptors to robotic manipulation. We demonstrate grasping of specific points on an object across potentially deformed object configurations, and demonstrate using class general descriptors to transfer specific grasps across objects in a class.
#### Citing
If you find this code useful in your work, please consider citing:
```
@article{florencemanuelli2018dense,
title={Dense Object Nets: Learning Dense Visual Object Descriptors By and For Robotic Manipulation},
author={Florence, Peter and Manuelli, Lucas and Tedrake, Russ},
journal={Conference on Robot Learning},
year={2018}
}
```### Tutorial
- [getting started with pytorch-dense-correspondence](./doc/tutorial_getting_started.md)
### Code Setup
- [setting up docker image](doc/docker_build_instructions.md)
- [recommended docker workflow ](doc/recommended_workflow.md)### Dataset
- [data organization](doc/data_organization.md)
- [data pre-processing for a single scene](doc/data_processing_single_scene.md)### Training and Evaluation
- [training a network](doc/training.md)
- [evaluating a trained network](doc/dcn_evaluation.md)
- [pre-trained models](doc/model_zoo.md)### Miscellaneous
- [coordinate conventions](doc/coordinate_conventions.md)
- [testing](doc/testing.md)### Git management
To prevent the repo from growing in size, recommend always "restart and clear outputs" before committing any Jupyter notebooks. If you'd like to save what your notebook looks like, you can always "download as .html", which is a great way to snapshot the state of that notebook and share.