Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/wx405557858/neural_tracking
https://github.com/wx405557858/neural_tracking
Last synced: 2 months ago
JSON representation
- Host: GitHub
- URL: https://github.com/wx405557858/neural_tracking
- Owner: wx405557858
- License: mit
- Created: 2022-07-14T21:55:10.000Z (over 2 years ago)
- Default Branch: master
- Last Pushed: 2022-07-15T17:57:12.000Z (over 2 years ago)
- Last Synced: 2024-08-02T08:06:38.888Z (6 months ago)
- Language: Python
- Size: 16.6 KB
- Stars: 8
- Watchers: 3
- Forks: 0
- Open Issues: 2
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- awesome-tactile-sensing - Neural Network based Marker Tracking
README
# Marker Tracking with Neural Networks
This project provides robust marker tracking for camera-based tactile sensors with Sim2Real neural networks.
## Getting Started
**System**: Ubuntu, OSX
**Install packages**
```
pip install tensorflow-gpu opencv-python
```**Download models**
```
bash ./models/download_model.sh
```**Download test GelSight video**
```
bash ./data/download_video.sh
```**Marker tracking for GelSight video**
```
python example_tracking_video.py
```**Note**: The model is implemented initially with Keras/TensorFlow to be compatible with [Coral](https://coral.ai/products/accelerator), using TPU as USB Accelerator for Raspberry Pi on-device computation. Please feel free to switch the model to other frameworks, like PyTorch, for your purpose.
## Examples
### `python example_tracking_sim.py`The interactive tracking demo. The mouse distorts (translates and rotates) the marker flow. The yellow arrow shows the marker tracking predictions from the neural network. The model can robustly track markers, even with extreme and complex interactions. The model is trained with 10x14 markers.
The model is also robust to marker sizes and background disturbances, due to added domain randomization during training.
User interface:
- **Mouse `Click and Drag`"**: Translate the flow.
- **Key `s` and `Drag` mouse**: Rotate the flow. (The mouse needs to be pressed while hitting `s`.)
- **Key `c`**: Random background.
- **Key `z`**: Change marker sizes.
- **Key `r`**: Reset.### `python example_tracking_sim_generic.py`
The generic model is trained on variable grid patterns so that it can be invariant to different numbers of markers. The output is the flow with the same size of the input.
Additional user interface:
- **Key `d`**: Random marker patterns.
**Note**: We suggest trying the generic model for preliminary experiments, and training your fixed model for best performance. The generic one can work on more cases directly, and the fixed one is more accurate for a certain marker pattern.
### `python example_tracking_video.py`
The model can be transferred to real sensor data robustly, with large forces, multiple contacts, and wrinkles.
## Train
You can train your own model to optimize the performance for a specific marker pattern. Here are two training example that is trained on 10x14 markers (`train.py`), and variable marker patterns (`train_generic.py`). Please feel free to customize them for your purposes.
### `python train.py`
It takes data pairs `(X, Y)` from `generate_img()` in `generate_data.py`.
* **Input**: `X` is the synthesized image. It has random backgrounds, and 10x14 markers. The marker positions are randomly distorted (translated and rotated) with Gaussian-based smoothing. It imitates smooth elastomer distortions with simple operations. The markers are then rendered as regions darker than surrounding pixels, given the distorted positions.
* **Output**: `Y` has a dimension of 10x14x2, which represents the ground-truth horizontal and vertical displacement in pixel for each marker.
* **Model**: The model consists multiple Convolutional layers and Pooling layers, defined as `build_model_small()` in `train.py`.### `python train_generic.py`
It takes the data pairs `(X, Y)` from `generate_img()` in `generate_data_generic.py`.
* **Input**: `X` is generated similarly to the previous one. The difference is that the number of rows and columns of the markers is randomly selected from [4, 15]. Besides the current frame, it also takes the initial frame.
* **Output**: `Y` is the flow with the same width and height as the input image. Each point represents the horizontal and vertical displacement for each pixel. We use multiple resolutions of Y to accelerate training.
* **Model**: The model is in the Encoder-Decoder style, which consists of Convolutional, Pooling, and Upsampling layers to generate output with the same dimensions as the input. It is defined as `build_model_ae()` in `train_generic.py`.**Note**: Please customize the `generate_img()` function to fit the marker patterns to your sensor.
## Citation
If you use this code for your research, please cite our paper: [Gelsight Wedge: Measuring High-Resolution 3D Contact Geometry with a Compact Robot Finger](https://arxiv.org/pdf/2106.08851.pdf):```
@inproceedings{wang2021gelsight,
title={Gelsight wedge: Measuring high-resolution 3d contact geometry with a compact robot finger},
author={Wang, Shaoxiong and She, Yu and Romero, Branden and Adelson, Edward},
booktitle={2021 IEEE International Conference on Robotics and Automation (ICRA)},
pages={6468--6475},
year={2021},
organization={IEEE}
}
```