https://github.com/GXNU-ZhongLab/ODTrack

The official implementation for the paper [ODTrack: Online Dense Temporal Token Learning for Visual Tracking].
https://github.com/GXNU-ZhongLab/ODTrack

Last synced: 4 months ago
JSON representation

The official implementation for the paper [ODTrack: Online Dense Temporal Token Learning for Visual Tracking].

Host: GitHub
URL: https://github.com/GXNU-ZhongLab/ODTrack
Owner: GXNU-ZhongLab
License: mit
Created: 2023-12-10T03:57:19.000Z (over 1 year ago)
Default Branch: main
Last Pushed: 2024-10-07T11:51:47.000Z (9 months ago)
Last Synced: 2024-11-02T05:32:33.148Z (9 months ago)
Language: Python
Size: 888 KB
Stars: 100
Watchers: 7
Forks: 9
Open Issues: 8
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

Awesome-Visual-Object-Tracking - [code

README

        # [AAAI'2024] - ODTrack

The official implementation for the **AAAI 2024** paper \[[_ODTrack: Online Dense Temporal Token Learning for Visual Tracking_](https://arxiv.org/abs/2401.01686)\].

[[Models](https://drive.google.com/drive/folders/17LacrfRO01R75bxU4bgA87eo1b_rX5Gj?usp=sharing)], [[Raw Results](https://drive.google.com/drive/folders/10I7aHb2J4SFTMuQ_LN33VbaiykD_M2hi?usp=sharing)], [[Training logs](https://drive.google.com/drive/folders/1BXnYmnGnSZIA0IR_gwdlDczl0ex4DgFF?usp=sharing)]

## Highlights

### :star2: New Video-Level Tracking Framework



  



ODTrack is a simple, flexible and effective **video-level tracking pipeline**, which densely associates the contextual relationships of video frames in an online token propagation manner. ODTrack receives video frames of arbitrary length to capture the spatio-temporal trajectory relationships of an instance, and compresses the discrimination features (localization information) of a target into a token sequence to achieve frame-to-frame association. 

This new solution brings the following benefits: 

1. the purified token sequences can serve as prompts for the inference in the next video frame, whereby past information is leveraged to guide future inference

2. the complex online update strategies are effectively avoided by the iterative propagation of token sequences, and thus ODTrack can achieves more efficient model representation and computation.

### :star2: Strong Performance

| Tracker     | GOT-10K (AO) | LaSOT (AUC) | TrackingNet (AUC) | LaSOT_ext (AUC) | VOT2020 (EAO) | TNL2K (AUC) | OTB(AUC) |

|:-----------:|:------------:|:-----------:|:-----------------:|:-----------:|:-----------:|:-----------:|:-----------:|

| ODTrack-L | 78.2         | 74.0        | 86.1              | 53.9          | 0.605          | 61.7          | 72.4          |

| ODTrack-B | 77.0         | 73.1        | 85.1              | 52.4          | 0.581          | 60.9          | 72.3          |

## Install the environment

```

conda create -n odtrack python=3.8

conda activate odtrack

bash install.sh

```

## Data Preparation

Put the tracking datasets in ./data. It should look like:

   ```

   ${PROJECT_ROOT}

    -- data

        -- lasot

            |-- airplane

            |-- basketball

            |-- bear

            ...

        -- got10k

            |-- test

            |-- train

            |-- val

        -- coco

            |-- annotations

            |-- images

        -- trackingnet

            |-- TRAIN_0

            |-- TRAIN_1

            ...

            |-- TRAIN_11

            |-- TEST

   ```

## Set project paths

Run the following command to set paths for this project

```

python tracking/create_default_local_file.py --workspace_dir . --data_dir ./data --save_dir ./output

```

After running this command, you can also modify paths by editing these two files

```

lib/train/admin/local.py  # paths about training

lib/test/evaluation/local.py  # paths about testing

```

## Training

Download pre-trained [MAE ViT-Base weights](https://dl.fbaipublicfiles.com/mae/pretrain/mae_pretrain_vit_base.pth) and put it under `$PROJECT_ROOT$/pretrained_networks` (different pretrained models can also be used, see [MAE](https://github.com/facebookresearch/mae) for more details).

```

python tracking/train.py \

--script odtrack --config baseline \

--save_dir ./output \

--mode multiple --nproc_per_node 4 \

--use_wandb 1

```

Replace `--config` with the desired model config under `experiments/odtrack`.

We use [wandb](https://github.com/wandb/client) to record detailed training logs, in case you don't want to use wandb, set `--use_wandb 0`.

## Test and Evaluation

- LaSOT or other off-line evaluated benchmarks (modify `--dataset` correspondingly)

```

python tracking/test.py odtrack baseline --dataset lasot --runid 300 --threads 8 --num_gpus 2

python tracking/analysis_results.py # need to modify tracker configs and names

```

- GOT10K-test

```

python tracking/test.py odtrack baseline_got --dataset got10k_test  --runid 100 --threads 8 --num_gpus 2

python lib/test/utils/transform_got10k.py --tracker_name odtrack --cfg_name baseline_got_100

```

- TrackingNet

```

python tracking/test.py odtrack baseline --dataset trackingnet  --runid 300 --threads 8 --num_gpus 2

python lib/test/utils/transform_trackingnet.py --tracker_name odtrack --cfg_name baseline_300

```

- VOT2020

```

cd external/vot20/    

bash exp.sh

```

## Test FLOPs, and Speed

*Note:* The speeds reported in our paper were tested on a single RTX2080Ti GPU.

```

python tracking/profile_model.py --script odtrack --config baseline

```

## Acknowledgments

* Thanks for the [STARK](https://github.com/researchmm/Stark) and [PyTracking](https://github.com/visionml/pytracking) library, which helps us to quickly implement our ideas.

## Citation

If our work is useful for your research, please consider citing:

```Bibtex

@inproceedings{zheng2024odtrack,

  title={ODTrack: Online Dense Temporal Token Learning for Visual Tracking}, 

  author={Yaozong Zheng and Bineng Zhong and Qihua Liang and Zhiyi Mo and Shengping Zhang and Xianxian Li},

  booktitle={AAAI},

  year={2024}

}

```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/GXNU-ZhongLab/ODTrack

Awesome Lists containing this project

README