Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/Tyhye/beyond-part-models-gluon
Implementation of <Beyond Part Models: Person Retrieval with Refined Part Pooling>, using gluon(mxnet)
https://github.com/Tyhye/beyond-part-models-gluon
Last synced: 2 months ago
JSON representation
Implementation of <Beyond Part Models: Person Retrieval with Refined Part Pooling>, using gluon(mxnet)
- Host: GitHub
- URL: https://github.com/Tyhye/beyond-part-models-gluon
- Owner: Tyhye
- License: mit
- Created: 2018-07-21T00:56:25.000Z (over 6 years ago)
- Default Branch: master
- Last Pushed: 2018-08-13T12:35:41.000Z (over 6 years ago)
- Last Synced: 2024-08-01T22:41:11.521Z (5 months ago)
- Language: Python
- Size: 92.8 KB
- Stars: 3
- Watchers: 2
- Forks: 0
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- Awesome-MXNet - beyond-part-models
README
# beyond-part-models-gluon
Implementation of <Beyond Part Models: Person Retrieval with Refined Part Pooling>, using gluon(mxnet)## Memo
* [x] Model Implementation
* [x] Metric Coding.
* [x] Data Loading.
* [x] Result Saving.
* [x] Process Control.
* [x] Train Process.
* [ ] Test Process.
* [ ] Eval Process.
* [ ] Show Result.
* [ ] Market-1501 Prepare
* [ ] Duke Prepare
* [ ] ...## Result
The model is based on resnet50.
Input images are resized to 384x128.
Feature channels are set 256.
Batchsize is set 32, which is different from the paper.
Here we just show some results.**Market-1501**
| BS | Network | PCB | PN | RPP | FT | CMC1 | CMC5 | CMC10 | mAP | Note |
| -- | ------- | --- | -- | --- | -- | ---- | ---- | ----- | --- | ---- |
| 32 | Resnet50_v2 | w/o | | w/o | | 89.76 | 96.20 | 97.51 | 75.22 | (512dim) |
| 64 | Resnet50_v2 | w/o | | w/o | | 85.3 | - | - | - | 68.5 | (in paper) |
| 32 | Resnet50_v2 | w/o | | w/o | | 91.75 | 96.59 | 98.22 | 77.68 | |
| 32 | Resnet50_v1 | w/o | | w/o | | 88.39 | 94.80 | 96.76 | 71.78 | |
| 64 | Resnet50_v2 | w | 6 | w/o | | 92.3 | 97.2 | 98.2 | 77.4 | (in paper) |
| 32 | Resnet50_v2 | w | 6 | w/o | | 92.61 | 96.82 | 97.74 | 77.33 | |
| 32 | Resnet50_v1 | w | 6 | w/o | | - | - | - | - | |
| 32 | Resnet50_v2 | w | 6 | w | no | 91.66 | 96.59 | 97.86 | 76.83 | |
| 32 | Resnet50_v1 | w | 6 | w | no | - | - | - | - | |
| 64 | Resnet50_v2 | w | 6 | w | yes | 93.8 | 97.5 | 98.5 | 81.6 | (in paper) |
| 32 | Resnet50_v2 | w | 6 | w | yes | 92.64 | 96.97 | 98.01 | 77.09 | (processing) |
| 32 | Resnet50_v1 | w | 6 | w | yes | - | - | - | - | |## Usage
```
Usage:
main.py [options]
main.py --withpcb [options]
main.py --withpcb --withrpp [options]General Options:
-h, --help Print this message
--logfile= File path for saving log message.
--device_type= Device Type for running the model [default: cpu]
--device_id= Device ID for running the model [default: 0]
Network Options:
--basenet_type= BaseNet type for Model [default: resnet50_v2]
--classes_num= Output classes number of the network [default: 751]
--feature_channels= Feature channels of the network [default: 512]
--partnum= The number of the pcb parts. [default: 6]
--feature_weight_share If the six partnum share weights.
--base_not_pretrained If the base network don't pretrained on ImageNet
--pretrain_path= Path to pretrained model.Training Setting Options:
--Optim= Optimizer Type [default: sgd]
--LRpolicy= Learning rate policy [default: multistep]
--milestones= Step milestone for multistep policy [default: [40,]]
--gamma= Gamma for multistep policy [default: 0.1]
--max_epochs= Max Train epochs [default: 60]
--val_epochs= Val step stone [default: 5]
--snap_epochs= Snap step stone [default: 5]
--Snap= Model state dict file path [default: saved/]Data Options:
--resize_size= Image resize size tuple (height, width) [default: (384, 128)]
--crop_size= Image crop size tuple (height, width) [default: (384, 128)]
--batchsize= Batchsize [default: 32]Train Data Options:
--trainList= Train files list txt [default: datas/Market1501/train.txt]
--trainIMpath= Train sketch images path prefix [default: datas/Market1501/]
Test Data Options:
--queryList= Query files list txt [default: datas/Market1501/query.txt]
--queryIMpath= Query sketch images path prefix [default: datas/Market1501/]
--galleryList= Gallery files list txt [default: datas/Market1501/gallery.txt]
--galleryIMpath= Gallery sketch images path prefix [default: datas/Market1501/]
Learning Rate Options:
--learning_rate= Learning rate for training process [default: 0.01]
--weight_decay= Weight decay for training process [default: 0.0005]
--momentum= Momentum for the SGD Optimizer [default: 0.9]--base_not_train If don't train base network.
--base_lr_scale= Learing rate scale rate for the base network [default: 0.1]
--tail_not_train If don't train tail module, when w/o pcb and w/o rpp.
--tail_lr_scale= Learing rate scale rate for the tail module.
--rpp_not_train If don't train the rpp module.
--rpp_lr_scale= Learing rate scale rate for the rpp module.
```## Note
I will introduce my code in this part.
The `experiment` contains all scripts used for experiments.### data
In the `data` package, I put the scripts about *loading data* and *save snapshots*.
+ `textdataset.py` defines the TextDataset, which define the dataset defined by a text file.
+ `transform.py` defines the classes Transformer.
- `ListTransformer` is used for training.
- `Market1501_Transformer` is used for market_1501 and duke testing.
+ `saver.py` defines a object for saving snapshots, only when the result is better.### metric
In the `reidmetric.py`, which define a metric class designed for Re-ID.### model
The `model` package contains the design of the model structures.
The `PCBRPPNet` is the implementation of the **beyond part model**.
Params `withpcb`, `withrpp` and `feature_shared_weight` are designed for the different situation.### process
I defined two kind of processor to control the processes of the training and testing.
We could implement function for our own experiments.
+ `epochprocessor.py` is designed for using *epoch* to control the number of training times
+ `iterprocessor.py` is designed for using *iter* to control the number of training times## Acknowledgement