https://github.com/ahmdtaha/tf_learning_to_count

(ICCV2017) Tensorflow implementation of Representation Learning by Learning to Count
https://github.com/ahmdtaha/tf_learning_to_count

computer-vision deep-learning deep-neural-networks machine-learning nvidia-dali self-supervised-learning tensorflow tensorpack unsupervised-learning

Last synced: 1 day ago
JSON representation

(ICCV2017) Tensorflow implementation of Representation Learning by Learning to Count

Host: GitHub
URL: https://github.com/ahmdtaha/tf_learning_to_count
Owner: ahmdtaha
Created: 2020-11-25T21:50:39.000Z (almost 5 years ago)
Default Branch: main
Last Pushed: 2021-02-20T20:30:51.000Z (over 4 years ago)
Last Synced: 2025-05-15T13:51:24.123Z (5 months ago)
Topics: computer-vision, deep-learning, deep-neural-networks, machine-learning, nvidia-dali, self-supervised-learning, tensorflow, tensorpack, unsupervised-learning
Language: Python
Homepage:
Size: 264 KB
Stars: 0
Watchers: 2
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

          # Learning to Count

This is an _Unofficial_ Tensorflow implementation of **Representation Learning by Learning to Count (Lrn-Cnt).** Lrn-Cnt is a self-supervised visual representation approach. Lrn-Cnt is published 2017; it no longer achieves SOTA results. However, it is simple approach that serves as a nice tutorial for self-supervised learning approaches. Lrn-Cnt is illustrated in this [blog](https://medium.com/p/787ba421611e).

### TL;DR

The alexnet used is [here](https://github.com/ahmdtaha/tf_unsupervised_count/blob/main/nets/alexnet.py). This alexnet version is inspired by [Zhang _et al._ [2]](https://github.com/richzhang/colorization/blob/caffe/train_alexnet/train_val_nobn.prototxt). The self-supervised representation count loss is [here](https://github.com/ahmdtaha/tf_unsupervised_count/blob/4f9d4717047f50712e46cad15d9ed09de9351fdb/nets/model.py#L199). The supervised linear classifier loss is [here](https://github.com/ahmdtaha/tf_unsupervised_count/blob/4f9d4717047f50712e46cad15d9ed09de9351fdb/nets/model.py#L120)

## Requirements

* Python 3+ [Tested on 3.6.10]

* Tensorflow 1.X [Tested on 1.14]

* TensorPack [Tested on 0.10.1]

* Nvidia-DALI [Tested on 0.20.0]

* Nvidia-DALI-Plugin [Tested on 0.20.0]

We use cuda 10.0.130 and cudnn v7.6.5

Our TensorFlow model and loss function are simple. However, to train it *efficiently*, we needed to use [TensorPack](https://tensorpack.readthedocs.io/) and [Nvidia-DALI](https://docs.nvidia.com/deeplearning/dali/user-guide/docs/) libraries. If linking these libraries is challenging, feel free to remove them from the code and use Tensorflow only (e.g., tf.data.dataset). The requirements.txt lists all our install packages and their versions.

[//]: # "## ImageNet Pretrained Models"

## Usage example

To run the unsupervised representation-count phase 

`python counter_trn.py --batch_size 256 --exp_name cnt_gpu4_bz256_adam_250 --net alexnet --learning_rate 0.0001 --gpu 0,1,2,3 --logits_dim 1000 --opt adam --epoch 250 --learning_rate_var_name lr_cnt`

To run the supervised linear-classifier phase

`python classifier_trn.py --exp_name cls_gpu2_bz256_adam_250 --pretrained --cnt_exp_name cnt_gpu4_bz256_adam_250 --batch_size 256 --learning_rate 0.001 --opt adam --epoch 250 --net alexnet  --learning_rate_var_nam lr_cls --gpu 0,1`

The following table compares our implementation with the paper results.

| ImageNet Performance          | conv1 | conv2 | conv3 | conv4 | conv5 |

|-------------------------------|-------|-------|-------|-------|-------|

| Mehdi et at. [1] \(Table. 2\) | 18.0  | 30.6  | 34.3  | 32.5  | 25.7  |

| Ours                          | 18.6  | 30.3  | 33.6  | 30.1  | 24.9  |

    

![Our implementation performance](./imgs/cls_performance.jpg)    

### TODO LIST

* highlight the image resize issue

* highlight the I/O performance issue

* Document the discrepancy between the paper and our implementation

Contributor list

----------------

1. [Ahmed Taha](http://www.ahmed-taha.com)

2. [Alex Hanson](https://github.com/j-alex-hanson)

* We train the unsupervised representation-count phase for 250 epochs and the supervised linear-classifier phase for 250 epochs. This leads to a long training time. If someone has an to converge faster (e.g., a better learning rate scheduler), please share through a Github issue.  

* It would be great if someone re-implement this in PyTorch. Let me know and I will add a link to your PyTorch implementation here

### MISC Notes

* Our implementation is inspired by [CLVR's implementation](https://github.com/clvrai/Representation-Learning-by-Learning-to-Count). However, the CLVR's implementation has a serious bug and performance issues that need to be fixed. These issues are discussed [here](https://github.com/ahmdtaha/tf_unsupervised_count/blob/main/docs/clvr_bug.md).

* Our implementation diverges from the paper [1] technical details. We explain this discrepancy [here](https://github.com/ahmdtaha/tf_unsupervised_count/blob/main/docs/paper_discrepancy.md).

## Release History

* 1.0.0

    * First commit on 25 Nov 2020

    * First code commit on 2 Dec 2020

## References

[1] Representation Learning by Learning to Count

[2] Colorful image colorization

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/ahmdtaha/tf_learning_to_count

Awesome Lists containing this project

README