https://github.com/philipperemy/tensorflow-class-activation-mapping

Learning Deep Features for Discriminative Localization (2016)
https://github.com/philipperemy/tensorflow-class-activation-mapping

cnn deep-learning discriminative-image-regions lenet machine-learning tensorflow

Last synced: about 2 months ago
JSON representation

Learning Deep Features for Discriminative Localization (2016)

Host: GitHub
URL: https://github.com/philipperemy/tensorflow-class-activation-mapping
Owner: philipperemy
License: mit
Created: 2017-01-27T02:41:31.000Z (over 8 years ago)
Default Branch: master
Last Pushed: 2022-11-15T05:01:19.000Z (almost 3 years ago)
Last Synced: 2024-12-10T10:02:34.561Z (10 months ago)
Topics: cnn, deep-learning, discriminative-image-regions, lenet, machine-learning, tensorflow
Language: Python
Size: 54 MB
Stars: 151
Watchers: 9
Forks: 42
Open Issues: 0
Metadata Files:
- Readme: README.md
- Funding: .github/FUNDING.yml
- License: LICENSE

Awesome Lists containing this project

README

          # Learning Deep Features for Discriminative Localization

**Tensorflow Implementation**

https://arxiv.org/abs/1512.04150

## How to use it?

```bash

# For Python 3. Should also work for Python2 :)

git clone git@github.com:philipperemy/tensorflow-class-activation-mapping.git

cd tensorflow-class-activation-mapping

pip3 install -r requirements.txt

mkdir /tmp/img

tar xvzf data/mnist_cluttered.tar.gz -C /tmp/img

python3 mnist.py # The heatmaps are available in out/

```

## Abstract

*In this work, we revisit the global average pooling layer and shed light on how it explicitly enables the convolutional neural network to have remarkable localization ability despite being trained on image-level labels. While this technique was previously proposed as a means for regularizing training, we find that it actually builds a generic localizable deep representation that can be applied to a variety of tasks. Despite the apparent simplicity of global average pooling, we are able to achieve 37.1% top-5 error for object localization on ILSVRC 2014, which is remarkably close to the 34.2% top-5 error achieved by a fully supervised CNN approach. We demonstrate that our network is able to localize the discriminative image regions on a variety of tasks despite not being trained for them.*

## Class Activation Mapping and Class-specific Saliency Map

*We propose a technique for generating class activation maps using the global average pooling (GAP) in CNNs. A class activation map for a particular category indicates the discriminative image regions used by the CNN to identify that category. The procedure for generating these maps is illustrated as follows:*



  





*Class activation maps could be used to intepret the prediction decision made by the CNN. The left image below shows the class activation map of top 5 predictions respectively, you can see that the CNN is triggered by different semantic regions of the image for different predictions. The right image below shows the CNN learns to localize the common visual patterns for the same object class.*



  





## MNIST cluttered dataset with LeNet

*LeNet* was used on the [MNIST cluttered dataset]((https://github.com/deepmind/mnist-cluttered)), provided by DeepMind.

We removed the last fully connected layers and replaced them with the **Class Activation Map**.

The accuracy was **97%** after 50 epochs (Final accuracy was **97.6%**). Here is an example of an input image (9):



  





The output of the **Class Activation Map** is provided below, at different steps:



  





The overall accuracy on the testing set, per step:



  





## CALTECH256 dataset with AlexNet pretrained with ImageNet

Coming soon!

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/philipperemy/tensorflow-class-activation-mapping

Awesome Lists containing this project

README