https://github.com/sayakpaul/convnext-tf

Includes PyTorch -> Keras model porting code for ConvNeXt family of models with fine-tuning and inference notebooks.
https://github.com/sayakpaul/convnext-tf

computer-vision convolutional-neural-networks efficient-models imagenet-1k imagenet-21k keras tensorflow

Last synced: about 1 month ago
JSON representation

Includes PyTorch -> Keras model porting code for ConvNeXt family of models with fine-tuning and inference notebooks.

Host: GitHub
URL: https://github.com/sayakpaul/convnext-tf
Owner: sayakpaul
License: apache-2.0
Created: 2022-01-13T07:55:19.000Z (over 3 years ago)
Default Branch: main
Last Pushed: 2022-03-27T06:56:09.000Z (about 3 years ago)
Last Synced: 2025-03-29T04:43:02.090Z (about 2 months ago)
Topics: computer-vision, convolutional-neural-networks, efficient-models, imagenet-1k, imagenet-21k, keras, tensorflow
Language: Jupyter Notebook
Homepage: https://tfhub.dev/sayakpaul/collections/convnext/1
Size: 2.47 MB
Stars: 100
Watchers: 5
Forks: 13
Open Issues: 1
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

        # ConvNeXt-TF

This repository provides TensorFlow / Keras implementations of different ConvNeXt

[1] variants. It also provides the TensorFlow / Keras models that have been

populated with the original ConvNeXt pre-trained weights available from [2]. These

models are not blackbox SavedModels i.e., they can be fully expanded into `tf.keras.Model`

objects and one can call all the utility functions on them (example: `.summary()`).

As of today, all the TensorFlow / Keras variants of the models listed

[here](https://github.com/facebookresearch/ConvNeXt#results-and-pre-trained-models)

are available in this repository except for the

[isotropic ones](https://github.com/facebookresearch/ConvNeXt#imagenet-1k-trained-models-isotropic).

This list includes the ImageNet-1k as well as ImageNet-21k models.

Refer to the ["Using the models"](https://github.com/sayakpaul/ConvNeXt-TF#using-the-models)

section to get started. Additionally, here's a [related blog post](https://sayak.dev/convnext-tfhub/)

that jots down my experience.

## Conversion

TensorFlow / Keras implementations are available in `models/convnext_tf.py`.

Conversion utilities are in `convert.py`.

## Models

The converted models are available on [TF-Hub](https://tfhub.dev/sayakpaul/collections/convnext/1). 

There should be a total of 15 different models each having two variants: classifier and

feature extractor. You can load any model and get started like so:

```py

import tensorflow as tf

model_gcs_path = "gs://tfhub-modules/sayakpaul/convnext_tiny_1k_224/1/uncompressed"

model = tf.keras.models.load_model(model_gcs_path)

print(model.summary(expand_nested=True))

```

The model names are interpreted as follows:

* `convnext_large_21k_1k_384`: This means that the model was first pre-trained

on the ImageNet-21k dataset and was then fine-tuned on the ImageNet-1k dataset. 

Resolution used during pre-training and fine-tuning: 384x384. `large` denotes

the topology of the underlying model.

* `convnext_large_1k_224`: Means that the model was pre-trained on the ImageNet-1k

dataset with a resolution of 224x224.

## Results

Results are on ImageNet-1k validation set (top-1 accuracy). 

| name | original acc@1 | keras acc@1 |

|:---:|:---:|:---:|

| convnext_tiny_1k_224 | 82.1 | 81.312 |

| convnext_small_1k_224 | 83.1 | 82.392 |

| convnext_base_1k_224 | 83.8 | 83.28 |

| convnext_base_1k_384 | 85.1 | 84.876 |

| convnext_large_1k_224 | 84.3 | 83.844 |

| convnext_large_1k_384 | 85.5 | 85.376 |

|  |  |  |

| convnext_base_21k_1k_224 | 85.8 | 85.364 |

| convnext_base_21k_1k_384 | 86.8 | 86.79 |

| convnext_large_21k_1k_224 | 86.6 | 86.36 |

| convnext_large_21k_1k_384 | 87.5 | 87.504 |

| convnext_xlarge_21k_1k_224 | 87.0 | 86.732 |

| convnext_xlarge_21k_1k_384 | 87.8 | 87.68 |

Differences in the results are primarily because of the differences in the library

implementations especially how image resizing is implemented in PyTorch and

TensorFlow. Results can be verified with the code in `i1k_eval`. Logs

are available at [this URL](https://tensorboard.dev/experiment/odN7OPCqQvGYCRpJP1GhRQ/).

## Using the models

**Pre-trained models**:

* Off-the-shelf classification: [Colab Notebook](https://colab.research.google.com/github/sayakpaul/ConvNeXt-TF/blob/main/notebooks/classification.ipynb)

* Fine-tuning: [Colab Notebook](https://colab.research.google.com/github/sayakpaul/ConvNeXt-TF/blob/main/notebooks/finetune.ipynb)

 

 **Randomly initialized models**:

 

 ```py

 from models.convnext_tf import get_convnext_model

 

 convnext_tiny = get_convnext_model()

 print(convnext_tiny.summary(expand_nested=True))

 ```

 

 To view different model configurations, refer [here](https://github.com/sayakpaul/ConvNeXt-TF/blob/main/models/model_configs.py).

 

## Upcoming (contributions welcome)

- [ ] Align layer initializers (useful if someone wanted to train the models

from scratch)

- [ ] Allow the models to accept arbitrary shapes (useful for downstream tasks)

- [ ] Convert the [isotropic models](https://github.com/facebookresearch/ConvNeXt#imagenet-1k-trained-models-isotropic) as well 

- [x] Fine-tuning notebook (thanks to [awsaf49](https://github.com/awsaf49))

- [x] Off-the-shelf-classification notebook

- [x] Publish models on TF-Hub

## References

[1] ConvNeXt paper: https://arxiv.org/abs/2201.03545

[2] Official ConvNeXt code: https://github.com/facebookresearch/ConvNeXt

## Acknowledgements

* [Vasudev Gupta](https://github.com/vasudevgupta7) 

* [Gus](https://twitter.com/gusthema)

* [Willi](https://ch.linkedin.com/in/willi-gierke)

* [ML-GDE program](https://developers.google.com/programs/experts/)

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/sayakpaul/convnext-tf

Awesome Lists containing this project

README