Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/matlab-deep-learning/convmixer-patches-are-all-you-need
ConvMixer - Patches Are All You Need?
https://github.com/matlab-deep-learning/convmixer-patches-are-all-you-need
cifar10 cifar10-classification deep-learning example image-classification matlab matlab-deep-learning pretrained-models
Last synced: 3 months ago
JSON representation
ConvMixer - Patches Are All You Need?
- Host: GitHub
- URL: https://github.com/matlab-deep-learning/convmixer-patches-are-all-you-need
- Owner: matlab-deep-learning
- License: other
- Created: 2021-12-06T18:05:34.000Z (almost 3 years ago)
- Default Branch: master
- Last Pushed: 2021-12-08T18:59:14.000Z (almost 3 years ago)
- Last Synced: 2024-07-27T12:47:05.220Z (4 months ago)
- Topics: cifar10, cifar10-classification, deep-learning, example, image-classification, matlab, matlab-deep-learning, pretrained-models
- Language: MATLAB
- Homepage: https://www.mathworks.com/products/deep-learning.html
- Size: 7.7 MB
- Stars: 6
- Watchers: 3
- Forks: 2
- Open Issues: 5
-
Metadata Files:
- Readme: README.md
- License: license.txt
- Security: SECURITY.md
Awesome Lists containing this project
- MATLAB-Deep-Learning-Model-Hub - ConvMixer - |[GitHub](https://github.com/matlab-deep-learning/convmixer-patches-are-all-you-need) | (Image Classification <a name="ImageClassification"/> / Robotics)
README
# ConvMixer -- Patches are all you need?
This demo shows how to implement and train a ConvMixer architecture for image classification with MATLAB®, as described in the paper "Patches are all you need?" https://openreview.net/forum?id=TVHS5Y4dNvM
The ConvMixer architecture employs a Patch Embedding representation of the input followed by repeated fully-convolutional blocks.
![ConvMixer Architecture](images/convMixer.png)
## How to get started
Start the project ConvMixer.prj to add to the path the relevant functions. There are examples in the `convmixer/examples` folder to get you started with training a ConvMixer for the digits dataset and the CIFAR-10 dataset [1].
The latter employs the ADAM algorithm with fixed weight decay regularization, as described in [2].
Training a ConvMixer for the CIFAR-10 architecture can be demanding in terms of computational resources: in the same `convmixer/examples` folder you can find a pretrained network. This model was trained on the CIFAR-10, available at https://www.cs.toronto.edu/~kriz/cifar-10-matlab.tar.gz
The source code for building the architecture is in the `convmixer/convmixer` directory.
## Requirements
- MATLAB® R2021b or later
- Deep Learning Toolbox™## License
The license is available in the license file within this repository.
Copyright 2021 The MathWorks, Inc.
[1] Krizhevsky, Alex. "Learning multiple layers of features from tiny images." (2009). https://www.cs.toronto.edu/~kriz/learning-features-2009-TR.pdf
[2] Loshchilov, Ilya, and Frank Hutter. "Fixing weight decay regularization in ADAM." (2018). https://openreview.net/forum?id=rk6qdGgCZ