{"id":22066379,"url":"https://github.com/jaketae/mlp-mixer","last_synced_at":"2025-09-07T10:33:51.008Z","repository":{"id":100151060,"uuid":"365688533","full_name":"jaketae/mlp-mixer","owner":"jaketae","description":"PyTorch implementation of MLP-Mixer: An all-MLP Architecture for Vision","archived":false,"fork":false,"pushed_at":"2021-05-11T13:31:47.000Z","size":14,"stargazers_count":23,"open_issues_count":2,"forks_count":6,"subscribers_count":2,"default_branch":"master","last_synced_at":"2025-05-13T02:01:45.730Z","etag":null,"topics":["computer-vision","image-classification","mlp","pytorch"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/jaketae.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2021-05-09T06:59:38.000Z","updated_at":"2025-03-15T16:09:18.000Z","dependencies_parsed_at":"2023-04-07T20:47:01.726Z","dependency_job_id":null,"html_url":"https://github.com/jaketae/mlp-mixer","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/jaketae/mlp-mixer","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/jaketae%2Fmlp-mixer","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/jaketae%2Fmlp-mixer/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/jaketae%2Fmlp-mixer/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/jaketae%2Fmlp-mixer/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/jaketae","download_url":"https://codeload.github.com/jaketae/mlp-mixer/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/jaketae%2Fmlp-mixer/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":274026691,"owners_count":25209739,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-09-07T02:00:09.463Z","response_time":67,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["computer-vision","image-classification","mlp","pytorch"],"created_at":"2024-11-30T19:27:55.370Z","updated_at":"2025-09-07T10:33:50.892Z","avatar_url":"https://github.com/jaketae.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# MLP Mixer\n\nPyTorch implementation of [MLP-Mixer: An all-MLP Architecture for Vision](https://arxiv.org/abs/2105.01601).\n\n![alt text](https://miro.medium.com/max/3398/1*cUHd6G9jjwl9F7xXv_SBEw.jpeg)\n\n## Quickstart\n\nClone this repository.\n\n```\ngit clone https://github.com/jaketae/mlp-mixer.git\n```\n\nNavigate to the cloned directory. You can start using the model via\n\n```python\n\u003e\u003e\u003e from mlp_mixer import MLPMixer\n\u003e\u003e\u003e model = MLPMixer()\n```\n\nBy default, the model comes with the following parameters:\n\n```python\nMLPMixer(\n    image_size=256,\n    patch_size=16,\n    in_channels=3,\n    num_features=128,\n    expansion_factor=2,\n    num_layers=8,\n    num_classes=10,\n    dropout=0.5,\n)\n```\n\n## Summary\n\nConvolutional Neural Networks (CNNs) and transformers are two mainstream model architectures currently dominating computer vision and natural language processing. The authors of the paper, however, empirically show that neither convolution nor self-attenion are necessary; in fact, muti-layered perceptrons (MLPs) can also serve as a strong baseline. The authors present MLP-Mixer, an all-MLP mode architecture, that contains two types of layers: a token-mixing layer and a channel-mixing layer. Each of the layers \"mix\" per-location and per-feature information in the input. MLP-Mixer performs comparably to other state-of-the-art models, such as [ViT](https://arxiv.org/abs/2010.11929) or [EfficientNet](https://arxiv.org/abs/1905.11946).\n\n## Resources\n\n- [Original Paper](https://arxiv.org/abs/2105.01601)\n- [Phil Wang's implementation](https://github.com/lucidrains/mlp-mixer-pytorch)\n- [Rishikesh's implementation](https://github.com/rishikksh20/MLP-Mixer-pytorch)\n- [Image from Nabil Madali's Medium article](https://medium.com/@nabil.madali/an-all-mlp-architecture-for-vision-7e7e1270fd33)\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fjaketae%2Fmlp-mixer","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fjaketae%2Fmlp-mixer","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fjaketae%2Fmlp-mixer/lists"}