https://github.com/sail-sg/adan
Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models
https://github.com/sail-sg/adan
adan artificial-intelligence bert-model convnext cuda-programming deep-learning diffusion dreamfusion fairseq gpt2 llm-training llms mae moe optimizer pytorch resnet timm transformer-xl vit
Last synced: 7 months ago
JSON representation
Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models
- Host: GitHub
- URL: https://github.com/sail-sg/adan
- Owner: sail-sg
- License: apache-2.0
- Created: 2022-09-01T10:34:27.000Z (over 3 years ago)
- Default Branch: main
- Last Pushed: 2025-06-08T14:35:41.000Z (8 months ago)
- Last Synced: 2025-06-08T15:29:34.654Z (8 months ago)
- Topics: adan, artificial-intelligence, bert-model, convnext, cuda-programming, deep-learning, diffusion, dreamfusion, fairseq, gpt2, llm-training, llms, mae, moe, optimizer, pytorch, resnet, timm, transformer-xl, vit
- Language: Python
- Homepage:
- Size: 1.3 MB
- Stars: 792
- Watchers: 7
- Forks: 68
- Open Issues: 3
-
Metadata Files:
- Readme: README.md
- License: LICENSE