https://github.com/guyulongcs/Awesome-Deep-Learning-Papers-for-Search-Recommendation-Advertising/blob/master/06_LLM/02_LLM_Classical/2022%20%28Google%29%20%28JMLR%29%20%5BSwitchTransfomers%5D%20Switch%20Transformers%20-%20Scaling%20to%20Trillion%20Parameter%20Models%20with%20Simple%20and%20Efficient%20Sparsity.pdf
Last synced: 4 months ago
JSON representation