https://github.com/guyulongcs/Deep-Learning-for-Search-Recommendation-Advertisements/blob/master/07_LLM/LLM/2022%20%28Google%29%20%28JMLR%29%20%5BSwitchTransfomers%5D%20Switch%20Transformers%20-%20Scaling%20to%20Trillion%20Parameter%20Models%20with%20Simple%20and%20Efficient%20Sparsity.pdf
Last synced: 14 days ago
JSON representation