https://github.com/guyulongcs/Deep-Learning-for-Search-Recommendation-Advertisements/blob/master/07_Large_Model/MOE/2017%20%28Google%29%20%28ICLR%29%20%5BSparsely-Gated%20MOE%5D%20Outrageously%20large%20neural%20networks%20-%20The%20sparsely-gated%20mixture-of-experts%20layer.pdf
Last synced: 3 months ago
JSON representation