Search filters

FlexMoE: Scaling Large-scale Sparse Pre-trained Model Training via Dynamic Device Placement

Image Image of a generic work. The text above it indicates that there is no free image of the work available, and that if you own one, you can click on the placeholder link to upload it.
Description scientific article published on 30 May 2023
Author/s

author: Xupeng Miao  Xiaonan Nie  Jilong Xue  Lingxiao Ma  Zichao Yang  Bin Cui  Gang Cao  Zilong Wang 

Publication date May 30, 2023
Language
Country of origin
Wikipedia link
Copyright status
Missing/wrong data? Edit Wikidata item