Search filters

Improving Computation and Memory Efficiency for Real-world Transformer Inference on GPUs

Image Image of a generic work. The text above it indicates that there is no free image of the work available, and that if you own one, you can click on the placeholder link to upload it.
Description scientific article published on 26 August 2023
Author/s

author: Jiazhi Jiang  Jiangsu Du  Jiang Zheng  Hongbin Zhang  Yutong Lu  Dan Huang 

Publication date August 26, 2023
Language
Country of origin
Wikipedia link
Copyright status
Missing/wrong data? Edit Wikidata item