Search filters

SpecInfer: Accelerating Large Language Model Serving with Tree-based Speculative Inference and Verification

Image Image of a generic work. The text above it indicates that there is no free image of the work available, and that if you own one, you can click on the placeholder link to upload it.
Description scientific article published on 24 April 2024
Author/s

author: Xupeng Miao  Xinhao Cheng  Gabriele Oliaro  Alan Zhu  Zhengxin Zhang  Rae Ying Yee Wong  Xiaoxiang Shi  Zhuoming Chen  Chunan Shi  Reyna Abhyankar  Daiyaan Arfeen  Zeyu Wang  Zhihao Zhang  Lijie Yang  Zhihao Jia 

Publication date April 24, 2024
Language
Country of origin
Wikipedia link
Copyright status
Missing/wrong data? Edit Wikidata item