Search filters

TwinPilots: A New Computing Paradigm for GPU-CPU Parallel LLM Inference

Image Image of a generic work. The text above it indicates that there is no free image of the work available, and that if you own one, you can click on the placeholder link to upload it.
Description scientific article published on 16 September 2024
Author/s

author: Chengye Yu  Linjie Zhu  Zili Shao  Xu Zhou  Song Jiang  Tianyu Wang 

Publication date September 16, 2024
Language
Country of origin
Wikipedia link
Copyright status
Missing/wrong data? Edit Wikidata item