Personal information

Activities

Works (3)

CacheGen: KV Cache Compression and Streaming for Fast Large Language Model Serving

2024-08-04 | Conference paper
Contributors: Yuhan Liu; Hanchen Li; Yihua Cheng; Siddhant Ray; Yuyang Huang; Qizheng Zhang; Kuntai Du; Jiayi Yao; Shan Lu; Ganesh Ananthanarayanan et al.
Source: check_circle
Crossref

Eloquent: A More Robust Transmission Scheme for LLM Token Streaming

2024-08-04 | Conference paper
Contributors: Hanchen Li; Yuhan Liu; Yihua Cheng; Siddhant Ray; Kuntai Du; Junchen Jiang
Source: check_circle
Crossref

Optimizing Real-Time Video Experience with Data Scalable Codec

2023-09-10 | Conference paper
Contributors: Hanchen Li; Yihua Cheng; Ziyi Zhang; Qizheng Zhang; Anton Arapin; Nick Feamster; Amrita Mazumdar
Source: check_circle
Crossref