Personal information

Activities

Employment (2)

University of Washington: Seattle, Washington, US

2021-03 to present | Research Assistant (Paul G. Allen School of Computer Science & Engineering)
Employment
Source: Self-asserted source
Zihao Ye

Amazon Web Services Inc: Shanghai, CN

2018-09 to 2021-02
Employment
Source: Self-asserted source
Zihao Ye

Education and qualifications (2)

University of Washington: Seattle, Washington, US

2021-03 to 2026 | PhD student (Paul G. Allen School of Computer Science & Engineering)
Education
Source: Self-asserted source
Zihao Ye

Shanghai Jiao Tong University: Shanghai, Shanghai, CN

2014-09 to 2018-06 | undergraduate (Department of Computer Science and Engineering)
Education
Source: Self-asserted source
Zihao Ye

Works (6)

Atom: Low-Bit Quantization for Efficient and Accurate LLM Serving

Proceedings of Machine Learning and Systems
2024 | Conference paper
URI:

https://proceedings.mlsys.org/paper_files/paper/2024/file/5edb57c05c81d04beb716ef1d542fe9e-Paper-Conference.pdf

Contributors: Zhao, Yilong; Lin, Chien-Yu; Zhu, Kan; Ye, Zihao; Chen, Lequn; Zheng, Size; Ceze, Luis; Krishnamurthy, Arvind; Chen, Tianqi; Kasikci, Baris et al.
Source: Self-asserted source
Zihao Ye

Punica: Multi-Tenant LoRA Serving

Proceedings of Machine Learning and Systems
2024 | Conference paper
URI:

https://proceedings.mlsys.org/paper_files/paper/2024/file/054de805fcceb78a201f5e9d53c85908-Paper-Conference.pdf

Contributors: Chen, Lequn; Ye, Zihao; Wu, Yongji; Zhuo, Danyang; Ceze, Luis; Krishnamurthy, Arvind; P. Gibbons; G. Pekhimenko; C. De Sa
Source: Self-asserted source
Zihao Ye

SparseTIR: Composable Abstractions for Sparse Compilation in Deep Learning

Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 3
2023-03-25 | Conference paper
Contributors: Zihao Ye; Ruihang Lai; Junru Shao; Tianqi Chen; Luis Ceze
Source: Self-asserted source
Zihao Ye

TensorIR: An Abstraction for Automatic Tensorized Program Optimization

Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 2
2023-01-27 | Conference paper
Contributors: Siyuan Feng; Bohan Hou; Hongyi Jin; Wuwei Lin; Junru Shao; Ruihang Lai; Zihao Ye; Lianmin Zheng; Cody Hao Yu; Yong Yu et al.
Source: Self-asserted source
Zihao Ye

Graphiler: Optimizing Graph Neural Networks with Message Passing Data Flow Graph

Proceedings of Machine Learning and Systems 2022, MLSys 2022, Santa Clara, CA, USA, August 29 - September 1, 2022
2022 | Conference paper
URI:

https://proceedings.mlsys.org/paper/2022/hash/a87ff679a2f3e71d9181a67b7542122c-Abstract.html

Contributors: Zhiqiang Xie; Minjie Wang; Zihao Ye; Zheng Zhang; Rui Fan; Diana Marculescu; Yuejie Chi; Carole-Jean Wu
Source: Self-asserted source
Zihao Ye

FeatGraph: A Flexible and Efficient Backend for Graph Neural Network Systems

SC20: International Conference for High Performance Computing, Networking, Storage and Analysis
2020-11 | Conference paper
Contributors: Yuwei Hu; Zihao Ye; Minjie Wang; Jiali Yu; Da Zheng; Mu Li; Zheng Zhang; Zhiru Zhang; Yida Wang
Source: Self-asserted source
Zihao Ye