Iman Mirzadeh
Iman Mirzadeh
Home
Publications
Blog
Talks
LLM in a flash: Efficient Large Language Model Inference with Limited Memory
Keivan Alizadeh
,
Iman Mirzadeh
,
Dmitry Belenko
,
Karen Khatamifard
,
Minsik Cho
,
Carlo C Del Mundo
,
Mohammad Rastegari
,
Mehrdad Farajtabar
April, 2024
PDF
Hackernews
Financial Times
Type
Conference paper
Publication
The 62nd Annual Meeting of the Association for Computational Linguistics (ACL), 2024
publight
Iman Mirzadeh
Machine Learning Research Engineer
Cite
×