LLMA

Creator

Creator

Seonglae Cho

Created

Created

2023 Apr 12 14:52

Editor

Editor

Seonglae Cho

Edited

Edited

2023 Aug 29 8:29

Refs

Refs

현재 병목인 추론을 가속

Inference with Reference: Lossless Acceleration of Large Language Models

We propose LLMA, an LLM accelerator to losslessly speed up Large Language Model (LLM) inference with references. LLMA is motivated by the observation that there are abundant identical text spans...

https://arxiv.org/abs/2304.04487

Recommendations

////////