Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Object/NLP/Language Model/Causal language model/
LLMA
Search

LLMA

Creator
Creator
Seonglae Cho
Created
Created
2023 Apr 12 14:52
Editor
Editor
Seonglae Cho
Edited
Edited
2023 Aug 29 8:29
Refs
Refs
현재 병목인 추론을 가속
 
 
 
 
 
 
 
Inference with Reference: Lossless Acceleration of Large Language Models
We propose LLMA, an LLM accelerator to losslessly speed up Large Language Model (LLM) inference with references. LLMA is motivated by the observation that there are abundant identical text spans...
Inference with Reference: Lossless Acceleration of Large Language Models
https://arxiv.org/abs/2304.04487
Inference with Reference: Lossless Acceleration of Large Language Models
 
 

Recommendations

Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Object/NLP/Language Model/Causal language model/
LLMA
Copyright Seonglae Cho