Lookahead Decoding

Created
Created
2023 Nov 25 10:11
Editor
Creator
Creator
Seonglae ChoSeonglae Cho
Edited
Edited
2024 Dec 9 23:34
Refs
Refs
Motivated by
Jacobi Decoding
lookahead decoding generates multiple tokens in parallel unlike traditional autoregressive model, which generates tokens one by one
notion image
notion image

lookahead branch

The lookahead branch maintains a fixed-sized, 2D window to generate n-grams from the Jacobi iteration trajectory.
 
 

verification branch

n-grams whose first token matches the last input token are identified.
 
 
 
 
 

Recommendations