Local/Global Attention

Creator

Created

2024 Feb 21 9:24

Editor

Edited

2024 Mar 28 9:55

Refs

naive elementwise multiplied soft attention for each hidden layer output

Local window hard attention + global soft attention

Effective Approaches to Attention-based Neural Machine Translation

/////////