Transformer-XL

Creator

Creator

Seonglae Cho

Created

Created

2023 May 3 17:19

Editor

Editor

Seonglae Cho

Edited

Edited

2023 May 3 17:19

Refs

Refs

kimiyoung • Updated 2023 May 3 9:56

Attentive Language Models Beyond a Fixed-Length Context

Recurrent Memory Transformer

Transformer-based models show their effectiveness across multiple domains and tasks. The self-attention allows to combine information from all sequence elements into context-aware representations....

https://arxiv.org/abs/2207.06881

Recommendations

/////////