Texonom
Texonom
/
Science
Science
/Mathematics/Math Field/Statistics/Statistical Model/Model Generalization/
Length Generalization
Loading views...
Search

Length Generalization

Creator
Creator
Seonglae ChoSeonglae Cho
Created
Created
2026 May 26 16:4
Editor
Editor
Seonglae ChoSeonglae Cho
Edited
Edited
2026 May 26 16:13
Refs
Refs
Extrapolation
 
 
 
 
 
arithmetic
Teaching Arithmetic to Small Transformers
Large language models like GPT-4 exhibit emergent capabilities across general-purpose tasks, such as basic arithmetic, when trained on extensive text data, even though these tasks are not...
Teaching Arithmetic to Small Transformers
https://openreview.net/forum?id=dsUB4bst9S
 
arxiv.org
https://arxiv.org/pdf/2506.09251
 

Recommendations

Texonom
Texonom
/
Science
Science
/Mathematics/Math Field/Statistics/Statistical Model/Model Generalization/
Length Generalization
Copyright Seonglae Cho