A technique that randomly skips byte pair merges, exposing the model to more diverse subword segmentations
BPE Dropout
Creator
Creator
Seonglae ChoCreated
Created
2024 Mar 10 13:36Editor
Editor
Seonglae ChoEdited
Edited
2025 Nov 14 11:36Refs
Refs
Dropout 