Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Industry/AI Scaling/Model Layer Scaling/
LayerSkip
Search

LayerSkip

Creator
Creator
Seonglae ChoSeonglae Cho
Created
Created
2024 Oct 30 20:48
Editor
Editor
Seonglae ChoSeonglae Cho
Edited
Edited
2025 Sep 10 8:56
Refs
Refs
Attention Sink
LayerSkip
facebookresearch • Updated 2024 Dec 29 18:31
 
 
 
 
 
LayerSkip - a facebook Collection
Models continually pretrained using LayerSkip - https://arxiv.org/abs/2404.16710
LayerSkip - a facebook Collection
https://huggingface.co/collections/facebook/layerskip-666b25c50c8ae90e1965727a
LayerSkip - a facebook Collection
Paper page - LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding
Join the discussion on this paper page
Paper page - LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding
https://huggingface.co/papers/2404.16710
Paper page - LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding
TR-BERT: Dynamic Token Reduction for Accelerating BERT Inference
arxiv.org
https://arxiv.org/pdf/2105.11618
 
 

Recommendations

Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Industry/AI Scaling/Model Layer Scaling/
LayerSkip
Copyright Seonglae Cho