Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/Machine Learning/Neural Network/Neural Network Structure/Seq2Seq/Transformer Model/
Transformer Modeling
Search

Transformer Modeling

Creator
Creator
Seonglae Cho
Created
Created
2024 Mar 31 15:47
Editor
Editor
Seonglae Cho
Edited
Edited
2024 Jun 12 4:48
Refs
Refs
  • Attention Mechanism Optimization
    • Grouped-query Attention
  • Activation Function
    • SwiGLU
  • Relative Positional Encoding
    • RoPE
  • Transformer Training
    • Ring Attention
  • Multimodal AI
  • Model Merging
    • Depth Up-Scaling
  • Text Tokenizer
 
 
 
 
 
 
 

Recommendations

Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/Machine Learning/Neural Network/Neural Network Structure/Seq2Seq/Transformer Model/
Transformer Modeling
Copyright Seonglae Cho