Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Industry/AI Scaling/Model Layer Scaling/
Depth Up-Scaling
Search

Depth Up-Scaling

Creator
Creator
Seonglae Cho
Created
Created
2024 Mar 5 9:43
Editor
Editor
Seonglae Cho
Edited
Edited
2025 Jan 12 15:52
Refs
Refs

DUS

  • 기존 weight와 함게 layer 복제
 
 
 
 
SOLAR 10.7B: Scaling Large Language Models with Simple yet...
We introduce SOLAR 10.7B, a large language model (LLM) with 10.7 billion parameters, demonstrating superior performance in various natural language processing (NLP) tasks. Inspired by recent...
SOLAR 10.7B: Scaling Large Language Models with Simple yet...
https://arxiv.org/abs/2312.15166
SOLAR 10.7B: Scaling Large Language Models with Simple yet...
 
 

Recommendations

Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Industry/AI Scaling/Model Layer Scaling/
Depth Up-Scaling
Copyright Seonglae Cho