Loading views...

Time scale fast weights 개념으로 lora 처럼 이용하도록?

Creator
Creator
Seonglae ChoSeonglae Cho
Created
Created
2024 May 29 15:21
Editor
Edited
Edited
2024 Nov 30 21:38
Specific
Specific
Specific
Computable
Computable
Computable

inference level computing으로 fast weight 구현하는게 핵심

Currently, there are only two time scales, we need temporary weight changing. We can’t do it now since there are batch processing for production which is more efficient.
  • Vector context scale
  • Weight learning scale
Neuroplasticity
는 새로운 모델 구조를 필요로 하고
Exploding gradient
같은 테크닉을 통해
AI Feature Dimensionality
를 빠르게 높은
Sample efficiency
로 변화시킬 수 있어야 한다. 즉 pre-training computational path와
Fast Weight
용 path가 달라야
 
 
 
 
 
 
 
 

Recommendations