Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/Machine Learning/Neural Network/Neural Network Structure/SSM/Selective State Space/Mamba Model/
Hymba
Search

Hymba

Creator
Creator
Seonglae Cho
Created
Created
2024 Nov 28 11:38
Editor
Editor
Seonglae Cho
Edited
Edited
2024 Nov 28 11:39
Refs
Refs

hybrid architecture with Mamba and Attention heads running in parallel

outperforms popular small language with 6-12x less training
 
 
 
 
Hymba - a nvidia Collection
A series of Hybrid Small Language Models.
Hymba - a nvidia Collection
https://huggingface.co/collections/nvidia/hymba-673c35516c12c4b98b5e845f
Hymba - a nvidia Collection
 
 

Backlinks

On-device AI

Recommendations

Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/Machine Learning/Neural Network/Neural Network Structure/SSM/Selective State Space/Mamba Model/
Hymba
Copyright Seonglae Cho