Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/Machine Learning/Neural Network/Neural Network Structure/SSM/Selective State Space/
Zamba
Search

Zamba

Creator
Creator
Seonglae Cho
Created
Created
2024 Aug 29 3:40
Editor
Editor
Seonglae Cho
Edited
Edited
2024 Oct 16 15:21
Refs
Refs
 
 
 
 
 
Zyphra
Zyphra is excited to release Zamba2-mini, a state-of-the-art small language model for on-device applications. Zamba2-mini achieves highly competitive evaluation scores and performance numbers and fits in a tiny memory footprint of <700MB at 4bit quantization.
Zyphra
https://www.zyphra.com/post/zamba2-mini
2
Zyphra
Zyphra is excited to release Zamba2-7B, a state-of-the-art small language model. At the 7B scale, we outperform the leading models of Mistral, Google’s Gemma and Meta’s Llama3 series in both quality and performance. We believe Zamba2-7B is the leading model for running on-device and on consumer GPUs as well as for many enterprise applications which require a powerful but compact and efficient model for natural-language tasks.
Zyphra
https://www.zyphra.com/post/zamba2-7b
 

Recommendations

Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/Machine Learning/Neural Network/Neural Network Structure/SSM/Selective State Space/
Zamba
Copyright Seonglae Cho