Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/Machine Learning/Neural Network/Neural Network Structure/SSM/Selective State Space/Mamba Model/
Mamba-2
Search

Mamba-2

Creator
Creator
Seonglae Cho
Created
Created
2024 Jun 6 17:5
Editor
Editor
Seonglae Cho
Edited
Edited
2024 Jul 17 15:13
Refs
Refs

Structured State Space Duality (SSD)

Mamba-2 shows slightly better scaling compared to Mamba-1, with faster training times.
 
 
 
 
Codestral Mamba
As a tribute to Cleopatra, whose glorious destiny ended in tragic snake circumstances, we are proud to release Codestral Mamba, a Mamba2 language model specialised in code generation, available under an Apache 2.0 license.
Codestral Mamba
https://mistral.ai/news/codestral-mamba/
Codestral Mamba
 
 

Recommendations

Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/Machine Learning/Neural Network/Neural Network Structure/SSM/Selective State Space/Mamba Model/
Mamba-2
Copyright Seonglae Cho