hybrid architecture with Mamba and Attention heads running in paralleloutperforms popular small language with 6-12x less training Hymba - a nvidia CollectionA series of Hybrid Small Language Models.https://huggingface.co/collections/nvidia/hymba-673c35516c12c4b98b5e845f