We exhibit that BlackMamba performs competitively against both equally Mamba and transformer baselines, and outperforms in inference and schooling FLOPs. We completely coach and open-source 340M/one.5B and 630M/two.8B https://k2spiceshop.com/product/liquid-k2-on-paper-online/