
Ai2 Releases OLMo Hybrid - Open Transformer-RNN That Halves Token Cost
OLMo Hybrid combines transformer attention with Gated DeltaNet to match OLMo 3 accuracy using 49% fewer tokens and 75% better throughput on long contexts. Fully open - weights, checkpoints, training code, and technical report.