Multimodal

Qwen3.5-Flash

Qwen3.5-Flash is Alibaba's hosted production model with 1M context, built-in tools, and multimodal support at $0.10/M input tokens - one of the cheapest frontier-tier APIs available.

Gemini 3.1 Pro

Google DeepMind's Gemini 3.1 Pro leads on 13 of 16 benchmarks with 77.1% ARC-AGI-2, 94.3% GPQA Diamond, and a 1M-token context window at $2/M input.

Alibaba Drops Qwen3.5: 397B Parameters, 17B Active, and It Trades Blows with GPT-5.2

Alibaba's Qwen team releases Qwen3.5-397B-A17B, an open-weight mixture-of-experts model with native multimodal support and a hybrid attention architecture that runs 8x faster than its predecessor. Apache 2.0 licensed.

ByteDance's Seedance 2.0 Is the Best AI Video Generator You Might Not Get to Use

ByteDance's Seedance 2.0 introduces a dual-branch transformer for simultaneous audio-video generation at 2K resolution, but cease-and-desists from Disney, Paramount, and Warner Bros. threaten its global rollout.

xAI Teases Grok 4.20 With Improved Multimodal and Reduced Hallucinations

xAI previews Grok 4.20 with enhanced multimodal capabilities and further reduced hallucinations, building on Grok 4.1's success. The company also teases a 6 trillion parameter Grok 5.

Alibaba Drops Qwen 3.5: 397B Parameters of Open-Source Power

Alibaba releases Qwen 3.5, a 397B parameter open-source multimodal model with 256K context, Apache 2.0 license, and performance that tops Python coding and math reasoning benchmarks.

← Previous