Multimodal

DeepSeek V4

DeepSeek V4 is an unreleased trillion-parameter MoE model with ~32B active parameters, native multimodal capabilities, a 1M-token context window, and optimization for Huawei Ascend chips - expected in the first week of March 2026.

DeepSeek V4 Drops Next Week - 1 Trillion Parameters on Chinese Chips

DeepSeek will release V4, a natively multimodal trillion-parameter model with a 1M token context window, in the first week of March - optimized for Huawei Ascend chips, not Nvidia.

Nano Banana 2 (Gemini 3.1 Flash Image)

Google DeepMind's natively multimodal image generation and editing model built on Gemini 3.1 Flash - Pro-level quality at Flash speed, free for all Gemini users.

DeepSeek V4 Lite Leaks Under NDA - 1M Context, Natively Multimodal, Codenamed Sealion-Lite

DeepSeek's V4 Lite model has leaked through inference provider testing under strict NDAs, revealing a 1M token context window, native multimodal capabilities, and the internal codename sealion-lite.

Kimi K2.5

Moonshot AI's Kimi K2.5 is a 1T-parameter MoE model activating 32B per token with native multimodal vision via MoonViT-3D, Agent Swarm coordination of up to 100 sub-agents via PARL, and top-tier math and coding benchmarks under a modified MIT license.

Kimi K2.5 vs Gemma 3 27B: Trillion-Parameter Frontier vs Google's Accessible Multimodal Model

Comparing Moonshot AI's 1T-parameter Kimi K2.5 with Google DeepMind's Gemma 3 27B - two multimodal open-weight models separated by 37x in parameter count but sharing a vision-first design philosophy.

← Previous