
Cohere Command A+
Cohere Command A+ is a 218B sparse MoE model with Apache 2.0 license, native citations, and a 128K context window that runs on just two H100 GPUs.
They summarize our coverage. We write it.
Newsletters like this one rebroadcast our headlines - often without the full review, the source reading, or the analysis underneath. Our weekly briefing sends the work they paraphrase, straight from the desk, before they get to it.
Free, weekly, no spam. One email every Tuesday. Unsubscribe anytime.

Cohere Command A+ is a 218B sparse MoE model with Apache 2.0 license, native citations, and a 128K context window that runs on just two H100 GPUs.

Verified June 2026: real cost per million tokens for self-hosting Llama 4 Scout, Maverick, Qwen3-235B, and DeepSeek V3.2 - GPU requirements, cost formulas, and when cheap APIs actually win.

Mistral rebrands Le Chat as Vibe, ships Work Mode with enterprise integrations, a VS Code extension, and remote coding agents powered by Mistral Medium 3.5 at 77.6% SWE-Bench.

NVIDIA Cosmos 3 is an open physical AI omnimodel with Mixture-of-Transformers architecture that natively handles text, images, video, sound, and robot actions in a single 16B or 64B model.

Nvidia's Cosmos 3 is the first fully open omnimodel for physical AI, trained on 20 trillion tokens to teach robots and autonomous vehicles how to reason and act in the real world.

Google is shutting down free access to its open-source Gemini CLI on June 18, replacing it with the proprietary Antigravity CLI - after accepting 6,000+ community pull requests.

IBM and Red Hat's Project Lightwell deploys 20,000 engineers and AI to patch open source vulnerabilities against exact deployed versions - no forced upgrades, commercial subscription model.

Mistral acquired Vienna-based Emmi AI and launched Physics AI - models that replace multi-day engineering simulations with seconds of inference on a single GPU.

NVIDIA's SANA-WM is a 2.6B-parameter hybrid linear diffusion transformer that generates 60-second 720p video with 6-DoF camera control on a single H100, built for embodied AI and robotics simulation.

NVIDIA NVLabs open-sourced SANA-WM, a 2.6B-parameter world model that generates 60-second 720p camera-controlled video on a single GPU, outperforming 14B+ competitors that need 8 GPUs.

Mistral AI's smallest open-weight model - 3B parameters, 256K context, Apache 2.0 license, built for edge and cost-sensitive deployments.

Cursor's Composer 2.5 scores within one point of Claude Opus 4.7 on SWE-Bench Multilingual at $0.50 per million tokens - a tenth of Anthropic's price - but the training disclosures deserve scrutiny.