
Nemotron 3 Nano Omni
NVIDIA's first open omni-modal model: 30B total / 3B active hybrid Mamba-MoE that processes text, images, audio, and video in a single inference loop, with 9x higher throughput than comparable open omni models.
They summarize our coverage. We write it.
Newsletters like this one rebroadcast our headlines - often without the full review, the source reading, or the analysis underneath. Our weekly briefing sends the work they paraphrase, straight from the desk, before they get to it.
Free, weekly, no spam. One email every Tuesday. Unsubscribe anytime.

NVIDIA's first open omni-modal model: 30B total / 3B active hybrid Mamba-MoE that processes text, images, audio, and video in a single inference loop, with 9x higher throughput than comparable open omni models.

Mistral's first flagship merged model: a dense 128B with configurable reasoning, vision, and 77.6% SWE-Bench Verified, self-hostable on 4 GPUs.

Mistral releases Medium 3.5, a 128B open-weights model that scores 77.6% on SWE-Bench Verified, and pairs it with asynchronous cloud coding agents in Vibe that open pull requests on GitHub while you are away.

Analyst Ming-Chi Kuo claims OpenAI is building a smartphone with Qualcomm and MediaTek where AI agents replace traditional apps, targeting 2028 mass production.

GPT-5.5 is OpenAI's first completely retrained base model since GPT-4.5, leading the field on agentic coding and computer use - but the doubled per-token pricing and delayed API access require careful evaluation.

An AI agent named Luna, powered by Claude Sonnet 4.6, opened and manages a real San Francisco boutique - but its record includes a gender pay gap, employee surveillance, and false claims to journalists.

Anthropic's Project Deal experiment found that agents running stronger models consistently closed better transactions - and users represented by weaker agents had no idea.

Five AI procurement platforms compared on capabilities, pricing, and fit - from autonomous supplier negotiation to enterprise spend management suites.

Five AI platforms for scientific R&D compared - from formulation chemistry to materials discovery and literature analysis. Pricing, capabilities, and fit for research teams.

Meta signs a multi-year AWS deal to deploy tens of millions of Graviton5 CPU cores, betting that agentic AI workloads need CPUs more than GPUs.

OpenAI's first fully retrained base model since GPT-4.5 ships today to ChatGPT and Codex, leading on Terminal-Bench 2.0 at 82.7% with a doubled per-token price.

OpenAI's first fully retrained base model since GPT-4.5, targeting agentic coding, computer use, and knowledge work at $5/$30 per million tokens.