
Leanstral Outperforms Claude Sonnet at Formal Code Proofs
Mistral's new open-source Lean 4 agent scores higher than Claude Sonnet on formal proofs at one-fifteenth the cost, raising the bar for trustworthy AI code generation.
They summarize our coverage. We write it.
Newsletters like this one rebroadcast our headlines - often without the full review, the source reading, or the analysis underneath. Our weekly briefing sends the work they paraphrase, straight from the desk, before they get to it.
Free, weekly, no spam. One email every Tuesday. Unsubscribe anytime.

Mistral's new open-source Lean 4 agent scores higher than Claude Sonnet on formal proofs at one-fifteenth the cost, raising the bar for trustworthy AI code generation.

LTX-2.3 is a 22-billion-parameter open-source video and audio generation model from Lightricks that rivals closed commercial tools - at zero cloud cost.

A developer leaked the model ID for Cursor's Composer 2: kimi-k2p5-rl-0317-s515-fast. Moonshot AI says Cursor violated the Kimi K2.5 license by not displaying attribution in a $2B ARR product.

OpenAI is acquiring Astral, the startup behind Python's dominant uv package manager and Ruff linter, folding critical developer infrastructure into its Codex coding agent team.

NVIDIA's Nemotron 3 Nano 4B packs a Mamba-dominant hybrid architecture, 262K token context, and 95.4% on MATH500 into a model that fits an 8GB Jetson Orin Nano.

Mistral Small 4 packs reasoning, vision, and agentic coding into a 119B MoE under Apache 2.0 - a serious small-model contender at a price that's hard to ignore.

A 1-trillion-parameter model called Hunter Alpha appeared anonymously on OpenRouter on March 11. Developers say it's DeepSeek V4 in disguise. The signals are strong but the precedent cuts both ways.
MiniMax M2.7 is a 230B MoE coding agent that handles 30-50% of MiniMax's own RL research workflow, scoring 56.22% on SWE-Pro and 78% on SWE-bench Verified at $0.30/M input tokens.

Seven AI and cloud companies pool $12.5M through OpenSSF and Alpha-Omega to build tools that help open-source maintainers cope with a flood of AI-generated vulnerability reports they can't triage.

NVIDIA released OpenShell at GTC 2026 - an open-source runtime that sandboxes AI agents with locked filesystems, blocked networks, and YAML-defined policies. One command to secure Claude Code, Codex, or OpenClaw.

Microsoft Azure's Foundry platform now runs Fireworks AI's inference engine, bringing DeepSeek V3.2, Kimi K2.5, and MiniMax M2.5 into enterprise AI under a unified control plane.

Mistral AI's unified MoE model - 119B total parameters, 6B active per token, 128 experts, 256K context, configurable reasoning, Apache 2.0 license.