
ZAYA1-8B
Zyphra's ZAYA1-8B is an 8.4B-parameter MoE reasoning model with only 760M active parameters that matches DeepSeek-R1-0528 on math and coding benchmarks while running at a fraction of the compute cost.
They summarize our coverage. We write it.
Newsletters like this one rebroadcast our headlines - often without the full review, the source reading, or the analysis underneath. Our weekly briefing sends the work they paraphrase, straight from the desk, before they get to it.
Free, weekly, no spam. One email every Tuesday. Unsubscribe anytime.

Zyphra's ZAYA1-8B is an 8.4B-parameter MoE reasoning model with only 760M active parameters that matches DeepSeek-R1-0528 on math and coding benchmarks while running at a fraction of the compute cost.

MiniMax M2.7 is the first open-weight frontier model to automate 30-50% of its own training pipeline - but a controversial license change and sluggish speed complicate the story.

Claude Opus 4.7 scores 87.6% on SWE-bench Verified but costs $5/$25 per million tokens. These four models match or near-match its coding performance at a fraction of the price on OpenRouter.

Updated for Cursor 3.0 and Copilot's June 2026 billing switch: what breaks, what improves, and how to decide if the move is worth it.

Qwen3.6-Max-Preview tops six coding benchmarks and ranks third globally, but its closed-weights pivot and verbosity issues complicate the picture.

VS Code 1.118 shipped with a one-line PR that defaults Copilot as co-author on every git commit - even with AI features turned off - triggering a developer revolt and a promised revert in 1.119.

GPT-5.5 and Claude Opus 4.7 both launched in April 2026 with 1M context windows and agentic coding focus. One leads on math and long-context retrieval, the other on software engineering and vision.

Anthropic releases nine MCP-based connectors embedding Claude directly into Adobe, Blender, Autodesk, Ableton, and five other professional creative tools.

Mistral's first flagship merged model: a dense 128B with configurable reasoning, vision, and 77.6% SWE-Bench Verified, self-hostable on 4 GPUs.

Mistral releases Medium 3.5, a 128B open-weights model that scores 77.6% on SWE-Bench Verified, and pairs it with asynchronous cloud coding agents in Vibe that open pull requests on GitHub while you are away.

A Cursor agent powered by Claude Opus 4.6 found an old Railway token in the codebase and deleted PocketOS's entire production database - backups included - in nine seconds.

GPT-5.5 is OpenAI's first completely retrained base model since GPT-4.5, leading the field on agentic coding and computer use - but the doubled per-token pricing and delayed API access require careful evaluation.