
Anthropic Releases Fable 5 Despite Its Own AI Safety Warning
Anthropic opens Mythos-class capabilities to the public with Claude Fable 5 at $10/$50 per million tokens, days after calling for a global AI pause.
They summarize our coverage. We write it.
Newsletters like this one rebroadcast our headlines - often without the full review, the source reading, or the analysis underneath. Our weekly briefing sends the work they paraphrase, straight from the desk, before they get to it.
Free, weekly, no spam. One email every Tuesday. Unsubscribe anytime.

Moonshot AI ships Kimi K2.7-Code with 30% fewer reasoning tokens and a 21.8% gain on its own coding benchmarks, but the model still trails Claude Opus 4.8 on most tests in the same table.

Meta's 6,500-person Applied AI team, created in March with a join-or-quit mandate, is in revolt: 1,600+ employees signed a petition against keystroke surveillance, bathroom-stall flyers appeared across US campuses, and UK workers began organizing with UTAW.

KPMG retracted its agentic AI report after GPTZero found that 40 of 45 citations were fabricated and case studies about UBS, the NHS, and Transport for London were invented.

Three new papers expose a 50-point gap in agent tool knowledge, show tree search tripling inference throughput, and map the research between AGI and superintelligence.

Mistral AI is in talks to raise €3 billion at a €20 billion valuation, nearly doubling its September 2025 price tag in under a year and cementing its status as Europe's most valuable AI company.

ChatGPT crossed 1 billion monthly active users in May - the fastest app in history to hit the mark. But Meta AI is growing at 973% and Claude at 640%, against ChatGPT's own 62%.

A Chinese cybercrime network sold $88/week phishing kits that used Google's own Gemini AI to generate fake sites impersonating banks, carriers, and government agencies at scale.

Anthropic commits $150 million to place 1,000 AI-trained fellows inside US nonprofits for a year at $85,000 each, no degree required.

Google's new streaming audio model translates speech in real time across 70+ languages - available now in Google Translate and via the Gemini Live API.

A practical guide to using free AI tools like NotebookLM, Knowt, and ChatGPT to turn your own notes into flashcards, practice tests, and study sessions that actually stick.

A practical beginner's guide to the five most accessible AI side hustles in 2026, with honest earnings estimates and a clear starting point for each.

A beginner's guide to using AI tools like Fathom, Otter.ai, Zoom AI, and Google Meet's Gemini to automatically capture meeting notes and follow-up tasks.

A hands-on review of all seven MAI models - from the April transcription and image launch to Build 2026's MAI-Thinking-1, MAI-Code-1-Flash, and the multimodal upgrades.

Claude Fable 5 delivers the strongest coding and long-context results Anthropic has ever shipped publicly, but its safety classifiers block enough legitimate work to make that power conditional.

OpenAI's life sciences reasoning model gets a June update with global access and new NGS plugins - strong benchmarks, but still locked behind a Trusted Access Program with no public pricing.

Current rankings of the best AI image generation models, including GPT Image 2, Nano Banana 2, Recraft V4.1, HiDream-O1-Image, FLUX 2, Midjourney v8.1, and Ideogram 3.0, scored on human preference, text rendering, and photorealism.

Rankings of the best AI models and agent frameworks on the GAIA benchmark, which tests real-world multi-step tasks requiring web browsing, tool use, and multi-hop reasoning.

Rankings of AI models by cost efficiency in May 2026, comparing performance per dollar across frontier and budget models. Updated with DeepSeek V4, GPT-5.5, and Kimi K2.6.

Moonshot AI's Kimi K2.7-Code is a 1T-parameter open-weight MoE coding model with mandatory thinking mode, 256K context, and 30% fewer reasoning tokens than K2.6.

Microsoft's first in-house reasoning model, a 35B-active sparse MoE with 256K context, 97% on AIME 2025, and no distillation from third-party labs.

DiffusionGemma 26B is Google DeepMind's open-weight discrete diffusion language model that generates 256 tokens in parallel, reaching 1,100+ tokens/sec on H100 - roughly 4x faster than autoregressive models of the same size.

Broadcom, Apollo, and Blackstone launched the $35B AI XPV Platform on June 9, targeting 20GW of custom AI silicon through 2028 with Anthropic as the anchor customer.

Microsoft's first in-house reasoning model, a 35B-active sparse MoE with 256K context, 97% on AIME 2025, and no distillation from third-party labs.

A practical guide to using free AI tools like NotebookLM, Knowt, and ChatGPT to turn your own notes into flashcards, practice tests, and study sessions that actually stick.

Jeff Bezos anchored a $500M round for Flourish, a New York startup building Cortex AI from connectomics research, targeting 20-50W operation versus server-rack GPU clusters.

A former xAI engineer filed a California whistleblower lawsuit claiming he was fired for warning that Grok could spread dangerous content - the day before SpaceX prices its historic $75B IPO.

Morgan Stanley projects $570B in AI-linked global debt issuance for 2026, more than double last year, as Amazon's $17.5B term loan reveals capex now consuming virtually all hyperscaler cash flows.

Former Datadog engineers launch Niteshift, a $7M-backed cloud platform that runs AI coding agents in full-stack environments with model-agnostic routing.

Gemini 2.5 Flash still leads LIT-RAGBench English RAG accuracy at 87.0%, but the full benchmark data reveals two overlooked entries: GPT-4.1-mini at 84.1% and o4-mini at 83.9%.

Three new arXiv papers expose how context bloat tanks agent performance, agent memory bleeds private data, and misaligned behavior spreads through multi-agent systems.

A benchmark-driven comparison of the five leading AI coding IDEs in 2026, covering pricing, agent capabilities, and who each one is actually built for.

Google DeepMind open-sources DiffusionGemma, a 26B MoE model that generates 256 tokens per denoising pass instead of one at a time, reaching 1,100 tokens per second on a single H100.

DiffusionGemma 26B is Google DeepMind's open-weight discrete diffusion language model that generates 256 tokens in parallel, reaching 1,100+ tokens/sec on H100 - roughly 4x faster than autoregressive models of the same size.