
DeepSeek Nears $7.4B Close With Tencent and CATL
DeepSeek's maiden external funding round is nearing completion at up to $59B valuation, with Tencent and EV battery giant CATL as the biggest outside investors.
They summarize our coverage. We write it.
Newsletters like this one rebroadcast our headlines - often without the full review, the source reading, or the analysis underneath. Our weekly briefing sends the work they paraphrase, straight from the desk, before they get to it.
Free, weekly, no spam. One email every Tuesday. Unsubscribe anytime.

Google DeepMind's new QAT checkpoints shrink the Gemma 4 E2B model to under 1GB, making serious on-device AI viable for phones and budget laptops.

The Trump administration is in talks with OpenAI about donating equity to a US sovereign-style fund, a deal that would make American taxpayers co-owners of the most valuable AI startup on Earth.

Google will pay SpaceX $920 million per month for 110,000 NVIDIA GPUs at Colossus 1, citing unexpected demand for its Gemini Enterprise agent platform.

NVIDIA's 550B Nemotron 3 Ultra, released June 4, tops the US open-weight leaderboard with a hybrid Mamba-Transformer MoE architecture and 300-plus tokens per second throughput.

OpenAI's Dreaming V3 replaces ChatGPT's flat memory with a hierarchical relational system, kicking off a four-way race for AI personalization dominance.

Three new arXiv papers expose how developers miss AI sabotage 94% of the time, why LLMs converge structurally in code evolution, and how ZK proofs could verify frontier AI training.

NVIDIA's Agent Toolkit lands 110+ verified skills on GitHub covering robotics, autonomous vehicles, vision AI, and industrial systems - turning complex physical AI pipelines into single agent calls.

A bipartisan Congressional bill would freeze state AI laws for three years and require frontier developers to publish catastrophic risk plans, submit to federal audits, and face $1M daily fines.

NVIDIA's Dynamo Snapshot uses CRIU and cuda-checkpoint to freeze and restore GPU inference containers in seconds, cutting Kubernetes cold-start times by up to 21x for large models.

A beginner's guide to using AI tools like Fathom, Otter.ai, Zoom AI, and Google Meet's Gemini to automatically capture meeting notes and follow-up tasks.

Learn how to use ChatGPT, Perplexity, Gemini, and Amazon's AI assistant to research products, compare prices, and spot fake reviews before you buy.

A practical beginner's guide to using AI tools to write a stronger resume, craft tailored cover letters, and prepare confidently for job interviews.

MiniMax M3 arrives as the first open-weight model to combine frontier coding, 1M-token context, and native multimodality - at a fraction of proprietary pricing - but every benchmark figure is self-reported and the weights weren't even shipped at launch.

Claude Opus 4.8 sets new highs on SWE-bench Pro and long-context tasks while a 4x improvement in code flaw detection may matter more than any benchmark number.

Google's Antigravity 2.0 rewrites the platform from a browser IDE into a five-surface agent suite. The architecture is ambitious, the launch was a mess.

Current rankings of the best AI image generation models, including GPT Image 2, Nano Banana 2, Recraft V4.1, HiDream-O1-Image, FLUX 2, Midjourney v8.1, and Ideogram 3.0, scored on human preference, text rendering, and photorealism.

Rankings of the best AI models and agent frameworks on the GAIA benchmark, which tests real-world multi-step tasks requiring web browsing, tool use, and multi-hop reasoning.

Rankings of AI models by cost efficiency in May 2026, comparing performance per dollar across frontier and budget models. Updated with DeepSeek V4, GPT-5.5, and Kimi K2.6.

NVIDIA's 550B open-weight MoE model with 55B active parameters, hybrid Mamba-Transformer architecture, and 1M token context - the top-scoring US open model on the Artificial Analysis Intelligence Index.

MiniMax M3 is an open-weight frontier model with a 1M-token context window, native multimodal input, and strong agentic coding at $0.60/M input tokens.

Meta's Llama 3.3 70B Instruct matches Llama 3.1 405B on instruction following and math while running at 4-5x lower cost, with the lowest hallucination rate of any open-weight model on the Vectara summarization leaderboard.

Mistral AI secures $830M in debt financing from seven banks to build a 13,800-GPU Nvidia GB300 cluster near Paris, targeting 200MW of European compute by 2027.

Google's Gemini 3.1 Flash Live beats GPT-4 Realtime 1.5 on Scale AI's Audio MultiChallenge and takes Search Live to 200+ countries - but it doesn't lead every benchmark.

NVIDIA's DSX Flex library and Emerald AI's Conductor platform let AI factories ramp GPU power up or down in seconds, unlocking faster grid connections and up to 100GW of new U.S. capacity.

Shopify activated Agentic Storefronts by default on March 24, making products from millions of merchants discoverable and purchasable inside ChatGPT, Microsoft Copilot, and Google's AI channels.

A technical comparison of how Claude, GPT-4o, Gemini, Grok, Pixtral, Qwen, and DeepSeek handle image inputs - resizing pipelines, token math, and undocumented gotchas.

Meta releases SAM 3.1 with Object Multiplex, processing all tracked objects in one shared pass for 7x faster inference at 128 objects and improvements on 6 of 7 VOS benchmarks.

Google launched two new tools on March 26 that let users transfer memories and full chat logs from ChatGPT or Claude into Gemini - 24 days after Anthropic launched the same concept first.

Anthropic confirmed paid Claude subscriptions more than doubled in 2026 while annualized revenue climbed from $1B to $19B in roughly 15 months.

Cohere releases its first audio model - a 2B-parameter open-source ASR system beating Whisper Large v3 by 27% on the HuggingFace Open ASR Leaderboard.

Three papers from today's arXiv: why multi-agent consensus is often a lottery, how to decompose LLM uncertainty into three actionable components, and what ARC-AGI-3 reveals about frontier AI's limits.

Microsoft signs a deal with Crusoe for a new 900 MW AI factory campus in Abilene, Texas, adjacent to the Stargate site Oracle and OpenAI walked away from three weeks ago.

GStack turns Claude Code into a virtual dev team with 28 slash commands for planning, review, QA, security, and deployment - all open source and free.