
DeepSeek Nears $7.4B Close With Tencent and CATL
DeepSeek's maiden external funding round is nearing completion at up to $59B valuation, with Tencent and EV battery giant CATL as the biggest outside investors.
They summarize our coverage. We write it.
Newsletters like this one rebroadcast our headlines - often without the full review, the source reading, or the analysis underneath. Our weekly briefing sends the work they paraphrase, straight from the desk, before they get to it.
Free, weekly, no spam. One email every Tuesday. Unsubscribe anytime.

OpenAI's Dreaming V3 replaces ChatGPT's flat memory with a hierarchical relational system, kicking off a four-way race for AI personalization dominance.

Three new arXiv papers expose how developers miss AI sabotage 94% of the time, why LLMs converge structurally in code evolution, and how ZK proofs could verify frontier AI training.

NVIDIA's Agent Toolkit lands 110+ verified skills on GitHub covering robotics, autonomous vehicles, vision AI, and industrial systems - turning complex physical AI pipelines into single agent calls.

A bipartisan Congressional bill would freeze state AI laws for three years and require frontier developers to publish catastrophic risk plans, submit to federal audits, and face $1M daily fines.

NVIDIA's Dynamo Snapshot uses CRIU and cuda-checkpoint to freeze and restore GPU inference containers in seconds, cutting Kubernetes cold-start times by up to 21x for large models.

Meta is building tent-style rapid deployment structures at its Ohio Prometheus facility, cutting construction timelines roughly in half and bypassing grid delays with off-grid gas power.

Florida becomes the first US state to hold an AI CEO personally liable, filing an 83-page complaint accusing OpenAI and Sam Altman of hiding ChatGPT's dangers while racing for market share.

Three new papers tackle how routine AI use quietly rewires emotional habits, how to spend compute where failures cost most, and why agentic RAG errors compound before anyone notices.

New open-source inference engine for Apple Silicon benchmarks up to 2.6x faster than Ollama, supports 66 model aliases, and drops in as an OpenAI-compatible server on any Mac.

A beginner's guide to using AI tools like Fathom, Otter.ai, Zoom AI, and Google Meet's Gemini to automatically capture meeting notes and follow-up tasks.

Learn how to use ChatGPT, Perplexity, Gemini, and Amazon's AI assistant to research products, compare prices, and spot fake reviews before you buy.

A practical beginner's guide to using AI tools to write a stronger resume, craft tailored cover letters, and prepare confidently for job interviews.

MiniMax M3 arrives as the first open-weight model to combine frontier coding, 1M-token context, and native multimodality - at a fraction of proprietary pricing - but every benchmark figure is self-reported and the weights weren't even shipped at launch.

Claude Opus 4.8 sets new highs on SWE-bench Pro and long-context tasks while a 4x improvement in code flaw detection may matter more than any benchmark number.

Google's Antigravity 2.0 rewrites the platform from a browser IDE into a five-surface agent suite. The architecture is ambitious, the launch was a mess.

Current rankings of the best AI image generation models, including GPT Image 2, Nano Banana 2, Recraft V4.1, HiDream-O1-Image, FLUX 2, Midjourney v8.1, and Ideogram 3.0, scored on human preference, text rendering, and photorealism.

Rankings of the best AI models and agent frameworks on the GAIA benchmark, which tests real-world multi-step tasks requiring web browsing, tool use, and multi-hop reasoning.

Rankings of AI models by cost efficiency in May 2026, comparing performance per dollar across frontier and budget models. Updated with DeepSeek V4, GPT-5.5, and Kimi K2.6.

MiniMax M3 is an open-weight frontier model with a 1M-token context window, native multimodal input, and strong agentic coding at $0.60/M input tokens.

Meta's Llama 3.3 70B Instruct matches Llama 3.1 405B on instruction following and math while running at 4-5x lower cost, with the lowest hallucination rate of any open-weight model on the Vectara summarization leaderboard.

Cohere Command A+ is a 218B sparse MoE model with Apache 2.0 license, native citations, and a 128K context window that runs on just two H100 GPUs.

Kye Gomez open-sourced OpenMythos, a PyTorch reconstruction that hypothesizes Anthropic's Mythos is a Recurrent-Depth Transformer with Mixture-of-Experts routing and Multi-Latent Attention.

Rankings of dedicated reward models and frontier LLMs as judges across RewardBench, RewardBench-2, and JudgeBench - benchmarks that measure preference alignment and human agreement.

Rankings of VLA models and embodied AI systems on real robotics benchmarks: CALVIN, SimplerEnv, LIBERO, RoboCasa, DROID, and real-robot success rates as of April 2026.

Rankings of AI models on STEM benchmarks: GPQA Diamond, SciBench, OlympiadBench-Science, MMLU-STEM, ARC-Challenge, and ChemQA/Physics Olympiad as of April 2026.

Per-query pricing for search APIs used in AI agents and RAG pipelines - Brave, Tavily, Exa, SerpAPI, Serper, Perplexity Sonar, You.com, Jina Reader, Firecrawl, and more compared at 10k, 100k, and 1M queries.

Per-minute and per-1000-minute transcription API pricing across OpenAI Whisper, Deepgram Nova-3, AssemblyAI, Google Chirp 2, Azure, AWS Transcribe, Groq, ElevenLabs Scribe, and more.

Rankings of LLMs and constrained decoding frameworks on JSON schema adherence benchmarks including JSONSchemaBench and BFCL v3, covering native APIs and open-source constraint engines.

Rankings of the top LLMs on summarization benchmarks - ROUGE-L, BERTScore, FActScore, and human preference across CNN/DailyMail, XSum, GovReport, QMSum, and BookSum as of April 2026.

Rankings of the best LLM-powered software engineering agents on SWE-Bench Verified, with pass rates, pricing, scaffold notes, and methodology - updated April 2026.

Normalized per-1M-character and per-hour TTS pricing across ElevenLabs, OpenAI, Google, Azure, Amazon Polly, Play.ht, Cartesia, Deepgram Aura, WellSaid, and more.

Rankings of the best LLMs and agent pipelines on BIRD, Spider 2.0, CoSQL, and SParC text-to-SQL benchmarks, with execution accuracy scores and analysis.

Sam Altman's World project launched World ID 4.0 at a San Francisco event on April 17, signing Tinder, Zoom, DocuSign, and Okta as partners while introducing Agent Kit to authorize AI agents.