Sophie Zhang

AI Infrastructure & Open Source Reporter

Sophie is a journalist and former systems engineer who covers AI infrastructure, open-source models, and the developer tooling ecosystem. She spent three years as a site reliability engineer at a cloud provider in Seattle before transitioning to tech journalism, which gives her writing an unusual level of technical depth - she understands distributed systems, GPU clusters, and inference optimization from the inside.

She studied Computer Engineering at the University of British Columbia and later completed a science communication fellowship at MIT. Her engineering background means she can read a model card, spot a misleading benchmark, and explain why quantization matters - all in the same paragraph.

At Awesome Agents, Sophie covers AI infrastructure news: new model releases, open-source launches, developer tools, deployment trends, and the hardware that makes it all run. She has a soft spot for underdog open-source projects that punch above their weight and a sharp eye for when a "breakthrough" is really just better marketing.

Based in Seattle, WA.

Articles by Sophie Zhang

Microsoft's Own ToS Labels Copilot Entertainment-Only

Microsoft's Copilot terms call the product 'for entertainment only' - language that sat unnoticed since October 2025 while the company charges enterprise customers up to $30 per user per month.

AutoAgent Builds Its Own Harness, Tops Two Benchmarks

Kevin Gu's MIT-licensed AutoAgent lets a meta-agent engineer and hill-climb its own agent harness overnight, claiming the top GPT-5 slot on TerminalBench and first place on SpreadsheetBench.

Netflix VOID Erases Video Objects and Rewrites Physics

Netflix open-sources VOID, a video inpainting model that removes objects while simulating the physical effects they left behind - available under Apache 2.0 with a HuggingFace demo.

Cursor 3 Rebuilds the IDE Around Agents

Cursor's ground-up IDE rebuild ships parallel agent orchestration, Design Mode for frontend work, and cloud-to-local session handoff - all in one unified workspace.

Google Gemma 4 Ships Four Open Models Under Apache 2.0

Google releases Gemma 4 with a 26B MoE, 31B Dense, and two edge variants under Apache 2.0 - claiming the highest intelligence-per-parameter of any open model.

Cloudflare Launches EmDash as Open-Source WordPress Rival

Cloudflare's EmDash is an MIT-licensed CMS built on Astro 6.0 that sandboxes plugins in isolated Workers, ships a built-in MCP server, and targets WordPress's 42.5% share of the web.

claw-code Hits 100K Stars After Claude Code Npm Leak

A missing .npmignore entry in Claude Code 2.1.88 exposed 512,000 lines of TypeScript source, spawned the fastest-growing GitHub repo ever, and revealed unshipped features Anthropic never announced.

Arm Claims Agents Need New Silicon - Intel Disagrees

Arm says its 136-core AGI CPU is purpose-built for agentic AI workloads. Intel's data center chief - Arm's former head of solutions engineering - says the claim overstates what's actually new.

Cisco DefenseClaw Locks Down AI Agents at RSA

Cisco open-sourced DefenseClaw at RSA 2026 - a five-minute install that scans agent skills, MCP servers, and AI-generated code before they run, with 2-second policy enforcement and Splunk telemetry built in.

Microsoft Open-Sources Harrier, a New Embedding Leader

Microsoft's Harrier-OSS-v1 family delivers three MIT-licensed multilingual embedding models, with the 27B variant claiming top spot on Multilingual MTEB v2 at 74.3.

llm-d Joins CNCF - Kubernetes Gets a Native LLM Inference Stack

IBM Research, Red Hat, and Google Cloud donated llm-d to the CNCF at KubeCon EU, giving Kubernetes a production-grade distributed LLM inference framework built on vLLM.

GitHub Copilot Is Injecting Ads Into Pull Requests

GitHub Copilot inserts promotional tips for itself and Raycast into PR descriptions, with over 11,000 affected pull requests found across GitHub and GitLab.

Transformers.js v4 Ships WebGPU Runtime for Browser ML

HuggingFace's Transformers.js v4 rewrites its WebGPU runtime in C++, supports 200+ architectures, and delivers up to 4x faster inference in browsers and server-side JS runtimes.

Mistral Borrows $830M to Build a Sovereign GPU Farm

Mistral AI secures $830M in debt financing from seven banks to build a 13,800-GPU Nvidia GB300 cluster near Paris, targeting 200MW of European compute by 2027.

NVIDIA and Emerald AI Turn Data Centers Into Grid Assets

NVIDIA's DSX Flex library and Emerald AI's Conductor platform let AI factories ramp GPU power up or down in seconds, unlocking faster grid connections and up to 100GW of new U.S. capacity.

Meta SAM 3.1 - 7x Faster Multi-Object Video Tracking

Meta releases SAM 3.1 with Object Multiplex, processing all tracked objects in one shared pass for 7x faster inference at 128 objects and improvements on 6 of 7 VOS benchmarks.

Cohere's Open-Source Transcribe Tops ASR Leaderboard

Cohere releases its first audio model - a 2B-parameter open-source ASR system beating Whisper Large v3 by 27% on the HuggingFace Open ASR Leaderboard.

Microsoft Picks Up 900 MW Texas Campus OpenAI Dropped

Microsoft signs a deal with Crusoe for a new 900 MW AI factory campus in Abilene, Texas, adjacent to the Stargate site Oracle and OpenAI walked away from three weeks ago.

OpenAI Codex Launches Plugin Marketplace for Agents

OpenAI ships a plugin marketplace for Codex CLI v0.117.0, bundling skills, app integrations, and MCP server configs behind an enterprise governance layer.

Mistral Ships Voxtral - Open-Weights Voice AI Platform

Mistral releases Voxtral, a pair of open-weights models covering speech recognition and text-to-speech that undercut OpenAI and ElevenLabs on price.

ARC-AGI-3 Launches - AI Agents Must Learn, Not Memorize

ARC Prize Foundation launched ARC-AGI-3 today with a fully open-source agent toolkit. The best AI in the preview phase scored 12.58% against a human baseline of 100%.

Ai2 Drops MolmoWeb - Open-Source Web Agent Beats GPT-4o

Ai2's MolmoWeb is a fully open-source web agent that navigates browsers by screenshot alone, beating GPT-4o-based agents at the 8B scale with weights, training data, and code all released under Apache 2.0.

Alibaba's C950 - First RISC-V CPU with Native LLM Inference

Alibaba's T-Head division launched the XuanTie C950, a 5nm 3.2GHz RISC-V server chip that sets a new world record for RISC-V single-core performance and natively runs billion-parameter models like DeepSeek V3 and Qwen3.

USCC: China's Open-Source AI Now Runs 80% of US Startups

A new USCC report finds Chinese open-source models now dominate US AI startup stacks, with Qwen surpassing Llama in global downloads and Chinese models taking 41% of all Hugging Face downloads.

Nemotron-Cascade 2: 30B Open MoE, One GPU, Beats 120B

NVIDIA's new Nemotron-Cascade-2-30B-A3B activates just 3B parameters per token, runs on a single RTX 4090, and outscores NVIDIA's own 120B model on coding and math benchmarks.

Leanstral Outperforms Claude Sonnet at Formal Code Proofs

Mistral's new open-source Lean 4 agent scores higher than Claude Sonnet on formal proofs at one-fifteenth the cost, raising the bar for trustworthy AI code generation.

WordPress.com Opens Write Access to AI Agents via MCP

WordPress.com expanded its Model Context Protocol integration to give AI agents write access across posts, pages, comments, media, and taxonomy - 19 new operations, all requiring explicit user confirmation before execution.

Supermicro SVP Charged in $2.5B Nvidia Chip Scheme

Federal prosecutors indicted Supermicro's co-founder and SVP along with two associates for smuggling $2.5 billion in Nvidia AI accelerator servers to China using fake hardware and stripped serial numbers.

Cursor Ships Composer 2 - Its First In-House Coding Model

Cursor launches Composer 2, its first in-house coding model trained via RL on long-horizon tasks, scoring 73.7 on SWE-bench Multilingual at $0.50/M input tokens.

Nemotron 3 Nano 4B: NVIDIA Edge Model Runs on 8GB

NVIDIA's Nemotron 3 Nano 4B packs a Mamba-dominant hybrid architecture, 262K token context, and 95.4% on MATH500 into a model that fits an 8GB Jetson Orin Nano.

Linux Foundation Raises $12.5M Against AI Bug Slop

Seven AI and cloud companies pool $12.5M through OpenSSF and Alpha-Omega to build tools that help open-source maintainers cope with a flood of AI-generated vulnerability reports they can't triage.

Mistral Small 4: 128 Experts, 6B Active, Apache 2.0

Mistral AI releases Small 4 - a 119B MoE with only 6B active parameters, 256K context, configurable reasoning, and Apache 2.0 license. Plus a new NVIDIA partnership to co-develop frontier open models.

NVIDIA DLSS 5 Uses AI to Add Real Lighting to Games

NVIDIA announced DLSS 5 at GTC 2026 - a neural rendering model that adds photorealistic lighting, materials, and subsurface scattering directly to game pixels in real time.

Grandmother Jailed 6 Months After AI Misidentified Her

Angela Lipps spent nearly six months in jail after Fargo police facial recognition software falsely matched her to a bank fraud suspect 1,200 miles away. She lost her home, car, and dog.

Pokemon Go Built a 30 Billion Image AI Dataset

Niantic trained a visual positioning AI on 30 billion images collected from Pokemon Go players over eight years. The data now guides delivery robots with centimeter-level accuracy.

Kraken Ships First Crypto CLI Built for AI Agents

Kraken launched an open-source Rust CLI with built-in MCP server that lets AI agents like Claude Code and Codex trade crypto, manage portfolios, and paper-trade against live markets.

AI Detectors Flag the Bible as AI-Generated Content

ZeroGPT scores the Book of Genesis at 88.2% AI-generated. The US Constitution gets flagged too. The problem is not the Bible - it is the detectors.

Naval Ravikant: AI Eats Software Amid $1T SaaS Crash

Naval Ravikant declared 'AI is eating software' as the SaaSpocalypse erased over $1 trillion in SaaS market value, with Atlassian cutting 1,600 jobs after its first decline in enterprise seats.

LangChain Releases Deep Agents for Long-Horizon Tasks

LangChain's open-source Deep Agents framework brings planning, subagents, and persistent context to autonomous agents tackling complex multi-step work.

Shopify CEO Uses AI Agent to Make Liquid 53% Faster

Tobi Lütke ran Karpathy's autoresearch loop against the Liquid templating engine he created 20 years ago, producing 93 commits from 120 experiments that cut parse+render time by 53% and allocations by 61%.

ServiceNow CEO Says AI Could Push Grad Unemployment Past 30%

ServiceNow CEO Bill McDermott told CNBC that AI agents could drive college graduate unemployment into the mid-30% range, while his own company has automated 90% of customer service use cases.

RentAHuman Lets AI Agents Hire People for Physical Tasks

RentAHuman.ai is a marketplace where AI agents hire humans via MCP or REST API for real-world tasks, but 590,000 workers are chasing just 11,300 bounties and most gigs turn out to be startup marketing.

METR: Half of SWE-Bench Passes Fail Real Code Review

METR found maintainers would reject roughly half of AI PRs that pass SWE-bench automated grading, with a 24-point gap that suggests benchmark scores substantially overstate production readiness.

Antigravity Pro Users Hit Multi-Day Quota Lockouts

Google Antigravity Pro subscribers report 5-7 day lockouts on premium models after quota changes replaced five-hour refreshes with weekly caps and AI credit overages, sparking backlash across developer forums.

Perplexity CTO Moves Away from MCP Toward APIs and CLIs

At Ask 2026, Perplexity CTO Denis Yarats announced a shift from Anthropic's MCP toward APIs and CLIs, citing high context usage and authentication friction, alongside the launch of their multi-model Agent API.

Codex Runs Halo on Modern Macs in a Single Prompt

A developer used OpenAI's Codex agent to get Halo: Combat Evolved running on Apple Silicon through Wine - automated setup, dependency installation, and rendering fixes with no manual tweaking.

NVIDIA Ships Nemotron 3 Super - 120B Open Model for Agents

NVIDIA releases Nemotron 3 Super, a 120B-parameter open model with only 12B active at inference, combining Mamba-2 and Transformer layers for agentic AI workloads with a 1M token context window.

Meta Unveils Four MTIA Chip Generations in Two Years

Meta published a four-generation MTIA silicon roadmap delivering chips every six months through 2027, with compute scaling 25x from MTIA 300 to MTIA 500.

IBM Granite 4.0 1B Speech Tops OpenASR Leaderboard

IBM's new 1B-parameter speech model claims the top spot on the Open ASR Leaderboard while running on consumer hardware, beating Whisper Large V3 by 25% on word error rate.

16 Open-Source RL Libraries, One Shared GPU Bottleneck

A Hugging Face survey of 16 open-source reinforcement learning libraries finds the entire ecosystem has converged on async disaggregated training to fix a single brutal bottleneck: GPU idle time during long rollouts.

LeRobot v0.5.0: First Humanoid, 10x Training Speed

Hugging Face ships its largest LeRobot update yet: Unitree G1 humanoid support, Pi0-FAST VLA, Real-Time Chunking, 10x faster image training, and PEFT/LoRA fine-tuning for large robot policies.

Google Launches Gemini Embedding 2 for Multimodal AI

Google's first natively multimodal embedding model maps text, images, video, audio, and PDFs into a single vector space - now in public preview via Gemini API and Vertex AI.

Hugging Face Launches Storage Buckets for ML Artifacts

Hugging Face introduced Storage Buckets, mutable S3-like object storage built on Xet deduplication for ML checkpoints, logs, and artifacts - starting at $8/TB/month at volume.

Karpathy's Autoresearch Runs 100 ML Experiments Overnight

Andrej Karpathy open-sourced autoresearch, a 630-line MIT-licensed Python tool that runs up to 100 autonomous ML experiments overnight on a single GPU, no PhD required.

NVIDIA NemoClaw: Enterprise AI Agents Without Lock-In

NVIDIA is preparing to launch NemoClaw, an open-source enterprise AI agent platform that runs on any hardware, with a formal reveal expected at GTC 2026 on March 16.

Meta Buys Moltbook, the Social Network for AI Agents

Meta acquires the viral AI-only social network Moltbook, bringing its founders into Meta Superintelligence Labs as the company races to build agent infrastructure.

Amazon Mandates Senior Approval for AI-Assisted Code

After a six-hour shopping outage and multiple AI-linked incidents, Amazon now requires junior and mid-level engineers to get senior sign-off before deploying AI-assisted code changes.

Google Workspace CLI Opens Gmail and Drive to AI Agents

A Google DevRel engineer shipped a Rust-powered CLI that exposes all of Google Workspace to AI agents over MCP - with built-in prompt injection defense.

Anthropic Ships Multi-Agent Code Review for PRs

Anthropic's new Code Review dispatches parallel AI agents on every pull request to find bugs, rank them by severity, and filter false positives - at $15-25 per review.

AMD Pushes P100 Embedded to 12 Cores and 80 TOPS

AMD expands its Ryzen AI Embedded P100 family with six new 8-to-12-core processors delivering 80 system TOPS, targeting industrial automation, robotics, and medical imaging.

Nscale Closes $2B to Build Europe's AI Backbone

Nscale raises the largest Series C in European history, valuing the company at $14.6 billion and funding a 100,000-GPU AI gigafactory in Norway with OpenAI as anchor customer.

Claude Code Gets Auto Mode - No More Permission Prompts

Anthropic's Claude Code launches an Auto Mode research preview on March 12, letting the agent handle permission decisions autonomously instead of interrupting developers at every step.

vLLM 0.17 Ships FlashAttention 4 and Live MoE Scaling

vLLM v0.17.0 adds FlashAttention 4, elastic expert parallelism for live MoE rescaling, full Qwen3.5 support, and a performance-mode flag, all in 699 commits from 272 contributors.

Claude Code Taught Itself to Escape Its Own Sandbox

Security firm Ona found Claude Code bypasses its own denylist, disables Anthropic's bubblewrap sandbox, and evades kernel-level enforcement through the ELF dynamic linker.

Claude Code Wipes Production Database in Terraform Mishap

An AI coding agent executed terraform destroy on a live course platform serving 100,000 students, obliterating the VPC, RDS database, and ECS cluster. AWS restored 1.94 million rows from a hidden snapshot after 24 hours.

Claude Opus Reasoning Distilled Into Open 27B Model

A community fine-tune distills Claude Opus 4.6 chain-of-thought reasoning into Qwen3.5-27B via LoRA, racking up 4,000+ downloads in days. No benchmarks yet - but the approach raises familiar questions.

Microsoft's Phi-4 Vision Matches Models 10x Its Size

Microsoft releases Phi-4-reasoning-vision-15B - a 15B open-weight multimodal model trained on 240 GPUs in 4 days that competes with 100B+ parameter models on math, science, and GUI understanding.

Cursor Launches Always-On AI Coding Agents

Cursor's new Automations feature triggers AI coding agents from GitHub PRs, Slack messages, and PagerDuty incidents - running hundreds per hour as the company's revenue doubles to $2B ARR.

Ai2 Releases OLMo Hybrid - Open Transformer-RNN That Halves Token Cost

OLMo Hybrid combines transformer attention with Gated DeltaNet to match OLMo 3 accuracy using 49% fewer tokens and 75% better throughput on long contexts. Fully open - weights, checkpoints, training code, and technical report.

China Unveils $70B AI and Chip Subsidy to Counter US Controls

China announced up to $70 billion in semiconductor and AI subsidies during the Two Sessions - one of the largest government chip programs in history, aimed at full self-sufficiency as US export controls tighten.

OpenAI Launches Codex Security, 14 Days After Anthropic

OpenAI launches Codex Security in research preview, scanning 1.2M commits and finding 11,353 critical and high-severity vulnerabilities. The AI vulnerability arms race is officially on.

Meta Opens WhatsApp AI API Under EU Pressure - For a Price

Meta reversed its WhatsApp ban on rival AI chatbots in Europe and Brazil, but now charges rivals up to €0.13 per message - pricing critics call a disguised ban.

AWS Launches AI Agent Platform for Healthcare Admin

Amazon Connect Health deploys multi-agent AI built on Nova Sonic and Bedrock to automate scheduling, documentation, and patient verification for health systems.

Luma Launches Agents for End-to-End Creative Work

Luma AI's new Agents platform, powered by the Uni-1 Unified Intelligence model, lets creative teams go from a written brief to finished video, images, and audio in one workflow.

GPT-5.4 Lands with Computer Use and 1M Token Context

OpenAI ships GPT-5.4 with built-in computer use that beats human desktop performance, a 1 million token context window, and native Excel and Google Sheets integrations.

Claude Code Brings Back 'Ultrathink' After Users Revolt

Claude Code 2.1.68 restores the ultrathink keyword after community backlash over quality degradation, while setting Opus 4.6 to medium effort by default for speed on daily tasks.

Defense Contractors Purge Claude After Pentagon Blacklist

Lockheed Martin and at least 10 defense-backed startups are actively removing Anthropic's Claude from their operations after the Pentagon designated the company a supply chain risk.

PersonaPlex 7B Runs Full-Duplex Speech on a Mac

A developer ported NVIDIA's PersonaPlex 7B speech-to-speech model to native Swift using MLX, running full-duplex conversation on Apple Silicon with no cloud, no Python, and faster-than-real-time inference.

OpenAI Ships Codex Desktop App for Windows Devs

OpenAI's Codex desktop app hits the Microsoft Store with native Windows sandboxing, PowerShell agents, WinUI skills, and multi-agent parallel execution - no WSL required.

Evo 2 Is the Largest Open-Source Biology AI Ever Built

Arc Institute and NVIDIA release Evo 2, a 40B-parameter open-source AI trained on 9.3 trillion nucleotides from every domain of life, with full weights, code, and training data.

MacBook Neo: Apple's iPhone Chip Lands in a $599 Mac

Apple's cheapest Mac ever packs the A18 Pro iPhone chip with a 16-core Neural Engine - but its 60 GB/s memory bandwidth puts a hard ceiling on what local models you can actually run.

OpenAI Targets NATO Networks After Pentagon Deal

Days after securing a Pentagon classified network deal, OpenAI is exploring a contract to deploy AI on NATO's unclassified systems - a 32-nation alliance spanning North America and Europe.

Open Source Endowment Fights AI Slop With Permanent Fund

A new 501(c)(3) nonprofit backed by the creators of curl, Vue.js, and HashiCorp is building the world's first permanent endowment for open source as AI-generated junk submissions push maintainers to the breaking point.

OpenAI's Three-Word GPT-5.4 Tease - What It Means

OpenAI posted '5.4 sooner than you Think.' one hour after launching GPT-5.3 Instant. Here is what the Codex repo leaks already told us - and what the tweet does not.

Mercury 2 Is 13x Faster Than Claude Haiku - Verified

Inception Labs' Mercury 2 hits 1,196 tokens per second in independent testing - a diffusion architecture that rewires how inference works.

Shannon AI Tool Masters Web App Pentesting With 96% Success

KeygraphHQ's open-source Shannon runs Claude-powered multi-agent attacks against real web apps, hitting 96.15% on the XBOW benchmark and finding 30+ flaws in OWASP Juice Shop.

US Weighs 75K-Chip Cap on Nvidia H200 Sales to China

The Trump administration is considering limiting Chinese companies to 75,000 Nvidia H200 GPUs each - less than half what Alibaba and ByteDance want - while zero chips have shipped despite months of export approvals.

Gemini 3.1 Pro Tops Benchmarks but Developers Can't Rely on It

Gemini 3.1 Pro leads ARC-AGI-2, LiveCodeBench, and 11 other benchmarks with 750 million users and 21.5% market share - but developers report stalled responses, leaked thinking tokens, and API outages that make it unusable for production coding and agent workflows.

Google Launches Gemini 3.1 Flash-Lite as Gemini 3 Pro Dies

Google quietly launched Gemini 3.1 Flash-Lite Preview - the first Flash-Lite in the Gemini 3 series - while announcing Gemini 3 Pro will shut down March 9. Developers have six days to migrate.

Apple's M5 Pro and Max Make 70B Models Portable

Apple launches M5 Pro and M5 Max MacBook Pros with Neural Accelerators in every GPU core, 128GB unified memory, and 614GB/s bandwidth - enough to run Llama 70B on a laptop.

ByteDance Trained an AI Agent That Writes Faster CUDA Kernels Than You

CUDA Agent uses reinforcement learning trained on actual GPU profiling data to generate optimized CUDA kernels. It beats torch.compile by 2.11x overall and outperforms Claude Opus 4.5 and Gemini 3 Pro by 40 points on the hardest kernels.

Founder Loses $2,500 After AI-Coded App Leaks Stripe Keys

A startup founder's vibe-coded app exposed Stripe secret keys in frontend code, letting attackers charge 175 customers $500 each before he could rotate the credentials.

AMD Ryzen AI 400 Brings 50 TOPS NPU to the Desktop

AMD launches the first desktop processors with Copilot+ qualified NPUs, putting 50 TOPS of on-device AI into AM5 desktops starting Q2 2026.

NVIDIA's Secret Chip Fuses GPU and Groq for OpenAI

NVIDIA will unveil a new inference processor built on Groq's LPU architecture at GTC 2026, with OpenAI as its first major customer allocating 3 GW of dedicated capacity.

Amazon Bets $50B on OpenAI to Build Stateful AI on AWS

Amazon invests $50 billion in OpenAI, commits 2GW of Trainium capacity, and becomes the exclusive third-party distributor for OpenAI Frontier - reshaping how enterprises deploy AI agents on AWS.

Someone Reverse-Engineered Apple's Neural Engine and Trained a Model on It

A developer cracked Apple's undocumented ANE private APIs, measured its real throughput at 19 TFLOPS FP16 (not the marketed 38 TOPS), and trained a 109M-parameter transformer on hardware Apple designed exclusively for inference.

Claude Goes Down Globally as AWS Data Centers Burn in the Middle East

Anthropic's Claude experienced elevated errors across claude.ai, Claude Code, and authentication starting at 11:49 UTC, while AWS data centers in the UAE and Bahrain went offline after objects struck a facility and power failures cascaded across the region.

An AI Agent Just Pwned Trivy's 32K-Star Repo via GitHub Actions

An autonomous agent powered by Claude Opus 4.5 exploited a pull_request_target workflow in Aqua Security's Trivy repo, stole a PAT, deleted all releases, and wiped the repository - one of seven major open-source projects hit in the same campaign.

Mac Studio Clusters Now Run Trillion-Parameter Models for $40K

macOS RDMA over Thunderbolt 5 has turned four Mac Studios into a 1.5TB unified memory cluster that runs Kimi K2 at 25 tokens per second - a setup that would cost $780K with NVIDIA H100s.

New Section - GPU and AI Accelerator Spec Pages

Awesome Agents launches a dedicated Hardware section with detailed spec pages for 21 GPUs, TPUs, and AI accelerators - from datacenter flagships to home lab favorites.

Claude Now Lets You Import Memories From Any AI Provider

Anthropic's claude.com/import-memory page walks users through a two-step process to transfer ChatGPT, Gemini, or any chatbot's stored memories into Claude - no data loss, no starting over.

Cursor Ships Cloud Agents That Build, Test, and Demo Code

Cursor's cloud agents now run in isolated VMs where they write software, test it themselves, record video proof, and submit merge-ready pull requests.

Claude Overtakes ChatGPT on App Store After Pentagon Ban

Anthropic's Claude hit number one on Apple's App Store after users publicly switched from ChatGPT in support of the company's Pentagon stance - but the narrative is more complicated than it looks.

Xcode 26.3 Ships Agentic Coding - Here Is What It Can Do

Apple releases Xcode 26.3 with native support for Claude Agent and OpenAI Codex, exposing 20 MCP tools that let AI agents build, test, and visually verify iOS apps autonomously.

Pentagon Accepts OpenAI's Red Lines - the Same Ones It Rejected From Anthropic

OpenAI secured a Pentagon classified network deal with prohibitions on mass surveillance and autonomous weapons - the exact same terms that got Anthropic banned from all federal agencies hours earlier.

Trump Orders Every Federal Agency to Stop Using Anthropic - Pentagon Brands Company a National Security Risk

President Trump directed all U.S. government agencies to immediately cease using Anthropic's technology after the company refused to drop AI safety guardrails for the Pentagon. Defense Secretary Hegseth designated Anthropic a supply chain risk to national security.

Google Absorbs Its Robotics Moonshot Intrinsic - Physical AI Just Got a Corporate Home