Articles Tagged "Agentic AI"

LongCat-2.0 Review: China's Stealth Coder

Meituan's 1.6T open-source coding model secretly topped OpenRouter for two months before revealing itself - and the price-to-performance math is hard to argue with.

GPT-5.6 Sol Review: Strong Model, Thin Access

OpenAI's GPT-5.6 Sol tops Terminal-Bench 2.1 at 91.9% with its multi-agent Ultra mode, but reward-hacking findings and government-gated access keep it out of reach for nearly everyone.

LongCat-2.0

Meituan's 1.6T-parameter open-source MoE coding model, trained end-to-end on 50,000 domestic Chinese ASICs, with native 1M token context and a 59.5 SWE-bench Pro score.

Meta's $145B AI Bet Is Behind Schedule, Zuckerberg Admits

Zuckerberg told employees at a July 2 town hall that Meta's agentic AI trajectory hasn't accelerated as expected - but the data tells a more complicated story.

Holo3-35B-A3B

H Company's open-weight sparse MoE vision-language model purpose-built for desktop computer use, scoring 82.6% on OSWorld-Verified with only 3B active parameters.

Best AI for Web Browsing and Computer Use - July 2026

Claude Fable 5 leads OSWorld-Verified at 85% after its 19-day US suspension ended July 1 - Holo3 open-source at 82.6% and Claude Sonnet 5 at $2/M tokens reshape the value calculus.

Gemini Spark Gains Mac File Access and MCP Support

Google's Gemini Spark agent is now in beta on macOS with local file system access, MCP server support, and real-time topic monitoring - but only for $99/month AI Ultra subscribers.

Claude Sonnet 5 Review: Near-Opus at Half the Price

Anthropic's Sonnet 5 is the first mid-tier model that genuinely competes with Opus-class agents on coding and computer use, released June 30 at $2/$10 per million tokens.

AWS Bets $1B to Embed AI Engineers at Client Sites

Amazon's new Forward Deployed Engineering unit places AI specialists inside enterprise clients to build and ship agentic systems in weeks, following similar programs already launched by OpenAI and Anthropic.

Claude Sonnet 5

Anthropic's latest Sonnet-class model brings near-Opus coding performance to mid-tier pricing, with major agentic search and computer use gains over Sonnet 4.6.

Claude Sonnet 5 Is Anthropic's New Agentic Default

Anthropic's Claude Sonnet 5 becomes the default model across all plans, promising near-Opus agentic performance at a third less than Sonnet 4.6's standard price.

GPT-5.6

OpenAI's GPT-5.6 family - Sol, Terra, and Luna - sets a new Terminal-Bench 2.1 record at 91.9% with subagent Ultra mode, but remains locked to ~20 government-vetted partners as of launch.