Articles Tagged "LLM"

Claude Sonnet 5

Claude Sonnet 5

Anthropic's latest Sonnet-class model brings near-Opus coding performance to mid-tier pricing, with major agentic search and computer use gains over Sonnet 4.6.

Grok 4.5

Grok 4.5

Grok 4.5 is xAI's 1.5-trillion-parameter V9 model in private beta at SpaceX and Tesla, with supplemental training on Cursor coding data and early evals claiming performance near Claude Opus 4.8.

Grok 4.3 Review: xAI Bets on Price Over Prestige

Grok 4.3 Review: xAI Bets on Price Over Prestige

Grok 4.3 slashes prices by up to 83%, adds native video input and voice cloning, and carves out a credible position as the most cost-efficient frontier model - with real caveats on coding and latency.

AI Diagnosis, Cache Efficiency, and Agent Security

AI Diagnosis, Cache Efficiency, and Agent Security

Three papers from today's arXiv: a 32B medical model beats DeepSeek-R1 in rare disease diagnosis, a KV cache method keeps 97% accuracy with 3% memory, and a new benchmark red-teams agentic AI systems.

ERNIE 5.1

ERNIE 5.1

Baidu's ERNIE 5.1 is a text-focused MoE model that claims the top Chinese model slot on LMArena with 800B parameters built at 6% of comparable training costs.