Articles Tagged "Benchmarks"

Kling 3.0

Kuaishou's Kling 3.0 is the first commercially available AI video model to ship native 4K at 60fps, with multilingual audio, multi-shot storyboarding, and a $0.075/s API.

Grok Imagine Video 1.5

xAI's Grok Imagine Video 1.5 is the #1-ranked image-to-video model on Artificial Analysis, generating 720p clips with native audio at $0.14/s - 86% cheaper than Sora 2 Pro.

Dreamina Seedance 2.0

ByteDance's top-ranked AI video generation model with native joint audio-video synthesis, multi-shot support, and multimodal reference inputs across up to 12 files per generation.

HappyHorse-1.0

HappyHorse-1.0 is Alibaba's 15-billion-parameter video generation model that ranked #1 on Artificial Analysis, producing 720p-1080p clips with joint audio-video synthesis in a single forward pass.

SkyReels V4

SkyReels V4 is Skywork AI's unified multi-modal video model that jointly generates 1080p/32FPS video and synchronized audio from a single dual-stream diffusion transformer.

Best AI Models for Video Generation - June 2026

HappyHorse-1.0 from Alibaba-ATH leads the Artificial Analysis blind-vote rankings at Elo 1,290, but Seedance 2.0 is now globally available via fal.ai and still tops the with-audio leaderboard at 1,218.

AI Patched Firefox Before Pwn2Own - OpenAI's Security Pivot

OpenAI's GPT-5.5-Cyber found CVE-2026-8390 in Firefox's WebAssembly engine before Pwn2Own Berlin - five of six registered exploit entries withdrew.

Sakana Fugu

Sakana AI's orchestrator model that dynamically coordinates Claude Opus 4.8, GPT-5.5, and Gemini 3.1 Pro to beat each of them individually on SWE-Bench Pro, GPQA-Diamond, and eight other benchmarks.

AI Research: Orchestration Beats Scale, Small Models Win

Sakana Fugu tops SWE-Bench Pro by routing tasks across rival LLMs, Microsoft's 9B browser agent beats OpenAI Operator, and a 3B model from Weibo matches DeepSeek V3.2 on math.

VibeThinker-3B

WeiboAI's 3B dense reasoning model fine-tuned from Qwen2.5-Coder-3B, posting AIME 2026 scores that match DeepSeek V3.2 (671B) using the Spectrum-to-Signal training pipeline.

Fara-1.5

Microsoft Research's family of open-weight browser computer use agents (4B, 9B, 27B) that beat OpenAI Operator and Gemini 2.5 Computer Use on Online-Mind2Web.

ERNIE 5.1

Baidu's ERNIE 5.1 is a text-focused MoE model that claims the top Chinese model slot on LMArena with 800B parameters built at 6% of comparable training costs.

← Previous