Articles Tagged "NVIDIA"

Odyssey Raises $310M as Amazon Bets on World Models

Amazon leads a $310M round into Odyssey, a startup building world models that simulate physics - not just language - with Trainium chip adoption baked in as the price of entry.

AgentPerf - First Infrastructure Benchmark for Agents

Artificial Analysis released AgentPerf, the first agentic AI infrastructure benchmark, measuring concurrent agents per megawatt. NVIDIA Blackwell leads with 20x gains over Hopper.

Orbital Plans 10,000 GPU Satellites for AI Inference

a16z-backed Orbital wants to run AI inference from low Earth orbit using NVIDIA Blackwell GPUs, targeting 10,000 satellites and 1 GW of compute at full scale.

Apple's iOS 27 Beta Ships the Multi-Model Extensions API

iOS 27 Beta 1 is live for developers today, shipping Apple's new Extensions framework that lets Gemini, Claude, and ChatGPT plug into Siri - plus the Nvidia B200 Confidential Computing architecture that keeps those cloud queries private.

NVIDIA Nemotron 3 Ultra 550B-A55B

NVIDIA's 550B open-weight MoE model with 55B active parameters, hybrid Mamba-Transformer architecture, and 1M token context - the top-scoring US open model on the Artificial Analysis Intelligence Index.

NVIDIA Ships Nemotron 3 Ultra - 550B Open-Weight MoE

NVIDIA's 550B Nemotron 3 Ultra, released June 4, tops the US open-weight leaderboard with a hybrid Mamba-Transformer MoE architecture and 300-plus tokens per second throughput.

NVIDIA Drops 110 Open-Source Skills for Physical AI Devs

NVIDIA's Agent Toolkit lands 110+ verified skills on GitHub covering robotics, autonomous vehicles, vision AI, and industrial systems - turning complex physical AI pipelines into single agent calls.

NVIDIA Dynamo Snapshot Slashes Kubernetes AI Cold Starts

NVIDIA's Dynamo Snapshot uses CRIU and cuda-checkpoint to freeze and restore GPU inference containers in seconds, cutting Kubernetes cold-start times by up to 21x for large models.

Intel Crescent Island GPU Skips HBM for 480GB LPDDR5X

Intel's Crescent Island inference GPU trades HBM bandwidth for 480GB of LPDDR5X, targeting customers locked out of NVIDIA's supply chain.

Nvidia Enters the PC Market With RTX Spark Superchip

Nvidia's RTX Spark packs 20 Arm CPU cores and a Blackwell 2.0 GPU with 6,144 CUDA cores into a 45-80W Windows laptop chip, targeting Apple Silicon head-on.

NVIDIA RTX Spark - ARM Blackwell Superchip for AI PCs

NVIDIA RTX Spark is a 20-core ARM + Blackwell GPU superchip delivering 1 petaFLOP FP4 and 128GB unified memory for AI-first Windows laptops and desktops.

NVIDIA Cosmos 3

NVIDIA Cosmos 3 is an open physical AI omnimodel with Mixture-of-Transformers architecture that natively handles text, images, video, sound, and robot actions in a single 16B or 64B model.

← Previous