Articles Tagged "NVIDIA"

NVIDIA Vera Rubin NVL144

NVIDIA's Rubin-based rack system with 144 R200 GPUs, 3.6 ExaFLOPS FP4, 20 TB HBM4 - arriving H2 2026.

Thinking Machines Lands Gigawatt Compute from Nvidia

Nvidia commits a gigawatt of Vera Rubin chips to Mira Murati's startup, a supply the FT values at tens of billions of dollars, alongside an undisclosed cash investment.

NVIDIA Nemotron 3 Super 120B-A12B

NVIDIA Nemotron 3 Super is a 120B-parameter open model with 12B active at inference, combining Mamba-2, LatentMoE, and Multi-Token Prediction for agentic workloads with a 1M token context window.

NVIDIA Ships Nemotron 3 Super - 120B Open Model for Agents

NVIDIA releases Nemotron 3 Super, a 120B-parameter open model with only 12B active at inference, combining Mamba-2 and Transformer layers for agentic AI workloads with a 1M token context window.

Iran Targets US Tech Facilities - What It Means for AI

Iran's IRGC designated facilities of Amazon, Nvidia, Microsoft, Google, Oracle, IBM, and Palantir across Israel and the Gulf as legitimate targets - with AWS data centers already struck by drones.

Oracle and OpenAI Scrap Texas Stargate Data Center Expansion

Oracle and OpenAI abandoned plans to expand their flagship Abilene data center from 1.2 GW to 2 GW after financing talks collapsed and a winter cooling outage strained relations with operator Crusoe.

NVIDIA NemoClaw: Enterprise AI Agents Without Lock-In

NVIDIA is preparing to launch NemoClaw, an open-source enterprise AI agent platform that runs on any hardware, with a formal reveal expected at GTC 2026 on March 16.

LeCun Raises $1B Seed to Build AI Beyond LLMs

Yann LeCun's AMI Labs closes a $1.03 billion seed round at a $3.5 billion valuation, betting that world models - not large language models - will define the next era of AI.

Nscale Closes $2B to Build Europe's AI Backbone

Nscale raises the largest Series C in European history, valuing the company at $14.6 billion and funding a 100,000-GPU AI gigafactory in Norway with OpenAI as anchor customer.

vLLM 0.17 Ships FlashAttention 4 and Live MoE Scaling

vLLM v0.17.0 adds FlashAttention 4, elastic expert parallelism for live MoE rescaling, full Qwen3.5 support, and a performance-mode flag, all in 699 commits from 272 contributors.

OpenClaw Hits 250K GitHub Stars, Surpasses React

The open-source AI agent framework crossed 250,000 GitHub stars in roughly 60 days, surpassing React's decade-long total. NVIDIA CEO Jensen Huang called it the most important software release ever.

US Drafts Rules Requiring Permits for All AI Chip Exports

The Commerce Department has drafted regulations requiring government approval for all AI chip exports worldwide - not just to China - giving Washington unprecedented gatekeeper power over global AI development.

← Previous