Articles Tagged "GPU"

OpenAI Open-Sources MRC to Fix AI Supercomputer Jams

Six companies just released MRC, an open networking protocol that routes AI training traffic across hundreds of simultaneous paths to end GPU idle time at supercomputer scale.

Fine-Tuning Costs Comparison - Train Your Own AI

May 2026: Together AI adds Llama 4 and DeepSeek fine-tuning, Fireworks raised deployment prices $1/hr, and H100 rentals fell to under $2.40/hr.

Best AI Home Workstations 2026 - Full Buying Guide

Complete buying guide for AI home workstations in 2026 - pre-built machines and DIY builds for running local LLMs from 3B to 70B+ models, with benchmarks, part lists, and price-tier comparisons.

Cloud GPU Rental Pricing Compared - April 2026

Raw GPU rental rates across 20+ providers normalized to per-GPU-hour - H100, H200, A100, L40S, RTX 4090, on-demand vs spot vs reserved, with hidden costs and value-tier recommendations.

Best Open-Source LLM Inference Servers 2026

A benchmark-driven comparison of the top open-source LLM inference servers - vLLM, SGLang, TGI, llama.cpp, TensorRT-LLM, LMDeploy, and more.

A $900 RTX 3090 Now Beats an M5 Max at LLM Inference

Two researchers fused all 24 layers of Qwen 3.5-0.8B into a single CUDA kernel launch, making a five-year-old RTX 3090 deliver 1.8x the throughput of an M5 Max at equal or better efficiency. The gap was software, not silicon.

AMD Instinct MI430X - Dual-Precision CDNA 5 Accelerator

The AMD Instinct MI430X is AMD's CDNA 5 HPC accelerator with 432GB HBM4, full FP64 support, and 19.6 TB/s bandwidth - designed for sovereign AI and scientific supercomputing alongside the MI455X AI GPU.

A Shoe Company Ditched Shoes for GPUs and Surged 373%

Allbirds sold its entire footwear business for $39 million - roughly 1% of its $4 billion peak valuation - and is rebranding as NewBird AI to buy GPUs and rent compute to AI developers. The stock quadrupled in a day.

Meta Commits $21B More to CoreWeave, Total Hits $35B

Meta expands its CoreWeave partnership by $21 billion through December 2032, bringing total commitments to $35 billion and locking in early NVIDIA Vera Rubin deployments.

Intel Arc Pro B70 Brings 32GB VRAM to Local AI for $949

Intel's Arc Pro B70 launched on March 25 with 32GB GDDR6 and 367 TOPS for $949, undercutting NVIDIA's RTX Pro 4000 by $850. The hardware case is strong. The software story is not.

AutoKernel - AI Agents That Write Faster GPU Kernels

RightNow AI releases AutoKernel, an open-source MIT-licensed framework that runs an autonomous LLM agent loop overnight to produce optimized Triton kernels for any PyTorch model.

Meta's KernelEvolve Automates Kernel Tuning in Production

Meta's KernelEvolve AI agent autonomously generates and optimizes hardware kernels across NVIDIA, AMD, and MTIA chips, delivering over 60% inference gains in production.

← Previous