Articles Tagged "Llama"

GPT-4 to Self-Hosted Llama 4 Migration Guide

GPT-4 to Self-Hosted Llama 4 Migration Guide

Migrate from GPT-4o (now retired) or GPT-5.1 to self-hosted Llama 4 with near-zero code changes, but plan carefully for hardware, EU licensing, and realistic context window limits.

Llama 3.3 70B Instruct

Llama 3.3 70B Instruct

Meta's Llama 3.3 70B Instruct matches Llama 3.1 405B on instruction following and math while running at 4-5x lower cost, with the lowest hallucination rate of any open-weight model on the Vectara summarization leaderboard.

Llama 4 Maverick

Llama 4 Maverick

Meta's Llama 4 Maverick packs 400B total parameters into a 128-expert MoE architecture with only 17B active per token, beating GPT-4o on Chatbot Arena while matching DeepSeek V3 on reasoning at half the active parameters.

Llama 4 Scout

Llama 4 Scout

Meta's Llama 4 Scout is a 109B-total, 17B-active MoE model with 16 experts and a 10M-token context window - the longest of any open-weight model - with native multimodal support for text and images.