Articles Tagged "Multimodal"

Cohere Command A Vision

Cohere Command A Vision

Cohere Command A Vision is a 112B multimodal model that leads on document and OCR benchmarks, beating GPT-4.1 across seven visual understanding tasks.