Articles Tagged "Speech Recognition"

Best AI Audio Editing Tools in 2026

Best AI Audio Editing Tools in 2026

A hands-on comparison of the top AI audio editing tools in 2026, covering noise removal, stem separation, mastering, and podcast production.

Qwen3.5-Omni

Qwen3.5-Omni

Alibaba's Qwen3.5-Omni takes text, images, audio, and video as input and streams both text and speech output in a single end-to-end model with a 256K context window.

llama.cpp Lands Three Audio Models in 48 Hours

llama.cpp Lands Three Audio Models in 48 Hours

Three separate PRs merged into llama.cpp between April 11-13 add MERaLiON-2, Gemma 4's Conformer encoder, and Qwen3-Omni/ASR - making local voice AI inference practical on consumer hardware for the first time.

Qwen3.5-Omni Does 10-Hour Audio and 4M Video Frames

Qwen3.5-Omni Does 10-Hour Audio and 4M Video Frames

Alibaba's Qwen3.5-Omni handles audio, video, images, and text in a single model pass - and generates speech in real time. The Plus variant hits SOTA on 215 benchmarks and edges out Gemini 3.1 Pro on audio tasks.