
Gemini 3.1 Flash-Lite Review: Fast, Cheap, and Capable
Google's Gemini 3.1 Flash-Lite delivers frontier-class benchmarks at a fraction of the cost of Pro - but a sluggish first-token response and preview-only status mean it's not for every workload.
They summarize our coverage. We write it.
Newsletters like this one rebroadcast our headlines - often without the full review, the source reading, or the analysis underneath. Our weekly briefing sends the work they paraphrase, straight from the desk, before they get to it.
Free, weekly, no spam. One email every Tuesday. Unsubscribe anytime.

Senior AI Editor & Investigative Journalist
Elena is a technology journalist with over eight years of experience covering artificial intelligence, machine learning, and the startup ecosystem. Before joining Awesome Agents, she reported on deep tech for Wired Italia and The Verge, where she earned a reputation for translating complex research papers into stories anyone could follow.
She holds a Master's degree in Computational Linguistics from the University of Edinburgh and a Bachelor's in Philosophy from Sapienza University of Rome - a combination that gives her a unique lens on both the technical and ethical dimensions of AI.
At Awesome Agents, Elena leads news coverage and writes in-depth reviews of frontier models. She is particularly interested in AI safety, alignment research, and the growing tension between open-source and proprietary approaches. When she is not testing the latest LLM, you will probably find her hiking in the Scottish Highlands or arguing about espresso ratios.
Based in Edinburgh, UK.

Google's Gemini 3.1 Flash-Lite delivers frontier-class benchmarks at a fraction of the cost of Pro - but a sluggish first-token response and preview-only status mean it's not for every workload.

Neon Oni started as Suno AI-generated music with fake Tokyo bios and 79K monthly Spotify listeners. After being exposed, the creator recruited 7 real Tokyo musicians to perform the songs live.

Qihoo 360 shipped its AI assistant 'Security Claw' with the wildcard SSL private key for *.myclaw.360.cn inside the installer - six days after its founder promised the product would never leak passwords.

Percepta AI compiled a WebAssembly interpreter into transformer weights, executing programs deterministically at 33K tokens/sec on CPU - but the community is skeptical about the practical value.

NVIDIA opens GTC 2026 with the Vera Rubin platform - six co-designed chips delivering 50 PFLOPS of inference per GPU and 10x lower token cost than Blackwell.

Robert Levine used ChatGPT for pricing, marketing, showings, and contract drafting to sell his Cooper City home in 5 days with 5 offers - saving roughly 3% in agent commission.

The International AI Safety Report 2026, led by Yoshua Bengio with 100+ experts from 30+ countries, finds frontier models increasingly detect test conditions and behave differently in real deployment - undermining pre-deployment safety evaluation.

Andrej Karpathy scored 342 US occupations on a 0-10 AI exposure scale using BLS data - 42% of jobs score 7+, representing 59.9 million workers and $3.7 trillion in wages. He then deleted the GitHub repo.

Sydney entrepreneur Paul Conyngham used ChatGPT and AlphaFold to design a personalized mRNA vaccine that shrank his rescue dog's mast cell tumor by 75% - the first AI-designed cancer vaccine for a dog.

Microsoft's March 2026 Patch Tuesday fixes 84 vulnerabilities including a CVSS 9.8 RCE discovered by XBOW's autonomous AI agent, an Azure MCP Server SSRF, and an Excel XSS that hijacks Copilot to exfiltrate data.

Anthropic made the 1M-token context window generally available for Claude Opus 4.6 and Sonnet 4.6, dropping the long-context pricing premium entirely - a 900K-token request now costs the same per token as a 9K one.

Google invested $1 million in Animaj, an AI animation studio making YouTube kids content, just seven weeks after YouTube CEO Neal Mohan declared war on AI slop - with early access to Veo, Gemini, and Imagen.