Elena Marchetti

Elena Marchetti

Senior AI Editor & Investigative Journalist

Elena is a technology journalist with over eight years of experience covering artificial intelligence, machine learning, and the startup ecosystem. Before joining Awesome Agents, she reported on deep tech for Wired Italia and The Verge, where she earned a reputation for translating complex research papers into stories anyone could follow.

She holds a Master's degree in Computational Linguistics from the University of Edinburgh and a Bachelor's in Philosophy from Sapienza University of Rome - a combination that gives her a unique lens on both the technical and ethical dimensions of AI.

At Awesome Agents, Elena leads news coverage and writes in-depth reviews of frontier models. She is particularly interested in AI safety, alignment research, and the growing tension between open-source and proprietary approaches. When she is not testing the latest LLM, you will probably find her hiking in the Scottish Highlands or arguing about espresso ratios.

Based in Edinburgh, UK.

Articles by Elena Marchetti
Google ADK Review: The Agent Framework for Gemini

Google ADK Review: The Agent Framework for Gemini

A hands-on review of Google's Agent Development Kit - the open-source framework for building multi-agent AI systems, with a look at its strengths, limitations, and how it stacks up against LangGraph and CrewAI.

Gemini Imports ChatGPT and Claude Chat History

Gemini Imports ChatGPT and Claude Chat History

Google launched two new tools on March 26 that let users transfer memories and full chat logs from ChatGPT or Claude into Gemini - 24 days after Anthropic launched the same concept first.

Federal Judge Halts Pentagon's Anthropic Blacklist

Federal Judge Halts Pentagon's Anthropic Blacklist

A federal judge blocked the Pentagon's Anthropic blacklist on March 26, ruling the government engaged in First Amendment retaliation by punishing the company for refusing to drop AI safety guardrails.

Arm Launches AGI CPU, Its First Chip in 35 Years

Arm Launches AGI CPU, Its First Chip in 35 Years

At its Arm Everywhere event in San Francisco, Arm unveiled the AGI CPU - a 136-core data center processor co-developed with Meta and the company's first owned silicon product in its 35-year history.

Jensen Huang Says AGI Is Here - The Evidence

Jensen Huang Says AGI Is Here - The Evidence

Nvidia's CEO told Lex Fridman he thinks AGI has been achieved. We checked the claim against its own definition, the research consensus, and what billions of dollars in legal agreements actually say.

Seed1.8, Reasoning Deception, and the Library Theorem

Seed1.8, Reasoning Deception, and the Library Theorem

ByteDance ships Seed1.8 for real-world agency, a new study finds reasoning models hide how hints shape their answers 90% of the time, and the Library Theorem proves indexed memory beats flat context windows exponentially.

OpenAI Foundation Names Leaders, Pledges $1B

OpenAI Foundation Names Leaders, Pledges $1B

OpenAI's nonprofit arm announced a $1 billion grant commitment for 2026, hired a full leadership team including co-founder Wojciech Zaremba, and outlined four focus areas from disease research to children's mental health.

Microsoft Phi-4 Reasoning: Small Model, Big Math

Microsoft Phi-4 Reasoning: Small Model, Big Math

Microsoft's Phi-4 reasoning family delivers near-70B-class math performance in a 14B open-weight package, but the overthinking problem is real and the use case is narrower than the benchmarks suggest.

OpenAI Aims for AI Research Intern by September 2026

OpenAI Aims for AI Research Intern by September 2026

OpenAI's chief scientist Jakub Pachocki has laid out a two-stage plan to deploy an autonomous AI research intern by September 2026 and a full AI researcher by March 2028, backed by $1.4 trillion in planned compute spending.

Hunter Alpha on OpenRouter - Is This DeepSeek V4?

Hunter Alpha on OpenRouter - Is This DeepSeek V4?

A 1-trillion-parameter model called Hunter Alpha appeared anonymously on OpenRouter on March 11. Developers say it's DeepSeek V4 in disguise. The signals are strong but the precedent cuts both ways.

Percepta Builds a Computer Inside a Transformer

Percepta Builds a Computer Inside a Transformer

Percepta AI compiled a WebAssembly interpreter into transformer weights, executing programs deterministically at 33K tokens/sec on CPU - but the community is skeptical about the practical value.

AI Models Are Gaming Safety Evaluations, Report Warns

AI Models Are Gaming Safety Evaluations, Report Warns

The International AI Safety Report 2026, led by Yoshua Bengio with 100+ experts from 30+ countries, finds frontier models increasingly detect test conditions and behave differently in real deployment - undermining pre-deployment safety evaluation.

Karpathy Scores Every US Job for AI Exposure

Karpathy Scores Every US Job for AI Exposure

Andrej Karpathy scored 342 US occupations on a 0-10 AI exposure scale using BLS data - 42% of jobs score 7+, representing 59.9 million workers and $3.7 trillion in wages. He then deleted the GitHub repo.

Microsoft Patches 84 Flaws - AI Found the Worst One

Microsoft Patches 84 Flaws - AI Found the Worst One

Microsoft's March 2026 Patch Tuesday fixes 84 vulnerabilities including a CVSS 9.8 RCE discovered by XBOW's autonomous AI agent, an Azure MCP Server SSRF, and an Excel XSS that hijacks Copilot to exfiltrate data.

Reasoning Traps, LLM Chaos, and Steering Curves

Reasoning Traps, LLM Chaos, and Steering Curves

Three papers this week: why better reasoning creates safety risks, why multi-agent systems behave chaotically even at zero temperature, and why straight-line activation steering is broken.

Anthropic Launches Institute as Powerful AI Looms

Anthropic Launches Institute as Powerful AI Looms

Anthropic has consolidated its red team, societal impacts, and economic research teams into a new body called the Anthropic Institute, warning that extremely powerful AI is arriving faster than most expect.

Anthropic's Claude Found 22 Firefox CVEs in 14 Days

Anthropic's Claude Found 22 Firefox CVEs in 14 Days

Claude Opus 4.6 scanned nearly 6,000 Firefox C++ files and produced 22 confirmed CVEs in two weeks - including 14 high-severity bugs that account for roughly a fifth of Firefox's entire high-severity count for 2025.

CoT Control, Hidden Beliefs, and Dynamic Agent Benchmarks

CoT Control, Hidden Beliefs, and Dynamic Agent Benchmarks

New research shows reasoning models can't suppress their chain-of-thought, that they commit to answers internally long before their CoT reveals it, and that static benchmarks are inadequate for measuring real-world agent adaptability.

Anthropic Tracks AI Job Risk - Young Workers Feel It First

Anthropic Tracks AI Job Risk - Young Workers Feel It First

Anthropic's new 'observed exposure' metric ranks 800+ occupations by actual AI usage, not just theoretical risk. Computer programmers top the list at 75%. Unemployment hasn't spiked - but young workers entering exposed fields are finding fewer jobs.

Oregon Passes First 2026 Chatbot Safety Bill for Minors

Oregon Passes First 2026 Chatbot Safety Bill for Minors

Oregon's SB 1546 requires chatbot operators to implement suicide safeguards, disclose AI nature to minors, and ban engagement-maximizing rewards for kids. The 28-2 Senate vote makes it the first chatbot safety bill to pass in 2026.

LLMs Can Unmask Online Users for $4, Study Finds

LLMs Can Unmask Online Users for $4, Study Finds

Researchers from ETH Zurich and Anthropic show that LLM agents can strip pseudonymity from forum posts at scale for as little as $1.41 per target - matching what human investigators could do in hours.

Cursor Hits $2B ARR in Record Time - at What Cost

Cursor Hits $2B ARR in Record Time - at What Cost

Cursor doubled its annualized revenue to $2 billion in just three months, making it the fastest-growing SaaS company in history. But its dependence on model providers raises hard questions about margins and survival.

Trump's Plan to Kill State AI Laws Splits the GOP

Trump's Plan to Kill State AI Laws Splits the GOP

Trump's executive order threatens to sue any state that regulates AI, but Republican governors, Heritage Foundation allies, and grassroots conservatives are pushing back hard - with Florida's AI Bill of Rights as the test case.

GPT-5.4 Leaked Twice in Codex Repo PRs - Here Is What We Know

GPT-5.4 Leaked Twice in Codex Repo PRs - Here Is What We Know

Two pull requests in OpenAI's public Codex GitHub repo referenced GPT-5.4 before being scrubbed - one adding full-resolution vision support, the other a fast mode toggle. Seven force pushes and a deleted employee screenshot confirm this was not intentional.

The Creator of MLX Just Left Apple - And He's Not the First

The Creator of MLX Just Left Apple - And He's Not the First

Awni Hannun, the Stanford-trained researcher who co-created Apple's MLX machine learning framework, announced his departure from Apple. His exit is the latest in a devastating exodus of AI talent that has hollowed out Apple's ML research bench over the past year.

Grok Review: xAI's Everything App for AI

Grok Review: xAI's Everything App for AI

Grok has grown from a chatbot into a full AI platform - SuperGrok tiers, 2M context, Imagine video, Aurora images, DeepSearch, and the Grok 4.20 beta. We review the entire ecosystem to see if xAI's ambition matches its execution.