Articles Tagged "AI Safety"

GPT-5.4-Cyber

GPT-5.4-Cyber

OpenAI's GPT-5.4-Cyber is a cyber-permissive fine-tune of GPT-5.4 Thinking with binary reverse engineering, 88.23% on professional CTFs, and access gated through the Trusted Access for Cyber program.

GitHub Bans Engineer Who Shipped 500 Agent PRs in 72 Hours

GitHub Bans Engineer Who Shipped 500 Agent PRs in 72 Hours

A Korean CTO ran a 13-step agent harness against 100+ major open-source repos over three days, landing 500+ commits and 130+ PRs - some merged by Kubernetes, Hugging Face, and Ollama maintainers. Then GitHub banned his account for spam, confirming that platform abuse detection cannot yet tell a disciplined harness from a bot.

Tesla Hid Thousands of Fatal Autopilot Incidents, RTS Says

Tesla Hid Thousands of Fatal Autopilot Incidents, RTS Says

Swiss broadcaster RTS reopens the 2023 Tesla Files leak in context of the confirmed $243M Miami verdict. The combined record: 2,400+ concealed sudden-acceleration complaints, 1,000+ undisclosed crashes, and a federal court that found Tesla knew.

LLM Jailbreak and Red-Team Resistance Leaderboard

LLM Jailbreak and Red-Team Resistance Leaderboard

Rankings of 14 frontier LLMs by adversarial robustness - how well they resist jailbreaks, prompt injection, and harmful-behavior elicitation across HarmBench, AdvBench, StrongREJECT, JailbreakBench, and AgentHarm.