AI Safety and Security

Focuses on reducing risks, improving reliability, and protecting systems from misuse, failure, and harmful outcomes.

AI Safety and Security

Prompt Injection Became Serious Enough for ICML to Police in Peer Review
ByJames McCallef June 13, 2026June 29, 2026

Prompt injection is an LLM attack that makes a model follow untrusted instructions hidden in user input or external content,…

Read More Prompt Injection Became Serious Enough for ICML to Police in Peer Review
AI Safety and Security

VS Code Token Theft Lands; Soundbar Becomes a Keyboard; Web PKI Starts Moving; Espressif Raises the Floor; Elixir Typing Gets Real
ByGeoff Dyers June 4, 2026June 29, 2026

The biggest security story today is VS Code token theft, not because one bug landed, but because it exposed how…

Read More VS Code Token Theft Lands; Soundbar Becomes a Keyboard; Web PKI Starts Moving; Espressif Raises the Floor; Elixir Typing Gets Real
AI Safety and Security

Red Hat Scope Turns Hostile; Weather Balloons Beat Public Models; FriendliAI Sells Spare GPU Cycles; CERN Anomaly Stays Below Discovery
BySarah Fraser June 2, 2026June 29, 2026

The top story is the Red Hat npm incident, because it breaks the usual safety shortcut. Red Hat npm compromise…

Read More Red Hat Scope Turns Hostile; Weather Balloons Beat Public Models; FriendliAI Sells Spare GPU Cycles; CERN Anomaly Stays Below Discovery
AI Safety and Security

Heretic Turns Guardrails Into Forks; AI Security Adds Another Alert Stream; Transformer Doubt Goes Public
ByJames McCallef May 26, 2026June 21, 2026

The sharpest story today is Heretic, because it turns model safety from a lab policy into a forkable artifact. Elsewhere,…

Read More Heretic Turns Guardrails Into Forks; AI Security Adds Another Alert Stream; Transformer Doubt Goes Public
AI Safety and Security

GitHub says poisoned VS Code extension exposed 3,800 repos
ByJames McCallef May 22, 2026June 16, 2026

GitHub said on 20 May that a compromised employee device running a poisoned VS Code extension led to the exfiltration…

Read More GitHub says poisoned VS Code extension exposed 3,800 repos
AI Safety and Security

Firefox Zero-Day: Mozilla Says Claude Mythos Found 271 Bugs
ByJames McCallef May 10, 2026June 23, 2026

Mozilla said this week that its Firefox zero-day hardening work with an early version of Claude Mythos Preview helped identify…

Read More Firefox Zero-Day: Mozilla Says Claude Mythos Found 271 Bugs
AI Safety and Security

11 Minutes, $1.73, and GPT-5.5 Cybersecurity Simulation
ByPriscilla Li May 2, 2026June 16, 2026

The UK AI Security Institute says GPT-5.5 cybersecurity simulation results now look a lot less like a one-off milestone and…

Read More 11 Minutes, $1.73, and GPT-5.5 Cybersecurity Simulation
AI Safety and Security

125 Words, No Account Cues: AI Identifies Writer From Style
ByPriscilla Li April 27, 2026June 23, 2026

Anthropic’s Claude Opus 4.7 reportedly identified journalist Kelsey Piper from 125 words of unpublished text, and the details of her…

Read More 125 Words, No Account Cues: AI Identifies Writer From Style
AI Safety and Security

SMS Blaster Bust Exposes the Limits of SMS Trust
ByPriscilla Li April 26, 2026June 23, 2026

Toronto police say they seized several devices they describe as an SMS blaster, a fake-cell-tower tool used to send fraudulent…

Read More SMS Blaster Bust Exposes the Limits of SMS Trust