Heretic Turns Guardrails Into Forks; AI Security Adds Another Alert Stream; Transformer Doubt Goes Public
The sharpest story today is Heretic, because it turns model safety from a lab policy into a forkable artifact. Elsewhere,…
Focuses on reducing risks, improving reliability, and protecting systems from misuse, failure, and harmful outcomes.
The sharpest story today is Heretic, because it turns model safety from a lab policy into a forkable artifact. Elsewhere,…
GitHub said on 20 May that a compromised employee device running a poisoned VS Code extension led to the exfiltration…
Mozilla said this week that its Firefox zero-day hardening work with an early version of Claude Mythos Preview helped identify…
The UK AI Security Institute says GPT-5.5 cybersecurity simulation results now look a lot less like a one-off milestone and…
Anthropic’s Claude Opus 4.7 reportedly identified journalist Kelsey Piper from 125 words of unpublished text, and the details of her…
Toronto police say they seized several devices they describe as an SMS blaster, a fake-cell-tower tool used to send fraudulent…
The headline sounds like satire. It isn’t. Anthropic bans are a real policy and product issue: Anthropic’s own documentation describes…
AI deathbots are software services that simulate a dead person from their digital traces: photos, voice recordings, chat logs, emails,…
A common AI illiteracy failure looks mundane: staff paste confidential text into a consumer chatbot to rewrite it in a…