AI alignment research, safety evaluations, and the organisations working to make AI trustworthy.
AI Weekly Issue #488: OpenAI lost three things in five days
OpenAI faces three simultaneous setbacks that expose vulnerabilities across legal, financial, and strategic fronts. Elon Musk testified Tuesday in a …
The US government’s Anthropic models ban was never about an AI jailbreak
The Trump administration forced Anthropic to withdraw recently released cybersecurity models from public access, citing unspecified national security …
Heart protection from COVID shots remains amid updates, study finds
Researchers have found that COVID-19 vaccines continue to protect hearts from inflammation and related damage, even as updated vaccine formulations ro…
Attackers scale deception with AI. Defenders need truth at machine speed.
Attackers now weaponize AI to generate thousands of convincing phishing messages, fake identities, and social engineering attacks faster than defender…
Roblox exec says ticking a box for age verification is ‘not enough anymore’
Roblox is rolling out facial age estimation technology to replace simple checkbox age verification. The platform's vice president of safety product po…
Anthropic shuts down Fable, Mythos models following Trump admin directive
Anthropic has discontinued its Fable and Mythos AI model lines following a directive from the Trump administration. The Commerce Department cited nati…
This Week in AI: The Next-Gen Recommendation Experience
Miguel Fierro, formerly a principal researcher at Microsoft, launched RecoMind to tackle next-generation recommendation systems. He joined data and AI…
SpaceX SPV investors won’t know their true holdings until post-IPO lock-ups lift
SpaceX's planned initial public offering presents a murky investment landscape for lower-tier SPV (special purpose vehicle) investors, who may not und…
Claude Fable 5: Anthropic admits "wrong tradeoff" after invisibly throttling rival AI researchers
Anthropic has reversed a controversial policy that would have secretly degraded performance for AI researchers using competing models. The company ack…
Job titles of the future: Nature’s drug designer
Tim Cernak spent nearly two decades at Merck developing precision therapies for cancer, HIV, and diabetes. His work focused on targeting disease while…