AI alignment research, safety evaluations, and the organisations working to make AI trustworthy.

Safety

AI Weekly Issue #488: OpenAI lost three things in five days

OpenAI faces three simultaneous setbacks that expose vulnerabilities across legal, financial, and strategic fronts. Elon Musk testified Tuesday in a …

12h ago
Safety

The US government’s Anthropic models ban was never about an AI jailbreak

The Trump administration forced Anthropic to withdraw recently released cybersecurity models from public access, citing unspecified national security …

12h ago
Safety

Heart protection from COVID shots remains amid updates, study finds

Researchers have found that COVID-19 vaccines continue to protect hearts from inflammation and related damage, even as updated vaccine formulations ro…

12h ago
Safety

Attackers scale deception with AI. Defenders need truth at machine speed.

Attackers now weaponize AI to generate thousands of convincing phishing messages, fake identities, and social engineering attacks faster than defender…

Yesterday
Safety

Roblox exec says ticking a box for age verification is ‘not enough anymore’

Roblox is rolling out facial age estimation technology to replace simple checkbox age verification. The platform's vice president of safety product po…

Yesterday
Safety

Anthropic shuts down Fable, Mythos models following Trump admin directive

Anthropic has discontinued its Fable and Mythos AI model lines following a directive from the Trump administration. The Commerce Department cited nati…

3 days ago
Safety

This Week in AI: The Next-Gen Recommendation Experience

Miguel Fierro, formerly a principal researcher at Microsoft, launched RecoMind to tackle next-generation recommendation systems. He joined data and AI…

4 days ago
Safety

SpaceX SPV investors won’t know their true holdings until post-IPO lock-ups lift

SpaceX's planned initial public offering presents a murky investment landscape for lower-tier SPV (special purpose vehicle) investors, who may not und…

4 days ago
Safety

Claude Fable 5: Anthropic admits "wrong tradeoff" after invisibly throttling rival AI researchers

Anthropic has reversed a controversial policy that would have secretly degraded performance for AI researchers using competing models. The company ack…

5 days ago
Safety

Job titles of the future: Nature’s drug designer

Tim Cernak spent nearly two decades at Merck developing precision therapies for cancer, HIV, and diabetes. His work focused on targeting disease while…

5 days ago

Get Daily AIWireDaily

The best stories, delivered to your inbox each morning.