RSS

Mario Gibney

Karma: 30

AI Safety Thurs­day: Risks Emerg­ing from Agent Swarms

30 Jan 2026 18:00 UTC
2 points
0 comments1 min readLW link

AI Policy Tues­day: AI Safety in Healthcare

30 Jan 2026 17:59 UTC
2 points
0 comments1 min readLW link

AI Policy Tues­day: Reg­u­lat­ing AI Agents—Les­sons from the EU AI Act

29 Jan 2026 19:44 UTC
2 points
0 comments1 min readLW link

AI Safety Thurs­day: Im­pli­ca­tions of Con­tinual Learn­ing in LLM Agents

29 Jan 2026 19:13 UTC
2 points
0 comments1 min readLW link

“The World After AGI” Discussion

29 Jan 2026 17:41 UTC
2 points
0 comments1 min readLW link

AI Policy Tues­day: Reg­u­lat­ing Catas­trophic AI Risk Through Li­a­bil­ity Insurance

10 Jan 2026 16:10 UTC
2 points
0 comments1 min readLW link

Beyond Ad­ver­sar­ial Ro­bust­ness—Re­think­ing So­ciopoli­ti­cal Safety in AI Systems

10 Jan 2026 16:08 UTC
2 points
0 comments1 min readLW link

Emer­gent Misal­ign­ment from Re­ward Hacking

10 Jan 2026 16:06 UTC
2 points
0 comments1 min readLW link

AI Safety Thurs­days: Why At­tack­ers Are Win­ning and What We Can Do About It

5 Jan 2026 20:40 UTC
2 points
0 comments1 min readLW link

AI Safety Thurs­day: Agen­tic Bug De­tec­tion—Progress and Deployment

4 Dec 2025 20:08 UTC
2 points
0 comments1 min readLW link

Weird AI Wed­nes­day: The Rise of Par­a­sitic AI

1 Dec 2025 16:27 UTC
2 points
0 comments1 min readLW link

AI Policy Tues­day: Ver­ifi­ca­tion Mechanisms for Global AI Governance

1 Dec 2025 16:19 UTC
2 points
0 comments1 min readLW link

AI Policy Tues­day: The Con­cept of Poli­ti­cal Space and AI Safety

16 Sep 2025 13:50 UTC
1 point
0 comments1 min readLW link

AI Policy Tues­day: The US AI Ac­tion Plan

25 Jul 2025 19:07 UTC
2 points
0 comments1 min readLW link

AI Safety Thurs­days: The In­tel­li­gence Curse

30 Jun 2025 20:40 UTC
2 points
0 comments1 min readLW link

AI Safety Thurs­days: Self-Other Over­lap—Fol­low Up

30 Jun 2025 19:54 UTC
2 points
0 comments1 min readLW link

Apart Re­search Hackathon: AI Safety x Physics Grand Challenge

30 Jun 2025 19:08 UTC
3 points
1 comment1 min readLW link

AI Safety Thurs­days: Agen­tic Misal­ign­ment: How LLMs could be in­sider threats

30 Jun 2025 18:56 UTC
2 points
0 comments1 min readLW link

AI Policy Tues­days: The Fu­ture of Cana­dian AI Governance

30 Jun 2025 18:41 UTC
2 points
0 comments1 min readLW link

AI Policy Tues­days: Tort Law and Fron­tier AI Governance

30 Jun 2025 18:19 UTC
2 points
0 comments1 min readLW link