RSS

Anthony DiGiovanni

Karma: 1,220

Researcher at the Center on Long-Term Risk. All opinions my own.

A high-level model of AI bargaining

Anthony DiGiovanni21 Jun 2026 15:37 UTC
24 points
1 comment5 min readLW link

[Linkpost] Evals for “SPI-in­com­pat­i­ble” be­hav­ior & rea­son­ing: Guide to ini­tial research

Anthony DiGiovanni9 Jun 2026 13:44 UTC
23 points
0 comments1 min readLW link
(docs.google.com)

CLR’s Safe Pareto Im­prove­ments Re­search Agenda

Anthony DiGiovanni20 Apr 2026 9:28 UTC
45 points
7 comments14 min readLW link