RSS

MIRI 2024 Com­mu­ni­ca­tions Strategy

Gretta Duleba29 May 2024 19:33 UTC
202 points
43 comments7 min readLW link

Value Claims (In Par­tic­u­lar) Are Usu­ally Bullshit

johnswentworth30 May 2024 6:26 UTC
61 points
6 comments2 min readLW link

Awakening

lsusr30 May 2024 7:03 UTC
52 points
6 comments9 min readLW link

The Pearly Gates

lsusr30 May 2024 4:01 UTC
65 points
1 comment3 min readLW link

AI #66: Oh to Be Less Online

Zvi30 May 2024 14:20 UTC
11 points
1 comment56 min readLW link
(thezvi.wordpress.com)

OpenAI: Fallout

Zvi28 May 2024 13:20 UTC
184 points
19 comments36 min readLW link
(thezvi.wordpress.com)

Apollo Re­search 1-year update

29 May 2024 17:44 UTC
74 points
0 comments7 min readLW link

Re­sponse to nos­talge­braist: proudly wav­ing my moral-an­tire­al­ist bat­tle flag

Steven Byrnes29 May 2024 16:48 UTC
62 points
10 comments11 min readLW link

Thoughts on SB-1047

ryan_greenblatt29 May 2024 23:26 UTC
45 points
0 comments11 min readLW link

No­tifi­ca­tions Re­ceived in 30 Minutes of Class

tanagrabeast26 May 2024 17:02 UTC
263 points
8 comments8 min readLW link

Maybe An­thropic’s Long-Term Benefit Trust is powerless

Zach Stein-Perlman27 May 2024 13:00 UTC
182 points
19 comments2 min readLW link

US Pres­i­den­tial Elec­tion: Tractabil­ity, Im­por­tance, and Ur­gency

kuhanj29 May 2024 23:52 UTC
35 points
1 comment3 min readLW link

Hardshipification

Jonathan Moregård28 May 2024 20:02 UTC
73 points
17 comments2 min readLW link
(honestliving.substack.com)

Truth­seek­ing is the ground in which other prin­ci­ples grow

Elizabeth27 May 2024 1:09 UTC
167 points
8 comments16 min readLW link

AXRP Epi­sode 32 - Un­der­stand­ing Agency with Jan Kulveit

DanielFilan30 May 2024 3:50 UTC
21 points
0 comments53 min readLW link

AI com­pa­nies aren’t re­ally us­ing ex­ter­nal evaluators

Zach Stein-Perlman24 May 2024 16:01 UTC
239 points
14 comments4 min readLW link

Re­ward hack­ing be­hav­ior can gen­er­al­ize across tasks

28 May 2024 16:33 UTC
56 points
0 comments21 min readLW link

When Are Cir­cu­lar Defi­ni­tions A Prob­lem?

johnswentworth28 May 2024 20:00 UTC
48 points
12 comments3 min readLW link

[Question] How to get nerds fas­ci­nated about mys­te­ri­ous chronic ill­ness re­search?

riceissa27 May 2024 22:58 UTC
78 points
33 comments2 min readLW link

I am the Golden Gate Bridge

Zvi27 May 2024 14:40 UTC
91 points
6 comments27 min readLW link
(thezvi.wordpress.com)