RSS

From seeded com­plex­ity to con­scious­ness—yes, it’s all the same.

eschatail8 Oct 2024 21:31 UTC
0 points
0 comments2 min readLW link

Limits of safe and al­igned AI

Shivam8 Oct 2024 21:30 UTC
1 point
0 comments4 min readLW link

[Question] What con­sti­tutes an in­fo­haz­ard?

K1r4d4rk.v18 Oct 2024 21:29 UTC
0 points
3 comments1 min readLW link

[In­tu­itive self-mod­els] 4. Trance

Steven Byrnes8 Oct 2024 13:30 UTC
26 points
2 comments25 min readLW link

Schel­ling game eval­u­a­tions for AI control

Olli Järviniemi8 Oct 2024 12:01 UTC
44 points
0 comments11 min readLW link

Overview of strong hu­man in­tel­li­gence am­plifi­ca­tion methods

TsviBT8 Oct 2024 8:37 UTC
122 points
43 comments10 min readLW link

Si­mu­lat­ing near-death experiences

Declan Molony8 Oct 2024 6:34 UTC
4 points
1 comment3 min readLW link

The un­rea­son­able effec­tive­ness of plas­mid se­quenc­ing as a service

Abhishaike Mahajan8 Oct 2024 2:02 UTC
22 points
0 comments13 min readLW link
(www.owlposting.com)

There is a globe in your LLM

jacob_drori8 Oct 2024 0:43 UTC
44 points
0 comments1 min readLW link

MATS AI Safety Strat­egy Cur­ricu­lum v2

7 Oct 2024 22:44 UTC
38 points
4 comments13 min readLW link

2025 Color Trends

sarahconstantin7 Oct 2024 21:20 UTC
29 points
4 comments6 min readLW link
(sarahconstantin.substack.com)

Clar­ify­ing Align­ment Fun­da­men­tals Through the Lens of Ontology

eternal/ephemera7 Oct 2024 20:57 UTC
10 points
0 comments24 min readLW link

Ethics on Cos­mic Scale, Outer Space Treaty, Directed Pansper­mia, For­wards-Con­tam­i­na­tion, Tech­nol­ogy Assess­ment, Plane­tary Pro­tec­tion, and Fermi’s Paradox

MrFantastic7 Oct 2024 20:56 UTC
−5 points
0 comments1 min readLW link

Do­main-spe­cific SAEs

jacob_drori7 Oct 2024 20:15 UTC
22 points
0 comments5 min readLW link

Re­search up­date: Towards a Law of Iter­ated Ex­pec­ta­tions for Heuris­tic Estimators

Eric Neyman7 Oct 2024 19:29 UTC
75 points
2 comments22 min readLW link

AI Model Registries: A Foun­da­tional Tool for AI Governance

7 Oct 2024 19:27 UTC
20 points
1 comment4 min readLW link
(www.convergenceanalysis.org)

Eval­u­at­ing the truth of state­ments in a world of am­bigu­ous lan­guage.

Hastings7 Oct 2024 18:08 UTC
44 points
17 comments2 min readLW link

Ad­vice for journalists

Nathan Young7 Oct 2024 16:46 UTC
48 points
15 comments9 min readLW link
(nathanpmyoung.substack.com)

Time Effi­cient Re­sis­tance Training

romeostevensit7 Oct 2024 15:15 UTC
38 points
5 comments3 min readLW link

A Nar­row Path: a plan to deal with AI ex­tinc­tion risk

7 Oct 2024 13:02 UTC
73 points
8 comments2 min readLW link
(www.narrowpath.co)