RSS

Con­tra pa­pers claiming su­per­hu­man AI forecasting

12 Sep 2024 18:10 UTC
90 points
4 comments7 min readLW link

How to Give in to Threats (with­out in­cen­tiviz­ing them)

Mikhail Samin12 Sep 2024 15:55 UTC
27 points
12 comments5 min readLW link

Open Prob­lems in AIXI Agent Foundations

Cole Wyeth12 Sep 2024 15:38 UTC
26 points
0 comments10 min readLW link

Op­ti­mis­ing un­der ar­bi­trar­ily many con­straint equations

dkl912 Sep 2024 14:59 UTC
5 points
0 comments3 min readLW link
(dkl9.net)

[Question] When can I be nu­mer­ate?

FinalFormal212 Sep 2024 4:05 UTC
21 points
1 comment1 min readLW link

A Non­con­struc­tive Ex­is­tence Proof of Aligned Superintelligence

Roko12 Sep 2024 3:20 UTC
9 points
29 comments1 min readLW link
(transhumanaxiology.substack.com)

Col­laps­ing the Belief/​Knowl­edge Distinction

Jeremias11 Sep 2024 21:24 UTC
−7 points
2 comments1 min readLW link

[Paper] Pro­gram­ming Re­fusal with Con­di­tional Ac­ti­va­tion Steering

Bruce W. Lee11 Sep 2024 20:57 UTC
34 points
0 comments11 min readLW link
(arxiv.org)

Re­fac­tor­ing cry­on­ics as struc­tural brain preservation

Andy_McKenzie11 Sep 2024 18:36 UTC
75 points
14 comments3 min readLW link

[Question] Is this a Pivotal Weak Act? Creat­ing bac­te­ria that de­com­pose metal

doomyeser11 Sep 2024 18:07 UTC
8 points
4 comments3 min readLW link

How to dis­cover the na­ture of sen­tience, and ethics

Gustavo Ramires11 Sep 2024 17:22 UTC
−2 points
3 comments4 min readLW link

Seek­ing Mechanism De­signer for Re­search into In­ter­nal­iz­ing Catas­trophic Externalities

c.trout11 Sep 2024 15:09 UTC
23 points
2 comments3 min readLW link

Could Things Be Very Differ­ent?—How His­tor­i­cal In­er­tia Might Blind Us To Op­ti­mal Solutions

James Stephen Brown11 Sep 2024 9:53 UTC
6 points
0 comments8 min readLW link
(nonzerosum.games)

Re­for­ma­tive Hypocrisy, and Pay­ing Close Enough At­ten­tion to Selec­tively Re­ward It.

Andrew_Critch11 Sep 2024 4:41 UTC
47 points
6 comments3 min readLW link

A nec­es­sary Mem­brane for­mal­ism feature

ThomasCederborg10 Sep 2024 21:33 UTC
18 points
3 comments11 min readLW link

For­mal­iz­ing the In­for­mal (event in­vite)

abramdemski10 Sep 2024 19:22 UTC
41 points
0 comments1 min readLW link

The Best Lay Ar­gu­ment is not a Sim­ple English Yud Essay

J Bostock10 Sep 2024 17:34 UTC
179 points
4 comments5 min readLW link

What boot­straps in­tel­li­gence?

invertedpassion10 Sep 2024 7:11 UTC
2 points
2 comments1 min readLW link

Phys­i­cal Ther­apy Sucks (but have you tried hid­ing it in some peanut but­ter?)

Declan Molony10 Sep 2024 5:54 UTC
14 points
12 comments2 min readLW link

Si­mon DeDeo on Ex­plore vs Ex­ploit in Science

Elizabeth10 Sep 2024 3:40 UTC
20 points
0 comments1 min readLW link
(acesounderglass.com)