So8res

Karma: 17,322

Eliezer and I wrote a book: If Anyone Builds It, Everyone Dies

So8resMay 14, 2025, 7:00 PM

601 points

103 comments2 min readLW link

LessWrong: After Dark, a new side of LessWrong

So8resApr 1, 2024, 10:44 PM

36 points

6 comments1 min readLW link

Ronny and Nate discuss what sorts of minds humanity is likely to find by Machine Learning

So8res and Ronny Fernandez

Dec 19, 2023, 11:39 PM

42 points

30 comments25 min readLW link

Quick takes on “AI is easy to control”

So8resDec 2, 2023, 10:31 PM

26 points

49 comments4 min readLW link

Apocalypse insurance, and the hardline libertarian take on AI risk

So8resNov 28, 2023, 2:09 AM

134 points

40 comments7 min readLW link 1 review

Ability to solve long-horizon tasks correlates with wanting things in the behaviorist sense

So8resNov 24, 2023, 5:37 PM

197 points

84 comments5 min readLW link 1 review

How much to update on recent AI governance moves?

habryka and So8res

Nov 16, 2023, 11:46 PM

112 points

5 comments29 min readLW link

Thoughts on the AI Safety Summit company policy requests and responses

So8resOct 31, 2023, 11:54 PM

169 points

14 comments10 min readLW link

AI as a science, and three obstacles to alignment strategies

So8resOct 25, 2023, 9:00 PM

193 points

80 comments11 min readLW link

A mind needn’t be curious to reap the benefits of curiosity

So8resJun 2, 2023, 6:00 PM

78 points

14 comments1 min readLW link

Cosmopolitan values don’t come free

So8resMay 31, 2023, 3:58 PM

137 points

85 comments1 min readLW link

Sentience matters

So8resMay 29, 2023, 9:25 PM

143 points

96 comments2 min readLW link

Request: stop advancing AI capabilities

So8resMay 26, 2023, 5:42 PM

154 points

24 comments1 min readLW link

Would we even want AI to solve all our problems?

So8resApr 21, 2023, 6:04 PM

97 points

15 comments2 min readLW link

How could you possibly choose what an AI wants?

So8resApr 19, 2023, 5:08 PM

108 points

19 comments1 min readLW link

But why would the AI kill us?

So8resApr 17, 2023, 6:42 PM

139 points

96 comments2 min readLW link

Misgeneralization as a misnomer

So8resApr 6, 2023, 8:43 PM

129 points

22 comments4 min readLW link

If interpretability research goes well, it may get dangerous

So8resApr 3, 2023, 9:48 PM

201 points

11 comments2 min readLW link

Hooray for stepping out of the limelight

So8resApr 1, 2023, 2:45 AM

284 points

26 comments1 min readLW link

A rough and incomplete review of some of John Wentworth’s research

So8resMar 28, 2023, 6:52 PM

175 points

18 comments18 min readLW link