All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025

All Jan Feb Mar Apr May Jun Jul Aug Sep OctNovDec

All 1 2 3 4 5 6 7 8 9 10 111213 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

“Rudeness”, a useful coordination mechanic

Raemon11 Nov 2022 22:27 UTC

51 points

20 comments2 min readLW link

Internalizing the damage of bad-acting partners creates incentives for due diligence

tailcalled11 Nov 2022 20:57 UTC

17 points

7 comments1 min readLW link

Speculation on Current Opportunities for Unusually High Impact in Global Health

johnswentworth11 Nov 2022 20:47 UTC

114 points

31 comments4 min readLW link

[Question] Is acausal extortion possible?

sisyphus11 Nov 2022 19:48 UTC

−20 points

35 comments3 min readLW link

Catharsis in Bb

jefftk11 Nov 2022 17:40 UTC

6 points

0 comments1 min readLW link

(www.jefftk.com)

Instrumental convergence is what makes general intelligence possible

tailcalled11 Nov 2022 16:38 UTC

105 points

11 comments4 min readLW link

Weekly Roundup #5

Zvi11 Nov 2022 16:20 UTC

33 points

0 comments6 min readLW link

(thezvi.wordpress.com)

Charging for the Dharma

jchan11 Nov 2022 14:02 UTC

32 points

18 comments5 min readLW link

[Question] EA (& AI Safety) has overestimated its projected funding — which decisions must be revised?

Cleo Nardo11 Nov 2022 13:50 UTC

22 points

7 comments1 min readLW link

(forum.effectivealtruism.org)

Where the logical fallacy is not (Generalization From Fictional Evidence)

banev11 Nov 2022 10:41 UTC

−12 points

14 comments1 min readLW link

Why I’m Working On Model Agnostic Interpretability

Jessica Rumbelow11 Nov 2022 9:24 UTC

27 points

9 comments2 min readLW link

How likely are malign priors over objectives? [aborted WIP]

David Johnston11 Nov 2022 5:36 UTC

−1 points

0 comments8 min readLW link

Do Timeless Decision Theorists reject all blackmail from other Timeless Decision Theorists?

myren11 Nov 2022 0:38 UTC

7 points

8 comments3 min readLW link

We must be very clear: fraud in the service of effective altruism is unacceptable

evhub10 Nov 2022 23:31 UTC

42 points

56 comments3 min readLW link

[simulation] 4chan user claiming to be the attorney hired by Google’s sentient chatbot LaMDA shares wild details of encounter

janus10 Nov 2022 21:39 UTC

19 points

1 comment13 min readLW link

(generative.ink)

divine carrot

Alok Singh10 Nov 2022 20:50 UTC

18 points

2 comments1 min readLW link

(alok.github.io)

Metaculus Announces The Million Predictions Hackathon

ChristianWilliams10 Nov 2022 20:00 UTC

7 points

0 comments1 min readLW link

(metaculus.medium.com)

The harnessing of complexity

geduardo10 Nov 2022 18:44 UTC

6 points

2 comments3 min readLW link

[Question] I there a demo of “You can’t fetch the coffee if you’re dead”?

Ram Rachum10 Nov 2022 18:41 UTC

8 points

9 comments1 min readLW link

Mastodon Linking Norms

jefftk10 Nov 2022 15:10 UTC

9 points

8 comments2 min readLW link

(www.jefftk.com)

Covid 11/10/22: Into the Background

Zvi10 Nov 2022 13:40 UTC

31 points

5 comments4 min readLW link

(thezvi.wordpress.com)

LessWrong Poll on AGI

Niclas Kupper10 Nov 2022 13:13 UTC

12 points

6 comments1 min readLW link

The optimal angle for a solar boiler is different than for a solar panel

Yair Halberstadt10 Nov 2022 10:32 UTC

42 points

4 comments2 min readLW link

What it’s like to dissect a cadaver

Alok Singh10 Nov 2022 6:40 UTC

208 points

24 comments5 min readLW link

(alok.github.io)

I Converted Book I of The Sequences Into A Zoomer-Readable Format

dkirmani10 Nov 2022 2:59 UTC

200 points

32 comments2 min readLW link

Adversarial Priors: Not Paying People to Lie to You

eva_10 Nov 2022 2:29 UTC

22 points

9 comments3 min readLW link

Is full self-driving an AGI-complete problem?

kraemahz10 Nov 2022 2:04 UTC

10 points

5 comments1 min readLW link

[Question] What are examples of problems that were caused by intelligence, that couldn’t be solved with intelligence?

Peter O'Malley10 Nov 2022 2:04 UTC

1 point

2 comments1 min readLW link

Desiderata for an Adversarial Prior

Shmi9 Nov 2022 23:45 UTC

13 points

2 comments1 min readLW link

Chord Notation

jefftk9 Nov 2022 21:30 UTC

12 points

5 comments1 min readLW link

(www.jefftk.com)

[ASoT] Instrumental convergence is useful

Ulisse Mini9 Nov 2022 20:20 UTC

5 points

9 comments1 min readLW link

Mesatranslation and Metatranslation

jdp9 Nov 2022 18:46 UTC

25 points

4 comments11 min readLW link

Trying to Make a Treacherous Mesa-Optimizer

MadHatter9 Nov 2022 18:07 UTC

95 points

14 comments4 min readLW link

(attentionspan.blog)

A caveat to the Orthogonality Thesis

Wuschel Schulz9 Nov 2022 15:06 UTC

38 points

10 comments2 min readLW link

Wednesday South Bay Meetups, November 16

Leonard Zabarsky9 Nov 2022 2:21 UTC

1 point

0 comments1 min readLW link

FTX Crisis. What we know and some forecasts on what will happen next

Nathan Young9 Nov 2022 2:14 UTC

60 points

21 comments3 min readLW link

A first success story for Outer Alignment: InstructGPT

Noosphere898 Nov 2022 22:52 UTC

6 points

1 comment1 min readLW link

(openai.com)

Trying Mastodon

jefftk8 Nov 2022 19:10 UTC

12 points

4 comments1 min readLW link

(www.jefftk.com)

Inverse scaling can become U-shaped

Edouard Harris8 Nov 2022 19:04 UTC

27 points

15 comments1 min readLW link

(arxiv.org)

People care about each other even though they have imperfect motivational pointers?

TurnTrout8 Nov 2022 18:15 UTC

33 points

25 comments7 min readLW link

Applying superintelligence without collusion

Eric Drexler8 Nov 2022 18:08 UTC

109 points

63 comments4 min readLW link

[Question] Binance is buying FTX.com: How did it happen and what are the implications?

Caerulean8 Nov 2022 17:14 UTC

16 points

6 comments1 min readLW link

Some advice on independent research

Marius Hobbhahn8 Nov 2022 14:46 UTC

56 points

5 comments10 min readLW link

Mysteries of mode collapse

janus8 Nov 2022 10:37 UTC

284 points

57 comments14 min readLW link 1 review

[ASoT] Thoughts on GPT-N

Ulisse Mini8 Nov 2022 7:14 UTC

8 points

0 comments1 min readLW link

Michael Simm—Introducing Myself

Michael Simm8 Nov 2022 5:45 UTC

4 points

0 comments2 min readLW link

EA & LW Forums Weekly Summary (31st Oct − 6th Nov 22′)

Zoe Williams8 Nov 2022 3:58 UTC

12 points

1 comment18 min readLW link

[Question] Value of Querying 100+ People About Humanity’s Future

T4318 Nov 2022 0:41 UTC

9 points

3 comments2 min readLW link

How could we know that an AGI system will have good consequences?

So8res7 Nov 2022 22:42 UTC

111 points

25 comments5 min readLW link

A Walkthrough of Interpretability in the Wild (w/ authors Kevin Wang, Arthur Conmy & Alexandre Variengien)

Neel Nanda7 Nov 2022 22:39 UTC

30 points

15 comments3 min readLW link

(youtu.be)