28 points

3 comments1 min readLW link

[Question] Could a Supreme Court suit work to solve NEPA problems?

ChristianKl3 Nov 2022 21:10 UTC

15 points

0 comments1 min readLW link

[Video] How having Fast Fourier Transforms sooner could have helped with Nuclear Disarmament—Veritasium

mako yass3 Nov 2022 21:04 UTC

17 points

1 comment1 min readLW link

(www.youtube.com)

Further considerations on the Evidentialist’s Wager

Martín Soto3 Nov 2022 20:06 UTC

3 points

9 comments8 min readLW link

AI as a Civilizational Risk Part 6/6: What can be done

PashaKamyshev3 Nov 2022 19:48 UTC

2 points

4 comments4 min readLW link

A Mystery About High Dimensional Concept Encoding

Fabien Roger3 Nov 2022 17:05 UTC

46 points

13 comments7 min readLW link

Why do we post our AI safety plans on the Internet?

Peter S. Park3 Nov 2022 16:02 UTC

4 points

4 comments11 min readLW link

Multiple Deploy-Key Repos

jefftk3 Nov 2022 15:10 UTC

15 points

0 comments1 min readLW link

(www.jefftk.com)

Covid 11/3/22: Asking Forgiveness

Zvi3 Nov 2022 13:50 UTC

23 points

3 comments6 min readLW link

(thezvi.wordpress.com)

Adversarial Policies Beat Professional-Level Go AIs

sanxiyn3 Nov 2022 13:27 UTC

31 points

35 comments1 min readLW link

(goattack.alignmentfund.org)

K-types vs T-types — what priors do you have?

Cleo Nardo3 Nov 2022 11:29 UTC

74 points

25 comments7 min readLW link

Information Markets 2: Optimally Shaped Reward Bets

eva_3 Nov 2022 11:08 UTC

9 points

0 comments3 min readLW link

The Rational Utilitarian Love Movement (A Historical Retrospective)

Caleb Biddulph3 Nov 2022 7:11 UTC

3 points

0 comments1 min readLW link

(ratutilove.substack.com)

The Mirror Chamber: A short story exploring the anthropic measure function and why it can matter

mako yass3 Nov 2022 6:47 UTC

30 points

13 comments10 min readLW link

Open Letter Against Reckless Nuclear Escalation and Use

Max Tegmark3 Nov 2022 5:34 UTC

27 points

25 comments1 min readLW link

Lazy Python Argument Parsing

jefftk3 Nov 2022 2:20 UTC

20 points

3 comments1 min readLW link

(www.jefftk.com)

AI as a Civilizational Risk Part 5/6: Relationship between C-risk and X-risk

PashaKamyshev3 Nov 2022 2:19 UTC

2 points

0 comments7 min readLW link

[Question] Is there a good way to award a fixed prize in a prediction contest?

jchan2 Nov 2022 21:37 UTC

18 points

5 comments1 min readLW link

“Are Experiments Possible?” Seeds of Science call for reviewers

rogersbacon2 Nov 2022 20:05 UTC

8 points

0 comments1 min readLW link

Humans do acausal coordination all the time

Adam Jermyn2 Nov 2022 14:40 UTC

57 points

35 comments3 min readLW link

Far-UVC Light Update: No, LEDs are not around the corner (tweetstorm)

Davidmanheim2 Nov 2022 12:57 UTC

74 points

29 comments4 min readLW link

(twitter.com)

Housing and Transit Thoughts #1

Zvi2 Nov 2022 12:10 UTC

35 points

5 comments16 min readLW link

(thezvi.wordpress.com)

Mind is uncountable

Filip Sondej2 Nov 2022 11:51 UTC

18 points

22 comments3 min readLW link

AI Safety Needs Great Product Builders

goodgravy2 Nov 2022 11:33 UTC

14 points

2 comments6 min readLW link

Why is fiber good for you?

braces2 Nov 2022 2:04 UTC

18 points

2 comments2 min readLW link

Information Markets

eva_2 Nov 2022 1:24 UTC

46 points

6 comments12 min readLW link

Sequence Reread: Fake Beliefs [plus sequence spotlight meta]

Raemon2 Nov 2022 0:09 UTC

27 points

3 comments1 min readLW link

Real-Time Research Recording: Can a Transformer Re-Derive Positional Info?

Neel Nanda1 Nov 2022 23:56 UTC

69 points

16 comments1 min readLW link

(youtu.be)

All AGI Safety questions welcome (especially basic ones) [~monthly thread]

Robert Miles1 Nov 2022 23:23 UTC

68 points

106 comments2 min readLW link

[Question] Which Issues in Conceptual Alignment have been Formalised or Observed (or not)?

ojorgensen1 Nov 2022 22:32 UTC

4 points

0 comments1 min readLW link

AI as a Civilizational Risk Part 4/6: Bioweapons and Philosophy of Modification

PashaKamyshev1 Nov 2022 20:50 UTC

7 points

1 comment8 min readLW link

Open & Welcome Thread—November 2022

MondSemmel1 Nov 2022 18:47 UTC

14 points

46 comments1 min readLW link

Mildly Against Donor Lotteries

jefftk1 Nov 2022 18:10 UTC

10 points

9 comments3 min readLW link

(www.jefftk.com)

Progress links and tweets, 2022-11-01

jasoncrawford1 Nov 2022 17:48 UTC

16 points

4 comments3 min readLW link

(rootsofprogress.org)

On the correspondence between AI-misalignment and cognitive dissonance using a behavioral economics model

Stijn Bruers1 Nov 2022 17:39 UTC

4 points

0 comments6 min readLW link

Threat Model Literature Review

zac_kenton, Rohin Shah, David Lindner, Vikrant Varma, Vika, Mary Phuong, Ramana Kumar and Elliot Catt

1 Nov 2022 11:03 UTC

79 points

4 comments25 min readLW link

Clarifying AI X-risk

zac_kenton, Rohin Shah, David Lindner, Vikrant Varma, Vika, Mary Phuong, Ramana Kumar and Elliot Catt

1 Nov 2022 11:03 UTC

127 points

24 comments4 min readLW link 1 review

Auditing games for high-level interpretability

Paul Colognese1 Nov 2022 10:44 UTC

33 points

1 comment7 min readLW link

Remember to translate your thoughts back again

brook1 Nov 2022 8:49 UTC

25 points

11 comments3 min readLW link

(forum.effectivealtruism.org)

Conversations on Alcohol Consumption

Annapurna1 Nov 2022 5:09 UTC

20 points

6 comments9 min readLW link

ML Safety Scholars Summer 2022 Retrospective

TW1231 Nov 2022 3:09 UTC

29 points

0 comments21 min readLW link

EA & LW Forums Weekly Summary (24 − 30th Oct 22′)

Zoe Williams1 Nov 2022 2:58 UTC

13 points

1 comment14 min readLW link

Caution when interpreting Deepmind’s In-context RL paper

Sam Marks1 Nov 2022 2:42 UTC

106 points

8 comments4 min readLW link

What sorts of systems can be deceptive?

Andrei Alexandru31 Oct 2022 22:00 UTC

17 points

0 comments7 min readLW link

“Cars and Elephants”: a handwavy argument/analogy against mechanistic interpretability

David Scott Krueger (formerly: capybaralet)31 Oct 2022 21:26 UTC

51 points

25 comments2 min readLW link

Superintelligent AI is necessary for an amazing future, but far from sufficient

So8res31 Oct 2022 21:16 UTC

134 points

48 comments34 min readLW link

Sanity-checking in an age of hyperbole

Ciprian Elliu Ivanof31 Oct 2022 20:04 UTC

2 points

4 comments2 min readLW link

Why Aren’t There More Schelling Holidays?

johnswentworth31 Oct 2022 19:31 UTC

63 points

21 comments1 min readLW link

The circular problem of epistemic irresponsibility

Roman Leventov31 Oct 2022 17:23 UTC

5 points

2 comments8 min readLW link

AI as a Civilizational Risk Part 3/6: Anti-economy and Signal Pollution

PashaKamyshev31 Oct 2022 17:03 UTC

7 points

4 comments14 min readLW link