27 Apr 2023 20:32 UTC

192 points

59 comments9 min readLW link 1 review

(bewelltuned.com)

The LW crossroads of purpose

Caerulea-Lawrence27 Apr 2023 19:53 UTC

11 points

2 comments2 min readLW link

Metaculus Event: Forecast Friday, April 28th at 12pm ET — Speed Forecasting Session!

ChristianWilliams27 Apr 2023 19:50 UTC

0 points

0 comments1 min readLW link

Infrafunctions Proofs

Diffractor27 Apr 2023 19:25 UTC

12 points

1 comment10 min readLW link

Infrafunctions and Robust Optimization

Diffractor27 Apr 2023 19:25 UTC

61 points

11 comments15 min readLW link

What are the limits of superintelligence?

rainy27 Apr 2023 18:29 UTC

4 points

3 comments5 min readLW link

A Proposal for AI Alignment: Using Directly Opposing Models

Arne B27 Apr 2023 18:05 UTC

0 points

5 comments3 min readLW link

My views on “doom”

paulfchristiano27 Apr 2023 17:50 UTC

257 points

38 comments2 min readLW link 1 review

(ai-alignment.com)

[untitled post]

NeuralSystem_e5e127 Apr 2023 17:37 UTC

3 points

0 comments1 min readLW link

An International Manhattan Project for Artificial Intelligence

Glenn Clayton27 Apr 2023 17:34 UTC

−9 points

2 comments5 min readLW link

Quote quiz: “drifting into dependence”

jasoncrawford27 Apr 2023 15:13 UTC

7 points

6 comments1 min readLW link

(rootsofprogress.org)

Second-Level Empiricism: Reframing the Two-Child Puzzle

Richard Henage27 Apr 2023 15:04 UTC

16 points

5 comments3 min readLW link

Interview with Paul Christiano: How We Prevent the AI’s from Killing us

Dalmert27 Apr 2023 14:39 UTC

12 points

0 comments1 min readLW link

(www.youtube.com)

AI #9: The Merge and the Million Tokens

Zvi27 Apr 2023 14:20 UTC

36 points

8 comments53 min readLW link

(thezvi.wordpress.com)

AI doom from an LLM-plateau-ist perspective

Steven Byrnes27 Apr 2023 13:58 UTC

166 points

24 comments6 min readLW link

Romance, misunderstanding, social stances, and the human LLM

Kaj_Sotala27 Apr 2023 12:59 UTC

80 points

32 comments16 min readLW link

AI chatbots don’t know why they did it

skybrian27 Apr 2023 6:57 UTC

18 points

11 comments2 min readLW link

(skybrian.substack.com)

The Great Ideological Conflict: Intuitionists vs. Establishmentarians

Thoth Hermes27 Apr 2023 1:49 UTC

3 points

0 comments11 min readLW link

(thothhermes.substack.com)

Automating the Breath Pulse

jefftk27 Apr 2023 0:10 UTC

11 points

0 comments1 min readLW link

(www.jefftk.com)

Freedom Is All We Need

Leo Glisic27 Apr 2023 0:09 UTC

−1 points

8 comments10 min readLW link

Contra Yudkowsky on Doom from Foom #2

jacob_cannell27 Apr 2023 0:07 UTC

95 points

76 comments6 min readLW link

A very non-technical explanation of the basics of infra-Bayesianism

David Matolcsi26 Apr 2023 22:57 UTC

73 points

14 comments9 min readLW link

LM Situational Awareness, Evaluation Proposal: Violating Imitation

Jacob Pfau26 Apr 2023 22:53 UTC

16 points

2 comments2 min readLW link

Recent Database Migration—Report Bugs

RobertM26 Apr 2023 22:19 UTC

38 points

2 comments1 min readLW link

Infra-Bayesianism naturally leads to the monotonicity principle, and I think this is a problem

David Matolcsi26 Apr 2023 21:39 UTC

22 points

6 comments4 min readLW link

Understanding new terms via etymology

corruptedCatapillar26 Apr 2023 20:48 UTC

3 points

1 comment2 min readLW link

(forum.effectivealtruism.org)

Chad Jones paper modeling AI and x-risk vs. growth

jasoncrawford26 Apr 2023 20:07 UTC

39 points

7 comments2 min readLW link

(web.stanford.edu)

I was Wrong, Simulator Theory is Real

Robert_AIZI26 Apr 2023 17:45 UTC

75 points

7 comments3 min readLW link

(aizi.substack.com)

$250 prize for checking Jake Cannell’s Brain Efficiency

Alexander Gietelink Oldenziel26 Apr 2023 16:21 UTC

123 points

170 comments2 min readLW link

My version of Simulacra Levels

Daniel Kokotajlo26 Apr 2023 15:50 UTC

42 points

15 comments3 min readLW link

[Question] Is the fact that we don’t observe any obvious glitch evidence that we’re not in a simulation?

Jim Buhler26 Apr 2023 14:57 UTC

8 points

16 comments1 min readLW link

Transcript and Brief Response to Twitter Conversation between Yann LeCunn and Eliezer Yudkowsky

Zvi26 Apr 2023 13:10 UTC

190 points

51 comments10 min readLW link

(thezvi.wordpress.com)

What comes after?

rogersbacon26 Apr 2023 12:44 UTC

3 points

0 comments2 min readLW link

(www.secretorum.life)

Accidental Terraforming

Sable26 Apr 2023 6:49 UTC

9 points

16 comments5 min readLW link

(affablyevil.substack.com)

Philosophy by Paul Graham Link

EniScien26 Apr 2023 5:36 UTC

24 points

4 comments1 min readLW link

Boxing at the gym

yakimoff26 Apr 2023 5:10 UTC

1 point

0 comments1 min readLW link

Sibelius + drinks

yakimoff26 Apr 2023 5:08 UTC

1 point

0 comments1 min readLW link

A simple presentation of AI risk arguments

Seth Herd26 Apr 2023 2:19 UTC

19 points

0 comments2 min readLW link

Archetypal Transfer Learning: a Proposed Alignment Solution that solves the Inner & Outer Alignment Problem while adding Corrigible Traits to GPT-2-medium

MiguelDev26 Apr 2023 1:37 UTC

14 points

5 comments10 min readLW link

[Question] How Many Bits Of Optimization Can One Bit Of Observation Unlock?

johnswentworth26 Apr 2023 0:26 UTC

67 points

32 comments3 min readLW link

Believe in Yourself and don’t stop Improving

Johannes C. Mayer25 Apr 2023 22:34 UTC

0 points

0 comments1 min readLW link

Should LW have an official list of norms?

Ruby25 Apr 2023 21:20 UTC

58 points

31 comments5 min readLW link

Implementing a Transformer from scratch in PyTorch—a write-up on my experience

Mislav Jurić25 Apr 2023 20:51 UTC

20 points

0 comments10 min readLW link

Exploring the Lottery Ticket Hypothesis

Rauno Arike25 Apr 2023 20:06 UTC

60 points

3 comments11 min readLW link

Genetic Sequencing of Wastewater: Prevalence to Relative Abundance

jefftk25 Apr 2023 19:30 UTC

17 points

2 comments2 min readLW link

(www.jefftk.com)

[Feedback please] New User’s Guide to LessWrong

Ruby25 Apr 2023 18:54 UTC

38 points

18 comments6 min readLW link

Reframing the burden of proof: Companies should prove that models are safe (rather than expecting auditors to prove that models are dangerous)

Orpheus1625 Apr 2023 18:49 UTC

27 points

11 comments3 min readLW link

(childrenoficarus.substack.com)

LLMs for online discussion moderation

Dave92F125 Apr 2023 16:53 UTC

12 points

3 comments3 min readLW link

AI Safety Newsletter #3: AI policy proposals and a new challenger approaches

ozhang25 Apr 2023 16:15 UTC

33 points

0 comments4 min readLW link

(newsletter.safe.ai)

EA might systematically generate a scarcity mindset that produces low-integrity actors

Severin T. Seehrich25 Apr 2023 15:50 UTC

26 points

2 comments4 min readLW link