16 Oct 2023 22:40 UTC

83 points

8 comments10 min readLW link

ACX Mérida Meetup

Silvia Fernández16 Oct 2023 19:39 UTC

1 point

0 comments1 min readLW link

An EPUB of Arbital’s AI Alignment section

mesaoptimizer16 Oct 2023 19:36 UTC

53 points

1 comment1 min readLW link

(git.sr.ht)

How should TurnTrout handle his DeepMind equity situation?

habryka and TurnTrout

16 Oct 2023 18:25 UTC

63 points

38 comments6 min readLW link 1 review

Pascal’s Mugging: The Word Wars

johncrox16 Oct 2023 17:54 UTC

9 points

1 comment6 min readLW link

Massapequa (Long Island), NY, USA ACX December Meetup

Gabriel Weil16 Oct 2023 17:37 UTC

2 points

1 comment1 min readLW link

The price is right

Elliott Thornley16 Oct 2023 16:34 UTC

42 points

3 comments4 min readLW link

(openairopensea.substack.com)

[Question] Dating in 2023 sucks. Why isn’t AI helping?

Andreas Chrysopoulos16 Oct 2023 12:31 UTC

5 points

24 comments1 min readLW link

Knowledge Base 4: General applications

iwis16 Oct 2023 12:26 UTC

3 points

0 comments1 min readLW link

UNGA General Debate speeches on AI

Odd anon16 Oct 2023 6:36 UTC

6 points

0 comments21 min readLW link

AI Alignment [Incremental Progress Units] this week (10/08/23)

Logan Zoellner16 Oct 2023 1:46 UTC

14 points

5 comments4 min readLW link

(midwitalignment.substack.com)

[Question] Does a broad overview of Mechanistic Interpretability exist?

kourabi16 Oct 2023 1:16 UTC

1 point

0 comments1 min readLW link

Goodhart’s Law in Reinforcement Learning

jacek, Joar Skalse, OliverHH, Charlie Griffin and Xingjian Bai

16 Oct 2023 0:54 UTC

126 points

22 comments7 min readLW link

My AI Predictions 2023 − 2026

HunterJay16 Oct 2023 0:50 UTC

62 points

34 comments5 min readLW link

Taxonomy of AI-risk counterarguments

Odd anon16 Oct 2023 0:12 UTC

66 points

12 comments8 min readLW link

Thoughts on Hardware limits to Prevent AGI?

jrincayc15 Oct 2023 23:45 UTC

4 points

4 comments9 min readLW link

[Question] Training a RL Model with Continuous State & Action Space in a Real-World Scenario

Alexander Ries15 Oct 2023 22:59 UTC

0 points

0 comments1 min readLW link

On Frequentism and Bayesian Dogma

DanielFilan and Adrià Garriga-alonso

15 Oct 2023 22:23 UTC

59 points

27 comments6 min readLW link

More or Fewer Fights over Principles and Values?

Ben Pace and Vaniver

15 Oct 2023 21:35 UTC

24 points

10 comments14 min readLW link

Mapping ChatGPT’s ontological landscape, gradients and choices [interpretability]

Bill Benzon15 Oct 2023 20:12 UTC

1 point

0 comments18 min readLW link

Arguments for optimism on AI Alignment (I don’t endorse this version, will reupload a new version soon.)

Noosphere8915 Oct 2023 14:51 UTC

28 points

129 comments25 min readLW link

Discovering Latent Knowledge in the Human Brain: Part 1 – Clarifying the concepts of belief and knowledge

Joseph Emerson15 Oct 2023 9:02 UTC

5 points

0 comments12 min readLW link

[Question] Rationalist horror movies

Elizabeth15 Oct 2023 7:42 UTC

46 points

35 comments1 min readLW link

Unity Gridworlds

WillPetillo15 Oct 2023 4:36 UTC

9 points

0 comments1 min readLW link

In memory of Louise Glück

Joe Carlsmith15 Oct 2023 2:59 UTC

46 points

1 comment8 min readLW link

[Question] One-on-one tutoring for any subject

yakimoff14 Oct 2023 20:58 UTC

8 points

5 comments1 min readLW link

The Puritans would one-box: evidential decision theory in the 17th century

Jacob G-W14 Oct 2023 20:23 UTC

86 points

5 comments3 min readLW link

(jacobgw.com)

Natural Abstraction: Convergent Preferences Over Information Structures

paulom14 Oct 2023 18:34 UTC

28 points

1 comment36 min readLW link

ChatGPT tells 20 versions of its prototypical story, with a short note on method

Bill Benzon14 Oct 2023 15:27 UTC

7 points

0 comments5 min readLW link

Will no one rid me of this turbulent pest?

Metacelsus14 Oct 2023 15:27 UTC

154 points

23 comments10 min readLW link

(denovo.substack.com)

Which Anaesthetic To Choose?

dadadarren14 Oct 2023 14:55 UTC

10 points

15 comments1 min readLW link

Is the Wave non-disparagement thingy okay?

Ruby, Linch and Auckland

14 Oct 2023 5:31 UTC

29 points

13 comments11 min readLW link

The Gods of Straight Lines

Richard_Ngo14 Oct 2023 4:10 UTC

70 points

13 comments5 min readLW link

(www.narrativeark.xyz)

Eight Magic Lamps

Richard_Ngo14 Oct 2023 4:10 UTC

42 points

0 comments6 min readLW link

(www.narrativeark.xyz)

RSPs are pauses done right

evhub14 Oct 2023 4:06 UTC

166 points

79 comments7 min readLW link 1 review

Dishonorable Gossip and Going Crazy

Ben Pace and Unreal

14 Oct 2023 4:00 UTC

29 points

31 comments23 min readLW link

Disentangling Our Terminal and Instrumental Values

PeterMcCluskey14 Oct 2023 3:35 UTC

11 points

1 comment4 min readLW link

(bayesianinvestor.com)

Global Pause AI Protest 10/21

Holly_Elmore, Joseph Miller and joepio

14 Oct 2023 3:20 UTC

5 points

0 comments1 min readLW link

[Question] Literature On Existential Risk From Atmospheric Contamination?

Yitz13 Oct 2023 22:27 UTC

6 points

3 comments1 min readLW link

How to partition teams to move fast? Debating “low-dimensional cuts”

Bird Concept and kave

13 Oct 2023 21:43 UTC

41 points

2 comments11 min readLW link

Gothenburg LW / ACX meetup

Stefan13 Oct 2023 21:39 UTC

2 points

0 comments1 min readLW link

Meta-Regulations

Sable13 Oct 2023 21:23 UTC

18 points

5 comments10 min readLW link

(affablyevil.substack.com)

Hiring: Lighthaven Events & Venue Lead

Raemon13 Oct 2023 21:02 UTC

69 points

3 comments4 min readLW link

Prediction markets covered in the NYT podcast “Hard Fork”

Austin Chen13 Oct 2023 18:43 UTC

56 points

6 comments9 min readLW link

(www.nytimes.com)

[Paper] All’s Fair In Love And Love: Copy Suppression in GPT-2 Small

CallumMcDougall, Arthur Conmy, Tom McGrath and Neel Nanda

13 Oct 2023 18:32 UTC

82 points

4 comments8 min readLW link

FLI podcast series, “Imagine A World”, about aspirational futures with AGI

Jackson Wagner13 Oct 2023 16:07 UTC

9 points

0 comments4 min readLW link

To open-source or to not open-source, that is (an oversimplification of) the question.

Justin Bullock13 Oct 2023 15:10 UTC

12 points

5 comments5 min readLW link

Combination Lock Boxes

jefftk13 Oct 2023 12:50 UTC

17 points

9 comments1 min readLW link

(www.jefftk.com)

Circle of Support (Oct 14th @ 10am PST)

Alexei13 Oct 2023 9:24 UTC

19 points

1 comment1 min readLW link

[Question] How can the world handle the HAMAS situation?

Annapurna13 Oct 2023 9:15 UTC

5 points

43 comments1 min readLW link