All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025 2026

All Jan Feb Mar Apr May Jun Jul Aug SepOctNov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 151617 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Thoughts on Hardware limits to Prevent AGI?

jrincayc15 Oct 2023 23:45 UTC

4 points

4 comments9 min readLW link

[Question] Training a RL Model with Continuous State & Action Space in a Real-World Scenario

Alexander Ries15 Oct 2023 22:59 UTC

0 points

0 comments1 min readLW link

On Frequentism and Bayesian Dogma

DanielFilan and Adrià Garriga-alonso

15 Oct 2023 22:23 UTC

59 points

27 comments6 min readLW link

More or Fewer Fights over Principles and Values?

Ben Pace and Vaniver

15 Oct 2023 21:35 UTC

24 points

10 comments14 min readLW link

Mapping ChatGPT’s ontological landscape, gradients and choices [interpretability]

Bill Benzon15 Oct 2023 20:12 UTC

1 point

0 comments18 min readLW link

Arguments for optimism on AI Alignment (I don’t endorse this version, will reupload a new version soon.)

Noosphere8915 Oct 2023 14:51 UTC

28 points

129 comments25 min readLW link

Discovering Latent Knowledge in the Human Brain: Part 1 – Clarifying the concepts of belief and knowledge

Joseph Emerson15 Oct 2023 9:02 UTC

5 points

0 comments12 min readLW link

[Question] Rationalist horror movies

Elizabeth15 Oct 2023 7:42 UTC

46 points

35 comments1 min readLW link

Unity Gridworlds

WillPetillo15 Oct 2023 4:36 UTC

9 points

0 comments1 min readLW link

In memory of Louise Glück

Joe Carlsmith15 Oct 2023 2:59 UTC

46 points

1 comment8 min readLW link

[Question] One-on-one tutoring for any subject

yakimoff14 Oct 2023 20:58 UTC

8 points

5 comments1 min readLW link

The Puritans would one-box: evidential decision theory in the 17th century

Jacob G-W14 Oct 2023 20:23 UTC

86 points

5 comments3 min readLW link

(jacobgw.com)

Natural Abstraction: Convergent Preferences Over Information Structures

paulom14 Oct 2023 18:34 UTC

28 points

1 comment36 min readLW link

ChatGPT tells 20 versions of its prototypical story, with a short note on method

Bill Benzon14 Oct 2023 15:27 UTC

7 points

0 comments5 min readLW link

Will no one rid me of this turbulent pest?

Metacelsus14 Oct 2023 15:27 UTC

154 points

23 comments10 min readLW link

(denovo.substack.com)

Which Anaesthetic To Choose?

dadadarren14 Oct 2023 14:55 UTC

10 points

15 comments1 min readLW link

Is the Wave non-disparagement thingy okay?

Ruby, Linch and Auckland

14 Oct 2023 5:31 UTC

29 points

13 comments11 min readLW link

The Gods of Straight Lines

Richard_Ngo14 Oct 2023 4:10 UTC

70 points

13 comments5 min readLW link

(www.narrativeark.xyz)

Eight Magic Lamps

Richard_Ngo14 Oct 2023 4:10 UTC

42 points

0 comments6 min readLW link

(www.narrativeark.xyz)

RSPs are pauses done right

evhub14 Oct 2023 4:06 UTC

166 points

79 comments7 min readLW link 1 review

Dishonorable Gossip and Going Crazy

Ben Pace and Unreal

14 Oct 2023 4:00 UTC

29 points

31 comments23 min readLW link

Disentangling Our Terminal and Instrumental Values

PeterMcCluskey14 Oct 2023 3:35 UTC

11 points

1 comment4 min readLW link

(bayesianinvestor.com)

Global Pause AI Protest 10/21

Holly_Elmore, Joseph Miller and joepio

14 Oct 2023 3:20 UTC

5 points

0 comments1 min readLW link

[Question] Literature On Existential Risk From Atmospheric Contamination?

Yitz13 Oct 2023 22:27 UTC

6 points

3 comments1 min readLW link

How to partition teams to move fast? Debating “low-dimensional cuts”

Bird Concept and kave

13 Oct 2023 21:43 UTC

41 points

2 comments11 min readLW link

Gothenburg LW / ACX meetup

Stefan13 Oct 2023 21:39 UTC

2 points

0 comments1 min readLW link

Meta-Regulations

Sable13 Oct 2023 21:23 UTC

18 points

5 comments10 min readLW link

(affablyevil.substack.com)

Hiring: Lighthaven Events & Venue Lead

Raemon13 Oct 2023 21:02 UTC

69 points

3 comments4 min readLW link

Prediction markets covered in the NYT podcast “Hard Fork”

Austin Chen13 Oct 2023 18:43 UTC

56 points

6 comments9 min readLW link

(www.nytimes.com)

[Paper] All’s Fair In Love And Love: Copy Suppression in GPT-2 Small

CallumMcDougall, Arthur Conmy, Tom McGrath and Neel Nanda

13 Oct 2023 18:32 UTC

82 points

4 comments8 min readLW link

FLI podcast series, “Imagine A World”, about aspirational futures with AGI

Jackson Wagner13 Oct 2023 16:07 UTC

9 points

0 comments4 min readLW link

To open-source or to not open-source, that is (an oversimplification of) the question.

Justin Bullock13 Oct 2023 15:10 UTC

12 points

5 comments5 min readLW link

Combination Lock Boxes

jefftk13 Oct 2023 12:50 UTC

17 points

9 comments1 min readLW link

(www.jefftk.com)

Circle of Support (Oct 14th @ 10am PST)

Alexei13 Oct 2023 9:24 UTC

19 points

1 comment1 min readLW link

[Question] How can the world handle the HAMAS situation?

Annapurna13 Oct 2023 9:15 UTC

5 points

43 comments1 min readLW link

UVic AI Ethics Conference

TristanTrim and Leo Mckee-Reid

13 Oct 2023 7:31 UTC

3 points

1 comment1 min readLW link

LW UI features you might not have tried

Elizabeth13 Oct 2023 3:04 UTC

49 points

6 comments1 min readLW link

Revisiting Guide Dogs and Blindness Prevention

jefftk13 Oct 2023 2:30 UTC

22 points

0 comments2 min readLW link

(www.jefftk.com)

Paper: Understanding and Controlling a Maze-Solving Policy Network

TurnTrout, Ulisse Mini, peligrietzer, mrinank_sharma, Austin Meek, Monte M and lisathiergart

13 Oct 2023 1:38 UTC

70 points

0 comments1 min readLW link

(arxiv.org)

OPTIC: Announcing Intercollegiate Forecasting Tournaments in SF, DC, Boston

Saul Munn, Jingyi Wang and toms

13 Oct 2023 1:36 UTC

6 points

0 comments1 min readLW link

Progress links digest, 2023-10-12: Dyson sphere thermodynamics and a cure for cavities

jasoncrawford13 Oct 2023 0:41 UTC

15 points

1 comment10 min readLW link

(rootsofprogress.org)

What do Marginal Grants at EAIF Look Like? Funding Priorities and Grantmaking Thresholds at the EA Infrastructure Fund

Linch12 Oct 2023 21:40 UTC

20 points

0 comments5 min readLW link

unRLHF—Efficiently undoing LLM safeguards

Pranav Gade, Jeffrey Ladish and Simon Lermen

12 Oct 2023 19:58 UTC

117 points

15 comments20 min readLW link

LoRA Fine-tuning Efficiently Undoes Safety Training from Llama 2-Chat 70B

Simon Lermen and Jeffrey Ladish

12 Oct 2023 19:58 UTC

151 points

29 comments14 min readLW link

[Question] Looking for reading recommendations: Theories of right/justice that safeguard against having one’s job automated?

bulKlub12 Oct 2023 19:40 UTC

−1 points

2 comments1 min readLW link

The International PauseAI Protest: Activism under uncertainty

Joseph Miller12 Oct 2023 17:36 UTC

37 points

1 comment4 min readLW link

AI #33: Cool New Interpretability Paper

Zvi12 Oct 2023 16:20 UTC

46 points

18 comments46 min readLW link

(thezvi.wordpress.com)

Noticing confusion in physics

Jacob G-W12 Oct 2023 15:21 UTC

20 points

27 comments2 min readLW link

(jacobgw.com)

[Question] How to make to-do lists (and to get things done)?

TeaTieAndHat12 Oct 2023 14:26 UTC

9 points

13 comments2 min readLW link

Relevance of ‘Harmful Intelligence’ Data in Training Datasets (WebText vs. Pile)

MiguelDev12 Oct 2023 12:08 UTC

12 points

0 comments9 min readLW link