All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025

All Jan Feb Mar Apr May Jun Jul Aug SepOctNov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 222324 25 26 27 28 29 30 31

Telic intuitions across the sciences

mrcbarbier22 Oct 2022 21:31 UTC

4 points

0 comments17 min readLW link

A basic lexicon of telic concepts

mrcbarbier22 Oct 2022 21:28 UTC

2 points

0 comments3 min readLW link

Do we have the right kind of math for roles, goals and meaning?

mrcbarbier22 Oct 2022 21:28 UTC

13 points

5 comments7 min readLW link

[Question] The Last Year - is there an existing novel about the last year before AI doom?

Luca Petrolati22 Oct 2022 20:44 UTC

4 points

4 comments1 min readLW link

The highest-probability outcome can be out of distribution

tailcalled22 Oct 2022 20:00 UTC

14 points

5 comments1 min readLW link

Newsletter for Alignment Research: The ML Safety Updates

Esben Kran22 Oct 2022 16:17 UTC

26 points

0 comments7 min readLW link

Crypto loves impact markets: Notes from Schelling Point Bogotá

Rachel Shu22 Oct 2022 15:58 UTC

17 points

2 comments7 min readLW link

[Question] When trying to define general intelligence is ability to achieve goals the best metric?

jmh22 Oct 2022 3:09 UTC

5 points

0 comments1 min readLW link

[Question] Simple question about corrigibility and values in AI.

jmh22 Oct 2022 2:59 UTC

6 points

1 comment1 min readLW link

Moorean Statements

David Udell22 Oct 2022 0:50 UTC

11 points

11 comments1 min readLW link

Wisdom Cannot Be Unzipped

Sable22 Oct 2022 0:28 UTC

75 points

17 comments7 min readLW link 1 review

(affablyevil.substack.com)

A framework and open questions for game theoretic shard modeling

Garrett Baker21 Oct 2022 21:40 UTC

11 points

4 comments4 min readLW link

Cooperators are more powerful than agents

Ivan Vendrov21 Oct 2022 20:02 UTC

29 points

7 comments3 min readLW link

Intelligent behaviour across systems, scales and substrates

Nora_Ammann21 Oct 2022 17:09 UTC

11 points

0 comments10 min readLW link

Deepfake(?) Phishing

jefftk21 Oct 2022 14:30 UTC

37 points

9 comments1 min readLW link

(www.jefftk.com)

acronyms ftw

Emrik21 Oct 2022 13:36 UTC

−2 points

5 comments2 min readLW link

Crossword puzzle: LessWrong Halloween 2022

jchan21 Oct 2022 12:41 UTC

11 points

11 comments1 min readLW link

Weekly Roundup #2

Zvi21 Oct 2022 12:10 UTC

37 points

2 comments11 min readLW link

(thezvi.wordpress.com)

Improved Security to Prevent Hacker-AI and Digital Ghosts

Erland Wittkotter21 Oct 2022 10:11 UTC

4 points

3 comments12 min readLW link

Two Guts

chanamessinger21 Oct 2022 10:01 UTC

21 points

0 comments2 min readLW link

(chanamessinger.com)

The importance of studying subjective experience

Q Home21 Oct 2022 8:43 UTC

10 points

3 comments7 min readLW link

Legal Brief: Plurality Voting is Unconstitutional

c.trout21 Oct 2022 4:55 UTC

6 points

20 comments11 min readLW link

(medium.com)

Learning societal values from law as part of an AGI alignment strategy

John Nay21 Oct 2022 2:03 UTC

5 points

18 comments54 min readLW link

Covid 10/20/22: Wait, We Did WHAT?

Zvi20 Oct 2022 21:50 UTC

55 points

16 comments16 min readLW link

(thezvi.wordpress.com)

When apparently positive evidence can be negative evidence

cata20 Oct 2022 21:47 UTC

32 points

5 comments1 min readLW link

(www.ncbi.nlm.nih.gov)

Plans Are Predictions, Not Optimization Targets

johnswentworth20 Oct 2022 21:17 UTC

110 points

20 comments4 min readLW link 1 review

Introduction to abstract entropy

Alex_Altair20 Oct 2022 21:03 UTC

246 points

78 comments18 min readLW link 1 review

Trajectories to 2036

ukc1001420 Oct 2022 20:23 UTC

3 points

1 comment14 min readLW link

[Question] Rough Sketch for Product to Enhance Citizen Participation in Politics

T43120 Oct 2022 20:04 UTC

13 points

5 comments1 min readLW link

The heritability of human values: A behavior genetic critique of Shard Theory

geoffreymiller20 Oct 2022 15:51 UTC

82 points

63 comments21 min readLW link

A Conflict Between Longtermism and Veganism, Pick One.

Connor Tabarrok20 Oct 2022 14:30 UTC

−3 points

3 comments5 min readLW link

(alltrades.substack.com)

AI Research Program Prediction Markets

tailcalled20 Oct 2022 13:42 UTC

38 points

10 comments1 min readLW link

[Question] Is the meaning of words chosen/interpreted to maximize correlations with other relevant queries?

tailcalled20 Oct 2022 10:03 UTC

9 points

9 comments1 min readLW link

How to Write Readable Posts

David Hartsough20 Oct 2022 7:48 UTC

7 points

0 comments7 min readLW link

(davidhartsough.com)

Notes on “Can you control the past”

So8res20 Oct 2022 3:41 UTC

64 points

41 comments21 min readLW link

Rhythmic Baby Toys

jefftk20 Oct 2022 1:50 UTC

15 points

1 comment1 min readLW link

(www.jefftk.com)

[Question] What Does AI Alignment Success Look Like?

Shmi20 Oct 2022 0:32 UTC

23 points

7 comments1 min readLW link

Scaling Laws for Reward Model Overoptimization

leogao, John Schulman and Jacob_Hilton

20 Oct 2022 0:20 UTC

103 points

13 comments1 min readLW link

(arxiv.org)

What is Consciousness?

belkarx19 Oct 2022 21:14 UTC

3 points

2 comments2 min readLW link

What to do if a nuclear weapon is used in Ukraine?

Valentin202619 Oct 2022 18:43 UTC

13 points

9 comments3 min readLW link

[Question] If I asked for an explanation of a perfect Utopia, could you give one?

Akkira19 Oct 2022 17:56 UTC

−4 points

2 comments1 min readLW link

[Question] Should we push for requiring AI training data to be licensed?

ChristianKl19 Oct 2022 17:49 UTC

37 points

32 comments1 min readLW link

Hacker-AI and Digital Ghosts – Pre-AGI

Erland Wittkotter19 Oct 2022 15:33 UTC

9 points

7 comments8 min readLW link

The reward function is already how well you manipulate humans

Kerry19 Oct 2022 1:52 UTC

20 points

9 comments2 min readLW link

Response to Katja Grace’s AI x-risk counterarguments

Erik Jenner and Johannes Treutlein

19 Oct 2022 1:17 UTC

77 points

18 comments15 min readLW link

(OLD) An Extremely Opinionated Annotated List of My Favourite Mechanistic Interpretability Papers

Neel Nanda18 Oct 2022 21:08 UTC

72 points

5 comments12 min readLW link

(www.neelnanda.io)

Distilled Representations Research Agenda

Hoagy and mishajw

18 Oct 2022 20:59 UTC

15 points

2 comments8 min readLW link

Drafting a Covid Survey

jefftk18 Oct 2022 19:30 UTC

15 points

2 comments2 min readLW link

(www.jefftk.com)

How To Make Prediction Markets Useful For Alignment Work

johnswentworth18 Oct 2022 19:01 UTC

97 points

18 comments2 min readLW link

A conversation about Katja’s counterarguments to AI risk

Matthew Barnett and Ege Erdil

18 Oct 2022 18:40 UTC

43 points

9 comments33 min readLW link