All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025 2026

All Jan Feb Mar Apr May Jun Jul Aug Sep Oct NovDec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 293031

Why and how to write things on the Internet

benkuhn29 Dec 2022 22:40 UTC

22 points

2 comments15 min readLW link

(www.benkuhn.net)

Friendly and Unfriendly AGI are Indistinguishable

ErgoEcho29 Dec 2022 22:13 UTC

−4 points

4 comments4 min readLW link

(neologos.co)

200 COP in MI: Looking for Circuits in the Wild

Neel Nanda29 Dec 2022 20:59 UTC

16 points

5 comments13 min readLW link

Thoughts on the implications of GPT-3, two years ago and NOW [here be dragons, we’re swimming, flying and talking with them]

Bill Benzon29 Dec 2022 20:05 UTC

0 points

0 comments5 min readLW link

Covid 12/29/22: Next Up is XBB.1.5

Zvi29 Dec 2022 18:20 UTC

33 points

4 comments10 min readLW link

(thezvi.wordpress.com)

Entrepreneurship ETG Might Be Better Than 80k Thought

Xodarap29 Dec 2022 17:51 UTC

33 points

0 comments2 min readLW link

Internal Interfaces Are a High-Priority Interpretability Target

Thane Ruthenis29 Dec 2022 17:49 UTC

26 points

6 comments7 min readLW link

CFP for Rebellion and Disobedience in AI workshop

Ram Rachum29 Dec 2022 16:08 UTC

15 points

0 comments1 min readLW link

My scorched-earth policy on New Year’s resolutions

PatrickDFarley29 Dec 2022 14:45 UTC

29 points

2 comments4 min readLW link

Don’t feed the void. She is fat enough!

Johannes C. Mayer29 Dec 2022 14:18 UTC

11 points

0 comments1 min readLW link

[Question] Is there any unified resource on Eliezer’s fatigue?

Johannes C. Mayer29 Dec 2022 14:04 UTC

9 points

2 comments1 min readLW link

Logical Probability of Goldbach’s Conjecture: Provable Rule or Coincidence?

avturchin29 Dec 2022 13:37 UTC

5 points

15 comments8 min readLW link

Where do you get your capabilities from?

tailcalled29 Dec 2022 11:39 UTC

38 points

28 comments6 min readLW link

The commercial incentive to intentionally train AI to deceive us

Derek M. Jones29 Dec 2022 11:30 UTC

5 points

1 comment4 min readLW link

(shape-of-code.com)

Infinite necklace: the line as a circle

Alok Singh29 Dec 2022 10:41 UTC

5 points

2 comments1 min readLW link

Privacy Tradeoffs

jefftk29 Dec 2022 3:40 UTC

13 points

1 comment2 min readLW link

(www.jefftk.com)

Against John Searle, Gary Marcus, the Chinese Room thought experiment and its world

philosophybear29 Dec 2022 3:26 UTC

21 points

43 comments8 min readLW link

Large Language Models Suggest a Path to Ems

anithite29 Dec 2022 2:20 UTC

17 points

2 comments5 min readLW link

[Question] Book recommendations for the history of ML?

Eleni Angelou28 Dec 2022 23:50 UTC

2 points

2 comments1 min readLW link

Rock-Paper-Scissors Can Be Weird

winwonce28 Dec 2022 23:12 UTC

14 points

3 comments1 min readLW link

200 COP in MI: The Case for Analysing Toy Language Models

Neel Nanda28 Dec 2022 21:07 UTC

40 points

3 comments7 min readLW link

200 Concrete Open Problems in Mechanistic Interpretability: Introduction

Neel Nanda28 Dec 2022 21:06 UTC

108 points

0 comments10 min readLW link

Effective ways to find love?

anonymoususer28 Dec 2022 20:46 UTC

9 points

6 comments1 min readLW link

Classical logic based on propositions-as-subsingleton-types

Thomas Kehrenberg28 Dec 2022 20:16 UTC

6 points

0 comments16 min readLW link

In Defense of Wrapper-Minds

Thane Ruthenis28 Dec 2022 18:28 UTC

24 points

38 comments3 min readLW link

[Question] What is the best way to approach Expected Value calculations when payoffs are highly skewed?

jmh28 Dec 2022 14:42 UTC

8 points

16 comments1 min readLW link

Bandwagon effect: Bias in Evaluating AGI X-Risks

Remmelt and flandry19

28 Dec 2022 7:54 UTC

1 point

0 comments1 min readLW link

Getting up to Speed on the Speed Prior in 2022

robertzk28 Dec 2022 7:49 UTC

36 points

5 comments65 min readLW link

[Question] What does “probability” really mean?

sisyphus28 Dec 2022 3:20 UTC

6 points

20 comments1 min readLW link

Zooming the Chrome Audio Player

jefftk28 Dec 2022 2:30 UTC

9 points

0 comments1 min readLW link

(www.jefftk.com)

What AI Safety Materials Do ML Researchers Find Compelling?

VG and Collin

28 Dec 2022 2:03 UTC

175 points

34 comments2 min readLW link

South Bay ACX/LW Meetup

IS28 Dec 2022 1:59 UTC

3 points

0 comments1 min readLW link

Regarding Blake Lemoine’s claim that LaMDA is ‘sentient’, he might be right (sorta), but perhaps not for the reasons he thinks

philosophybear28 Dec 2022 1:55 UTC

9 points

1 comment6 min readLW link

Fundamental Uncertainty: Chapter 5 - How do we know what we know?

Gordon Seidoh Worley28 Dec 2022 1:28 UTC

10 points

2 comments12 min readLW link

Is checking that a state of the world is not dystopian easier than constructing a non-dystopian state?

No77e27 Dec 2022 20:57 UTC

5 points

3 comments1 min readLW link

Crypto-currency as pro-alignment mechanism

False Name27 Dec 2022 17:45 UTC

−10 points

2 comments2 min readLW link

My Reservations about Discovering Latent Knowledge (Burns, Ye, et al)

Robert_AIZI27 Dec 2022 17:27 UTC

50 points

0 comments4 min readLW link

(aizi.substack.com)

Things that can kill you quickly: What everyone should know about first aid

jasoncrawford27 Dec 2022 16:23 UTC

172 points

21 comments2 min readLW link 1 review

(jasoncrawford.org)

[Question] Why The Focus on Expected Utility Maximisers?

DragonGod27 Dec 2022 15:49 UTC

120 points

85 comments3 min readLW link

Presumptive Listening: sticking to familiar concepts and missing the outer reasoning paths

Remmelt27 Dec 2022 15:40 UTC

−16 points

8 comments2 min readLW link

(mflb.com)

Mere exposure effect: Bias in Evaluating AGI X-Risks

Remmelt and flandry19

27 Dec 2022 14:05 UTC

0 points

2 comments1 min readLW link

Housing and Transportation Roundup #2

Zvi27 Dec 2022 13:10 UTC

25 points

0 comments12 min readLW link

(thezvi.wordpress.com)

[Question] Are tulpas moral patients?

ChristianKl27 Dec 2022 11:30 UTC

18 points

28 comments1 min readLW link

Reflections on my 5-month alignment upskilling grant

Jay Bailey27 Dec 2022 10:51 UTC

83 points

4 comments8 min readLW link

Institutions Cannot Restrain Dark-Triad AI Exploitation

Remmelt and flandry19

27 Dec 2022 10:34 UTC

5 points

0 comments5 min readLW link

(mflb.com)

Introduction: Bias in Evaluating AGI X-Risks

Remmelt and flandry19

27 Dec 2022 10:27 UTC

1 point

0 comments3 min readLW link

MDPs and the Bellman Equation, Intuitively Explained

Jack O'Brien27 Dec 2022 5:50 UTC

11 points

3 comments14 min readLW link

How ‘Human-Human’ dynamics give way to ‘Human-AI’ and then ‘AI-AI’ dynamics

Remmelt and flandry19

27 Dec 2022 3:16 UTC

−2 points

5 comments2 min readLW link

(mflb.com)

Nine Points of Collective Insanity

Remmelt and flandry19

27 Dec 2022 3:14 UTC

−2 points

3 comments1 min readLW link

(mflb.com)

Fractional Resignation

jefftk27 Dec 2022 2:30 UTC

37 points

12 comments1 min readLW link

(www.jefftk.com)