All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025 2026

All Jan Feb Mar Apr May Jun Jul Aug Sep OctNovDec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 171819 20 21 22 23 24 25 26 27 28 29 30

AGIs may value intrinsic rewards more than extrinsic ones

catubc17 Nov 2022 21:49 UTC

8 points

6 comments4 min readLW link

LLMs may capture key components of human agency

catubc17 Nov 2022 20:14 UTC

27 points

0 comments4 min readLW link

Mastodon Replies as Comments

jefftk17 Nov 2022 20:10 UTC

20 points

0 comments1 min readLW link

(www.jefftk.com)

Announcing the Progress Forum

jasoncrawford17 Nov 2022 19:26 UTC

83 points

9 comments1 min readLW link

[Question] What kind of bias is this?

Daniel Samuel17 Nov 2022 18:44 UTC

3 points

2 comments1 min readLW link

AI Forecasting Research Ideas

Jsevillamol17 Nov 2022 17:37 UTC

21 points

2 comments1 min readLW link

(docs.google.com)

Results from the interpretability hackathon

Esben Kran and Neel Nanda

17 Nov 2022 14:51 UTC

81 points

0 comments6 min readLW link

(alignmentjam.com)

Covid 11/17/22: Slow Recovery

Zvi17 Nov 2022 14:50 UTC

33 points

3 comments4 min readLW link

(thezvi.wordpress.com)

Sadly, FTX

Zvi17 Nov 2022 14:30 UTC

133 points

18 comments47 min readLW link

(thezvi.wordpress.com)

Deontology and virtue ethics as “effective theories” of consequentialist ethics

Jan_Kulveit17 Nov 2022 14:11 UTC

72 points

9 comments10 min readLW link 1 review

The Ground Truth Problem (Or, Why Evaluating Interpretability Methods Is Hard)

Jessica Rumbelow17 Nov 2022 11:06 UTC

27 points

2 comments2 min readLW link

[Question] [Personal Question] Can anyone help me navigate this potentially painful interpersonal dynamic rationally?

SlainLadyMondegreen17 Nov 2022 8:53 UTC

9 points

3 comments4 min readLW link

Massive Scaling Should be Frowned Upon

harsimony17 Nov 2022 8:43 UTC

5 points

6 comments5 min readLW link

[Question] Why are profitable companies laying off staff?

Yair Halberstadt17 Nov 2022 6:19 UTC

15 points

10 comments1 min readLW link

[Question] [retracted] Discussion: Was SBF a naive utilitarian, or a sociopath?

Nicholas Kross17 Nov 2022 2:52 UTC

0 points

4 comments1 min readLW link

Kelsey Piper’s recent interview of SBF

agucova16 Nov 2022 20:30 UTC

51 points

29 comments2 min readLW link

(www.vox.com)

The Echo Principle

Jonathan Moregård16 Nov 2022 20:09 UTC

4 points

0 comments3 min readLW link

(honestliving.substack.com)

[Question] Is there some reason LLMs haven’t seen broader use?

tailcalled16 Nov 2022 20:04 UTC

25 points

27 comments1 min readLW link

When should we be surprised that an invention took “so long”?

jasoncrawford16 Nov 2022 20:04 UTC

32 points

11 comments4 min readLW link

(rootsofprogress.org)

Questions about Value Lock-in, Paternalism, and Empowerment

Sam F. Brown16 Nov 2022 15:33 UTC

13 points

2 comments12 min readLW link

(sambrown.eu)

If Professional Investors Missed This...

jefftk16 Nov 2022 15:00 UTC

37 points

18 comments3 min readLW link

(www.jefftk.com)

Disagreement with bio anchors that lead to shorter timelines

Marius Hobbhahn16 Nov 2022 14:40 UTC

75 points

17 comments7 min readLW link 1 review

Current themes in mechanistic interpretability research

Lee Sharkey, Sid Black and beren

16 Nov 2022 14:14 UTC

89 points

2 comments12 min readLW link

Unpacking “Shard Theory” as Hunch, Question, Theory, and Insight

Jacy Reese Anthis16 Nov 2022 13:54 UTC

31 points

9 comments2 min readLW link

Miracles and why not to believe them

mruwnik16 Nov 2022 12:07 UTC

4 points

0 comments2 min readLW link

[Question] How do people do remote research collaborations effectively?

Krieger16 Nov 2022 11:51 UTC

8 points

0 comments1 min readLW link

Method of statements: an alternative to taboo

Q Home16 Nov 2022 10:57 UTC

7 points

0 comments41 min readLW link

The two conceptions of Active Inference: an intelligence architecture and a theory of agency

Roman Leventov16 Nov 2022 9:30 UTC

18 points

0 comments4 min readLW link

Developer experience for the motivation

Adam Zerner16 Nov 2022 7:12 UTC

49 points

7 comments4 min readLW link

Progress links and tweets, 2022-11-15

jasoncrawford16 Nov 2022 3:21 UTC

9 points

0 comments2 min readLW link

(rootsofprogress.org)

EA & LW Forums Weekly Summary (7th Nov − 13th Nov 22′)

Zoe Williams16 Nov 2022 3:04 UTC

19 points

0 comments14 min readLW link

The FTX Saga—Simplified

Annapurna16 Nov 2022 2:42 UTC

44 points

10 comments7 min readLW link

(jorgevelez.substack.com)

Utilitarianism and the idea of a “rational agent” are fundamentally inconsistent with reality

banev16 Nov 2022 0:19 UTC

−4 points

1 comment1 min readLW link

[Question] Is the speed of training large models going to increase significantly in the near future due to Cerebras Andromeda?

Amal 15 Nov 2022 22:50 UTC

13 points

11 comments1 min readLW link

[Question] What is our current best infohazard policy for AGI (safety) research?

Roman Leventov15 Nov 2022 22:33 UTC

12 points

2 comments1 min readLW link

ACX/SSC Meetup 1 pm Sunday Nov 20

svfritz15 Nov 2022 20:39 UTC

2 points

0 comments1 min readLW link

SBF x LoL

Nicholas Kross15 Nov 2022 20:24 UTC

17 points

6 comments4 min readLW link

Some research ideas in forecasting

Jsevillamol15 Nov 2022 19:47 UTC

35 points

2 comments6 min readLW link

Strategy of Inner Conflict

Jonathan Moregård15 Nov 2022 19:38 UTC

9 points

4 comments6 min readLW link

(honestliving.substack.com)

The limited upside of interpretability

Peter S. Park15 Nov 2022 18:46 UTC

13 points

11 comments10 min readLW link

Why bet Kelly?

AlexMennen15 Nov 2022 18:12 UTC

32 points

14 comments5 min readLW link

Entropy Scaling And Intrinsic Memory

Alexander Gietelink Oldenziel and Adam Shai

15 Nov 2022 18:11 UTC

20 points

5 comments5 min readLW link

[Question] Will nanotech/biotech be what leads to AI doom?

tailcalled15 Nov 2022 17:38 UTC

4 points

9 comments2 min readLW link

Value Formation: An Overarching Model

Thane Ruthenis15 Nov 2022 17:16 UTC

34 points

20 comments34 min readLW link

Internal communication framework

rosehadshar and Nora_Ammann

15 Nov 2022 12:41 UTC

38 points

13 comments12 min readLW link

Better Mastodon Aliases

jefftk15 Nov 2022 12:10 UTC

14 points

3 comments1 min readLW link

(www.jefftk.com)

The economy as an analogy for advanced AI systems

rosehadshar and particlemania

15 Nov 2022 11:16 UTC

28 points

0 comments5 min readLW link

We need better prediction markets

eigen15 Nov 2022 4:54 UTC

9 points

8 comments1 min readLW link

Preventing, reversing, and addressing data leakage: some thoughts

VipulNaik15 Nov 2022 2:09 UTC

14 points

4 comments25 min readLW link

Winners of the AI Safety Nudge Competition

Marc Carauleanu15 Nov 2022 1:06 UTC

4 points

0 comments1 min readLW link