All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025

All Jan Feb Mar Apr May Jun Jul Aug Sep Oct NovDec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 222324 25 26 27 28 29 30 31

SociaLLM: proposal for a language model design for personalised apps, social science, and AI safety research

Roman LeventovDec 19, 2023, 4:49 PM

17 points

5 comments3 min readLW link

Chording “The Next Right Thing”

jefftkDec 19, 2023, 3:40 PM

11 points

0 comments2 min readLW link

(www.jefftk.com)

Monthly Roundup #13: December 2023

ZviDec 19, 2023, 3:10 PM

32 points

5 comments26 min readLW link

(thezvi.wordpress.com)

Effective Aspersions: How the Nonlinear Investigation Went Wrong

TracingWoodgrainsDec 19, 2023, 12:00 PM

188 points

172 comments31 min readLW link 2 reviews

A Universal Emergent Decomposition of Retrieval Tasks in Language Models

Alexandre Variengien and Eric Winsor

Dec 19, 2023, 11:52 AM

84 points

3 comments10 min readLW link

(arxiv.org)

Assessment of AI safety agendas: think about the downside risk

Roman LeventovDec 19, 2023, 9:00 AM

13 points

1 comment1 min readLW link

Constellations are Younger than Continents

Jeffrey HeningerDec 19, 2023, 6:12 AM

264 points

21 comments2 min readLW link

The Dark Arts

lsusr and Lyrongolem

Dec 19, 2023, 4:41 AM

134 points

49 comments9 min readLW link

When scientists consider whether their research will end the world

HarlanDec 19, 2023, 3:47 AM

30 points

4 comments11 min readLW link

(blog.aiimpacts.org)

Is the far future inevitably zero sum?

Srdjan MileticDec 19, 2023, 1:45 AM

8 points

2 comments2 min readLW link

(dissent.blog)

The ‘Neglected Approaches’ Approach: AE Studio’s Alignment Agenda

Cameron Berg, Judd Rosenblatt, AE Studio and Marc Carauleanu

Dec 18, 2023, 8:35 PM

178 points

23 comments12 min readLW link 1 review

The Shortest Path Between Scylla and Charybdis

Thane RuthenisDec 18, 2023, 8:08 PM

50 points

8 comments5 min readLW link

OpenAI: Preparedness framework

Zach Stein-PerlmanDec 18, 2023, 6:30 PM

70 points

23 comments4 min readLW link

(openai.com)

[Valence series] 5. “Valence Disorders” in Mental Health & Personality

Steven ByrnesDec 18, 2023, 3:26 PM

45 points

13 comments13 min readLW link

Discussion: Challenges with Unsupervised LLM Knowledge Discovery

Seb Farquhar, Vikrant Varma, zac_kenton, gasteigerjo, Vlad Mikulik and Rohin Shah

Dec 18, 2023, 11:58 AM

147 points

21 comments10 min readLW link

Interpreting the Learning of Deceit

RogerDearnaleyDec 18, 2023, 8:12 AM

30 points

14 comments9 min readLW link

Talk: “AI Would Be A Lot Less Alarming If We Understood Agents”

johnswentworthDec 17, 2023, 11:46 PM

58 points

3 comments1 min readLW link

(www.youtube.com)

∀: a story

Richard_NgoDec 17, 2023, 10:42 PM

39 points

1 comment8 min readLW link

(www.narrativeark.xyz)

Reviving a 2015 MacBook

jefftkDec 17, 2023, 9:00 PM

11 points

0 comments1 min readLW link

(www.jefftk.com)

A Common-Sense Case For Mutually-Misaligned AGIs Allying Against Humans

Thane RuthenisDec 17, 2023, 8:28 PM

29 points

7 comments11 min readLW link

The Limits of Artificial Consciousness: A Biology-Based Critique of Chalmers’ Fading Qualia Argument

Štěpán LosDec 17, 2023, 7:11 PM

−6 points

9 comments17 min readLW link

What makes teaching math special

ViliamDec 17, 2023, 2:15 PM

45 points

27 comments11 min readLW link

The predictive power of dissipative adaptation

dr_sDec 17, 2023, 2:01 PM

56 points

14 comments19 min readLW link

Linkpost: Francesca v Harvard

LinchDec 17, 2023, 6:18 AM

5 points

5 comments2 min readLW link

(www.francesca-v-harvard.org)

Lessons from massaging myself, others, dogs, and cats

Chris LakinDec 17, 2023, 4:28 AM

2 points

27 comments5 min readLW link

(chipmonk.blog)

The Serendipity of Density

jefftkDec 17, 2023, 3:50 AM

40 points

4 comments1 min readLW link

(www.jefftk.com)

Bounty: Diverse hard tasks for LLM agents

Beth Barnes and Megan Kinniment

Dec 17, 2023, 1:04 AM

49 points

31 comments16 min readLW link

2022 (and All Time) Posts by Pingback Count

RaemonDec 16, 2023, 9:17 PM

53 points

14 comments6 min readLW link

“Humanity vs. AGI” Will Never Look Like “Humanity vs. AGI” to Humanity

Thane RuthenisDec 16, 2023, 8:08 PM

191 points

34 comments5 min readLW link

A visual analogy for text generation by LLMs?

Bill BenzonDec 16, 2023, 5:58 PM

3 points

0 comments1 min readLW link

cold aluminum for medicine

bhauthDec 16, 2023, 2:38 PM

42 points

4 comments4 min readLW link

(www.bhauth.com)

Scalable Oversight and Weak-to-Strong Generalization: Compatible approaches to the same problem

Ansh Radhakrishnan, Buck, ryan_greenblatt and Fabien Roger

Dec 16, 2023, 5:49 AM

76 points

4 comments6 min readLW link 1 review

Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision

leogaoDec 16, 2023, 5:39 AM

55 points

5 comments1 min readLW link

Pope Francis shares thoughts on responsible AI development

corruptedCatapillarDec 16, 2023, 3:49 AM

15 points

4 comments1 min readLW link

(www.vatican.va)

Current AIs Provide Nearly No Data Relevant to AGI Alignment

Thane RuthenisDec 15, 2023, 8:16 PM

132 points

157 comments8 min readLW link 1 review

Agglomeration of ‘Ought’

DavidAndresBloomDec 15, 2023, 7:07 PM

1 point

1 comment11 min readLW link

Predicting the future with the power of the Internet (and pissing off Rob Miles)

WriterDec 15, 2023, 5:37 PM

23 points

9 comments4 min readLW link

(youtu.be)

Progress links digest, 2023-12-15: Vitalik on d/acc, $100M+ in prizes, and more

jasoncrawfordDec 15, 2023, 3:52 PM

20 points

0 comments12 min readLW link

(rootsofprogress.org)

“AI Alignment” is a Dangerously Overloaded Term

RokoDec 15, 2023, 2:34 PM

108 points

100 comments3 min readLW link

[Valence series] 4. Valence & Social Status (deprecated)

Steven ByrnesDec 15, 2023, 2:24 PM

35 points

19 comments11 min readLW link

Contra Scott on Abolishing the FDA

Maxwell TabarrokDec 15, 2023, 2:00 PM

46 points

3 comments6 min readLW link

(maximumprogress.substack.com)

[Paper] Trajectories through semantic spaces in schizophrenia and the relationship to ripple bursts

bvbvbvbvbvbvbvbvbvbvbvDec 15, 2023, 1:37 PM

3 points

0 comments1 min readLW link

(www.pnas.org)

Takeaways from a Mechanistic Interpretability project on “Forbidden Facts”

Tony Wang, Miles Wang and kaivu

Dec 15, 2023, 11:05 AM

33 points

8 comments10 min readLW link

Refinement of Active Inference agency ontology

Roman LeventovDec 15, 2023, 9:31 AM

16 points

0 comments5 min readLW link

(arxiv.org)

EU policymakers reach an agreement on the AI Act

tlevinDec 15, 2023, 6:02 AM

78 points

7 comments7 min readLW link

Where Does Adversarial Pressure Come From?

quetzal_rainbowDec 14, 2023, 10:31 PM

17 points

1 comment2 min readLW link

Epoch wise critical periods, and singular learning theory

Garrett BakerDec 14, 2023, 8:55 PM

16 points

1 comment5 min readLW link

OpenAI Superalignment: Weak-to-strong generalization

DalmertDec 14, 2023, 7:47 PM

25 points

3 comments1 min readLW link

(openai.com)

Applications for EA Global are still open!

Eli_NathanDec 14, 2023, 7:10 PM

1 point

0 comments1 min readLW link

Personal Development System: Winning Repeatedly and Growing Effectively With The BIG4

Paul RohdeDec 14, 2023, 6:49 PM

13 points

0 comments33 min readLW link

(blog.paul-rohde.com)

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer