All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025

AllJanFeb Mar Apr May Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 141516 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

[Question] When to mention irrelevant accusations?

philhJan 14, 2023, 9:58 PM

20 points

50 comments1 min readLW link

World-Model Interpretability Is All We Need

Thane RuthenisJan 14, 2023, 7:37 PM

36 points

22 comments21 min readLW link

Current AI Models Seem Sufficient for Low-Risk, Beneficial AI

harsimonyJan 14, 2023, 6:55 PM

17 points

1 comment2 min readLW link

[Question] Basic Question about LLMs: how do they know what task to perform

GarakJan 14, 2023, 1:13 PM

1 point

3 comments1 min readLW link

Aligned with what?

Program DenJan 14, 2023, 10:28 AM

3 points

41 comments1 min readLW link

Wokism, rethinking priorities and the Bostrom case

Arturo MaciasJan 14, 2023, 2:27 AM

−25 points

2 comments4 min readLW link

A general comment on discussions of genetic group differences

anonymous8101Jan 14, 2023, 2:11 AM

71 points

46 comments3 min readLW link

Abstractions as morphisms between (co)algebras

Erik JennerJan 14, 2023, 1:51 AM

17 points

1 comment8 min readLW link

Concrete Reasons for Hope about AI

Zac Hatfield-DoddsJan 14, 2023, 1:22 AM

94 points

13 comments1 min readLW link

Negative Expertise

Jonas KgomoJan 14, 2023, 12:51 AM

4 points

0 comments1 min readLW link

(twitter.com)

Mid-Atlantic AI Alignment Alliance Unconference

QuinnJan 13, 2023, 8:33 PM

7 points

2 comments1 min readLW link

Smallpox vaccines are widely available, for now

David HornbeinJan 13, 2023, 8:02 PM

26 points

5 comments1 min readLW link

How does GPT-3 spend its 175B parameters?

Robert_AIZIJan 13, 2023, 7:21 PM

41 points

14 comments6 min readLW link

(aizi.substack.com)

[ASoT] Simulators show us behavioural properties by default

JozdienJan 13, 2023, 6:42 PM

36 points

3 comments3 min readLW link

Wheel of Consent Theory for Rationalists and Effective Altruists

adamwilderJan 13, 2023, 5:59 PM

1 point

0 comments2 min readLW link

Money is a way of thanking strangers

DirectedEvolutionJan 13, 2023, 5:06 PM

13 points

5 comments4 min readLW link

Tracr: Compiled Transformers as a Laboratory for Interpretability | DeepMind

DragonGodJan 13, 2023, 4:53 PM

62 points

12 comments1 min readLW link

(arxiv.org)

How we could stumble into AI catastrophe

HoldenKarnofskyJan 13, 2023, 4:20 PM

71 points

18 comments18 min readLW link

(www.cold-takes.com)

Robustness & Evolution [MLAISU W02]

Esben KranJan 13, 2023, 3:47 PM

10 points

0 comments3 min readLW link

(newsletter.apartresearch.com)

On Cooking With Gas

ZviJan 13, 2023, 2:20 PM

38 points

60 comments6 min readLW link

(thezvi.wordpress.com)

Beware safety-washing

LizkaJan 13, 2023, 1:59 PM

51 points

2 comments4 min readLW link

Some Arguments Against Strong Scaling

Joar SkalseJan 13, 2023, 12:04 PM

25 points

21 comments16 min readLW link

[Question] Where do you find people who actually do things?

Ulisse MiniJan 13, 2023, 6:57 AM

7 points

12 comments1 min readLW link

[Question] Could Simulating an AGI Taking Over the World Actually Lead to a LLM Taking Over the World?

simeon_cJan 13, 2023, 6:33 AM

15 points

1 comment1 min readLW link

Burning Uptime: When your Sandbox of Empathy is Leaky and also an Hourglass

CedarJan 13, 2023, 5:18 AM

13 points

2 comments3 min readLW link

Disentangling Shard Theory into Atomic Claims

Leon LangJan 13, 2023, 4:23 AM

86 points

6 comments18 min readLW link

AGISF adaptation for in-person groups

Sam Marks, Xander Davies and Richard_Ngo

Jan 13, 2023, 3:24 AM

44 points

2 comments3 min readLW link

Actions and Flows

Alok SinghJan 13, 2023, 3:20 AM

5 points

0 comments1 min readLW link

(alok.github.io)

A Thorough Introduction to Abstraction

RohanSJan 13, 2023, 12:30 AM

9 points

1 comment18 min readLW link

The AI Control Problem in a wider intellectual context

philosophybearJan 13, 2023, 12:28 AM

11 points

3 comments12 min readLW link

The Alignment Problems

Martín SotoJan 12, 2023, 10:29 PM

20 points

0 comments4 min readLW link

Proposal for Inducing Steganography in LMs

Logan RiggsJan 12, 2023, 10:15 PM

22 points

3 comments2 min readLW link

Announcing the 2023 PIBBSS Summer Research Fellowship

Nora_Ammann and DusanDNesic

Jan 12, 2023, 9:31 PM

32 points

0 comments1 min readLW link

Victoria Krakovna on AGI Ruin, The Sharp Left Turn and Paradigms of AI Alignment

Michaël TrazziJan 12, 2023, 5:09 PM

40 points

3 comments4 min readLW link

(www.theinsideview.ai)

[Question] What is a disagreement you have around AI safety?

tailcalledJan 12, 2023, 4:58 PM

16 points

7 comments1 min readLW link

Reward is not Necessary: How to Create a Compositional Self-Preserving Agent for Life-Long Learning

Roman LeventovJan 12, 2023, 4:43 PM

17 points

2 comments2 min readLW link

(arxiv.org)

ChatGPT struggles to respond to the real world

Alex FlintJan 12, 2023, 4:02 PM

31 points

9 comments24 min readLW link

Covid 1/12/23: Unexpected Spike in Deaths

ZviJan 12, 2023, 2:30 PM

31 points

2 comments8 min readLW link

(thezvi.wordpress.com)

[Linkpost] Scaling Laws for Generative Mixed-Modal Language Models

Amal Jan 12, 2023, 2:24 PM

15 points

2 comments1 min readLW link

(arxiv.org)

ea.domains—Domains Free to a Good Home

plexJan 12, 2023, 1:32 PM

24 points

0 comments LW link

VIRTUA: a novel about AI alignment

Karl von WendtJan 12, 2023, 9:37 AM

46 points

12 comments1 min readLW link

Iron deficiencies are very bad and you should treat them

ElizabethJan 12, 2023, 9:10 AM

108 points

34 comments11 min readLW link 1 review

(acesounderglass.com)

Nonstandard analysis in ethics

Alok SinghJan 12, 2023, 5:58 AM

−1 points

0 comments78 min readLW link

(nickbostrom.com)

Example of the nameless rationalist virtue

Alok SinghJan 12, 2023, 5:45 AM

−9 points

2 comments1 min readLW link

FFMI Gains: A List of Vitalities

porbyJan 12, 2023, 4:48 AM

26 points

3 comments7 min readLW link

[Linkpost] DreamerV3: A General RL Architecture

simeon_cJan 12, 2023, 3:55 AM

23 points

3 comments1 min readLW link

(arxiv.org)

Microsoft Plans to Invest $10B in OpenAI; $3B Invested to Date | Fortune

DragonGodJan 12, 2023, 3:55 AM

23 points

10 comments2 min readLW link

(fortune.com)

Progress and research disruptiveness

Eleni AngelouJan 12, 2023, 3:51 AM

3 points

2 comments1 min readLW link

(www.nature.com)

The Fable of the AI Coomer: Why the Social Prowess of Machines is AI’s Most Proximal Threat

Ace DelgadoJan 12, 2023, 1:15 AM

−10 points

4 comments4 min readLW link

Write to Think

Michael SamoilovJan 12, 2023, 12:33 AM

10 points

2 comments2 min readLW link

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer