All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025

All Jan Feb Mar Apr May JunJulAug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 121314 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

[Question] How bad would AI progress need to be for us to think general technological progress is also bad?

Jim BuhlerJul 9, 2024, 10:43 AM

9 points

5 comments1 min readLW link

How LLMs Learn: What We Know, What We Don’t (Yet) Know, and What Comes Next

JonasbJul 9, 2024, 9:58 AM

2 points

0 comments16 min readLW link

(www.denominations.io)

WTF is with the Infancy Gospel of Thomas?!? A deep dive into satire, philosophy, and more

kromemJul 9, 2024, 9:29 AM

18 points

2 comments11 min readLW link

Book Review: Safe Enough? A History of Nuclear Power and Accident Risk

ErickBallJul 9, 2024, 1:12 AM

10 points

0 comments28 min readLW link

Me, Myself, and AI: the Situational Awareness Dataset (SAD) for LLMs

L Rudolf L, bilalchughtai, Jan Betley, kaivu, Jérémy Scheurer, Mikita Balesni, AlexMeinke, Owain_Evans and Marius Hobbhahn

Jul 8, 2024, 10:24 PM

109 points

37 comments5 min readLW link

Robin Hanson & Liron Shapira Debate AI X-Risk

LironJul 8, 2024, 9:45 PM

34 points

4 comments1 min readLW link

(www.youtube.com)

“The Singularity Is Nearer” by Ray Kurzweil—Review

LavenderJul 8, 2024, 9:32 PM

22 points

0 comments4 min readLW link

Sample Prevalence vs Global Prevalence

jefftkJul 8, 2024, 9:00 PM

11 points

0 comments2 min readLW link

(www.jefftk.com)

Advice to junior AI governance researchers

Orpheus16Jul 8, 2024, 7:19 PM

66 points

1 comment5 min readLW link

Pantheon Interface

NicholasKees and Sofia Vanhanen

Jul 8, 2024, 7:03 PM

127 points

22 comments6 min readLW link

Launching the AI Forecasting Benchmark Series Q3 | $30k in Prizes

ChristianWilliamsJul 8, 2024, 5:20 PM

5 points

0 comments1 min readLW link

(www.metaculus.com)

The Golden Mean of Scientific Virtues

adamShimiJul 8, 2024, 5:16 PM

12 points

4 comments8 min readLW link

(epistemologicalfascinations.substack.com)

Massapequa (Long Island), New York, USA – ACX Meetup

Gabriel WeilJul 8, 2024, 5:01 PM

2 points

0 comments1 min readLW link

Dialogue introduction to Singular Learning Theory

Olli JärviniemiJul 8, 2024, 4:58 PM

102 points

15 comments8 min readLW link

Announcing The Techno-Humanist Manifesto: A new philosophy of progress for the 21st century

jasoncrawfordJul 8, 2024, 4:33 PM

18 points

4 comments5 min readLW link

(blog.rootsofprogress.org)

Response to Dileep George: AGI safety warrants planning ahead

Steven ByrnesJul 8, 2024, 3:27 PM

27 points

7 comments27 min readLW link

Why not parliamentarianism? [book by Tiago Ribeiro dos Santos]

Arturo MaciasJul 8, 2024, 2:57 PM

2 points

1 comment4 min readLW link

Games of My Childhood: The Troops

Kaj_SotalaJul 8, 2024, 11:20 AM

18 points

0 comments5 min readLW link

(kajsotala.fi)

Towards shutdownable agents via stochastic choice

EJT, alexr, christosi and LAThomson

Jul 8, 2024, 10:14 AM

59 points

11 comments23 min readLW link

(arxiv.org)

On scalable oversight with weak LLMs judging strong LLMs

zac_kenton, Noah Siegel, janos, Jonah Brown-Cohen, Samuel Albanie, David Lindner and Rohin Shah

Jul 8, 2024, 8:59 AM

49 points

18 comments7 min readLW link

(arxiv.org)

Poker is a bad game for teaching epistemics. Figgie is a better one.

rossryJul 8, 2024, 6:05 AM

105 points

47 comments11 min readLW link

(blog.rossry.net)

Controlled Creative Destruction

Martin SustrikJul 8, 2024, 4:36 AM

11 points

0 comments2 min readLW link

On saying “Thank you” instead of “I’m Sorry”

Michael CohnJul 8, 2024, 3:13 AM

136 points

16 comments3 min readLW link

How can I get over my fear of becoming an emulated consciousness?

James DowdellJul 7, 2024, 10:02 PM

6 points

8 comments5 min readLW link

An Extremely Opinionated Annotated List of My Favourite Mechanistic Interpretability Papers v2

Neel NandaJul 7, 2024, 5:39 PM

137 points

16 comments25 min readLW link

Joint mandatory donation as a way to increase the number of donations

Crazy philosopherJul 7, 2024, 10:56 AM

3 points

3 comments2 min readLW link

Rationality vs Alignment

Donatas LučiūnasJul 7, 2024, 10:12 AM

−14 points

14 comments2 min readLW link

Beyond Biomarkers: Understanding Multiscale Causality

Matěj NekoranecJul 7, 2024, 9:56 AM

13 points

0 comments7 min readLW link

Goodhart’s Law and Emotions

Zero ContradictionsJul 7, 2024, 8:32 AM

1 point

5 comments1 min readLW link

(expandingrationality.substack.com)

Reflections on Less Online

ErrorJul 7, 2024, 3:49 AM

89 points

15 comments18 min readLW link

LK-99 in retrospect

bhauthJul 7, 2024, 2:06 AM

72 points

21 comments3 min readLW link

(www.bhauth.com)

NYU Debate Training Update: Methods, Baselines, Preliminary Results

samarnesenJul 6, 2024, 6:28 PM

9 points

0 comments20 min readLW link

Scalable oversight as a quantitative rather than qualitative problem

BuckJul 6, 2024, 5:42 PM

86 points

11 comments3 min readLW link

An AI Manhattan Project is Not Inevitable

Maxwell TabarrokJul 6, 2024, 4:42 PM

39 points

25 comments4 min readLW link

(www.maximum-progress.com)

[Linkpost] A Case for AI Consciousness

cdkg and Simon Goldstein

Jul 6, 2024, 2:52 PM

21 points

2 comments1 min readLW link

(philpapers.org)

[Question] Can agents coordinate on randomness without outside sources?

Mikhail SaminJul 6, 2024, 1:43 PM

11 points

16 comments1 min readLW link

AI Alignment Research Engineer Accelerator (ARENA): Call for applicants v4.0

James Fox, Chloe Li, JamesH, Gracie Green and CallumMcDougall

Jul 6, 2024, 11:34 AM

57 points

7 comments6 min readLW link

Links and brief musings for June

Kaj_SotalaJul 6, 2024, 10:10 AM

26 points

0 comments10 min readLW link

(kajsotala.fi)

Indecision and internalized authority figures

Kaj_SotalaJul 6, 2024, 10:10 AM

69 points

1 comment2 min readLW link

(kajsotala.fi)

Free Will, Determinism, And Choice

Zero ContradictionsJul 6, 2024, 6:34 AM

7 points

3 comments1 min readLW link

(thewaywardaxolotl.blogspot.com)

Travel Buffer

jefftkJul 6, 2024, 2:20 AM

17 points

3 comments1 min readLW link

(www.jefftk.com)

[Question] What progress have we made on automated auditing?

LawrenceCJul 6, 2024, 1:49 AM

38 points

1 comment1 min readLW link

A “Bitter Lesson” Approach to Aligning AGI and ASI

RogerDearnaleyJul 6, 2024, 1:23 AM

64 points

41 comments24 min readLW link

D&D.Sci: Whom Shall You Call?

abstractapplicJul 5, 2024, 8:53 PM

40 points

6 comments2 min readLW link

[Interim research report] Activation plateaus & sensitive directions in GPT2

StefanHex and jake_mendel

Jul 5, 2024, 5:05 PM

65 points

2 comments5 min readLW link

Minimalist And Maximalist Type Systems

adamShimiJul 5, 2024, 4:25 PM

17 points

6 comments3 min readLW link

(epistemologicalfascinations.substack.com)

ML4Good Summer Bootcamps—Applications Open [deadline extended]

YMJul 5, 2024, 1:59 PM

12 points

0 comments1 min readLW link

[Question] Are there any plans to launch a paperback version of “Rationality: From AI to Zombies”?

m_arjJul 5, 2024, 11:14 AM

2 points

1 comment1 min readLW link

Doomsday Argument and the False Dilemma of Anthropic Reasoning

Ape in the coatJul 5, 2024, 5:38 AM

49 points

61 comments7 min readLW link

Finding the Wisdom to Build Safe AI

Gordon Seidoh WorleyJul 4, 2024, 7:04 PM

36 points

10 comments9 min readLW link

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer