All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025

All Jan Feb Mar Apr May JunJulAug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 91011 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

[Question] Pondering how good or bad things will be in the AGI future

SherrinfordJul 9, 2024, 10:46 PM

11 points

9 comments2 min readLW link

Causal Graphs of GPT-2-Small’s Residual Stream

David UdellJul 9, 2024, 10:06 PM

53 points

7 comments7 min readLW link

[Question] If AI starts to end the world, is suicide a good idea?

IlluminateRealityJul 9, 2024, 9:53 PM

0 points

8 comments1 min readLW link

Rationalist Purity Test

Gunnar_ZarnckeJul 9, 2024, 8:30 PM

−9 points

5 comments1 min readLW link

(ratpuritytest.com)

That which can be destroyed by the truth, should be assumed to should be destroyed by it

Thac0Jul 9, 2024, 7:39 PM

6 points

0 comments3 min readLW link

AISN #38: Supreme Court Decision Could Limit Federal Ability to Regulate AI Plus, “Circuit Breakers” for AI systems, and updates on China’s AI industry

Corin Katzke, Alexa Pan, Julius and Dan H

Jul 9, 2024, 7:28 PM

5 points

0 comments5 min readLW link

(newsletter.safe.ai)

Summer Tour Stops

jefftkJul 9, 2024, 7:10 PM

10 points

0 comments3 min readLW link

(www.jefftk.com)

Fix simple mistakes in ARC-AGI, etc.

Oleg TrottJul 9, 2024, 5:46 PM

9 points

9 comments1 min readLW link

Paper Summary: The Effects of Communicating Uncertainty on Public Trust in Facts and Numbers

Jeffrey HeningerJul 9, 2024, 4:50 PM

42 points

2 comments2 min readLW link

(blog.aiimpacts.org)

UC Berkeley course on LLMs and ML Safety

Dan HJul 9, 2024, 3:40 PM

36 points

1 comment1 min readLW link

(rdi.berkeley.edu)

What and Why: Developmental Interpretability of Reinforcement Learning

Garrett BakerJul 9, 2024, 2:09 PM

68 points

4 comments6 min readLW link

Medical Roundup #3

ZviJul 9, 2024, 1:10 PM

39 points

4 comments19 min readLW link

(thezvi.wordpress.com)

Consent across power differentials

Ramana KumarJul 9, 2024, 11:42 AM

50 points

12 comments3 min readLW link

[Question] How bad would AI progress need to be for us to think general technological progress is also bad?

Jim BuhlerJul 9, 2024, 10:43 AM

9 points

5 comments1 min readLW link

How LLMs Learn: What We Know, What We Don’t (Yet) Know, and What Comes Next

JonasbJul 9, 2024, 9:58 AM

2 points

0 comments16 min readLW link

(www.denominations.io)

WTF is with the Infancy Gospel of Thomas?!? A deep dive into satire, philosophy, and more

kromemJul 9, 2024, 9:29 AM

18 points

2 comments11 min readLW link

Book Review: Safe Enough? A History of Nuclear Power and Accident Risk

ErickBallJul 9, 2024, 1:12 AM

10 points

0 comments28 min readLW link

Me, Myself, and AI: the Situational Awareness Dataset (SAD) for LLMs

L Rudolf L, bilalchughtai, Jan Betley, kaivu, Jérémy Scheurer, Mikita Balesni, AlexMeinke, Owain_Evans and Marius Hobbhahn

Jul 8, 2024, 10:24 PM

109 points

37 comments5 min readLW link

Robin Hanson & Liron Shapira Debate AI X-Risk

LironJul 8, 2024, 9:45 PM

34 points

4 comments1 min readLW link

(www.youtube.com)

“The Singularity Is Nearer” by Ray Kurzweil—Review

LavenderJul 8, 2024, 9:32 PM

22 points

0 comments4 min readLW link

Sample Prevalence vs Global Prevalence

jefftkJul 8, 2024, 9:00 PM

11 points

0 comments2 min readLW link

(www.jefftk.com)

Advice to junior AI governance researchers

Orpheus16Jul 8, 2024, 7:19 PM

66 points

1 comment5 min readLW link

Pantheon Interface

NicholasKees and Sofia Vanhanen

Jul 8, 2024, 7:03 PM

127 points

22 comments6 min readLW link

Launching the AI Forecasting Benchmark Series Q3 | $30k in Prizes

ChristianWilliamsJul 8, 2024, 5:20 PM

5 points

0 comments LW link

(www.metaculus.com)

The Golden Mean of Scientific Virtues

adamShimiJul 8, 2024, 5:16 PM

12 points

4 comments8 min readLW link

(epistemologicalfascinations.substack.com)

Massapequa (Long Island), New York, USA – ACX Meetup

Gabriel WeilJul 8, 2024, 5:01 PM

2 points

0 comments1 min readLW link

Dialogue introduction to Singular Learning Theory

Olli JärviniemiJul 8, 2024, 4:58 PM

101 points

15 comments8 min readLW link

Announcing The Techno-Humanist Manifesto: A new philosophy of progress for the 21st century

jasoncrawfordJul 8, 2024, 4:33 PM

18 points

4 comments5 min readLW link

(blog.rootsofprogress.org)

Response to Dileep George: AGI safety warrants planning ahead

Steven ByrnesJul 8, 2024, 3:27 PM

27 points

7 comments27 min readLW link

Why not parliamentarianism? [book by Tiago Ribeiro dos Santos]

Arturo MaciasJul 8, 2024, 2:57 PM

2 points

1 comment4 min readLW link

Games of My Childhood: The Troops

Kaj_SotalaJul 8, 2024, 11:20 AM

18 points

0 comments5 min readLW link

(kajsotala.fi)

Towards shutdownable agents via stochastic choice

EJT, alexr, christosi and LAThomson

Jul 8, 2024, 10:14 AM

59 points

11 comments23 min readLW link

(arxiv.org)

On scalable oversight with weak LLMs judging strong LLMs

zac_kenton, Noah Siegel, janos, Jonah Brown-Cohen, Samuel Albanie, David Lindner and Rohin Shah

Jul 8, 2024, 8:59 AM

49 points

18 comments7 min readLW link

(arxiv.org)

Poker is a bad game for teaching epistemics. Figgie is a better one.

rossryJul 8, 2024, 6:05 AM

105 points

47 comments11 min readLW link

(blog.rossry.net)

Controlled Creative Destruction

Martin SustrikJul 8, 2024, 4:36 AM

11 points

0 comments2 min readLW link

On saying “Thank you” instead of “I’m Sorry”

Michael CohnJul 8, 2024, 3:13 AM

136 points

16 comments3 min readLW link

How can I get over my fear of becoming an emulated consciousness?

James DowdellJul 7, 2024, 10:02 PM

6 points

8 comments5 min readLW link

An Extremely Opinionated Annotated List of My Favourite Mechanistic Interpretability Papers v2

Neel NandaJul 7, 2024, 5:39 PM

136 points

16 comments25 min readLW link

Joint mandatory donation as a way to increase the number of donations

Crazy philosopherJul 7, 2024, 10:56 AM

3 points

3 comments2 min readLW link

Rationality vs Alignment

Donatas LučiūnasJul 7, 2024, 10:12 AM

−14 points

14 comments2 min readLW link

Beyond Biomarkers: Understanding Multiscale Causality

Matěj NekoranecJul 7, 2024, 9:56 AM

13 points

0 comments7 min readLW link

Goodhart’s Law and Emotions

Zero ContradictionsJul 7, 2024, 8:32 AM

1 point

5 comments1 min readLW link

(expandingrationality.substack.com)

Reflections on Less Online

ErrorJul 7, 2024, 3:49 AM

89 points

15 comments18 min readLW link

LK-99 in retrospect

bhauthJul 7, 2024, 2:06 AM

72 points

21 comments3 min readLW link

(www.bhauth.com)

NYU Debate Training Update: Methods, Baselines, Preliminary Results

samarnesenJul 6, 2024, 6:28 PM

9 points

0 comments20 min readLW link

Scalable oversight as a quantitative rather than qualitative problem

BuckJul 6, 2024, 5:42 PM

85 points

11 comments3 min readLW link

An AI Manhattan Project is Not Inevitable

Maxwell TabarrokJul 6, 2024, 4:42 PM

39 points

25 comments4 min readLW link

(www.maximum-progress.com)

[Linkpost] A Case for AI Consciousness

cdkg and Simon Goldstein

Jul 6, 2024, 2:52 PM

21 points

2 comments1 min readLW link

(philpapers.org)

[Question] Can agents coordinate on randomness without outside sources?

Mikhail SaminJul 6, 2024, 1:43 PM

11 points

16 comments1 min readLW link

AI Alignment Research Engineer Accelerator (ARENA): Call for applicants v4.0

James Fox, Chloe Li, JamesH, Gracie Green and CallumMcDougall

Jul 6, 2024, 11:34 AM

57 points

7 comments6 min readLW link

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer