Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
1
Nonprofit to retain control of OpenAI
Archimedes
May 5, 2025, 11:41 PM
37
points
1
comment
1
min read
LW
link
(openai.com)
Unexpected Conscious Entities
Gunnar_Zarncke
May 5, 2025, 10:14 PM
34
points
6
comments
6
min read
LW
link
The First Law of Conscious Agency: Linguistic Relativity and the Birth of “I”
Dima (lain)
May 5, 2025, 9:20 PM
−17
points
4
comments
2
min read
LW
link
Newton’s second law explained: it works in many universes
Tahp
May 5, 2025, 7:47 PM
19
points
10
comments
15
min read
LW
link
(quark.rodeo)
Replicator->Vehicle Alignment and Human->AI Alignment
derelict5432
May 5, 2025, 7:23 PM
0
points
3
comments
4
min read
LW
link
The Sweet Lesson: AI Safety Should Scale With Compute
Jesse Hoogland
May 5, 2025, 7:03 PM
95
points
3
comments
3
min read
LW
link
[Question]
Blue light, ‘Adrenal ASMR’: strange experiences I can’t find any literature about
vernichtung
May 5, 2025, 6:58 PM
16
points
6
comments
1
min read
LW
link
Tsinghua paper: Does RL Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?
Thomas Kwa
May 5, 2025, 6:56 PM
68
points
21
comments
2
min read
LW
link
(arxiv.org)
Intro & Proposal for AGI Model
PickleBrine
May 5, 2025, 6:56 PM
0
points
0
comments
3
min read
LW
link
AI Superorganisms: An Alternative Pathway to Artificial Superintelligence
Aaron Vanzyl
May 5, 2025, 6:55 PM
4
points
5
comments
15
min read
LW
link
Karlsruhe ACX: The colours of her coat
wilm
May 5, 2025, 6:35 PM
2
points
0
comments
1
min read
LW
link
The Metaculus Cup Series Is Live, $5,000 Prize Pool
ChristianWilliams
May 5, 2025, 5:14 PM
4
points
0
comments
LW
link
(www.metaculus.com)
Community Feedback Request: AI Safety Intro for General Public
Algon
and
Vishakha
May 5, 2025, 4:38 PM
6
points
5
comments
3
min read
LW
link
GPT-4o Sycophancy Post Mortem
Zvi
May 5, 2025, 4:00 PM
55
points
1
comment
16
min read
LW
link
(thezvi.wordpress.com)
Legal Supervision of Frontier AI Labs is the answer.
Gauraventh
May 5, 2025, 1:36 PM
14
points
2
comments
3
min read
LW
link
(robertandgaurav.substack.com)
The crucible — how I think about the situation with AI
owencb
May 5, 2025, 1:18 PM
25
points
1
comment
8
min read
LW
link
(strangecities.substack.com)
Lightning Talks: Thought, Trick, Curiosity
marta_k
May 5, 2025, 11:49 AM
1
point
0
comments
1
min read
LW
link
Are standardized tests effective?
Hruss
May 5, 2025, 10:02 AM
1
point
1
comment
1
min read
LW
link
Proposal: Liquid Prediction Markets for AI Forecasting
Jesse Richardson
May 5, 2025, 5:13 AM
23
points
2
comments
3
min read
LW
link
Why “Solving Alignment” Is Likely a Category Mistake
Nate Sharpe
May 5, 2025, 4:26 AM
22
points
3
comments
3
min read
LW
link
AI, Animals, & Digital Minds 2025: apply to speak by Wednesday!
Alistair Stewart
May 5, 2025, 12:56 AM
4
points
0
comments
1
min read
LW
link
AI, Animals, & Digital Minds 2025
Alistair Stewart
May 5, 2025, 12:51 AM
2
points
0
comments
1
min read
LW
link
Notes on the Long Tasks METR paper, from a HCAST task contributor
abstractapplic
May 4, 2025, 11:17 PM
108
points
7
comments
2
min read
LW
link
Why I am not a successionist
Nina Panickssery
May 4, 2025, 7:08 PM
62
points
52
comments
2
min read
LW
link
(ninapanickssery.substack.com)
Overview: AI Safety Outreach Grassroots Orgs
Severin T. Seehrich
and
Benjamin Schmidt
May 4, 2025, 5:39 PM
46
points
8
comments
2
min read
LW
link
The Power Users We Forgot: Why AI Needs Them Now More Than Ever
Anthony Fox
May 4, 2025, 5:23 PM
1
point
6
comments
3
min read
LW
link
Fake AI lawsuits to drive links
Yair Halberstadt
May 4, 2025, 4:53 PM
22
points
0
comments
1
min read
LW
link
(www.rationalistjudaism.com)
Scott Aaronson at UT Austin on May 17 | Computational Complexity & Philosophy
ekkolápto
May 4, 2025, 4:42 PM
1
point
0
comments
1
min read
LW
link
Interpretability Will Not Reliably Find Deceptive AI
Neel Nanda
May 4, 2025, 4:32 PM
316
points
66
comments
7
min read
LW
link
80 concepts on my new version of AI: DecisionBots
Wes R
May 4, 2025, 2:04 PM
0
points
2
comments
15
min read
LW
link
Where have all the tokens gone?
braces
May 4, 2025, 1:52 PM
13
points
7
comments
6
min read
LW
link
The Ukraine War and the Kill Market
Martin Sustrik
May 4, 2025, 7:50 AM
98
points
13
comments
5
min read
LW
link
(250bpm.substack.com)
PSA: Before May 21 is a good time to sign up for cryonics
AlexMennen
May 4, 2025, 4:10 AM
53
points
0
comments
1
min read
LW
link
GTFO of the Social Internet Before you Can’t: The Miro & Yindi Story
keltan
May 4, 2025, 1:08 AM
30
points
12
comments
10
min read
LW
link
“Superhuman” Isn’t Well Specified
JustisMills
May 3, 2025, 11:42 PM
32
points
9
comments
3
min read
LW
link
(justismills.substack.com)
Navigating burnout
gw
May 3, 2025, 10:07 PM
73
points
1
comment
9
min read
LW
link
(www.georgeyw.com)
What is your favorite podcast?
ChristianKl
May 3, 2025, 9:25 PM
32
points
9
comments
1
min read
LW
link
[Question]
Does translating a post with an LLM affect its rating?
ReverendBayes
May 3, 2025, 2:45 PM
9
points
9
comments
2
min read
LW
link
SimpleStories: A Better Synthetic Dataset and Tiny Models for Interpretability
Lennart Finke
May 3, 2025, 2:04 PM
13
points
0
comments
1
min read
LW
link
What’s up with AI’s vision
Joachim Bartosik
May 3, 2025, 1:23 PM
12
points
19
comments
1
min read
LW
link
Sparsity is the enemy of feature extraction (ft. absorption)
7vik
,
chanind
and
Adrià Garriga-alonso
May 3, 2025, 10:13 AM
31
points
0
comments
6
min read
LW
link
Exploring out-of-context reasoning (OOCR) fine-tuning in LLMs to increase test-phase awareness
Sanyu Rajakumar
May 3, 2025, 3:33 AM
8
points
0
comments
6
min read
LW
link
Prison Journal: Building Better Thinking Skills—Altruistic Person Saved > 100 Gorillas saved
P. João
May 3, 2025, 1:34 AM
−30
points
2
comments
1
min read
LW
link
Updates from Comments on “AI 2027 is a Bet Against Amdahl’s Law”
snewman
May 2, 2025, 11:52 PM
40
points
2
comments
13
min read
LW
link
Attend SPAR’s virtual demo day! (career fair + talks)
agucova
May 2, 2025, 11:45 PM
9
points
0
comments
LW
link
(demoday.sparai.org)
Why does METR score o3 as effective for such a long time duration despite overall poor scores?
Cole Wyeth
May 2, 2025, 10:58 PM
19
points
3
comments
1
min read
LW
link
Short story: Who is nancygonzalez8451097
Anders Lindström
May 2, 2025, 9:01 PM
13
points
2
comments
5
min read
LW
link
Interim Research Report: Mechanisms of Awareness
Josh Engels
,
Neel Nanda
and
Senthooran Rajamanoharan
May 2, 2025, 8:29 PM
43
points
6
comments
8
min read
LW
link
Agents, Tools, and Simulators
WillPetillo
,
Sean Herrington
,
Adebayo Mubarak
,
Cancus
and
Spencer Ames
May 2, 2025, 8:19 PM
12
points
2
comments
10
min read
LW
link
Obstacles in ARC’s agenda: Low Probability Estimation
David Matolcsi
May 2, 2025, 7:38 PM
43
points
0
comments
6
min read
LW
link
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel