Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
New
Hot
Active
Old
Page
1
Annals of Counterfactual Han
GenericModel
12 Dec 2025 1:11 UTC
30
points
0
comments
6
min read
LW
link
(enrichedjamsham.substack.com)
Does dissolving newcomb’s paradox matter?
Srdjan Miletic
12 Dec 2025 1:06 UTC
9
points
2
comments
2
min read
LW
link
(www.dissent.blog)
Designing the World’s Safest AI based on Morality Models & Vipassana
shanzson
11 Dec 2025 23:33 UTC
−5
points
0
comments
21
min read
LW
link
Anhedoniapolis
Alex Beyman
11 Dec 2025 21:53 UTC
13
points
0
comments
77
min read
LW
link
ASI Already Knows About Torture—In Defense of Talking Openly About S-Risks
KatWoods
11 Dec 2025 21:15 UTC
−7
points
0
comments
2
min read
LW
link
Cognitive Tech from Algorithmic Information Theory
Cole Wyeth
11 Dec 2025 20:32 UTC
31
points
6
comments
1
min read
LW
link
Weird Generalization & Inductive Backdoors
Jorio Cocola
,
Owain_Evans
and
dylan_f
11 Dec 2025 18:18 UTC
88
points
0
comments
8
min read
LW
link
The tree, the fly, the ant, the dog, the farmer and the businessman
Alexandre Variengien
11 Dec 2025 17:56 UTC
13
points
2
comments
5
min read
LW
link
(alexandrevariengien.com)
Ships in the Night – A Short Story
Dhruv Sumathi
11 Dec 2025 17:11 UTC
7
points
0
comments
29
min read
LW
link
My AGI safety research—2025 review, ’26 plans
Steven Byrnes
11 Dec 2025 17:05 UTC
87
points
1
comment
12
min read
LW
link
Thinking through a lens of physiology
Vadim Golub
11 Dec 2025 16:55 UTC
1
point
0
comments
7
min read
LW
link
If Anyone Builds It Everyone Dies, another semi-outsider review
manueldelrio
11 Dec 2025 15:43 UTC
50
points
6
comments
8
min read
LW
link
North Sentinelese Post-Singularity
Cleo Nardo
11 Dec 2025 14:57 UTC
52
points
21
comments
1
min read
LW
link
Sea snails in a cocaine vaccine
Alexandre Variengien
11 Dec 2025 14:22 UTC
9
points
0
comments
2
min read
LW
link
Resources for parents
Viliam
11 Dec 2025 10:46 UTC
17
points
7
comments
2
min read
LW
link
Steganographic Chains of Thought Are Low-Probability but High-Stakes: Evidence and Arguments
artkpv
11 Dec 2025 7:40 UTC
6
points
1
comment
6
min read
LW
link
Brain-inspired LLM alignment
mtaran
11 Dec 2025 3:08 UTC
11
points
1
comment
3
min read
LW
link
Seven Perspectives on LLMs
GenericModel
11 Dec 2025 2:11 UTC
12
points
1
comment
12
min read
LW
link
(enrichedjamsham.substack.com)
Some evidence against the idea strange CoT stems from incentives to compress language
williawa
10 Dec 2025 22:43 UTC
12
points
0
comments
2
min read
LW
link
Rock Paper Scissors is Not Solved, In Practice
Linch
10 Dec 2025 21:37 UTC
45
points
11
comments
9
min read
LW
link
(inchpin.substack.com)
Back to top
Next