Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
2
Clarifying Alignment Fundamentals Through the Lens of Ontology
Ben Ihrig
Oct 7, 2024, 8:57 PM
12
points
4
comments
24
min read
LW
link
Ethics on Cosmic Scale, Outer Space Treaty, Directed Panspermia, Forwards-Contamination, Technology Assessment, Planetary Protection, and Fermi’s Paradox
MrFantastic
Oct 7, 2024, 8:56 PM
−12
points
0
comments
1
min read
LW
link
Domain-specific SAEs
jacob_drori
Oct 7, 2024, 8:15 PM
28
points
2
comments
5
min read
LW
link
Metaculus Is Open Source
ChristianWilliams
Oct 7, 2024, 7:55 PM
13
points
0
comments
LW
link
(www.metaculus.com)
Research update: Towards a Law of Iterated Expectations for Heuristic Estimators
Eric Neyman
Oct 7, 2024, 7:29 PM
87
points
2
comments
22
min read
LW
link
AI Model Registries: A Foundational Tool for AI Governance
Elliot Mckernon
,
Deric Cheng
and
Gwyn Glasser
Oct 7, 2024, 7:27 PM
20
points
1
comment
4
min read
LW
link
(www.convergenceanalysis.org)
Evaluating the truth of statements in a world of ambiguous language.
Hastings
Oct 7, 2024, 6:08 PM
48
points
19
comments
2
min read
LW
link
Advice for journalists
Nathan Young
Oct 7, 2024, 4:46 PM
101
points
53
comments
9
min read
LW
link
(nathanpmyoung.substack.com)
Time Efficient Resistance Training
romeostevensit
Oct 7, 2024, 3:15 PM
42
points
12
comments
3
min read
LW
link
A Narrow Path: a plan to deal with AI extinction risk
Andrea_Miotti
,
davekasten
and
Tolga
Oct 7, 2024, 1:02 PM
73
points
12
comments
2
min read
LW
link
(www.narrowpath.co)
Toy Models of Feature Absorption in SAEs
chanind
,
hrdkbhatnagar
,
TomasD
and
Joseph Bloom
Oct 7, 2024, 9:56 AM
49
points
8
comments
10
min read
LW
link
An argument that consequentialism is incomplete
cousin_it
Oct 7, 2024, 9:45 AM
35
points
27
comments
1
min read
LW
link
An X-Ray is Worth 15 Features: Sparse Autoencoders for Interpretable Radiology Report Generation
hugofry
,
Ahmed Abdulaal
,
NMontanaBrown
and
a-ijishakin
Oct 7, 2024, 8:53 AM
40
points
1
comment
5
min read
LW
link
(arxiv.org)
Compelling Villains and Coherent Values
Cole Wyeth
Oct 6, 2024, 7:53 PM
42
points
4
comments
4
min read
LW
link
To Be Born in a Bag
Niko_McCarty
Oct 6, 2024, 5:21 PM
19
points
1
comment
16
min read
LW
link
(www.asimov.press)
Whimsical Thoughts on an AI Notepad: Exploring Non-Invasive Neural Integration via Viral and Stem Cell Pathways
Pug stanky
Oct 6, 2024, 4:37 PM
1
point
2
comments
4
min read
LW
link
Why I’m not a Bayesian
Richard_Ngo
Oct 6, 2024, 3:22 PM
215
points
104
comments
10
min read
LW
link
(www.mindthefuture.info)
European Progress Conference
Martin Sustrik
Oct 6, 2024, 11:10 AM
27
points
11
comments
3
min read
LW
link
(250bpm.substack.com)
Open Thread Fall 2024
habryka
Oct 5, 2024, 10:28 PM
44
points
193
comments
1
min read
LW
link
[Question]
Seeking AI Alignment Tutor/Advisor: $100–150/hr
MrThink
Oct 5, 2024, 9:28 PM
28
points
3
comments
2
min read
LW
link
Interpretability of SAE Features Representing Check in ChessGPT
Jonathan Kutasov
Oct 5, 2024, 8:43 PM
27
points
2
comments
8
min read
LW
link
2024 Election Forecasting Contest
mike20731
Oct 5, 2024, 8:43 PM
4
points
0
comments
1
min read
LW
link
(www.mikesblog.net)
5 ways to improve CoT faithfulness
Caleb Biddulph
Oct 5, 2024, 8:17 PM
44
points
40
comments
6
min read
LW
link
Consciousness As Recursive Reflections
Gunnar_Zarncke
Oct 5, 2024, 8:00 PM
7
points
2
comments
1
min read
LW
link
(www.astralcodexten.com)
What is it like to be psychologically healthy? Podcast ft. DaystarEld
Chipmonk
and
DaystarEld
Oct 5, 2024, 7:14 PM
31
points
8
comments
2
min read
LW
link
(chrislakin.blog)
Musings on Text Data Wall (Oct 2024)
Vladimir_Nesov
Oct 5, 2024, 7:00 PM
40
points
2
comments
5
min read
LW
link
Apply to the Cooperative AI PhD Fellowship by October 14th!
Lewis Hammond
Oct 5, 2024, 12:41 PM
23
points
0
comments
LW
link
AISafety.info: What is the “natural abstractions hypothesis”?
Algon
Oct 5, 2024, 12:31 PM
38
points
2
comments
3
min read
LW
link
(aisafety.info)
ARENA4.0 Capstone: Hyperparameter tuning for MELBO + replication on Llama-3.2-1b-Instruct
25Hour
and
submarat
Oct 5, 2024, 11:30 AM
34
points
2
comments
8
min read
LW
link
Exploring SAE features in LLMs with definition trees and token lists
mwatkins
Oct 4, 2024, 10:15 PM
38
points
5
comments
6
min read
LW
link
AXRP Episode 37 - Jaime Sevilla on Forecasting AI
DanielFilan
Oct 4, 2024, 9:00 PM
21
points
3
comments
56
min read
LW
link
[Question]
Seeking Solutions for Aggregating Classifier Outputs
Saeid Ghafouri
Oct 4, 2024, 5:39 PM
−1
points
0
comments
1
min read
LW
link
Amoeba roles in tech
Sindhu Shivaprasad
Oct 4, 2024, 5:25 PM
12
points
0
comments
4
min read
LW
link
LASR Labs Spring 2025 applications are open!
Erin Robertson
,
charlie_griffin
,
joehardie
and
Justin Olive
Oct 4, 2024, 1:44 PM
38
points
0
comments
4
min read
LW
link
(Maybe) A Bag of Heuristics is All There Is & A Bag of Heuristics is All You Need
Sodium
Oct 3, 2024, 7:11 PM
35
points
17
comments
17
min read
LW
link
Does natural selection favor AIs over humans?
cdkg
Oct 3, 2024, 6:47 PM
20
points
1
comment
1
min read
LW
link
(link.springer.com)
What Hayek Taught Us About Nature
Ground Truth Data
Oct 3, 2024, 6:20 PM
−1
points
6
comments
2
min read
LW
link
Biasing VLM Response with Visual Stimuli
Jaehyuk Lim
Oct 3, 2024, 6:04 PM
5
points
0
comments
8
min read
LW
link
AI #84: Better Than a Podcast
Zvi
Oct 3, 2024, 3:00 PM
56
points
7
comments
52
min read
LW
link
(thezvi.wordpress.com)
[Question]
If I have some money, whom should I donate it to in order to reduce expected P(doom) the most?
KvmanThinking
Oct 3, 2024, 11:31 AM
35
points
37
comments
1
min read
LW
link
Shutting down all competing AI projects might not buy a lot of time due to Internal Time Pressure
ThomasCederborg
Oct 3, 2024, 12:01 AM
12
points
7
comments
12
min read
LW
link
“25 Lessons from 25 Years of Marriage” by honorary rationalist Ferrett Steinmetz
CronoDAS
Oct 2, 2024, 10:42 PM
24
points
2
comments
1
min read
LW
link
(theferrett.substack.com)
MIT FutureTech are hiring for a Head of Operations role
peterslattery
Oct 2, 2024, 5:11 PM
8
points
0
comments
4
min read
LW
link
Can AI Quantity beat AI Quality?
Gianluca Calcagni
Oct 2, 2024, 3:21 PM
2
points
0
comments
5
min read
LW
link
[Intuitive self-models] 3. The Homunculus
Steven Byrnes
Oct 2, 2024, 3:20 PM
78
points
38
comments
25
min read
LW
link
AI Safety University Organizing: Early Takeaways from Thirteen Groups
agucova
Oct 2, 2024, 3:14 PM
26
points
0
comments
LW
link
Three main arguments that AI will save humans and one meta-argument
avturchin
Oct 2, 2024, 11:39 AM
8
points
8
comments
2
min read
LW
link
Should we abstain from voting? (In nondeterministic elections)
B Jacobs
Oct 2, 2024, 10:07 AM
5
points
6
comments
4
min read
LW
link
(bobjacobs.substack.com)
AI Safety at the Frontier: Paper Highlights, September ’24
gasteigerjo
2 Oct 2024 9:49 UTC
13
points
0
comments
7
min read
LW
link
(aisafetyfrontier.substack.com)
Self-Help Corner: Loop Detection
adamShimi
2 Oct 2024 8:33 UTC
88
points
6
comments
2
min read
LW
link
(formethods.substack.com)
Previous
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel