Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
Archive
Sequences
About
Search
Log In
All
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
All
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
All
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
Page
1
Dangers of Closed-Loop AI
Gordon Seidoh Worley
Mar 22, 2024, 11:52 PM
35
points
9
comments
2
min read
LW
link
Why The Insects Scream
omnizoid
Mar 22, 2024, 7:47 PM
4
points
11
comments
9
min read
LW
link
What does “autodidact” mean?
bhauth
Mar 22, 2024, 6:37 PM
22
points
19
comments
1
min read
LW
link
[Linkpost] Vague Verbiage in Forecasting
trevor
Mar 22, 2024, 6:05 PM
11
points
9
comments
3
min read
LW
link
(goodjudgment.com)
Wolf and Rabbit
Richard Henage
Mar 22, 2024, 5:20 PM
14
points
4
comments
1
min read
LW
link
AI Model Registries: A Regulatory Review
Deric Cheng
and
Elliot Mckernon
Mar 22, 2024, 4:04 PM
9
points
0
comments
6
min read
LW
link
Video and transcript of presentation on Scheming AIs
Joe Carlsmith
Mar 22, 2024, 3:52 PM
32
points
1
comment
32
min read
LW
link
Benchmarking LLM Agents on Kaggle Competitions
aog
Mar 22, 2024, 1:09 PM
15
points
4
comments
5
min read
LW
link
American Acceleration vs Development
Maxwell Tabarrok
Mar 22, 2024, 1:01 PM
1
point
0
comments
4
min read
LW
link
(www.maximum-progress.com)
Transformative AI and Scenario Planning for AI X-risk
Elliot Mckernon
and
Justin Bullock
Mar 22, 2024, 9:38 AM
15
points
0
comments
8
min read
LW
link
The Pyromaniacs
Ted Sanders
Mar 22, 2024, 6:55 AM
4
points
1
comment
2
min read
LW
link
Vernor Vinge, who coined the term “Technological Singularity”, dies at 79
Kaj_Sotala
Mar 21, 2024, 10:14 PM
150
points
25
comments
1
min read
LW
link
(arstechnica.com)
ChatGPT can learn indirect control
Raymond Douglas
Mar 21, 2024, 9:11 PM
213
points
27
comments
1
min read
LW
link
“Deep Learning” Is Function Approximation
Zack_M_Davis
Mar 21, 2024, 5:50 PM
98
points
28
comments
10
min read
LW
link
(zackmdavis.net)
A Teacher vs. Everyone Else
ronak69
Mar 21, 2024, 5:45 PM
41
points
8
comments
2
min read
LW
link
Static vs Dynamic Alignment
Gracie Green
Mar 21, 2024, 5:44 PM
5
points
0
comments
12
min read
LW
link
On green
Joe Carlsmith
Mar 21, 2024, 5:38 PM
269
points
35
comments
31
min read
LW
link
Comparing Alignment to other AGI interventions: Extensions and analysis
Martín Soto
Mar 21, 2024, 5:30 PM
7
points
0
comments
4
min read
LW
link
The Comcast Problem
RamblinDash
Mar 21, 2024, 4:46 PM
1
point
15
comments
1
min read
LW
link
Vipassana Meditation and Active Inference: A Framework for Understanding Suffering and its Cessation
sturb
Mar 21, 2024, 12:32 PM
50
points
8
comments
19
min read
LW
link
AI #56: Blackwell That Ends Well
Zvi
Mar 21, 2024, 12:10 PM
34
points
16
comments
68
min read
LW
link
(thezvi.wordpress.com)
An Affordable CO2 Monitor
Pretentious Penguin
Mar 21, 2024, 3:06 AM
28
points
1
comment
1
min read
LW
link
DeepMind: Evaluating Frontier Models for Dangerous Capabilities
Zach Stein-Perlman
Mar 21, 2024, 3:00 AM
61
points
8
comments
1
min read
LW
link
(arxiv.org)
Where are the Contra Dances?
jefftk
Mar 21, 2024, 2:00 AM
9
points
0
comments
1
min read
LW
link
(www.jefftk.com)
Slim overview of work one could do to make AI go better (and a grab-bag of other career considerations)
Chi Nguyen
Mar 20, 2024, 11:17 PM
9
points
1
comment
LW
link
How does AI solve problems?
Dom Polsinelli
Mar 20, 2024, 10:29 PM
2
points
0
comments
7
min read
LW
link
What I Learned (Conclusion To “The Sense Of Physical Necessity”)
LoganStrohl
Mar 20, 2024, 9:24 PM
34
points
0
comments
3
min read
LW
link
Stagewise Development in Neural Networks
Jesse Hoogland
,
Liam Carroll
and
Daniel Murfet
Mar 20, 2024, 7:54 PM
96
points
1
comment
11
min read
LW
link
On the Gladstone Report
Zvi
Mar 20, 2024, 7:50 PM
64
points
11
comments
40
min read
LW
link
(thezvi.wordpress.com)
Natural Latents: The Concepts
johnswentworth
and
David Lorell
Mar 20, 2024, 6:21 PM
90
points
18
comments
19
min read
LW
link
Comparing Alignment to other AGI interventions: Basic model
Martín Soto
Mar 20, 2024, 6:17 PM
12
points
4
comments
7
min read
LW
link
New report: Safety Cases for AI
joshc
Mar 20, 2024, 4:45 PM
89
points
14
comments
1
min read
LW
link
(twitter.com)
User-inclination-guessing algorithms: registering a goal
ProgramCrafter
Mar 20, 2024, 3:55 PM
2
points
0
comments
2
min read
LW
link
My MATS Summer 2023 experience
James Chua
Mar 20, 2024, 11:26 AM
29
points
0
comments
3
min read
LW
link
(jameschua.net)
[Question]
What are the weirdest things a human may want for their own sake?
Mateusz Bagiński
Mar 20, 2024, 11:15 AM
7
points
16
comments
1
min read
LW
link
[Question]
Best *organization* red-pill books and posts?
lemonhope
Mar 20, 2024, 7:01 AM
10
points
2
comments
1
min read
LW
link
Parent-Friendly Dance Weekends
jefftk
Mar 20, 2024, 2:10 AM
16
points
0
comments
2
min read
LW
link
(www.jefftk.com)
[Question]
“I Can’t Believe It Both Is and Is Not Encephalitis!” Or: What do you do when the evidence is crazy?
Erhannis
Mar 19, 2024, 10:08 PM
20
points
3
comments
11
min read
LW
link
Delta’s of Change
Jonas Kgomo
Mar 19, 2024, 9:03 PM
1
point
0
comments
4
min read
LW
link
Increasing IQ by 10 Points is Possible
George3d6
Mar 19, 2024, 8:48 PM
23
points
51
comments
5
min read
LW
link
(morelucid.substack.com)
Are extreme probabilities for P(doom) epistemically justifed?
NathanBarnard
and
Alexander Gietelink Oldenziel
Mar 19, 2024, 8:32 PM
20
points
12
comments
7
min read
LW
link
Have I Solved the Two Envelopes Problem Once and For All?
JackOfAllTrades
Mar 19, 2024, 7:57 PM
−6
points
5
comments
3
min read
LW
link
[Question]
How can one be less wrong, if their conversation partner loses the interest on discussing the topic with them?
Ooker
Mar 19, 2024, 6:11 PM
−10
points
3
comments
1
min read
LW
link
Carlo: uncertainty analysis in Google Sheets
ProbabilityEnjoyer
19 Mar 2024 17:59 UTC
6
points
0
comments
1
min read
LW
link
(carlo.app)
NAIRA—An exercise in regulatory, competitive safety governance [AI Governance Institutional Design idea]
Heramb
19 Mar 2024 17:43 UTC
2
points
0
comments
1
min read
LW
link
(forum.effectivealtruism.org)
AI Safety Evaluations: A Regulatory Review
Elliot Mckernon
and
Deric Cheng
19 Mar 2024 15:05 UTC
22
points
1
comment
11
min read
LW
link
Mechanism for feature learning in neural networks and backpropagation-free machine learning models
Matt Goldenberg
19 Mar 2024 14:55 UTC
8
points
1
comment
1
min read
LW
link
(www.science.org)
Monthly Roundup #16: March 2024
Zvi
19 Mar 2024 13:10 UTC
33
points
4
comments
55
min read
LW
link
(thezvi.wordpress.com)
Experimentation (Part 7 of “The Sense Of Physical Necessity”)
LoganStrohl
18 Mar 2024 21:25 UTC
33
points
0
comments
10
min read
LW
link
INTERVIEW: Round 2 - StakeOut.AI w/ Dr. Peter Park
jacobhaimes
18 Mar 2024 21:21 UTC
5
points
0
comments
1
min read
LW
link
(into-ai-safety.github.io)
Back to top
Next
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel