Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
David Johnston
Karma:
510
All
Posts
Comments
New
Top
Old
A brief theory of why we think things are good or bad
David Johnston
Oct 20, 2024, 8:31 PM
7
points
10
comments
LW
link
Mechanistic Anomaly Detection Research Update
Nora Belrose
and
David Johnston
Aug 6, 2024, 10:33 AM
11
points
0
comments
1
min read
LW
link
(blog.eleuther.ai)
Opinion merging for AI control
David Johnston
May 4, 2023, 2:43 AM
6
points
0
comments
11
min read
LW
link
[Question]
Is it worth avoiding detailed discussions of expectations about agency levels of powerful AIs?
David Johnston
Mar 16, 2023, 3:06 AM
11
points
6
comments
2
min read
LW
link
How likely are malign priors over objectives? [aborted WIP]
David Johnston
Nov 11, 2022, 5:36 AM
−1
points
0
comments
8
min read
LW
link
When can a mimic surprise you? Why generative models handle seemingly ill-posed problems
David Johnston
Nov 5, 2022, 1:19 PM
8
points
4
comments
16
min read
LW
link
There’s probably a tradeoff between AI capability and safety, and we should act like it
David Johnston
Jun 9, 2022, 12:17 AM
3
points
3
comments
1
min read
LW
link
Is evolutionary influence the mesa objective that we’re interested in?
David Johnston
May 3, 2022, 1:18 AM
3
points
2
comments
5
min read
LW
link
[Cross-post] Half baked ideas: defining and measuring Artificial Intelligence system effectiveness
David Johnston
Apr 5, 2022, 12:29 AM
2
points
0
comments
7
min read
LW
link
[Question]
Are there any impossibility theorems for strong and safe AI?
David Johnston
Mar 11, 2022, 1:41 AM
5
points
3
comments
1
min read
LW
link
Counterfactuals from ensembles of peers
David Johnston
Jan 4, 2022, 7:01 AM
3
points
4
comments
7
min read
LW
link
Back to top
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel