Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Xander Davies
Karma:
283
Researcher at
UK AI Security Institute
.
All
Posts
Comments
New
Top
Old
Apply to HAIST/MAIA’s AI Governance Workshop in DC (Feb 17-20)
Phosphorous
,
Xander Davies
,
CMD
,
Paramedic
and
tlevin
Jan 31, 2023, 2:06 AM
28
points
0
comments
2
min read
LW
link
AGISF adaptation for in-person groups
Sam Marks
,
Xander Davies
and
Richard_Ngo
Jan 13, 2023, 3:24 AM
44
points
2
comments
3
min read
LW
link
Update on Harvard AI Safety Team and MIT AI Alignment
Xander Davies
,
Sam Marks
,
kaivu
,
tlevin
,
eleni
,
maxnadeau
and
Naomi Bashkansky
2 Dec 2022 0:56 UTC
60
points
4
comments
8
min read
LW
link
Recommend HAIST resources for assessing the value of RLHF-related alignment research
Sam Marks
and
Xander Davies
5 Nov 2022 20:58 UTC
26
points
9
comments
3
min read
LW
link
Apply to the Redwood Research Mechanistic Interpretability Experiment (REMIX), a research program in Berkeley
maxnadeau
,
Xander Davies
,
Buck
and
Nate Thomas
27 Oct 2022 1:32 UTC
135
points
14
comments
12
min read
LW
link
GD’s Implicit Bias on Separable Data
Xander Davies
17 Oct 2022 4:13 UTC
25
points
0
comments
7
min read
LW
link
Back to top
N
W
F
A
C
D
E
F
G
H
I
Customize appearance
Current theme:
default
A
C
D
E
F
G
H
I
Less Wrong (text)
Less Wrong (link)
Invert colors
Reset to defaults
OK
Cancel
Hi, I’m Bobby the Basilisk! Click on the minimize button (
) to minimize the theme tweaker window, so that you can see what the page looks like with the current tweaked values. (But remember,
the changes won’t be saved until you click “OK”!
)
Theme tweaker help
Show Bobby the Basilisk
OK
Cancel