Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
paulfchristiano
(Paul Christiano)
Karma:
24,593
All
Posts
Comments
New
Top
Old
Page
1
Prizes for matrix completion problems
paulfchristiano
3 May 2023 23:30 UTC
154
points
41
comments
1
min read
LW
link
(www.alignment.org)
My views on “doom”
paulfchristiano
27 Apr 2023 17:50 UTC
232
points
31
comments
2
min read
LW
link
(ai-alignment.com)
Christiano (ARC) and GA (Conjecture) Discuss Alignment Cruxes
Andrea_Miotti
,
paulfchristiano
,
Gabriel Alfour
and
Olivia Jimenez
24 Feb 2023 23:03 UTC
64
points
7
comments
47
min read
LW
link
Thoughts on the impact of RLHF research
paulfchristiano
25 Jan 2023 17:23 UTC
231
points
101
comments
9
min read
LW
link
Can we efficiently distinguish different mechanisms?
paulfchristiano
27 Dec 2022 0:20 UTC
86
points
30
comments
16
min read
LW
link
(ai-alignment.com)
Three reasons to cooperate
paulfchristiano
24 Dec 2022 17:40 UTC
78
points
14
comments
10
min read
LW
link
(sideways-view.com)
Can we efficiently explain model behaviors?
paulfchristiano
16 Dec 2022 19:40 UTC
64
points
3
comments
9
min read
LW
link
(ai-alignment.com)
AI alignment is distinct from its near-term applications
paulfchristiano
13 Dec 2022 7:10 UTC
253
points
21
comments
2
min read
LW
link
(ai-alignment.com)
Finding gliders in the game of life
paulfchristiano
1 Dec 2022 20:40 UTC
94
points
7
comments
16
min read
LW
link
(ai-alignment.com)
Mechanistic anomaly detection and ELK
paulfchristiano
25 Nov 2022 18:50 UTC
132
points
18
comments
21
min read
LW
link
(ai-alignment.com)
Decision theory and dynamic inconsistency
paulfchristiano
3 Jul 2022 22:20 UTC
71
points
33
comments
10
min read
LW
link
(sideways-view.com)
AI-Written Critiques Help Humans Notice Flaws
paulfchristiano
25 Jun 2022 17:22 UTC
137
points
5
comments
3
min read
LW
link
(openai.com)
Where I agree and disagree with Eliezer
paulfchristiano
19 Jun 2022 19:15 UTC
838
points
212
comments
20
min read
LW
link
What is causality to an evidential decision theorist?
paulfchristiano
17 Apr 2022 16:00 UTC
45
points
26
comments
5
min read
LW
link
(sideways-view.com)
ELK prize results
paulfchristiano
and
Mark Xu
9 Mar 2022 0:01 UTC
133
points
50
comments
21
min read
LW
link
IMO challenge bet with Eliezer
paulfchristiano
26 Feb 2022 4:50 UTC
163
points
25
comments
3
min read
LW
link
Better impossibility result for unbounded utilities
paulfchristiano
9 Feb 2022 6:10 UTC
30
points
22
comments
5
min read
LW
link
Impossibility results for unbounded utilities
paulfchristiano
2 Feb 2022 3:52 UTC
157
points
103
comments
8
min read
LW
link
ELK First Round Contest Winners
Mark Xu
and
paulfchristiano
26 Jan 2022 2:56 UTC
63
points
6
comments
1
min read
LW
link
Apply for research internships at ARC!
paulfchristiano
3 Jan 2022 20:26 UTC
61
points
0
comments
1
min read
LW
link
Back to top
Next