Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
paulfchristiano
(Paul Christiano)
Karma:
27,036
All
Posts
Comments
New
Top
Old
Page
1
Where I agree and disagree with Eliezer
paulfchristiano
19 Jun 2022 19:15 UTC
874
points
219
comments
18
min read
LW
link
2
reviews
What failure looks like
paulfchristiano
17 Mar 2019 20:18 UTC
401
points
54
comments
8
min read
LW
link
2
reviews
AI alignment is distinct from its near-term applications
paulfchristiano
13 Dec 2022 7:10 UTC
254
points
21
comments
2
min read
LW
link
(ai-alignment.com)
My views on “doom”
paulfchristiano
27 Apr 2023 17:50 UTC
242
points
33
comments
2
min read
LW
link
(ai-alignment.com)
Another (outer) alignment failure story
paulfchristiano
7 Apr 2021 20:12 UTC
241
points
38
comments
12
min read
LW
link
1
review
Thoughts on the impact of RLHF research
paulfchristiano
25 Jan 2023 17:23 UTC
236
points
101
comments
9
min read
LW
link
Self-driving car bets
paulfchristiano
29 Jul 2023 18:10 UTC
229
points
41
comments
5
min read
LW
link
(sideways-view.com)
ARC’s first technical report: Eliciting Latent Knowledge
paulfchristiano
,
Mark Xu
and
Ajeya Cotra
14 Dec 2021 20:09 UTC
225
points
90
comments
1
min read
LW
link
3
reviews
(docs.google.com)
Thoughts on responsible scaling policies and regulation
paulfchristiano
24 Oct 2023 22:21 UTC
214
points
33
comments
6
min read
LW
link
Hiring engineers and researchers to help align GPT-3
paulfchristiano
1 Oct 2020 18:54 UTC
206
points
13
comments
3
min read
LW
link
Thoughts on sharing information about language model capabilities
paulfchristiano
31 Jul 2023 16:04 UTC
191
points
34
comments
11
min read
LW
link
Announcing the Alignment Research Center
paulfchristiano
26 Apr 2021 23:30 UTC
178
points
6
comments
1
min read
LW
link
(ai-alignment.com)
Impossibility results for unbounded utilities
paulfchristiano
2 Feb 2022 3:52 UTC
166
points
109
comments
8
min read
LW
link
1
review
IMO challenge bet with Eliezer
paulfchristiano
26 Feb 2022 4:50 UTC
166
points
25
comments
3
min read
LW
link
Prizes for matrix completion problems
paulfchristiano
3 May 2023 23:30 UTC
163
points
51
comments
1
min read
LW
link
(www.alignment.org)
Secure homes for digital people
paulfchristiano
10 Oct 2021 15:50 UTC
161
points
37
comments
9
min read
LW
link
1
review
(sideways-view.com)
My research methodology
paulfchristiano
22 Mar 2021 21:20 UTC
159
points
38
comments
16
min read
LW
link
1
review
(ai-alignment.com)
Prizes for ELK proposals
paulfchristiano
3 Jan 2022 20:23 UTC
150
points
152
comments
7
min read
LW
link
Moral public goods
paulfchristiano
26 Jan 2020 0:10 UTC
147
points
74
comments
4
min read
LW
link
(sideways-view.com)
AI-Written Critiques Help Humans Notice Flaws
paulfchristiano
25 Jun 2022 17:22 UTC
137
points
5
comments
3
min read
LW
link
(openai.com)
Back to top
Next