paulfchristiano

Karma: 27,899

Matrix completion prize results

paulfchristianoDec 20, 2023, 3:40 PM

42 points

0 comments2 min readLW link

(www.alignment.org)

Thoughts on responsible scaling policies and regulation

paulfchristianoOct 24, 2023, 10:21 PM

221 points

33 comments6 min readLW link

Thoughts on sharing information about language model capabilities

paulfchristianoJul 31, 2023, 4:04 PM

210 points

44 comments11 min readLW link 1 review

Self-driving car bets

paulfchristianoJul 29, 2023, 6:10 PM

236 points

44 comments5 min readLW link

(sideways-view.com)

ARC is hiring theoretical researchers

paulfchristiano, Jacob_Hilton and Mark Xu

Jun 12, 2023, 6:50 PM

126 points

12 comments4 min readLW link

(www.alignment.org)

Prizes for matrix completion problems

paulfchristianoMay 3, 2023, 11:30 PM

164 points

52 comments1 min readLW link

(www.alignment.org)

My views on “doom”

paulfchristianoApr 27, 2023, 5:50 PM

250 points

37 comments2 min readLW link 1 review

(ai-alignment.com)

Christiano (ARC) and GA (Conjecture) Discuss Alignment Cruxes

Andrea_Miotti, paulfchristiano, Gabriel Alfour and OliviaJ

Feb 24, 2023, 11:03 PM

61 points

7 comments47 min readLW link

Thoughts on the impact of RLHF research

paulfchristianoJan 25, 2023, 5:23 PM

253 points

102 comments9 min readLW link

Can we efficiently distinguish different mechanisms?

paulfchristianoDec 27, 2022, 12:20 AM

91 points

30 comments16 min readLW link

(ai-alignment.com)

Three reasons to cooperate

paulfchristianoDec 24, 2022, 5:40 PM

86 points

14 comments10 min readLW link

(sideways-view.com)

Can we efficiently explain model behaviors?

paulfchristianoDec 16, 2022, 7:40 PM

64 points

3 comments9 min readLW link

(ai-alignment.com)

AI alignment is distinct from its near-term applications

paulfchristianoDec 13, 2022, 7:10 AM

255 points

21 comments2 min readLW link

(ai-alignment.com)

Finding gliders in the game of life

paulfchristianoDec 1, 2022, 8:40 PM

104 points

8 comments16 min readLW link

(ai-alignment.com)

Mechanistic anomaly detection and ELK

paulfchristianoNov 25, 2022, 6:50 PM

138 points

22 comments21 min readLW link

(ai-alignment.com)

Decision theory and dynamic inconsistency

paulfchristianoJul 3, 2022, 10:20 PM

80 points

33 comments10 min readLW link

(sideways-view.com)

AI-Written Critiques Help Humans Notice Flaws

paulfchristianoJun 25, 2022, 5:22 PM

137 points

5 comments3 min readLW link

(openai.com)

Where I agree and disagree with Eliezer

paulfchristianoJun 19, 2022, 7:15 PM

901 points

224 comments18 min readLW link 2 reviews

What is causality to an evidential decision theorist?

paulfchristianoApr 17, 2022, 4:00 PM

45 points

26 comments5 min readLW link

(sideways-view.com)

ELK prize results

paulfchristiano and Mark Xu

Mar 9, 2022, 12:01 AM

138 points

50 comments21 min readLW link