RSS

Chris_Leong

Karma: 6,714

De­cou­pling vs Con­tex­tu­al­is­ing Norms

Chris_Leong14 May 2018 22:44 UTC
155 points
51 comments2 min readLW link3 reviews

Don’t Dis­miss Sim­ple Align­ment Approaches

Chris_Leong7 Oct 2023 0:35 UTC
128 points
9 comments4 min readLW link

No­tice When Peo­ple Are Direc­tion­ally Correct

Chris_Leong14 Jan 2024 14:12 UTC
127 points
7 comments2 min readLW link

On De­stroy­ing the World

Chris_Leong28 Sep 2020 7:38 UTC
78 points
86 comments5 min readLW link

Challenges with Break­ing into MIRI-Style Research

Chris_Leong17 Jan 2022 9:23 UTC
75 points
15 comments3 min readLW link

The World Ac­cord­ing to Do­minic Cummings

Chris_Leong14 Apr 2020 5:05 UTC
69 points
14 comments7 min readLW link

Google “We Have No Moat, And Nei­ther Does OpenAI”

Chris_Leong4 May 2023 18:23 UTC
61 points
28 comments1 min readLW link
(www.semianalysis.com)

[Question] What’s the the­ory of im­pact for ac­ti­va­tion vec­tors?

Chris_Leong11 Feb 2024 7:34 UTC
57 points
12 comments1 min readLW link

In­ter­views on Im­prov­ing the AI Safety Pipeline

Chris_Leong7 Dec 2021 12:03 UTC
55 points
15 comments17 min readLW link

[Question] Can we get an AI to do our al­ign­ment home­work for us?

Chris_Leong26 Feb 2024 7:56 UTC
53 points
33 comments1 min readLW link

The Ham­mer and the Dance

Chris_Leong20 Mar 2020 16:09 UTC
48 points
5 comments1 min readLW link
(medium.com)

Should ra­tio­nal­ity be a move­ment?

Chris_Leong20 Jun 2019 23:09 UTC
48 points
13 comments3 min readLW link

Gen­eral Thoughts on Less Wrong

Chris_Leong3 Apr 2022 4:09 UTC
44 points
14 comments2 min readLW link

[Question] Does re­duc­ing the amount of RL for a given ca­pa­bil­ity level make AI safer?

Chris_Leong5 May 2024 17:04 UTC
43 points
21 comments1 min readLW link

The Sense-Mak­ing Web

Chris_Leong4 Jan 2021 6:17 UTC
41 points
21 comments6 min readLW link