RSS

Chris_Leong

Karma: 6,642

“You’re the most beau­tiful girl in the world” and Wittgen­stei­nian Lan­guage Games

Chris_Leong20 Apr 2024 14:54 UTC
6 points
15 comments1 min readLW link

The ar­gu­ment for near-term hu­man dis­em­pow­er­ment through AI

Chris_Leong16 Apr 2024 4:50 UTC
19 points
2 comments1 min readLW link
(link.springer.com)

Re­v­erse Reg­u­la­tory Capture

Chris_Leong11 Apr 2024 2:40 UTC
12 points
3 comments1 min readLW link

On the Con­fu­sion be­tween In­ner and Outer Misalignment

Chris_Leong25 Mar 2024 11:59 UTC
17 points
10 comments1 min readLW link

The Best Es­say (Paul Gra­ham)

Chris_Leong11 Mar 2024 19:25 UTC
25 points
2 comments1 min readLW link
(paulgraham.com)

[Question] Can we get an AI to do our al­ign­ment home­work for us?

Chris_Leong26 Feb 2024 7:56 UTC
53 points
33 comments1 min readLW link

[Question] What’s the the­ory of im­pact for ac­ti­va­tion vec­tors?

Chris_Leong11 Feb 2024 7:34 UTC
57 points
12 comments1 min readLW link

No­tice When Peo­ple Are Direc­tion­ally Correct

Chris_Leong14 Jan 2024 14:12 UTC
127 points
7 comments2 min readLW link

Are Me­tac­u­lus AI Timelines In­con­sis­tent?

Chris_Leong2 Jan 2024 6:47 UTC
16 points
7 comments2 min readLW link

Ran­dom Mus­ings on The­ory of Im­pact for Ac­ti­va­tion Vectors

Chris_Leong7 Dec 2023 13:07 UTC
8 points
0 comments1 min readLW link

Good­hart’s Law Ex­am­ple: Train­ing Ver­ifiers to Solve Math Word Problems

Chris_Leong25 Nov 2023 0:53 UTC
27 points
2 comments1 min readLW link
(arxiv.org)

Up­com­ing Feed­back Op­por­tu­nity on Dual-Use Foun­da­tion Models

Chris_Leong2 Nov 2023 4:28 UTC
3 points
0 comments1 min readLW link

On Hav­ing No Clue

Chris_Leong1 Nov 2023 1:36 UTC
20 points
11 comments1 min readLW link

Is Yann LeCun straw­man­ning AI x-risks?

Chris_Leong19 Oct 2023 11:35 UTC
25 points
4 comments1 min readLW link

Don’t Dis­miss Sim­ple Align­ment Approaches

Chris_Leong7 Oct 2023 0:35 UTC
127 points
8 comments4 min readLW link

[Question] What ev­i­dence is there of LLM’s con­tain­ing world mod­els?

Chris_Leong4 Oct 2023 14:33 UTC
17 points
17 comments1 min readLW link

The Role of Groups in the Pro­gres­sion of Hu­man Understanding

Chris_Leong27 Sep 2023 15:09 UTC
11 points
0 comments2 min readLW link

The Flow-Through Fallacy

Chris_Leong13 Sep 2023 4:28 UTC
20 points
7 comments1 min readLW link

Char­i­ots of Philo­soph­i­cal Fire

Chris_Leong26 Aug 2023 0:52 UTC
12 points
0 comments1 min readLW link
(l.facebook.com)

Call for Papers on Global AI Gover­nance from the UN

Chris_Leong20 Aug 2023 8:56 UTC
19 points
0 comments1 min readLW link
(www.linkedin.com)