Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Chris_Leong
Karma:
6,642
All
Posts
Comments
New
Top
Old
Page
1
“You’re the most beautiful girl in the world” and Wittgensteinian Language Games
Chris_Leong
20 Apr 2024 14:54 UTC
6
points
15
comments
1
min read
LW
link
The argument for near-term human disempowerment through AI
Chris_Leong
16 Apr 2024 4:50 UTC
19
points
2
comments
1
min read
LW
link
(link.springer.com)
Reverse Regulatory Capture
Chris_Leong
11 Apr 2024 2:40 UTC
12
points
3
comments
1
min read
LW
link
On the Confusion between Inner and Outer Misalignment
Chris_Leong
25 Mar 2024 11:59 UTC
17
points
10
comments
1
min read
LW
link
The Best Essay (Paul Graham)
Chris_Leong
11 Mar 2024 19:25 UTC
25
points
2
comments
1
min read
LW
link
(paulgraham.com)
[Question]
Can we get an AI to do our alignment homework for us?
Chris_Leong
26 Feb 2024 7:56 UTC
53
points
33
comments
1
min read
LW
link
[Question]
What’s the theory of impact for activation vectors?
Chris_Leong
11 Feb 2024 7:34 UTC
57
points
12
comments
1
min read
LW
link
Notice When People Are Directionally Correct
Chris_Leong
14 Jan 2024 14:12 UTC
127
points
7
comments
2
min read
LW
link
Are Metaculus AI Timelines Inconsistent?
Chris_Leong
2 Jan 2024 6:47 UTC
16
points
7
comments
2
min read
LW
link
Random Musings on Theory of Impact for Activation Vectors
Chris_Leong
7 Dec 2023 13:07 UTC
8
points
0
comments
1
min read
LW
link
Goodhart’s Law Example: Training Verifiers to Solve Math Word Problems
Chris_Leong
25 Nov 2023 0:53 UTC
27
points
2
comments
1
min read
LW
link
(arxiv.org)
Upcoming Feedback Opportunity on Dual-Use Foundation Models
Chris_Leong
2 Nov 2023 4:28 UTC
3
points
0
comments
1
min read
LW
link
On Having No Clue
Chris_Leong
1 Nov 2023 1:36 UTC
20
points
11
comments
1
min read
LW
link
Is Yann LeCun strawmanning AI x-risks?
Chris_Leong
19 Oct 2023 11:35 UTC
25
points
4
comments
1
min read
LW
link
Don’t Dismiss Simple Alignment Approaches
Chris_Leong
7 Oct 2023 0:35 UTC
127
points
8
comments
4
min read
LW
link
[Question]
What evidence is there of LLM’s containing world models?
Chris_Leong
4 Oct 2023 14:33 UTC
17
points
17
comments
1
min read
LW
link
The Role of Groups in the Progression of Human Understanding
Chris_Leong
27 Sep 2023 15:09 UTC
11
points
0
comments
2
min read
LW
link
The Flow-Through Fallacy
Chris_Leong
13 Sep 2023 4:28 UTC
20
points
7
comments
1
min read
LW
link
Chariots of Philosophical Fire
Chris_Leong
26 Aug 2023 0:52 UTC
12
points
0
comments
1
min read
LW
link
(l.facebook.com)
Call for Papers on Global AI Governance from the UN
Chris_Leong
20 Aug 2023 8:56 UTC
19
points
0
comments
1
min read
LW
link
(www.linkedin.com)
Back to top
Next