Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Thomas Kwa
Karma:
1,483
Doing alignment research
with Vivek Hebbar’s team at MIRI
.
All
Posts
Comments
New
Top
Old
Failure modes in a shard theory alignment plan
Thomas Kwa
27 Sep 2022 22:34 UTC
24
points
2
comments
7
min read
LW
link
Utility functions and probabilities are entangled
Thomas Kwa
26 Jul 2022 5:36 UTC
13
points
5
comments
1
min read
LW
link
Deriving Conditional Expected Utility from Pareto-Efficient Decisions
Thomas Kwa
5 May 2022 3:21 UTC
24
points
1
comment
6
min read
LW
link
Most problems don’t differ dramatically in tractability (under certain assumptions)
Thomas Kwa
4 May 2022 0:05 UTC
8
points
0
comments
3
min read
LW
link
The case for turning glowfic into Sequences
Thomas Kwa
27 Apr 2022 6:58 UTC
72
points
24
comments
5
min read
LW
link
Mesa-utility functions might not be purely proxy goals
Thomas Kwa
22 Apr 2022 22:16 UTC
12
points
17
comments
1
min read
LW
link
[Question]
(When) do high-dimensional spaces have linear paths down to local minima?
Thomas Kwa
22 Apr 2022 15:35 UTC
12
points
8
comments
1
min read
LW
link
How dath ilan coordinates around solving alignment
Thomas Kwa
13 Apr 2022 4:22 UTC
47
points
37
comments
5
min read
LW
link
5 Tips for Good Hearting
Thomas Kwa
1 Apr 2022 19:47 UTC
25
points
10
comments
1
min read
LW
link
Can we simulate human evolution to create a somewhat aligned AGI?
Thomas Kwa
28 Mar 2022 22:55 UTC
21
points
7
comments
7
min read
LW
link
Jetlag, Nausea, and Diarrhea are Largely Optional
Thomas Kwa
21 Mar 2022 22:40 UTC
83
points
27
comments
2
min read
LW
link
The Box Spread Trick: Get rich slightly faster
Thomas Kwa
1 Sep 2020 21:41 UTC
40
points
44
comments
6
min read
LW
link
Thomas Kwa’s Bounty List
Thomas Kwa
13 Jun 2020 0:03 UTC
12
points
15
comments
1
min read
LW
link
[Question]
What past highly-upvoted posts are overrated today?
Thomas Kwa
9 Jun 2020 21:25 UTC
14
points
9
comments
1
min read
LW
link
[Question]
How to learn from a stronger rationalist in daily life?
Thomas Kwa
20 May 2020 4:55 UTC
16
points
11
comments
1
min read
LW
link
My experience with the “rationalist uncanny valley”
Thomas Kwa
23 Apr 2020 20:27 UTC
65
points
18
comments
5
min read
LW
link
Thomas Kwa’s Shortform
Thomas Kwa
22 Mar 2020 23:19 UTC
2
points
68
comments
1
min read
LW
link
Back to top