RSS

Thomas Kwa

Karma: 3,988

Was on Vivek Hebbar’s team at MIRI, now working with Adrià Garriga-Alonso onvarious empirical alignment projects.

I’m looking for projects in interpretability, activation engineering, and control/​oversight; DM me if you’re interested in working with me.

Thomas Kwa’s MIRI re­search experience

2 Oct 2023 16:42 UTC
169 points
52 comments1 min readLW link

Jet­lag, Nausea, and Diar­rhea are Largely Optional

Thomas Kwa21 Mar 2022 22:40 UTC
89 points
32 comments2 min readLW link

The case for turn­ing glowfic into Sequences

Thomas Kwa27 Apr 2022 6:58 UTC
86 points
28 comments5 min readLW link

Thomas Kwa’s re­search journal

23 Nov 2023 5:11 UTC
79 points
1 comment6 min readLW link

My ex­pe­rience with the “ra­tio­nal­ist un­canny valley”

Thomas Kwa23 Apr 2020 20:27 UTC
66 points
18 comments5 min readLW link

How dath ilan co­or­di­nates around solv­ing alignment

Thomas Kwa13 Apr 2022 4:22 UTC
64 points
45 comments5 min readLW link

[Question] Will quan­tum ran­dom­ness af­fect the 2028 elec­tion?

24 Jan 2024 22:54 UTC
63 points
48 comments1 min readLW link

The Box Spread Trick: Get rich slightly faster

Thomas Kwa1 Sep 2020 21:41 UTC
43 points
44 comments6 min readLW link