RSS

Ansh Radhakrishnan

Karma: 560

An In­side View of AI Alignment

Ansh Radhakrishnan11 May 2022 2:16 UTC
32 points
2 comments2 min readLW link

RLHF

Ansh Radhakrishnan12 May 2022 21:18 UTC
18 points
5 comments5 min readLW link

The Bio An­chors Forecast

Ansh Radhakrishnan2 Jun 2022 1:32 UTC
12 points
0 comments3 min readLW link

Mea­sur­ing and Im­prov­ing the Faith­ful­ness of Model-Gen­er­ated Rea­son­ing

18 Jul 2023 16:36 UTC
109 points
13 comments6 min readLW link

An­thropic Fall 2023 De­bate Progress Update

Ansh Radhakrishnan28 Nov 2023 5:37 UTC
74 points
9 comments12 min readLW link

Scal­able Over­sight and Weak-to-Strong Gen­er­al­iza­tion: Com­pat­i­ble ap­proaches to the same problem

16 Dec 2023 5:49 UTC
71 points
3 comments6 min readLW link