RSS

viluon

Karma: 48

Ro­bust­ness of Model-Graded Eval­u­a­tions and Au­to­mated Interpretability

15 Jul 2023 19:12 UTC
43 points
5 comments9 min readLW link