Hi, thank you for pointing this out! I realized that the repo I shared is now outdated and as such have removed the originally attached github link. @Tim Hua open sourced his eval aware model organism code in https://github.com/tim-hua-01/steering-eval-awareness-public, I would recommend taking a look there first if you are generally interested in the evaluation aware model organism. If you are interested specifically in our experiments, please dm me.
Hi, thank you for pointing this out! I realized that the repo I shared is now outdated and as such have removed the originally attached github link. @Tim Hua open sourced his eval aware model organism code in https://github.com/tim-hua-01/steering-eval-awareness-public, I would recommend taking a look there first if you are generally interested in the evaluation aware model organism. If you are interested specifically in our experiments, please dm me.
still a 404 on that link—it has a full stop at the end which should be removed
Fixed the link, thanks for pointing this out.