We’re studying a net with the structure I commented below, trained via PPO. I’d be happy to discuss more at EAG.
Not posting much publicly right now so that we can a: work on the research sprint and b: let people preregister credences in various mechint / generalization propositions, so that they can calibrate / see how their opinions evolve over time.
We’re studying a net with the structure I commented below, trained via PPO. I’d be happy to discuss more at EAG.
Not posting much publicly right now so that we can a: work on the research sprint and b: let people preregister credences in various mechint / generalization propositions, so that they can calibrate / see how their opinions evolve over time.