RSS

peligrietzer

Karma: 459

Un­der­stand­ing and con­trol­ling a maze-solv­ing policy network

11 Mar 2023 18:59 UTC
284 points
13 comments22 min readLW link

Pre­dic­tions for shard the­ory mechanis­tic in­ter­pretabil­ity results

1 Mar 2023 5:16 UTC
94 points
9 comments5 min readLW link

[Si­mu­la­tors sem­i­nar se­quence] #2 Semiotic physics—revamped

27 Feb 2023 0:25 UTC
20 points
22 comments13 min readLW link

[Si­mu­la­tors sem­i­nar se­quence] #1 Back­ground & shared assumptions

2 Jan 2023 23:48 UTC
46 points
4 comments3 min readLW link

peli­gri­et­zer’s Shortform

peligrietzer1 Dec 2022 0:51 UTC
2 points
2 comments1 min readLW link

A Short Dialogue on the Mean­ing of Re­ward Functions

19 Nov 2022 21:04 UTC
42 points
0 comments3 min readLW link