RSS

Logan Riggs

Karma: 1,655

Pro­posal for In­duc­ing Steganog­ra­phy in LMs

Logan Riggs12 Jan 2023 22:15 UTC
18 points
2 comments2 min readLW link

[Si­mu­la­tors sem­i­nar se­quence] #2 Semiotic physics

3 Jan 2023 18:01 UTC
17 points
14 comments11 min readLW link

[Si­mu­la­tors sem­i­nar se­quence] #1 Back­ground & shared assumptions

2 Jan 2023 23:48 UTC
40 points
4 comments3 min readLW link

Re­sults from a sur­vey on tool use and work­flows in al­ign­ment research

19 Dec 2022 15:19 UTC
69 points
2 comments19 min readLW link