RSS

ProgramCrafter

Karma: 88

User-in­cli­na­tion-guess­ing al­gorithms: reg­is­ter­ing a goal

ProgramCrafter20 Mar 2024 15:55 UTC
2 points
0 comments2 min readLW link

Pro­gramCrafter’s Shortform

ProgramCrafter21 Jul 2023 5:26 UTC
2 points
16 comments1 min readLW link

LLM mis­al­ign­ment can prob­a­bly be found with­out man­ual prompt engineering

ProgramCrafter8 Jul 2023 14:35 UTC
1 point
0 comments1 min readLW link

[Question] Does ob­ject per­ma­nence of simu­lacrum af­fect LLMs’ rea­son­ing?

ProgramCrafter19 Apr 2023 16:28 UTC
1 point
1 comment1 min readLW link

The frozen neutrality

ProgramCrafter1 Apr 2023 12:58 UTC
3 points
0 comments3 min readLW link

Pro­posal on AI eval­u­a­tion: false-proving

ProgramCrafter31 Mar 2023 12:12 UTC
1 point
2 comments1 min readLW link

How AI could workaround goals if rated by people

ProgramCrafter19 Mar 2023 15:51 UTC
1 point
1 comment1 min readLW link