Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
ProgramCrafter
Karma:
88
All
Posts
Comments
New
Top
Old
User-inclination-guessing algorithms: registering a goal
ProgramCrafter
20 Mar 2024 15:55 UTC
2
points
0
comments
2
min read
LW
link
ProgramCrafter’s Shortform
ProgramCrafter
21 Jul 2023 5:26 UTC
2
points
16
comments
1
min read
LW
link
LLM misalignment can probably be found without manual prompt engineering
ProgramCrafter
8 Jul 2023 14:35 UTC
1
point
0
comments
1
min read
LW
link
[Question]
Does object permanence of simulacrum affect LLMs’ reasoning?
ProgramCrafter
19 Apr 2023 16:28 UTC
1
point
1
comment
1
min read
LW
link
The frozen neutrality
ProgramCrafter
1 Apr 2023 12:58 UTC
3
points
0
comments
3
min read
LW
link
Proposal on AI evaluation: false-proving
ProgramCrafter
31 Mar 2023 12:12 UTC
1
point
2
comments
1
min read
LW
link
How AI could workaround goals if rated by people
ProgramCrafter
19 Mar 2023 15:51 UTC
1
point
1
comment
1
min read
LW
link
Back to top