Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Austin Meek
Karma:
59
All
Posts
Comments
New
Top
Old
Inducing human-like biases in moral reasoning LMs
Artyom Karpov
,
Austin Meek
,
Bogdan Ionut Cirstea
and
SCho
20 Feb 2024 16:28 UTC
18
points
3
comments
14
min read
LW
link
Paper: Understanding and Controlling a Maze-Solving Policy Network
TurnTrout
,
Ulisse Mini
,
peligrietzer
,
mrinank_sharma
,
Austin Meek
,
Monte M
and
lisathiergart
13 Oct 2023 1:38 UTC
69
points
0
comments
1
min read
LW
link
(arxiv.org)
Back to top