Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
miles
Karma:
84
All
Posts
Comments
New
Top
Old
Bias-Augmented Consistency Training Reduces Biased Reasoning in Chain-of-Thought
miles
11 Mar 2024 23:46 UTC
16
points
0
comments
1
min read
LW
link
(arxiv.org)
Some Quick Follow-Up Experiments to “Taken out of context: On measuring situational awareness in LLMs”
miles
3 Oct 2023 2:22 UTC
31
points
0
comments
9
min read
LW
link
Unfaithful Explanations in Chain-of-Thought Prompting
miles
3 Jun 2023 0:22 UTC
38
points
8
comments
7
min read
LW
link
Back to top