RSS

Truth­ful AI

TagLast edit: 7 Apr 2022 16:40 UTC by Ruby

How do LLMs give truth­ful an­swers? A dis­cus­sion of LLM vs. hu­man rea­son­ing, en­sem­bles & parrots

Owain_Evans28 Mar 2024 2:34 UTC
26 points
0 comments9 min readLW link

Truth­ful­ness, stan­dards and credibility

Joe_Collman7 Apr 2022 10:31 UTC
12 points
2 comments32 min readLW link

A ten­sion be­tween two pro­saic al­ign­ment subgoals

Alex Lawsen 19 Mar 2023 14:07 UTC
31 points
8 comments1 min readLW link

Bench­mark Study #2: Truth­fulQA (Task, MCQ)

Bruce W. Lee6 Jan 2024 2:39 UTC
11 points
2 comments4 min readLW link
(arxiv.org)

Tall Tales at Differ­ent Scales: Eval­u­at­ing Scal­ing Trends For De­cep­tion In Lan­guage Models

8 Nov 2023 11:37 UTC
49 points
0 comments18 min readLW link
No comments.