RSS

Owain_Evans(Owain Evans)

Karma: 2,212

https://​​owainevans.github.io/​​

Model Mis-speci­fi­ca­tion and In­verse Re­in­force­ment Learning

9 Nov 2018 15:33 UTC
33 points
3 comments16 min readLW link

Ma­chine Learn­ing Pro­jects on IDA

24 Jun 2019 18:38 UTC
49 points
3 comments2 min readLW link

Neu­ral nets as a model for how hu­mans make and un­der­stand vi­sual art

Owain_Evans9 Nov 2019 16:53 UTC
28 points
7 comments2 min readLW link
(owainevans.github.io)

Up­date on Ought’s ex­per­i­ments on fac­tored eval­u­a­tion of arguments

Owain_Evans12 Jan 2020 21:20 UTC
29 points
1 comment1 min readLW link
(ought.org)

Quan­tify­ing House­hold Trans­mis­sion of COVID-19

Owain_Evans6 Jul 2020 11:19 UTC
35 points
4 comments4 min readLW link

AI Safety Re­search Pro­ject Ideas

Owain_Evans21 May 2021 13:39 UTC
58 points
2 comments3 min readLW link

How truth­ful is GPT-3? A bench­mark for lan­guage models

Owain_Evans16 Sep 2021 10:09 UTC
58 points
24 comments6 min readLW link

Truth­ful AI: Devel­op­ing and gov­ern­ing AI that does not lie

18 Oct 2021 18:37 UTC
82 points
9 comments10 min readLW link

AMA on Truth­ful AI: Owen Cot­ton-Bar­ratt, Owain Evans & co-authors

Owain_Evans22 Oct 2021 16:23 UTC
31 points
15 comments1 min readLW link

The Ra­tion­al­ists of the 1950s (and be­fore) also called them­selves “Ra­tion­al­ists”

Owain_Evans28 Nov 2021 20:17 UTC
187 points
32 comments3 min readLW link1 review

Lives of the Cam­bridge poly­math geniuses

Owain_Evans25 Jan 2022 4:45 UTC
107 points
40 comments3 min readLW link

How do new mod­els from OpenAI, Deep­Mind and An­thropic perform on Truth­fulQA?

Owain_Evans26 Feb 2022 12:46 UTC
44 points
3 comments11 min readLW link

Paper: Teach­ing GPT3 to ex­press un­cer­tainty in words

Owain_Evans31 May 2022 13:27 UTC
97 points
7 comments4 min readLW link

Paper: Fore­cast­ing world events with neu­ral nets

1 Jul 2022 19:40 UTC
39 points
3 comments4 min readLW link

Paper: On mea­sur­ing situ­a­tional aware­ness in LLMs

4 Sep 2023 12:54 UTC
106 points
16 comments5 min readLW link
(arxiv.org)

Paper: Tell, Don’t Show- Declar­a­tive facts in­fluence how LLMs generalize

19 Dec 2023 19:14 UTC
45 points
4 comments6 min readLW link
(arxiv.org)

How do LLMs give truth­ful an­swers? A dis­cus­sion of LLM vs. hu­man rea­son­ing, en­sem­bles & parrots

Owain_Evans28 Mar 2024 2:34 UTC
26 points
0 comments9 min readLW link