RSS

Olli Järviniemi

Karma: 544

Urg­ing an In­ter­na­tional AI Treaty: An Open Letter

Olli Järviniemi31 Oct 2023 11:26 UTC
48 points
2 comments1 min readLW link
(aitreaty.org)

In­stru­men­tal de­cep­tion and ma­nipu­la­tion in LLMs—a case study

Olli Järviniemi24 Feb 2024 2:07 UTC
39 points
13 comments12 min readLW link

Take­aways from cal­ibra­tion training

Olli Järviniemi29 Jan 2023 19:09 UTC
38 points
1 comment3 min readLW link

Lan­guage mod­els are not in­her­ently safe

Olli Järviniemi7 Mar 2023 21:15 UTC
11 points
1 comment3 min readLW link