RSS

Michael Tontchev

Karma: 175

GPT-4 can catch sub­tle cross-lan­guage trans­la­tion mistakes

Michael Tontchev27 Jul 2023 1:39 UTC
7 points
1 comment1 min readLW link

[Question] Do you speed up ca­pa­bil­ities when you do AI in­te­gra­tions and con­sume over­hangs?

Michael Tontchev20 Jul 2023 6:40 UTC
6 points
1 comment1 min readLW link

[Question] Links to dis­cus­sions on so­cial equil­ibrium and hu­man value af­ter (al­igned) su­per-AI?

Michael Tontchev8 Jul 2023 1:01 UTC
7 points
3 comments1 min readLW link

Outreach suc­cess: In­tro to AI risk that has been successful

Michael Tontchev1 Jun 2023 23:12 UTC
83 points
8 comments74 min readLW link
(medium.com)

A rough model for P(AI doom)

Michael Tontchev31 May 2023 8:58 UTC
0 points
1 comment2 min readLW link

Align­ment solu­tions for weak AI don’t (nec­es­sar­ily) scale to strong AI

Michael Tontchev25 May 2023 8:26 UTC
6 points
0 comments5 min readLW link

Unal­igned sta­ble loops emerge at scale

Michael Tontchev6 Apr 2023 2:15 UTC
9 points
8 comments4 min readLW link

ChatGPT’s “fuzzy al­ign­ment” isn’t ev­i­dence of AGI al­ign­ment: the ba­nana test

Michael Tontchev23 Mar 2023 7:12 UTC
23 points
6 comments4 min readLW link

A method for em­piri­cal back-test­ing of AI’s abil­ity to self-improve

Michael Tontchev21 Mar 2023 20:24 UTC
3 points
0 comments2 min readLW link

Paper­clipGPT(-4)

Michael Tontchev14 Mar 2023 22:03 UTC
7 points
0 comments11 min readLW link