Michael Tontchev

Karma: 176

GPT-4 can catch subtle cross-language translation mistakes

Michael TontchevJul 27, 2023, 1:39 AM

7 points

1 comment1 min readLW link

[Question] Do you speed up capabilities when you do AI integrations and consume overhangs?

Michael TontchevJul 20, 2023, 6:40 AM

6 points

1 comment1 min readLW link

[Question] Links to discussions on social equilibrium and human value after (aligned) super-AI?

Michael TontchevJul 8, 2023, 1:01 AM

7 points

3 comments1 min readLW link

Outreach success: Intro to AI risk that has been successful

Michael TontchevJun 1, 2023, 11:12 PM

83 points

8 comments74 min readLW link

(medium.com)

A rough model for P(AI doom)

Michael TontchevMay 31, 2023, 8:58 AM

0 points

1 comment2 min readLW link

Alignment solutions for weak AI don’t (necessarily) scale to strong AI

Michael TontchevMay 25, 2023, 8:26 AM

6 points

0 comments5 min readLW link

Unaligned stable loops emerge at scale

Michael TontchevApr 6, 2023, 2:15 AM

9 points

8 comments4 min readLW link

ChatGPT’s “fuzzy alignment” isn’t evidence of AGI alignment: the banana test

Michael TontchevMar 23, 2023, 7:12 AM

23 points

6 comments4 min readLW link

A method for empirical back-testing of AI’s ability to self-improve

Michael TontchevMar 21, 2023, 8:24 PM

3 points

0 comments2 min readLW link

PaperclipGPT(-4)

Michael TontchevMar 14, 2023, 10:03 PM

7 points

0 comments11 min readLW link