Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Michael Tontchev
Karma:
159
All
Posts
Comments
New
Top
Old
GPT-4 can catch subtle cross-language translation mistakes
Michael Tontchev
27 Jul 2023 1:39 UTC
7
points
1
comment
1
min read
LW
link
[Question]
Do you speed up capabilities when you do AI integrations and consume overhangs?
Michael Tontchev
20 Jul 2023 6:40 UTC
6
points
1
comment
1
min read
LW
link
[Question]
Links to discussions on social equilibrium and human value after (aligned) super-AI?
Michael Tontchev
8 Jul 2023 1:01 UTC
7
points
3
comments
1
min read
LW
link
Outreach success: Intro to AI risk that has been successful
Michael Tontchev
1 Jun 2023 23:12 UTC
82
points
8
comments
74
min read
LW
link
(medium.com)
A rough model for P(AI doom)
Michael Tontchev
31 May 2023 8:58 UTC
0
points
1
comment
2
min read
LW
link
Alignment solutions for weak AI don’t (necessarily) scale to strong AI
Michael Tontchev
25 May 2023 8:26 UTC
6
points
0
comments
5
min read
LW
link
Unaligned stable loops emerge at scale
Michael Tontchev
6 Apr 2023 2:15 UTC
9
points
8
comments
4
min read
LW
link
ChatGPT’s “fuzzy alignment” isn’t evidence of AGI alignment: the banana test
Michael Tontchev
23 Mar 2023 7:12 UTC
23
points
6
comments
4
min read
LW
link
A method for empirical back-testing of AI’s ability to self-improve
Michael Tontchev
21 Mar 2023 20:24 UTC
3
points
0
comments
2
min read
LW
link
PaperclipGPT(-4)
Michael Tontchev
14 Mar 2023 22:03 UTC
7
points
0
comments
11
min read
LW
link
Back to top