Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Robert_AIZI
Karma:
1,143
All
Posts
Comments
New
Top
Old
Article Review: Google’s AlphaTensor
Robert_AIZI
12 Oct 2022 18:04 UTC
8
points
4
comments
10
min read
LW
link
Testing Ways to Bypass ChatGPT’s Safety Features
Robert_AIZI
5 Dec 2022 18:50 UTC
7
points
4
comments
5
min read
LW
link
(aizi.substack.com)
Log-odds are better than Probabilities
Robert_AIZI
12 Dec 2022 20:10 UTC
22
points
4
comments
4
min read
LW
link
(aizi.substack.com)
Article Review: Discovering Latent Knowledge (Burns, Ye, et al)
Robert_AIZI
22 Dec 2022 18:16 UTC
13
points
4
comments
6
min read
LW
link
(aizi.substack.com)
My Reservations about Discovering Latent Knowledge (Burns, Ye, et al)
Robert_AIZI
27 Dec 2022 17:27 UTC
50
points
0
comments
4
min read
LW
link
(aizi.substack.com)
How does GPT-3 spend its 175B parameters?
Robert_AIZI
13 Jan 2023 19:21 UTC
40
points
13
comments
6
min read
LW
link
(aizi.substack.com)
The Gallery for Painting Transformations—A GPT-3 Analogy
Robert_AIZI
19 Jan 2023 23:32 UTC
1
point
0
comments
6
min read
LW
link
(aizi.substack.com)
No Really, Attention is ALL You Need—Attention can do feedforward networks
Robert_AIZI
31 Jan 2023 18:48 UTC
29
points
7
comments
6
min read
LW
link
(aizi.substack.com)
Addendum: More Efficient FFNs via Attention
Robert_AIZI
6 Feb 2023 18:55 UTC
10
points
2
comments
5
min read
LW
link
(aizi.substack.com)
Explaining SolidGoldMagikarp by looking at it from random directions
Robert_AIZI
14 Feb 2023 14:54 UTC
8
points
0
comments
8
min read
LW
link
(aizi.substack.com)
Why do we assume there is a “real” shoggoth behind the LLM? Why not masks all the way down?
Robert_AIZI
9 Mar 2023 17:28 UTC
61
points
48
comments
2
min read
LW
link
GPT-4: What we (I) know about it
Robert_AIZI
15 Mar 2023 20:12 UTC
40
points
29
comments
12
min read
LW
link
(aizi.substack.com)
Three of my beliefs about upcoming AGI
Robert_AIZI
27 Mar 2023 20:27 UTC
6
points
0
comments
3
min read
LW
link
(aizi.substack.com)
Corrigibility, Self-Deletion, and Identical Strawberries
Robert_AIZI
28 Mar 2023 16:54 UTC
8
points
2
comments
6
min read
LW
link
(aizi.substack.com)
Early Results: Do LLMs complete false equations with false equations?
Robert_AIZI
30 Mar 2023 20:14 UTC
14
points
0
comments
4
min read
LW
link
(aizi.substack.com)
Invocations: The Other Capabilities Overhang?
Robert_AIZI
4 Apr 2023 13:38 UTC
29
points
4
comments
4
min read
LW
link
(aizi.substack.com)
Pre-registering a study
Robert_AIZI
7 Apr 2023 15:46 UTC
10
points
0
comments
6
min read
LW
link
(aizi.substack.com)
Research Report: Incorrectness Cascades
Robert_AIZI
14 Apr 2023 12:49 UTC
19
points
0
comments
10
min read
LW
link
(aizi.substack.com)
Study 1b: This One Weird Trick does NOT cause incorrectness cascades
Robert_AIZI
20 Apr 2023 18:10 UTC
5
points
0
comments
6
min read
LW
link
(aizi.substack.com)
The Toxoplasma of AGI Doom and Capabilities?
Robert_AIZI
24 Apr 2023 18:11 UTC
68
points
12
comments
1
min read
LW
link
Back to top