Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Garrett Baker
Karma:
3,170
Independent alignment researcher
All
Posts
Comments
New
Top
Old
Page
1
On Complexity Science
Garrett Baker
5 Apr 2024 2:24 UTC
50
points
19
comments
4
min read
LW
link
So You Created a Sociopath—New Book Announcement!
Garrett Baker
1 Apr 2024 18:02 UTC
46
points
3
comments
1
min read
LW
link
Announcing Suffering For Good
Garrett Baker
1 Apr 2024 17:08 UTC
70
points
5
comments
1
min read
LW
link
Neuroscience and Alignment
Garrett Baker
18 Mar 2024 21:09 UTC
39
points
25
comments
2
min read
LW
link
Epoch wise critical periods, and singular learning theory
Garrett Baker
14 Dec 2023 20:55 UTC
9
points
1
comment
5
min read
LW
link
A bet on critical periods in neural networks
kave
and
Garrett Baker
6 Nov 2023 23:21 UTC
24
points
1
comment
6
min read
LW
link
When and why should you use the Kelly criterion?
Garrett Baker
,
philh
and
River
5 Nov 2023 23:26 UTC
26
points
25
comments
16
min read
LW
link
Singular learning theory and bridging from ML to brain emulations
kave
and
Garrett Baker
1 Nov 2023 21:31 UTC
26
points
16
comments
29
min read
LW
link
My hopes for alignment: Singular learning theory and whole brain emulation
Garrett Baker
25 Oct 2023 18:31 UTC
57
points
5
comments
12
min read
LW
link
AI presidents discuss AI alignment agendas
TurnTrout
and
Garrett Baker
9 Sep 2023 18:55 UTC
216
points
22
comments
1
min read
LW
link
(www.youtube.com)
Activation additions in a small residual network
Garrett Baker
22 May 2023 20:28 UTC
22
points
4
comments
3
min read
LW
link
Collective Identity
NicholasKees
,
ukc10014
and
Garrett Baker
18 May 2023 9:00 UTC
59
points
12
comments
8
min read
LW
link
Activation additions in a simple MNIST network
Garrett Baker
18 May 2023 2:49 UTC
26
points
0
comments
2
min read
LW
link
Value drift threat models
Garrett Baker
12 May 2023 23:03 UTC
27
points
4
comments
5
min read
LW
link
[Question]
What constraints does deep learning place on alignment plans?
Garrett Baker
3 May 2023 20:40 UTC
9
points
0
comments
1
min read
LW
link
Pessimistic Shard Theory
Garrett Baker
25 Jan 2023 0:59 UTC
72
points
13
comments
3
min read
LW
link
Performing an SVD on a time-series matrix of gradient updates on an MNIST network produces 92.5 singular values
Garrett Baker
21 Dec 2022 0:44 UTC
9
points
10
comments
5
min read
LW
link
Don’t design agents which exploit adversarial inputs
TurnTrout
and
Garrett Baker
18 Nov 2022 1:48 UTC
69
points
64
comments
12
min read
LW
link
A framework and open questions for game theoretic shard modeling
Garrett Baker
21 Oct 2022 21:40 UTC
11
points
4
comments
4
min read
LW
link
Taking the parameters which seem to matter and rotating them until they don’t
Garrett Baker
26 Aug 2022 18:26 UTC
120
points
48
comments
1
min read
LW
link
Back to top
Next