Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
abramdemski
Karma:
19,530
All
Posts
Comments
New
Top
Old
Page
1
Alignment Proposal: Adversarially Robust Augmentation and Distillation
Cole Wyeth
and
abramdemski
25 May 2025 12:58 UTC
54
points
47
comments
13
min read
LW
link
Events: Debate & Fiction Project
abramdemski
16 May 2025 21:51 UTC
39
points
1
comment
1
min read
LW
link
Understanding Trust: Overview Presentations
abramdemski
16 Apr 2025 18:08 UTC
22
points
0
comments
1
min read
LW
link
Understanding Trust—Overview Presentations
abramdemski
16 Apr 2025 18:05 UTC
13
points
0
comments
1
min read
LW
link
Dream, Truth, & Good
abramdemski
24 Feb 2025 16:59 UTC
50
points
11
comments
4
min read
LW
link
Judgements: Merging Prediction & Evidence
abramdemski
23 Feb 2025 19:35 UTC
103
points
5
comments
6
min read
LW
link
[Question]
Have LLMs Generated Novel Insights?
abramdemski
and
Cole Wyeth
23 Feb 2025 18:22 UTC
160
points
41
comments
2
min read
LW
link
Anti-Slop Interventions?
abramdemski
4 Feb 2025 19:50 UTC
76
points
33
comments
6
min read
LW
link
Lecture Series on Tiling Agents #2
abramdemski
20 Jan 2025 21:02 UTC
16
points
0
comments
1
min read
LW
link
Lecture Series on Tiling Agents
abramdemski
14 Jan 2025 21:34 UTC
38
points
14
comments
1
min read
LW
link
Why Don’t We Just… Shoggoth+Face+Paraphraser?
Daniel Kokotajlo
and
abramdemski
19 Nov 2024 20:53 UTC
152
points
58
comments
14
min read
LW
link
AI Craftsmanship
abramdemski
11 Nov 2024 22:17 UTC
66
points
7
comments
4
min read
LW
link
o1 is a bad idea
abramdemski
11 Nov 2024 21:20 UTC
162
points
39
comments
2
min read
LW
link
Seeking Collaborators
abramdemski
1 Nov 2024 17:13 UTC
62
points
15
comments
7
min read
LW
link
Complete Feedback
abramdemski
1 Nov 2024 16:58 UTC
25
points
8
comments
3
min read
LW
link
[Question]
Why is o1 so deceptive?
abramdemski
27 Sep 2024 17:27 UTC
183
points
24
comments
3
min read
LW
link
Formalizing the Informal (event invite)
abramdemski
10 Sep 2024 19:22 UTC
42
points
0
comments
1
min read
LW
link
In Defense of Open-Minded UDT
abramdemski
12 Aug 2024 18:27 UTC
79
points
28
comments
11
min read
LW
link
Leaving MIRI, Seeking Funding
abramdemski
8 Aug 2024 18:32 UTC
264
points
19
comments
2
min read
LW
link
Circular Reasoning
abramdemski
5 Aug 2024 18:10 UTC
91
points
37
comments
8
min read
LW
link
Back to top
Next