RSS

abramdemski

Karma: 19,530

Align­ment Pro­posal: Ad­ver­sar­i­ally Ro­bust Aug­men­ta­tion and Distillation

25 May 2025 12:58 UTC
54 points
47 comments13 min readLW link

Events: De­bate & Fic­tion Project

abramdemski16 May 2025 21:51 UTC
39 points
1 comment1 min readLW link

Un­der­stand­ing Trust: Overview Presentations

abramdemski16 Apr 2025 18:08 UTC
22 points
0 comments1 min readLW link

Un­der­stand­ing Trust—Overview Presentations

abramdemski16 Apr 2025 18:05 UTC
13 points
0 comments1 min readLW link

Dream, Truth, & Good

abramdemski24 Feb 2025 16:59 UTC
50 points
11 comments4 min readLW link

Judge­ments: Merg­ing Pre­dic­tion & Evidence

abramdemski23 Feb 2025 19:35 UTC
103 points
5 comments6 min readLW link

[Question] Have LLMs Gen­er­ated Novel In­sights?

23 Feb 2025 18:22 UTC
160 points
41 comments2 min readLW link

Anti-Slop In­ter­ven­tions?

abramdemski4 Feb 2025 19:50 UTC
76 points
33 comments6 min readLW link

Lec­ture Series on Tiling Agents #2

abramdemski20 Jan 2025 21:02 UTC
16 points
0 comments1 min readLW link

Lec­ture Series on Tiling Agents

abramdemski14 Jan 2025 21:34 UTC
38 points
14 comments1 min readLW link

Why Don’t We Just… Shog­goth+Face+Para­phraser?

19 Nov 2024 20:53 UTC
152 points
58 comments14 min readLW link

AI Craftsmanship

abramdemski11 Nov 2024 22:17 UTC
66 points
7 comments4 min readLW link

o1 is a bad idea

abramdemski11 Nov 2024 21:20 UTC
162 points
39 comments2 min readLW link

Seek­ing Collaborators

abramdemski1 Nov 2024 17:13 UTC
62 points
15 comments7 min readLW link

Com­plete Feedback

abramdemski1 Nov 2024 16:58 UTC
25 points
8 comments3 min readLW link

[Question] Why is o1 so de­cep­tive?

abramdemski27 Sep 2024 17:27 UTC
183 points
24 comments3 min readLW link

For­mal­iz­ing the In­for­mal (event in­vite)

abramdemski10 Sep 2024 19:22 UTC
42 points
0 comments1 min readLW link

In Defense of Open-Minded UDT

abramdemski12 Aug 2024 18:27 UTC
79 points
28 comments11 min readLW link

Leav­ing MIRI, Seek­ing Funding

abramdemski8 Aug 2024 18:32 UTC
264 points
19 comments2 min readLW link

Cir­cu­lar Reasoning

abramdemski5 Aug 2024 18:10 UTC
91 points
37 comments8 min readLW link