RSS

[Linkpost] Towards a The­o­ret­i­cal Un­der­stand­ing of the ‘Rev­er­sal Curse’ via Train­ing Dynamics

Bogdan Ionut Cirstea11 May 2024 22:59 UTC
1 point
0 comments1 min readLW link
(arxiv.org)

How To Do Patch­ing Fast

Joseph Miller11 May 2024 20:13 UTC
27 points
3 comments4 min readLW link

Can we build a bet­ter Public Dou­ble­crux?

Raemon11 May 2024 19:21 UTC
51 points
4 comments4 min readLW link

[Question] How do I get bet­ter at D&D Sci?

FinalFormal211 May 2024 18:48 UTC
9 points
5 comments1 min readLW link

[Question] Re­sources for learn­ing about poise /​ grace­ful­ness?

David Gross11 May 2024 18:30 UTC
14 points
0 comments1 min readLW link

New in­tro text­book on AIXI

Alex_Altair11 May 2024 18:18 UTC
39 points
3 comments1 min readLW link

Ques­tions are usu­ally too cheap

Nathan Young11 May 2024 13:00 UTC
46 points
8 comments6 min readLW link
(nathanpmyoung.substack.com)

[Question] Ethics and prospects of AI re­lated jobs?

dr_s11 May 2024 9:31 UTC
10 points
8 comments1 min readLW link

Ap­ply­ing re­fusal-vec­tor ab­la­tion to a Llama 3 70B agent

Simon Lermen11 May 2024 0:08 UTC
39 points
6 comments7 min readLW link

The Align­ment Prob­lem No One Is Talk­ing About

James Stephen Brown10 May 2024 18:34 UTC
10 points
2 comments2 min readLW link
(nonzerosum.games)

Pas­cal’s Mug­ging and the Order of Quantification

Mascal's Pugging10 May 2024 18:34 UTC
11 points
3 comments2 min readLW link

Pod­cast with Yoshua Ben­gio on Why AI Labs are “Play­ing Dice with Hu­man­ity’s Fu­ture”

garrison10 May 2024 17:23 UTC
41 points
0 comments1 min readLW link
(garrisonlovely.substack.com)

(Geo­met­ri­cally) Max­i­mal Lot­tery-Lot­ter­ies Are Prob­a­bly Not Unique

Lorxus10 May 2024 16:00 UTC
15 points
1 comment14 min readLW link

What do you value ?

Akram Choudhary10 May 2024 15:34 UTC
3 points
1 comment2 min readLW link

[Question] Do you know of lists of p(doom)s/​AI fore­casts/​ AI quotes?

Nathan Young10 May 2024 11:47 UTC
7 points
2 comments1 min readLW link

AI and Chem­i­cal, Biolog­i­cal, Ra­diolog­i­cal, & Nu­clear Hazards: A Reg­u­la­tory Review

10 May 2024 8:41 UTC
7 points
1 comment9 min readLW link

short­est god­damn bayes guide ever

lukehmiles10 May 2024 7:06 UTC
33 points
8 comments1 min readLW link

Lin­ear in­fra-Bayesian Bandits

Vanessa Kosoy10 May 2024 6:41 UTC
29 points
2 comments1 min readLW link
(arxiv.org)

Why Care About Nat­u­ral La­tents?

9 May 2024 23:14 UTC
50 points
3 comments5 min readLW link

What I learned from do­ing Quiz Bowl

Jacob G-W9 May 2024 21:05 UTC
4 points
0 comments6 min readLW link
(jacobgw.com)