[Linkpost] Towards a The­o­ret­i­cal Un­der­stand­ing of the ‘Rev­er­sal Curse’ via Train­ing Dynamics

Bogdan Ionut Cirstea11 May 2024 22:59 UTC
6 points
0 comments1 min readLW link
(arxiv.org)

How To Do Patch­ing Fast

Joseph Miller11 May 2024 20:13 UTC
44 points
8 comments4 min readLW link

Can we build a bet­ter Public Dou­ble­crux?

Raemon11 May 2024 19:21 UTC
53 points
6 comments4 min readLW link

[Question] How do I get bet­ter at D&D Sci?

FinalFormal211 May 2024 18:48 UTC
10 points
7 comments1 min readLW link

[Question] Re­sources for learn­ing about poise /​ grace­ful­ness?

David Gross11 May 2024 18:30 UTC
14 points
1 comment1 min readLW link

New in­tro text­book on AIXI

Alex_Altair11 May 2024 18:18 UTC
47 points
8 comments1 min readLW link

Ques­tions are usu­ally too cheap

Nathan Young11 May 2024 13:00 UTC
57 points
19 comments6 min readLW link
(nathanpmyoung.substack.com)

dead post 2

David Chapel11 May 2024 9:33 UTC
−6 points
4 comments1 min readLW link

[Question] Ethics and prospects of AI re­lated jobs?

dr_s11 May 2024 9:31 UTC
10 points
8 comments1 min readLW link

Should I Finish My Bach­e­lor’s De­gree?

Zack_M_Davis11 May 2024 5:17 UTC
27 points
14 comments6 min readLW link
(zackmdavis.net)

Cus­tom Au­dio Switch Box

jefftk11 May 2024 2:40 UTC
9 points
0 comments1 min readLW link
(www.jefftk.com)

MATS Win­ter 2023-24 Retrospective

11 May 2024 0:09 UTC
87 points
28 comments49 min readLW link

Ap­ply­ing re­fusal-vec­tor ab­la­tion to a Llama 3 70B agent

Simon Lermen11 May 2024 0:08 UTC
51 points
14 comments7 min readLW link

Qualia

A*10 May 2024 19:11 UTC
3 points
0 comments1 min readLW link

The Align­ment Prob­lem No One Is Talk­ing About

James Stephen Brown10 May 2024 18:34 UTC
10 points
10 comments2 min readLW link
(nonzerosum.games)

Pas­cal’s Mug­ging and the Order of Quantification

SorenJ10 May 2024 18:34 UTC
11 points
3 comments2 min readLW link

New to this community

kjsisco10 May 2024 18:34 UTC
1 point
1 comment1 min readLW link

dead post 1

David Chapel10 May 2024 18:34 UTC
−14 points
1 comment1 min readLW link

Pod­cast with Yoshua Ben­gio on Why AI Labs are “Play­ing Dice with Hu­man­ity’s Fu­ture”

garrison10 May 2024 17:23 UTC
41 points
0 comments2 min readLW link
(garrisonlovely.substack.com)

(Geo­met­ri­cally) Max­i­mal Lot­tery-Lot­ter­ies Are Prob­a­bly Not Unique

Lorxus10 May 2024 16:00 UTC
16 points
1 comment14 min readLW link

What do you value ?

Akram Choudhary10 May 2024 15:34 UTC
3 points
1 comment2 min readLW link

[Question] Do you know of lists of p(doom)s/​AI fore­casts/​ AI quotes?

Nathan Young10 May 2024 11:47 UTC
8 points
2 comments1 min readLW link

[Question] Have any par­ties in the cur­rent Euro­pean Par­li­a­men­tary Elec­tion made pub­lic state­ments on AI?

MondSemmel10 May 2024 10:22 UTC
9 points
0 comments1 min readLW link

AI and Chem­i­cal, Biolog­i­cal, Ra­diolog­i­cal, & Nu­clear Hazards: A Reg­u­la­tory Review

10 May 2024 8:41 UTC
7 points
1 comment10 min readLW link

short­est god­damn bayes guide ever

lemonhope10 May 2024 7:06 UTC
54 points
8 comments1 min readLW link

Lin­ear in­fra-Bayesian Bandits

Vanessa Kosoy10 May 2024 6:41 UTC
40 points
5 comments1 min readLW link
(arxiv.org)

Why Care About Nat­u­ral La­tents?

9 May 2024 23:14 UTC
56 points
3 comments5 min readLW link

What I learned from do­ing Quiz Bowl

Jacob G-W9 May 2024 21:05 UTC
10 points
0 comments6 min readLW link
(jacobgw.com)

My the­sis (Al­gorith­mic Bayesian Episte­mol­ogy) ex­plained in more depth

Eric Neyman9 May 2024 19:43 UTC
82 points
4 comments27 min readLW link
(ericneyman.wordpress.com)

Dyslucksia

Shoshannah Tekofsky9 May 2024 19:21 UTC
156 points
45 comments6 min readLW link

Has Gen­er­a­tive AI Already Peaked? - Computerphile

daviddelauba9 May 2024 18:27 UTC
7 points
2 comments1 min readLW link
(www.youtube.com)

[Question] Thoughts on the rel­a­tive eco­nomic benefits of polyamorous re­la­tion­ships?

Oliver Kuperman9 May 2024 18:26 UTC
−1 points
4 comments1 min readLW link

AI Safety Strate­gies Landscape

Charbel-Raphaël9 May 2024 17:33 UTC
34 points
1 comment42 min readLW link

Knowl­edge Base 9: In­creas­ing intelligence

iwis9 May 2024 15:46 UTC
−5 points
0 comments1 min readLW link

We might be miss­ing some key fea­ture of AI take­off; it’ll prob­a­bly seem like “we could’ve seen this com­ing”

Lukas_Gloor9 May 2024 15:43 UTC
100 points
36 comments5 min readLW link

Four Un­re­lated Is Over

jefftk9 May 2024 14:50 UTC
14 points
2 comments1 min readLW link
(www.jefftk.com)

AI #63: In­tro­duc­ing Alpha Fold 3

Zvi9 May 2024 14:20 UTC
33 points
2 comments28 min readLW link
(thezvi.wordpress.com)

I Got 95 Th­e­ses But a Glitch Ain’t One

Zvi9 May 2024 14:10 UTC
16 points
0 comments16 min readLW link
(thezvi.wordpress.com)

The Hu­man’s Role in Mesa Optimization

silentbob9 May 2024 12:07 UTC
5 points
0 comments2 min readLW link

Vi­su­al­iz­ing neu­ral net­work planning

9 May 2024 6:40 UTC
4 points
0 comments5 min readLW link

Fore­cast­ing: the way I think about it

Molly9 May 2024 0:49 UTC
41 points
4 comments3 min readLW link
(cuttyshark.substack.com)

some thoughts on LessOnline

Raemon8 May 2024 23:17 UTC
58 points
5 comments5 min readLW link

Zero-Sum Defeats Nash Equilibrium

Richard Henage8 May 2024 19:49 UTC
−6 points
4 comments3 min readLW link

Se­man­tic Disagree­ment of Sleep­ing Beauty Problem

Ape in the coat8 May 2024 19:09 UTC
18 points
10 comments8 min readLW link

Is There a Power Play Over­hang?

crispweed8 May 2024 17:39 UTC
3 points
0 comments1 min readLW link
(upcoder.com)

Ex­pe­rience Switch­ing to Right Shoulder Round

jefftk8 May 2024 17:30 UTC
13 points
1 comment4 min readLW link
(www.jefftk.com)

[Question] How do top AI labs vet ar­chi­tec­ture/​al­gorithm changes?

Jemal Young8 May 2024 16:47 UTC
3 points
5 comments1 min readLW link

How to be an am­a­teur polyglot

arisAlexis8 May 2024 15:08 UTC
66 points
16 comments7 min readLW link

Dat­ing Roundup #3: Third Time’s the Charm

Zvi8 May 2024 13:30 UTC
45 points
28 comments39 min readLW link
(thezvi.wordpress.com)

Nav­i­gat­ing LLM em­bed­ding spaces us­ing archetype-based directions

mwatkins8 May 2024 5:54 UTC
16 points
4 comments28 min readLW link