[Linkpost] Towards a Theoretical Understanding of the ‘Reversal Curse’ via Training Dynamics

Bogdan Ionut Cirstea11 May 2024 22:59 UTC

1 point

0 comments1 min readLW link

(arxiv.org)

How To Do Patching Fast

Joseph Miller11 May 2024 20:13 UTC

27 points

3 comments4 min readLW link

Can we build a better Public Doublecrux?

Raemon11 May 2024 19:21 UTC

51 points

4 comments4 min readLW link

[Question] How do I get better at D&D Sci?

FinalFormal211 May 2024 18:48 UTC

9 points

5 comments1 min readLW link

[Question] Resources for learning about poise / gracefulness?

David Gross11 May 2024 18:30 UTC

14 points

0 comments1 min readLW link

New intro textbook on AIXI

Alex_Altair11 May 2024 18:18 UTC

39 points

3 comments1 min readLW link

Questions are usually too cheap

Nathan Young11 May 2024 13:00 UTC

46 points

8 comments6 min readLW link

(nathanpmyoung.substack.com)

[Question] Ethics and prospects of AI related jobs?

dr_s11 May 2024 9:31 UTC

10 points

8 comments1 min readLW link

Applying refusal-vector ablation to a Llama 3 70B agent

Simon Lermen11 May 2024 0:08 UTC

39 points

6 comments7 min readLW link

The Alignment Problem No One Is Talking About

James Stephen Brown10 May 2024 18:34 UTC

10 points

2 comments2 min readLW link

(nonzerosum.games)

Pascal’s Mugging and the Order of Quantification

Mascal's Pugging10 May 2024 18:34 UTC

11 points

3 comments2 min readLW link

Podcast with Yoshua Bengio on Why AI Labs are “Playing Dice with Humanity’s Future”

garrison10 May 2024 17:23 UTC

41 points

0 comments1 min readLW link

(garrisonlovely.substack.com)

(Geometrically) Maximal Lottery-Lotteries Are Probably Not Unique

Lorxus10 May 2024 16:00 UTC

15 points

1 comment14 min readLW link

What do you value ?

Akram Choudhary10 May 2024 15:34 UTC

3 points

1 comment2 min readLW link

[Question] Do you know of lists of p(doom)s/AI forecasts/ AI quotes?

Nathan Young10 May 2024 11:47 UTC

7 points

2 comments1 min readLW link

AI and Chemical, Biological, Radiological, & Nuclear Hazards: A Regulatory Review

Elliot_Mckernon and Deric Cheng

10 May 2024 8:41 UTC

7 points

1 comment9 min readLW link

shortest goddamn bayes guide ever

lukehmiles10 May 2024 7:06 UTC

33 points

8 comments1 min readLW link

Linear infra-Bayesian Bandits

Vanessa Kosoy10 May 2024 6:41 UTC

29 points

2 comments1 min readLW link

(arxiv.org)

Why Care About Natural Latents?

johnswentworth and David Lorell

9 May 2024 23:14 UTC

50 points

3 comments5 min readLW link

What I learned from doing Quiz Bowl

Jacob G-W9 May 2024 21:05 UTC

4 points

0 comments6 min readLW link

(jacobgw.com)