ProLU: A Pareto Improvement for Sparse Autoencoders

Glen Taggart23 Apr 2024 14:09 UTC

−3 points

0 comments6 min readLW link

Subjective Questions Require Subjective information

Ben23 Apr 2024 13:16 UTC

7 points

1 comment4 min readLW link

Rejecting Television

Declan Molony23 Apr 2024 4:59 UTC

36 points

3 comments6 min readLW link

Take the wheel, Shoggoth! (Lesswrong is trying out changes to the frontpage algorithm)

Ruby and RobertM

23 Apr 2024 3:58 UTC

50 points

3 comments4 min readLW link

Thoughts on Zero Points

depressurize23 Apr 2024 2:22 UTC

21 points

0 comments4 min readLW link

(sexandchicago.substack.com)

How LLMs Work, in the Style of The Economist

Rocket22 Apr 2024 19:06 UTC

1 point

0 comments2 min readLW link

Measuring Coherence and Goal-Directedness in RL Policies

dx2622 Apr 2024 18:26 UTC

2 points

0 comments7 min readLW link

AI Regulation is Unsafe

Maxwell Tabarrok22 Apr 2024 16:37 UTC

31 points

8 comments4 min readLW link

(www.maximum-progress.com)

Priors and Prejudice

MathiasKB22 Apr 2024 15:00 UTC

65 points

10 comments7 min readLW link

Forget Everything (Statistical Mechanics Part 1)

J Bostock22 Apr 2024 13:33 UTC

36 points

4 comments3 min readLW link

Should we break up Google DeepMind?

Hauke Hillebrandt22 Apr 2024 9:16 UTC

−6 points

0 comments1 min readLW link

What should our containers do?

Richard Henage22 Apr 2024 6:17 UTC

3 points

1 comment2 min readLW link

Goal oriented cognition in “a single forward pass”

dxu and habryka

22 Apr 2024 5:03 UTC

18 points

11 comments26 min readLW link

Time complexity for deterministic string machines

alcatal21 Apr 2024 22:35 UTC

14 points

0 comments21 min readLW link

Transfer Learning in Humans

niplav21 Apr 2024 20:49 UTC

53 points

1 comment13 min readLW link

I created an Asi Alignment Tier List

TimeGoat21 Apr 2024 18:44 UTC

−6 points

0 comments1 min readLW link

Fruits of our Labors Introduction: The Art of Weirdness

Bridgett Kay21 Apr 2024 17:34 UTC

2 points

2 comments4 min readLW link

(dxmrevealed.wordpress.com)

The losing identity of Twitter

Itay Dreyfus21 Apr 2024 13:43 UTC

8 points

1 comment12 min readLW link

(productidentity.co)

Good Bings copy, great Bings steal

dr_s21 Apr 2024 9:52 UTC

29 points

6 comments9 min readLW link

Paper: “The Ethics of Advanced AI Assistants” -Google DeepMind

Tristan Wegner21 Apr 2024 6:45 UTC

20 points

0 comments1 min readLW link

(storage.googleapis.com)