Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
New
Hot
Active
Old
Page
1
ProLU: A Pareto Improvement for Sparse Autoencoders
Glen Taggart
23 Apr 2024 14:09 UTC
−3
points
0
comments
6
min read
LW
link
Subjective Questions Require Subjective information
Ben
23 Apr 2024 13:16 UTC
7
points
1
comment
4
min read
LW
link
Rejecting Television
Declan Molony
23 Apr 2024 4:59 UTC
36
points
3
comments
6
min read
LW
link
Take the wheel, Shoggoth! (Lesswrong is trying out changes to the frontpage algorithm)
Ruby
and
RobertM
23 Apr 2024 3:58 UTC
50
points
3
comments
4
min read
LW
link
Thoughts on Zero Points
depressurize
23 Apr 2024 2:22 UTC
21
points
0
comments
4
min read
LW
link
(sexandchicago.substack.com)
How LLMs Work, in the Style of The Economist
Rocket
22 Apr 2024 19:06 UTC
1
point
0
comments
2
min read
LW
link
Measuring Coherence and Goal-Directedness in RL Policies
dx26
22 Apr 2024 18:26 UTC
2
points
0
comments
7
min read
LW
link
AI Regulation is Unsafe
Maxwell Tabarrok
22 Apr 2024 16:37 UTC
31
points
8
comments
4
min read
LW
link
(www.maximum-progress.com)
Priors and Prejudice
MathiasKB
22 Apr 2024 15:00 UTC
65
points
10
comments
7
min read
LW
link
Forget Everything (Statistical Mechanics Part 1)
J Bostock
22 Apr 2024 13:33 UTC
36
points
4
comments
3
min read
LW
link
Should we break up Google DeepMind?
Hauke Hillebrandt
22 Apr 2024 9:16 UTC
−6
points
0
comments
1
min read
LW
link
What should our containers do?
Richard Henage
22 Apr 2024 6:17 UTC
3
points
1
comment
2
min read
LW
link
Goal oriented cognition in “a single forward pass”
dxu
and
habryka
22 Apr 2024 5:03 UTC
18
points
11
comments
26
min read
LW
link
Time complexity for deterministic string machines
alcatal
21 Apr 2024 22:35 UTC
14
points
0
comments
21
min read
LW
link
Transfer Learning in Humans
niplav
21 Apr 2024 20:49 UTC
53
points
1
comment
13
min read
LW
link
I created an Asi Alignment Tier List
TimeGoat
21 Apr 2024 18:44 UTC
−6
points
0
comments
1
min read
LW
link
Fruits of our Labors Introduction: The Art of Weirdness
Bridgett Kay
21 Apr 2024 17:34 UTC
2
points
2
comments
4
min read
LW
link
(dxmrevealed.wordpress.com)
The losing identity of Twitter
Itay Dreyfus
21 Apr 2024 13:43 UTC
8
points
1
comment
12
min read
LW
link
(productidentity.co)
Good Bings copy, great Bings steal
dr_s
21 Apr 2024 9:52 UTC
29
points
6
comments
9
min read
LW
link
Paper: “The Ethics of Advanced AI Assistants” -Google DeepMind
Tristan Wegner
21 Apr 2024 6:45 UTC
20
points
0
comments
1
min read
LW
link
(storage.googleapis.com)
Back to top
Next