RSS

[Question] Ex­am­ples of Highly Coun­ter­fac­tual Dis­cov­er­ies?

johnswentworth23 Apr 2024 22:19 UTC
89 points
23 comments1 min readLW link

[Question] Is there soft­ware to prac­tice read­ing ex­pres­sions?

lsusr23 Apr 2024 21:53 UTC
30 points
3 comments1 min readLW link

Let’s De­sign A School, Part 1

Sable23 Apr 2024 21:50 UTC
47 points
1 comment11 min readLW link
(affablyevil.substack.com)

WSJ: In­side Ama­zon’s Se­cret Oper­a­tion to Gather In­tel on Rivals

trevor23 Apr 2024 21:33 UTC
22 points
4 comments5 min readLW link
(www.wsj.com)

On Minicircle

Metacelsus23 Apr 2024 21:28 UTC
10 points
0 comments1 min readLW link
(docs.google.com)

Sim­ple probes can catch sleeper agents

23 Apr 2024 21:10 UTC
100 points
7 comments1 min readLW link
(www.anthropic.com)

[Question] (When) Should you work through the night when in­spira­tion strikes you?

Chi Nguyen23 Apr 2024 21:07 UTC
14 points
2 comments1 min readLW link

Book re­view: Deep Utopia

PeterMcCluskey23 Apr 2024 19:55 UTC
24 points
2 comments4 min readLW link
(bayesianinvestor.com)

On what re­search poli­cy­mak­ers ac­tu­ally need

MondSemmel23 Apr 2024 19:50 UTC
35 points
0 comments3 min readLW link
(www.slowboring.com)

De­quan­tify­ing first-or­der theories

jessicata23 Apr 2024 19:04 UTC
35 points
3 comments8 min readLW link
(unstableontology.com)

Plan­ning in a Lat­tice Graph

23 Apr 2024 16:58 UTC
11 points
1 comment2 min readLW link

ProLU: A Pareto Im­prove­ment for Sparse Autoencoders

Glen Taggart23 Apr 2024 14:09 UTC
6 points
0 comments7 min readLW link

Sub­jec­tive Ques­tions Re­quire Sub­jec­tive information

Ben23 Apr 2024 13:16 UTC
7 points
3 comments4 min readLW link

Re­ject­ing Television

Declan Molony23 Apr 2024 4:59 UTC
52 points
6 comments6 min readLW link

Take the wheel, Shog­goth! (Less­wrong is try­ing out changes to the front­page al­gorithm)

23 Apr 2024 3:58 UTC
61 points
3 comments4 min readLW link

Thoughts on Zero Points

depressurize23 Apr 2024 2:22 UTC
27 points
0 comments4 min readLW link
(sexandchicago.substack.com)

How LLMs Work, in the Style of The Economist

Rocket22 Apr 2024 19:06 UTC
1 point
0 comments2 min readLW link

Mea­sur­ing Co­her­ence and Goal-Direct­ed­ness in RL Policies

dx2622 Apr 2024 18:26 UTC
2 points
0 comments7 min readLW link

AI Reg­u­la­tion is Unsafe

Maxwell Tabarrok22 Apr 2024 16:37 UTC
20 points
15 comments4 min readLW link
(www.maximum-progress.com)

Pri­ors and Prejudice

MathiasKB22 Apr 2024 15:00 UTC
74 points
13 comments7 min readLW link