AI Safety proposal—Influencing the superintelligence explosion

Morgan22 May 2024 23:31 UTC

0 points

1 comment7 min readLW link

The Button (Short Comic)

milanrosko22 May 2024 23:28 UTC

3 points

0 comments1 min readLW link

Implementing Asimov’s Laws of Robotics—How I imagine alignment working.

Joshua Clancy22 May 2024 23:15 UTC

2 points

0 comments11 min readLW link

Higher-Order Forecasts

ozziegooen22 May 2024 21:49 UTC

30 points

0 comments1 min readLW link

A Positive Double Standard—Self-Help Principles Work For Individuals Not Populations

James Stephen Brown22 May 2024 21:37 UTC

2 points

2 comments5 min readLW link

A Bi-Modal Brain Model

Johannes C. Mayer22 May 2024 20:10 UTC

9 points

1 comment2 min readLW link

[Question] Should we be concerned about eating too much soy?

ChristianKl22 May 2024 12:53 UTC

20 points

2 comments1 min readLW link

Procedural Executive Function, Part 3

DaystarEld22 May 2024 11:58 UTC

15 points

2 comments1 min readLW link

Cicadas, Anthropic, and the bilateral alignment problem

kromem22 May 2024 11:09 UTC

17 points

0 comments5 min readLW link

“Which chains-of-thought was that faster than?”

Emrik22 May 2024 8:21 UTC

31 points

1 comment4 min readLW link

ARIA’s Safeguarded AI grant program is accepting applications for Technical Area 1.1 until May 28th

Brendon_Wong22 May 2024 6:54 UTC

10 points

0 comments1 min readLW link

(www.aria.org.uk)

Anthropic announces interpretability advances. How much does this advance alignment?

Seth Herd21 May 2024 22:30 UTC

48 points

4 comments3 min readLW link

(www.anthropic.com)

EIS XIII: Reflections on Anthropic’s SAE Research Circa May 2024

scasper21 May 2024 20:15 UTC

114 points

9 comments3 min readLW link

Mitigating extreme AI risks amid rapid progress [Linkpost]

Akash21 May 2024 19:59 UTC

18 points

5 comments4 min readLW link

Helping loved ones with their finances: the why and how of an unusually impactful opportunity

Sam Anschell21 May 2024 18:48 UTC

0 points

1 comment1 min readLW link

(forum.effectivealtruism.org)

rough draft on what happens in the brain when you have an insight

Emrik21 May 2024 18:02 UTC

9 points

2 comments1 min readLW link

On Dwarkesh’s Podcast with OpenAI’s John Schulman

Zvi21 May 2024 17:30 UTC

65 points

3 comments20 min readLW link

(thezvi.wordpress.com)

[Question] Is deleting capabilities still a relevant research question?

tailcalled21 May 2024 13:24 UTC

15 points

1 comment1 min readLW link

My Dating Heuristic

Declan Molony21 May 2024 5:28 UTC

13 points

4 comments2 min readLW link

Scorable Functions: A Format for Algorithmic Forecasting

ozziegooen21 May 2024 4:14 UTC

26 points

0 comments1 min readLW link