All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025 2026

All Jan Feb Mar Apr May JunJulAug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 293031

What is Morality?

Zero Contradictions29 Jul 2024 19:19 UTC

−1 points

0 comments1 min readLW link

(thewaywardaxolotl.blogspot.com)

Arch-anarchism and immortality

Peter lawless 29 Jul 2024 18:10 UTC

−5 points

1 comment2 min readLW link

AI Safety Newsletter #39: Implications of a Trump Administration for AI Policy Plus, Safety Engineering

Corin Katzke, Alexa Pan, Julius and Dan H

29 Jul 2024 17:50 UTC

17 points

1 comment6 min readLW link

(newsletter.safe.ai)

New Blog Post Against AI Doom

Noah Birnbaum29 Jul 2024 17:21 UTC

2 points

5 comments1 min readLW link

(substack.com)

An Interpretability Illusion from Population Statistics in Causal Analysis

Daniel Tan29 Jul 2024 14:50 UTC

9 points

3 comments1 min readLW link

[Question] How tokenization influences prompting?

Boris Kashirin29 Jul 2024 10:28 UTC

9 points

4 comments1 min readLW link

Understanding Positional Features in Layer 0 SAEs

bilalchughtai and Yeu-Tong Lau

29 Jul 2024 9:36 UTC

43 points

0 comments5 min readLW link

Prediction Markets Explained

Benjamin_Sturisky29 Jul 2024 8:02 UTC

8 points

0 comments9 min readLW link

Relativity Theory for What the Future ‘You’ Is and Isn’t

FlorianH29 Jul 2024 2:01 UTC

7 points

50 comments4 min readLW link

Wittgenstein and Word2vec: Capturing Relational Meaning in Language and Thought

cleanwhiteroom28 Jul 2024 19:55 UTC

2 points

2 comments2 min readLW link

Making Beliefs Pay Rent

Screwtape and NoSignalNoNoise

28 Jul 2024 17:59 UTC

7 points

2 comments1 min readLW link

This is already your second chance

Malmesbury28 Jul 2024 17:13 UTC

201 points

13 comments8 min readLW link

[Question] Has Eliezer publicly and satisfactorily responded to attempted rebuttals of the analogy to evolution?

kaler28 Jul 2024 12:23 UTC

10 points

14 comments1 min readLW link

Family and Society

Zero Contradictions28 Jul 2024 7:05 UTC

1 point

0 comments1 min readLW link

(thewaywardaxolotl.blogspot.com)

[Question] What is AI Safety’s line of retreat?

Remmelt28 Jul 2024 5:43 UTC

12 points

12 comments1 min readLW link

AXRP Episode 34 - AI Evaluations with Beth Barnes

DanielFilan28 Jul 2024 3:30 UTC

23 points

0 comments69 min readLW link

Rats, Back a Candidate

Blake28 Jul 2024 3:19 UTC

−36 points

19 comments1 min readLW link

AI existential risk probabilities are too unreliable to inform policy

Oleg Trott28 Jul 2024 0:59 UTC

18 points

5 comments1 min readLW link

(www.aisnakeoil.com)

Idle Speculations on Pipeline Parallelism

DaemonicSigil27 Jul 2024 22:40 UTC

2 points

0 comments4 min readLW link

(pbement.com)

Re: Anthropic’s suggested SB-1047 amendments

RobertM27 Jul 2024 22:32 UTC

87 points

13 comments9 min readLW link

(www.documentcloud.org)

The problem with psychology is that it has no theory.

Nicholas D.27 Jul 2024 19:36 UTC

2 points

7 comments4 min readLW link

(nicholasdecker.substack.com)

Bryan Johnson and a search for healthy longevity

NancyLebovitz27 Jul 2024 15:28 UTC

18 points

17 comments1 min readLW link

What are matching markets?

ohmurphy27 Jul 2024 15:05 UTC

12 points

0 comments8 min readLW link

(ohmurphy.substack.com)

Safety consultations for AI lab employees

Zach Stein-Perlman27 Jul 2024 15:00 UTC

183 points

6 comments1 min readLW link

The Case Against UBI

Zero Contradictions27 Jul 2024 6:36 UTC

−1 points

2 comments2 min readLW link

(thewaywardaxolotl.blogspot.com)

Unlocking Solutions—by understanding coordination problems

James Stephen Brown27 Jul 2024 4:52 UTC

58 points

4 comments5 min readLW link

(nonzerosum.games)

Utilitarianism and the replaceability of desires and attachments

MichaelStJules27 Jul 2024 1:57 UTC

5 points

2 comments12 min readLW link

Inspired by: Failures in Kindness

X4vier27 Jul 2024 1:21 UTC

60 points

2 comments3 min readLW link

My Experience Using Gamification

Ari S26 Jul 2024 23:06 UTC

13 points

4 comments4 min readLW link

How the AI safety technical landscape has changed in the last year, according to some practitioners

tlevin26 Jul 2024 19:06 UTC

57 points

6 comments2 min readLW link

A Visual Task that’s Hard for GPT-4o, but Doable for Primary Schoolers

Lennart Finke26 Jul 2024 17:51 UTC

25 points

6 comments2 min readLW link

Unaligned AI is coming regardless.

verbalshadow26 Jul 2024 16:41 UTC

−15 points

3 comments2 min readLW link

Index of rationalist groups in the Bay Area June 2025

Lucie Philippon, Czynski and Screwtape

26 Jul 2024 16:32 UTC

41 points

14 comments2 min readLW link

End Single Family Zoning by Overturning Euclid V Ambler

Maxwell Tabarrok26 Jul 2024 14:08 UTC

33 points

1 comment7 min readLW link

(www.maximum-progress.com)

Common Uses of “Acceptance”

Yi-Yang26 Jul 2024 11:18 UTC

14 points

5 comments24 min readLW link

Universal Basic Income and Poverty

Eliezer Yudkowsky26 Jul 2024 7:23 UTC

357 points

150 comments9 min readLW link 1 review

A Solomonoff Inductor Walks Into a Bar: Schelling Points for Communication

johnswentworth and David Lorell

26 Jul 2024 0:33 UTC

106 points

8 comments13 min readLW link 1 review

What does a Gambler’s Verity world look like?

ErioirE25 Jul 2024 22:03 UTC

7 points

6 comments1 min readLW link

Pacing Outside the Box: RNNs Learn to Plan in Sokoban

Adrià Garriga-alonso, taufeeque, AdamGleave and ChengCheng

25 Jul 2024 22:00 UTC

59 points

8 comments2 min readLW link

(arxiv.org)

Sex, Death, and Complexity

Zero Contradictions25 Jul 2024 21:22 UTC

0 points

0 comments1 min readLW link

(thewaywardaxolotl.blogspot.com)

Does robustness improve with scale?

ChengCheng, niki.h, Ian McKenzie, Oskar Hollinsworth, Tom Tseng and AdamGleave

25 Jul 2024 20:55 UTC

14 points

0 comments1 min readLW link

(far.ai)

Organisation for Program Equilibrium reading group

Smaug12325 Jul 2024 19:11 UTC

11 points

14 comments1 min readLW link

In Text

Valerii Kremnev25 Jul 2024 18:22 UTC

−3 points

0 comments5 min readLW link

“AI achieves silver-medal standard solving International Mathematical Olympiad problems”

gjm25 Jul 2024 15:58 UTC

133 points

38 comments2 min readLW link

(deepmind.google)

[Talk transcript] What “structure” is and why it matters

Alex_Altair25 Jul 2024 15:49 UTC

23 points

0 comments5 min readLW link

(www.youtube.com)

AI #74: GPT-4o Mini Me and Llama 3

Zvi25 Jul 2024 13:50 UTC

30 points

6 comments36 min readLW link

(thezvi.wordpress.com)

AI Constitutions are a tool to reduce societal scale risk

Sammy Martin25 Jul 2024 11:18 UTC

30 points

2 comments18 min readLW link

Determining the power of investors over Frontier AI Labs is strategically important to reduce x-risk

Lucie Philippon25 Jul 2024 1:12 UTC

18 points

7 comments2 min readLW link

FLI is hiring across Comms and Ops

beisenpress25 Jul 2024 0:06 UTC

1 point

0 comments1 min readLW link

A framework for thinking about AI power-seeking

Joe Carlsmith24 Jul 2024 22:41 UTC

70 points

15 comments16 min readLW link