All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025 2026

All Jan Feb Mar Apr May Jun JulAugSep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 141516 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

A short calculation about a Twitter poll

Ege Erdil14 Aug 2023 19:48 UTC

64 points

64 comments11 min readLW link

Decomposing independent generalizations in neural networks via Hessian analysis

Dmitry Vaintrob and Nina Panickssery

14 Aug 2023 17:04 UTC

86 points

4 comments1 min readLW link

Memetic Judo #2: Incorporal Switches and Levers Compendium

Max TK14 Aug 2023 16:53 UTC

19 points

6 comments17 min readLW link

Existentially relevant thought experiment: To kill or not to kill, a sniper, a man and a button.

AlexFromSafeTransition14 Aug 2023 10:53 UTC

−18 points

6 comments4 min readLW link

Stepping down as moderator on LW

Kaj_Sotala14 Aug 2023 10:46 UTC

82 points

1 comment1 min readLW link

Announcing Manifest 2023 (Sep 22-24 in Berkeley)

Saul Munn and Austin Chen

14 Aug 2023 5:13 UTC

31 points

0 comments2 min readLW link

Listen For What You Don’t Hear: The Case for Contrarianism

Yashvardhan Sharma14 Aug 2023 2:53 UTC

1 point

1 comment5 min readLW link

Recipe: Hessian eigenvector computation for PyTorch models

Nina Panickssery14 Aug 2023 2:48 UTC

32 points

5 comments5 min readLW link

[Question] Assuming LK99 or similar: how to accelerate commercialization?

ryan_b13 Aug 2023 21:34 UTC

7 points

5 comments1 min readLW link

Twin Cities ACX Meetup September 2023

Timothy M.13 Aug 2023 20:10 UTC

1 point

4 comments1 min readLW link

Fundamental Uncertainty: Chapter 1 - How can we know what’s true?

Gordon Seidoh Worley13 Aug 2023 18:55 UTC

19 points

4 comments12 min readLW link

We Should Prepare for a Larger Representation of Academia in AI Safety

Leon Lang13 Aug 2023 18:03 UTC

90 points

14 comments5 min readLW link

AGI is easier than robotaxis

Daniel Kokotajlo13 Aug 2023 17:00 UTC

41 points

30 comments4 min readLW link

[Question] If we’re alive in 5 years, do you think the funding situation will be much better by then? (With large amounts of government funding, for example)

kuira13 Aug 2023 16:32 UTC

−2 points

6 comments1 min readLW link

Abstract Theories of Everything

Philosophistry13 Aug 2023 6:06 UTC

−17 points

0 comments1 min readLW link

[Linkpost] Personal and Psychological Dimensions of AI Researchers Confronting AI Catastrophic Risks

Bogdan Ionut Cirstea12 Aug 2023 22:02 UTC

42 points

0 comments1 min readLW link

The Empathy Engine: A Deconstruction of the Societal Metamorphosis through Technological Empathy Augmentation

bigdickproblems12 Aug 2023 18:23 UTC

−30 points

3 comments2 min readLW link

The Benevolent Ruler’s Handbook (Part 2): Morality Rules

FCCC12 Aug 2023 14:25 UTC

5 points

0 comments4 min readLW link

Learning as you play: anthropic shadow in deadly games

dr_s12 Aug 2023 7:34 UTC

37 points

28 comments35 min readLW link

Biological Anchors: The Trick that Might or Might Not Work

Scott Alexander12 Aug 2023 0:53 UTC

91 points

3 comments33 min readLW link

(astralcodexten.substack.com)

Simulate the CEO

robotelvis12 Aug 2023 0:09 UTC

23 points

5 comments5 min readLW link

(messyprogress.substack.com)

How to decide under low-stakes uncertainty

dkl911 Aug 2023 18:07 UTC

11 points

4 comments1 min readLW link

(dkl9.net)

The Pandemic is Only Beginning: The Long COVID Disaster

salvatore mattera11 Aug 2023 17:36 UTC

−6 points

15 comments8 min readLW link

When discussing AI risks, talk about capabilities, not intelligence

Vika11 Aug 2023 13:38 UTC

124 points

7 comments3 min readLW link

(vkrakovna.wordpress.com)

What are the flaws in this AGI argument?

William the Kiwi 11 Aug 2023 11:31 UTC

5 points

14 comments1 min readLW link

Google DeepMind’s RT-2

SandXbox11 Aug 2023 11:26 UTC

9 points

1 comment1 min readLW link

(robotics-transformer2.github.io)

Linkpost: We need another Expert Survey on Progress in AI, urgently

David Mears11 Aug 2023 8:22 UTC

25 points

2 comments2 min readLW link

(open.substack.com)

What Does a Marginal Grant at LTFF Look Like? Funding Priorities and Grantmaking Thresholds at the Long-Term Future Fund

Linch, calebp99 and Daniel_Eth

11 Aug 2023 3:59 UTC

64 points

0 comments1 min readLW link

(forum.effectivealtruism.org)

[Question] Will posting any thread on LW guarantee that a LLM will index all my content, and if questions people ask to the LLM after my name will surface up all my LW content?

Alex K. Chen (StochasticCockatoo)11 Aug 2023 1:40 UTC

0 points

0 comments1 min readLW link

AI Safety Concepts Writeup: WebGPT

JustisMills11 Aug 2023 1:35 UTC

9 points

1 comment7 min readLW link

[Question] What is science?

Adam Zerner11 Aug 2023 0:00 UTC

6 points

4 comments1 min readLW link

Three configurable prettyprinters

philh10 Aug 2023 23:10 UTC

9 points

0 comments22 min readLW link

(reasonableapproximation.net)

Ilya Sutskever’s thoughts on AI safety (July 2023): a transcript with my comments

mishka10 Aug 2023 19:07 UTC

22 points

3 comments5 min readLW link

Seeking Input to AI Safety Book for non-technical audience

Darren McKee10 Aug 2023 17:58 UTC

10 points

4 comments1 min readLW link

Evaluating GPT-4 Theory of Mind Capabilities

gcmac and Nathan

10 Aug 2023 17:57 UTC

15 points

2 comments14 min readLW link

Some alignment ideas

SelonNerias10 Aug 2023 17:51 UTC

1 point

0 comments11 min readLW link

Self Supervised Learning (SSL)

Varshul Gupta10 Aug 2023 17:43 UTC

5 points

1 comment2 min readLW link

(dubverseblack.substack.com)

Predicting Virus Relative Abundance in Wastewater

jefftk10 Aug 2023 15:46 UTC

33 points

2 comments1 min readLW link

(naobservatory.org)

AI #24: Week of the Podcast

Zvi10 Aug 2023 15:00 UTC

49 points

5 comments44 min readLW link

(thezvi.wordpress.com)

Could We Automate AI Alignment Research?

Stephen McAleese10 Aug 2023 12:17 UTC

34 points

10 comments21 min readLW link

The positional embedding matrix and previous-token heads: how do they actually work?

AdamYedidia10 Aug 2023 1:58 UTC

27 points

4 comments13 min readLW link

LLMs are (mostly) not helped by filler tokens

Kshitij Sachan10 Aug 2023 0:48 UTC

68 points

36 comments6 min readLW link

2023 ACX Meetups Everywhere—Newton, MA

duck_master9 Aug 2023 22:47 UTC

6 points

2 comments1 min readLW link

Progress links digest, 2023-08-09: US adds new nuclear, Katalin Karikó interview, and more

jasoncrawford9 Aug 2023 19:22 UTC

18 points

0 comments3 min readLW link

(rootsofprogress.org)

Mech Interp Challenge: August—Deciphering the First Unique Character Model

CallumMcDougall9 Aug 2023 19:14 UTC

36 points

1 comment3 min readLW link

Real Meaning of life has been found. Eliezer discovered it in 2000′s.

Jorterder9 Aug 2023 18:13 UTC

−15 points

1 comment1 min readLW link

(docs.google.com)

Marginal Revolution unofficial birthday party

Derek M. Jones9 Aug 2023 14:35 UTC

4 points

0 comments1 min readLW link

A content analysis of the SQ-R questionnaire and a proposal for testing EQ-SQ theory

tailcalled9 Aug 2023 13:51 UTC

10 points

2 comments13 min readLW link

[Question] Does LessWrong allow exempting posts from being scraped by GPTBot?

mic9 Aug 2023 13:02 UTC

29 points

3 comments1 min readLW link

If I Was An Eccentric Trillionaire

niplav9 Aug 2023 7:56 UTC

9 points

8 comments26 min readLW link