All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025 2026

All Jan Feb Mar Apr May Jun Jul Aug Sep OctNovDec

All 1 2 3 4 5 6 7 8 91011 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

Is P(Doom) Meaningful? Bayesian vs. Popperian Epistemology Debate

Liron9 Nov 2024 23:39 UTC

5 points

1 comment124 min readLW link

(www.youtube.com)

Bellevue Library Meetup—Nov 23

Cedar9 Nov 2024 23:05 UTC

5 points

3 comments1 min readLW link

LifeKeeper Diaries: Exploring Misaligned AI Through Interactive Fiction

Tristan Tran, stijn and Mose Wintner

9 Nov 2024 20:58 UTC

15 points

5 comments2 min readLW link

[Question] Poll: what’s your impression of altruism?

David Gross9 Nov 2024 20:28 UTC

3 points

4 comments1 min readLW link

Chaos Theory in Ecology

Elizabeth9 Nov 2024 17:50 UTC

15 points

4 comments20 min readLW link

(acesounderglass.com)

Some Comments on Recent AI Safety Developments

testingthewaters9 Nov 2024 16:44 UTC

13 points

1 comment9 min readLW link

Formalize the Hashiness Model of AGI Uncontainability

Remmelt9 Nov 2024 16:10 UTC

3 points

0 comments5 min readLW link

(docs.google.com)

Agenda Manipulation

Pazzaz9 Nov 2024 14:13 UTC

2 points

0 comments3 min readLW link

Force Sequential Output with SCP?

jefftk9 Nov 2024 12:40 UTC

9 points

4 comments1 min readLW link

(www.jefftk.com)

Anthropic teams up with Palantir and AWS to sell AI to defense customers

Matrice Jacobine9 Nov 2024 11:50 UTC

9 points

0 comments2 min readLW link

(techcrunch.com)

GPT-4o Can In Some Cases Solve Moderately Complicated Captchas

dirk9 Nov 2024 4:04 UTC

12 points

2 comments1 min readLW link

LLMs Look Increasingly Like General Reasoners

eggsyntax8 Nov 2024 23:47 UTC

95 points

45 comments3 min readLW link

overengineered air filter shelving

bhauth8 Nov 2024 22:04 UTC

26 points

2 comments5 min readLW link

(bhauth.com)

Bigger Livers?

sarahconstantin8 Nov 2024 21:50 UTC

99 points

17 comments6 min readLW link

(sarahconstantin.substack.com)

New UChicago Rationality Group

Noah Birnbaum8 Nov 2024 21:20 UTC

11 points

0 comments1 min readLW link

Active Recall and Spaced Repetition are Different Things

Saul Munn8 Nov 2024 20:14 UTC

51 points

2 comments3 min readLW link

(www.brasstacks.blog)

The King and the Golem—The Animation

Writer8 Nov 2024 18:23 UTC

72 points

1 comment1 min readLW link

Boring & straightforward trauma explanation

lemonhope8 Nov 2024 9:45 UTC

24 points

7 comments1 min readLW link

Curriculum of Ascension

andrew sauer7 Nov 2024 23:54 UTC

12 points

0 comments18 min readLW link

Analyzing how SAE features evolve across a forward pass

bensenberner, danibalcells, Michael Oesterle, Ediz Ucar and StefanHex

7 Nov 2024 22:07 UTC

47 points

0 comments1 min readLW link

(arxiv.org)

Markets Are Information—Beating the Sportsbooks at Their Own Game

JJXW7 Nov 2024 20:58 UTC

9 points

1 comment2 min readLW link

(thehobbyist.substack.com)

Signaling with Small Orange Diamonds

jefftk7 Nov 2024 20:20 UTC

41 points

1 comment1 min readLW link

(www.jefftk.com)

Fundamental Uncertainty: Chapter 9 - How do we live with uncertainty?

Gordon Seidoh Worley7 Nov 2024 18:15 UTC

11 points

2 comments15 min readLW link

AI #89: Trump Card

Zvi7 Nov 2024 16:30 UTC

42 points

12 comments42 min readLW link

(thezvi.wordpress.com)

Quantum Immortality: A Perspective if AI Doomers are Probably Right

avturchin and James_Miller

7 Nov 2024 16:06 UTC

16 points

55 comments14 min readLW link

On Targeted Manipulation and Deception when Optimizing LLMs for User Feedback

Marcus Williams, micahcarroll, Adhyyan Narang, Constantin Weisser and Brendan Murphy

7 Nov 2024 15:39 UTC

51 points

7 comments11 min readLW link

In the Name of All That Needs Saving

pleiotroth7 Nov 2024 15:26 UTC

18 points

3 comments22 min readLW link

The Case Against Moral Realism

Zero Contradictions7 Nov 2024 10:14 UTC

−5 points

10 comments1 min readLW link

(thewaywardaxolotl.blogspot.com)

[Question] What are the primary drivers that caused selection pressure for intelligence in humans?

Towards_Keeperhood7 Nov 2024 9:40 UTC

8 points

15 comments1 min readLW link

The Logistics of Distribution of Meaning: Against Epistemic Bureaucratization

Sahil7 Nov 2024 5:27 UTC

33 points

7 comments12 min readLW link

SAEs are highly dataset dependent: a case study on the refusal direction

Connor Kissane, robertzk, Neel Nanda and Arthur Conmy

7 Nov 2024 5:22 UTC

67 points

4 comments14 min readLW link

Should CA, TX, OK, and LA merge into a giant swing state, just for elections?

Thomas Kwa6 Nov 2024 23:01 UTC

116 points

35 comments1 min readLW link

New Funding Category Open in Foresight’s AI Safety Grants

Allison Duettmann6 Nov 2024 22:59 UTC

15 points

0 comments1 min readLW link

Scattered thoughts on what it means for an LLM to believe

TheManxLoiner6 Nov 2024 22:10 UTC

5 points

4 comments5 min readLW link

The Bayesian Conspiracy Live Recording

Eneasz6 Nov 2024 16:25 UTC

9 points

0 comments1 min readLW link

Anthropic: Three Sketches of ASL-4 Safety Case Components

Zach Stein-Perlman6 Nov 2024 16:00 UTC

96 points

35 comments1 min readLW link 1 review

(alignment.anthropic.com)

Meme Talking Points

ymeskhout6 Nov 2024 15:27 UTC

33 points

0 comments3 min readLW link

Advisors for Smaller Major Donors?

jefftk6 Nov 2024 14:30 UTC

18 points

2 comments3 min readLW link

(www.jefftk.com)

Scissors Statements for President?

AnnaSalamon6 Nov 2024 10:38 UTC

122 points

34 comments1 min readLW link 1 review

[Question] How to cite LessWrong as an academic source?

PhilosophicalSoul6 Nov 2024 8:28 UTC

10 points

6 comments1 min readLW link

How to put California and Texas on the campaign trail!

Yair Halberstadt6 Nov 2024 6:08 UTC

25 points

4 comments1 min readLW link

LDT (and everything else) can be irrational

Christopher King6 Nov 2024 4:05 UTC

14 points

18 comments2 min readLW link

Join my new subscriber chat

sarahconstantin6 Nov 2024 2:30 UTC

7 points

0 comments1 min readLW link

(sarahconstantin.substack.com)

Graceful Degradation

Screwtape5 Nov 2024 23:57 UTC

84 points

8 comments4 min readLW link

An alternative approach to superbabies

Towards_Keeperhood5 Nov 2024 22:56 UTC

48 points

19 comments3 min readLW link

Apply to be a mentor in SPAR!

agucova5 Nov 2024 21:32 UTC

5 points

0 comments1 min readLW link

Going Beyond “immaturity”

moisentinel5 Nov 2024 20:51 UTC

−3 points

2 comments2 min readLW link

Intent alignment as a stepping-stone to value alignment

Seth Herd5 Nov 2024 20:43 UTC

37 points

8 comments3 min readLW link

Why Recursion Pharmaceuticals abandoned cell painting for brightfield imaging

Abhishaike Mahajan5 Nov 2024 14:51 UTC

29 points

1 comment18 min readLW link

(www.owlposting.com)

Winning isn’t enough

JesseClifton and Anthony DiGiovanni

5 Nov 2024 11:37 UTC

44 points

35 comments9 min readLW link