All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025

All Jan Feb Mar Apr May Jun Jul Aug Sep OctNovDec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 141516 17 18 19 20 21 22 23 24 25 26 27 28 29 30

[Question] A Coordination Cookbook?

azerganteNov 10, 2024, 11:20 PM

2 points

0 comments1 min readLW link

Towards a Clever Hans Test: Unmasking Sentience Biases in Chatbot Interactions

glykokalyxNov 10, 2024, 10:34 PM

4 points

0 comments1 min readLW link

Urbit New England Meetup

Conquerer CohenNov 10, 2024, 5:56 PM

−4 points

0 comments1 min readLW link

Personal AI Planning

jefftkNov 10, 2024, 2:00 PM

68 points

11 comments2 min readLW link

(www.jefftk.com)

AI alignment via civilizational cognitive updates

AtillaYasarNov 10, 2024, 9:33 AM

1 point

10 comments6 min readLW link

[Question] How should vegans think about Methionine needs?

ChristianKlNov 10, 2024, 9:28 AM

32 points

3 comments1 min readLW link

Is P(Doom) Meaningful? Bayesian vs. Popperian Epistemology Debate

LironNov 9, 2024, 11:39 PM

5 points

0 comments124 min readLW link

(www.youtube.com)

Bellevue Library Meetup—Nov 23

CedarNov 9, 2024, 11:05 PM

5 points

3 comments1 min readLW link

LifeKeeper Diaries: Exploring Misaligned AI Through Interactive Fiction

Tristan Tran, stijn and Mose Wintner

Nov 9, 2024, 8:58 PM

15 points

5 comments2 min readLW link

[Question] Poll: what’s your impression of altruism?

David GrossNov 9, 2024, 8:28 PM

2 points

4 comments1 min readLW link

Chaos Theory in Ecology

ElizabethNov 9, 2024, 5:50 PM

15 points

4 comments20 min readLW link

(acesounderglass.com)

Some Comments on Recent AI Safety Developments

testingthewatersNov 9, 2024, 4:44 PM

4 points

0 comments8 min readLW link

Formalize the Hashiness Model of AGI Uncontainability

RemmeltNov 9, 2024, 4:10 PM

3 points

0 comments LW link

(docs.google.com)

Agenda Manipulation

PazzazNov 9, 2024, 2:13 PM

2 points

0 comments3 min readLW link

Force Sequential Output with SCP?

jefftkNov 9, 2024, 12:40 PM

9 points

4 comments1 min readLW link

(www.jefftk.com)

Anthropic teams up with Palantir and AWS to sell AI to defense customers

Matrice JacobineNov 9, 2024, 11:50 AM

9 points

0 comments2 min readLW link

(techcrunch.com)

GPT-4o Can In Some Cases Solve Moderately Complicated Captchas

dirkNov 9, 2024, 4:04 AM

12 points

2 comments1 min readLW link

Stone Age Herbalist’s notes on ant warfare and slavery

trevorNov 9, 2024, 2:40 AM

32 points

0 comments3 min readLW link

(x.com)

LLMs Look Increasingly Like General Reasoners

eggsyntaxNov 8, 2024, 11:47 PM

94 points

45 comments3 min readLW link

overengineered air filter shelving

bhauthNov 8, 2024, 10:04 PM

26 points

2 comments5 min readLW link

(bhauth.com)

Bigger Livers?

sarahconstantinNov 8, 2024, 9:50 PM

98 points

17 comments6 min readLW link

(sarahconstantin.substack.com)

New UChicago Rationality Group

Noah BirnbaumNov 8, 2024, 9:20 PM

9 points

0 comments1 min readLW link

Active Recall and Spaced Repetition are Different Things

Saul MunnNov 8, 2024, 8:14 PM

49 points

2 comments3 min readLW link

(www.brasstacks.blog)

The King and the Golem—The Animation

WriterNov 8, 2024, 6:23 PM

70 points

0 comments1 min readLW link

Boring & straightforward trauma explanation

lemonhopeNov 8, 2024, 9:45 AM

24 points

7 comments2 min readLW link

Curriculum of Ascension

andrew sauerNov 7, 2024, 11:54 PM

13 points

0 comments18 min readLW link

Analyzing how SAE features evolve across a forward pass

bensenberner, danibalcells, Michael Oesterle, Ediz Ucar and StefanHex

Nov 7, 2024, 10:07 PM

47 points

0 comments1 min readLW link

(arxiv.org)

Markets Are Information—Beating the Sportsbooks at Their Own Game

JJXWNov 7, 2024, 8:58 PM

9 points

1 comment2 min readLW link

(thehobbyist.substack.com)

Signaling with Small Orange Diamonds

jefftkNov 7, 2024, 8:20 PM

40 points

1 comment1 min readLW link

(www.jefftk.com)

Fundamental Uncertainty: Chapter 9 - How do we live with uncertainty?

Gordon Seidoh WorleyNov 7, 2024, 6:15 PM

11 points

2 comments15 min readLW link

AI #89: Trump Card

ZviNov 7, 2024, 4:30 PM

42 points

12 comments42 min readLW link

(thezvi.wordpress.com)

Quantum Immortality: A Perspective if AI Doomers are Probably Right

avturchin and James_Miller

Nov 7, 2024, 4:06 PM

12 points

55 comments14 min readLW link

On Targeted Manipulation and Deception when Optimizing LLMs for User Feedback

Marcus Williams, micahcarroll, Adhyyan Narang, Constantin Weisser and Brendan Murphy

Nov 7, 2024, 3:39 PM

51 points

7 comments11 min readLW link

In the Name of All That Needs Saving

pleiotrothNov 7, 2024, 3:26 PM

18 points

3 comments22 min readLW link

Agency overhang as a proxy for Sharp left turn

Eris and Iuliia Levin

Nov 7, 2024, 12:14 PM

6 points

0 comments5 min readLW link

The Case Against Moral Realism

Zero ContradictionsNov 7, 2024, 10:14 AM

−5 points

10 comments1 min readLW link

(thewaywardaxolotl.blogspot.com)

[Question] What are the primary drivers that caused selection pressure for intelligence in humans?

Towards_KeeperhoodNov 7, 2024, 9:40 AM

8 points

15 comments1 min readLW link

The Logistics of Distribution of Meaning: Against Epistemic Bureaucratization

SahilNov 7, 2024, 5:27 AM

27 points

7 comments12 min readLW link

SAEs are highly dataset dependent: a case study on the refusal direction

Connor Kissane, robertzk, Neel Nanda and Arthur Conmy

Nov 7, 2024, 5:22 AM

66 points

4 comments14 min readLW link

Should CA, TX, OK, and LA merge into a giant swing state, just for elections?

Thomas KwaNov 6, 2024, 11:01 PM

115 points

35 comments1 min readLW link

New Funding Category Open in Foresight’s AI Safety Grants

Allison DuettmannNov 6, 2024, 10:59 PM

15 points

0 comments1 min readLW link

Scattered thoughts on what it means for an LLM to believe

TheManxLoinerNov 6, 2024, 10:10 PM

5 points

4 comments5 min readLW link

The Bayesian Conspiracy Live Recording

EneaszNov 6, 2024, 4:25 PM

9 points

0 comments1 min readLW link

Anthropic: Three Sketches of ASL-4 Safety Case Components

Zach Stein-PerlmanNov 6, 2024, 4:00 PM

95 points

33 comments1 min readLW link

(alignment.anthropic.com)

Meme Talking Points

ymeskhoutNov 6, 2024, 3:27 PM

34 points

0 comments3 min readLW link

Advisors for Smaller Major Donors?

jefftkNov 6, 2024, 2:30 PM

18 points

2 comments3 min readLW link

(www.jefftk.com)

Scissors Statements for President?

AnnaSalamonNov 6, 2024, 10:38 AM

118 points

32 comments1 min readLW link

[Question] How to cite LessWrong as an academic source?

PhilosophicalSoulNov 6, 2024, 8:28 AM

6 points

6 comments1 min readLW link

How to put California and Texas on the campaign trail!

Yair HalberstadtNov 6, 2024, 6:08 AM

25 points

4 comments1 min readLW link

LDT (and everything else) can be irrational

Christopher KingNov 6, 2024, 4:05 AM

10 points

15 comments2 min readLW link