All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025

All Jan Feb Mar Apr May Jun Jul Aug Sep Oct NovDec

All 1 2 3 4 5 6 7 8 9 101112 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

A car journey with conservative evangelicals—Understanding some British political-religious beliefs

Nathan YoungDec 6, 2024, 11:22 AM

41 points

8 comments6 min readLW link

(nathanpmyoung.substack.com)

Frontier Models are Capable of In-context Scheming

Marius Hobbhahn, AlexMeinke, Bronson Schoen, rusheb, Jérémy Scheurer and Mikita Balesni

Dec 5, 2024, 10:11 PM

203 points

24 comments7 min readLW link

Should you be worried about H5N1?

gwDec 5, 2024, 9:11 PM

89 points

2 comments5 min readLW link

(www.georgeyw.com)

o1 tried to avoid being shut down

RaelifinDec 5, 2024, 7:52 PM

10 points

5 comments1 min readLW link

(www.transformernews.ai)

More Growth, Melancholy, and MindCraft @3QD [revised and updated]

Bill BenzonDec 5, 2024, 7:36 PM

4 points

0 comments4 min readLW link

Expevolu, a laissez-faire approach to country creation

FernandoDec 5, 2024, 7:29 PM

4 points

4 comments44 min readLW link

(expevolu.substack.com)

Are SAE features from the Base Model still meaningful to LLaVA?

Shan23ChenDec 5, 2024, 7:24 PM

5 points

2 comments10 min readLW link

OpenAI o1 + ChatGPT Pro release

anagumaDec 5, 2024, 7:13 PM

5 points

0 comments1 min readLW link

(openai.com)

Smart people should do biology

HaotianDec 5, 2024, 7:11 PM

11 points

2 comments3 min readLW link

Announcement: AI for Math Fund

sarahconstantinDec 5, 2024, 6:33 PM

20 points

9 comments2 min readLW link

(renaissancephilanthropy.org)

Detection of Asymptomatically Spreading Pathogens

jefftkDec 5, 2024, 6:20 PM

45 points

8 comments7 min readLW link

(www.jefftk.com)

Model Integrity: MAI on Value Alignment

Jonas HallgrenDec 5, 2024, 5:11 PM

6 points

11 comments1 min readLW link

(meaningalignment.substack.com)

Social Science in its epistemological context

Arturo MaciasDec 5, 2024, 4:12 PM

3 points

0 comments1 min readLW link

(www.theseedsofscience.pub)

Higher and lower pleasures

Chris_LeongDec 5, 2024, 1:13 PM

19 points

3 comments1 min readLW link

Sam Harris’s Argument For Objective Morality

Zero ContradictionsDec 5, 2024, 10:19 AM

7 points

5 comments1 min readLW link

(thewaywardaxolotl.blogspot.com)

Morality as Cooperation Part III: Failure Modes

DeLesley HutchinsDec 5, 2024, 9:39 AM

4 points

0 comments20 min readLW link

Morality as Cooperation Part II: Theory and Experiment

DeLesley HutchinsDec 5, 2024, 9:04 AM

2 points

0 comments17 min readLW link

Morality as Cooperation Part I: Humans

DeLesley HutchinsDec 5, 2024, 8:16 AM

5 points

0 comments19 min readLW link

I Finally Worked Through Bayes’ Theorem (Personal Achievement)

keltanDec 5, 2024, 2:04 AM

53 points

7 comments9 min readLW link

The Dream Machine

sarahconstantinDec 5, 2024, 12:00 AM

117 points

6 comments12 min readLW link

(sarahconstantin.substack.com)

Should you have children? A decision framework for a crucial life choice that affects yourself, your child and the world

SherrinfordDec 4, 2024, 11:14 PM

0 points

1 comment20 min readLW link

CCing Mailing Lists on External Communication

jefftkDec 4, 2024, 10:00 PM

9 points

0 comments1 min readLW link

(www.jefftk.com)

Picking favourites is hard

dkl9Dec 4, 2024, 8:46 PM

11 points

3 comments1 min readLW link

(dkl9.net)

[Question] How can I convince my cryptobro friend that S&P500 is efficient?

AhmedNeedsATherapistDec 4, 2024, 8:04 PM

−7 points

10 comments1 min readLW link

The 2023 LessWrong Review: The Basic Ask

RaemonDec 4, 2024, 7:52 PM

77 points

25 comments9 min readLW link

Is the AI Doomsday Narrative the Product of a Big Tech Conspiracy?

garrisonDec 4, 2024, 7:20 PM

35 points

1 comment LW link

(garrisonlovely.substack.com)

[Question] AI box question

KvmanThinkingDec 4, 2024, 7:03 PM

2 points

2 comments1 min readLW link

The Polite Coup

Charlie SandersDec 4, 2024, 2:03 PM

3 points

0 comments3 min readLW link

(www.dailymicrofiction.com)

Analysis of Global AI Governance Strategies

Sammy Martin, Justin Bullock and Corin Katzke

Dec 4, 2024, 10:45 AM

49 points

10 comments36 min readLW link

[Question] Cryonics considerations: how big of a problem is ischemia?

kmanDec 4, 2024, 4:45 AM

8 points

1 comment1 min readLW link

AI #93: Happy Tuesday

ZviDec 4, 2024, 12:30 AM

26 points

2 comments23 min readLW link

(thezvi.wordpress.com)

A Qualitative Case for LTFF: Filling Critical Ecosystem Gaps

LinchDec 3, 2024, 9:57 PM

64 points

2 comments LW link

Deep Causal Transcoding: A Framework for Mechanistically Eliciting Latent Behaviors in Language Models

Andrew Mack and TurnTrout

Dec 3, 2024, 9:19 PM

106 points

8 comments41 min readLW link

“Alignment at Large”: Bending the Arc of History Towards Life-Affirming Futures

welfvhDec 3, 2024, 9:17 PM

5 points

0 comments4 min readLW link

Roots of Progress is hiring an event manager

jasoncrawfordDec 3, 2024, 8:46 PM

10 points

0 comments7 min readLW link

(rootsofprogress.notion.site)

Do simulacra dream of digital sheep?

EuanMcLeanDec 3, 2024, 8:25 PM

16 points

36 comments10 min readLW link

Orca communication project—seeking feedback (and collaborators)

Towards_KeeperhoodDec 3, 2024, 5:29 PM

38 points

16 comments2 min readLW link

Book a Time to Chat about Interp Research

Logan RiggsDec 3, 2024, 5:27 PM

47 points

3 comments1 min readLW link

Balsa Research 2024 Update

ZviDec 3, 2024, 12:30 PM

21 points

0 comments5 min readLW link

(thezvi.wordpress.com)

First Solo Bus Ride

jefftkDec 3, 2024, 12:20 PM

28 points

1 comment1 min readLW link

(www.jefftk.com)

How to make evals for the AISI evals bounty

TheManxLoinerDec 3, 2024, 10:44 AM

9 points

0 comments5 min readLW link

Should there be just one western AGI project?

rosehadshar and Tom Davidson

Dec 3, 2024, 10:11 AM

78 points

75 comments15 min readLW link

(www.forethought.org)

Cognitive Biases Contributing to AI X-risk — a deleted excerpt from my 2018 ARCHES draft

Andrew_CritchDec 3, 2024, 9:29 AM

48 points

2 comments5 min readLW link

[Question] What is your opinion of Dr. Angelo Dilullo(meditation)?

Suh_Prance_AlotDec 3, 2024, 5:54 AM

0 points

2 comments1 min readLW link

Chemical Turing Machines

Yudhister KumarDec 3, 2024, 5:26 AM

10 points

2 comments4 min readLW link

(www.yudhister.me)

MIRI’s 2024 End-of-Year Update

Rob BensingerDec 3, 2024, 4:33 AM

98 points

2 comments4 min readLW link

Linkpost: Rat Traps by Sheon Han in Asterisk Mag

Chris_LeongDec 3, 2024, 3:22 AM

12 points

7 comments1 min readLW link

(asteriskmag.com)

[Question] Who are the worthwhile non-European pre-Industrial thinkers?

LorecDec 3, 2024, 1:45 AM

12 points

4 comments1 min readLW link

A Paradox of Simulated Suffering

arusardaDec 2, 2024, 11:44 PM

−3 points

3 comments1 min readLW link

Levels of Thought: from Points to Fields

HNXDec 2, 2024, 8:25 PM

4 points

2 comments23 min readLW link