All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025

All Jan Feb Mar Apr May Jun Jul Aug Sep Oct NovDec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 161718 19 20 21 22 23 24 25 26 27 28 29 30 31

Balsa Update and General Thank You

ZviDec 12, 2023, 8:30 PM

61 points

8 comments8 min readLW link

(thezvi.wordpress.com)

Towards an Ethics Calculator for Use by an AGI

sweenesmDec 12, 2023, 6:37 PM

3 points

2 comments11 min readLW link

Why Psychologists Are Wrong About The Illusion Of Explanatory Depth

moses onyedikachukwuDec 12, 2023, 6:32 PM

1 point

0 comments4 min readLW link

A design concept for superintelligent machines (and Popper’s critique of induction)

tiplur-bilrexDec 12, 2023, 6:31 PM

−7 points

6 comments1 min readLW link

(tiplur-bilrex.tlon.network)

Significantly Enhancing Adult Intelligence With Gene Editing May Be Possible

GeneSmith and kman

Dec 12, 2023, 6:14 PM

459 points

206 comments33 min readLW link 2 reviews

[Question] Why No Automated Plagerism Detection For Past Papers?

Lao MeinDec 12, 2023, 5:24 PM

7 points

10 comments1 min readLW link

OpenAI: Leaks Confirm the Story

ZviDec 12, 2023, 2:00 PM

77 points

9 comments16 min readLW link

(thezvi.wordpress.com)

Navigating the Attackspace

Jonas KgomoDec 12, 2023, 1:59 PM

1 point

0 comments2 min readLW link

Nonlinear’s Evidence: Debunking False and Misleading Claims

KatWoodsDec 12, 2023, 1:16 PM

104 points

171 comments LW link

AI Institution Design Hackathon (EAG Bay Area Satellite Event)

beatrice@foresight.org and Allison Duettmann

Dec 12, 2023, 1:10 PM

1 point

0 comments1 min readLW link

Funding case: AI Safety Camp 10

Remmelt and Linda Linsefors

Dec 12, 2023, 9:08 AM

66 points

5 comments6 min readLW link

(manifund.org)

What is the next level of rationality?

lsusr and Yoav Ravid

Dec 12, 2023, 8:14 AM

48 points

24 comments7 min readLW link

Embedded Agents are Quines

lsusr and DaemonicSigil

Dec 12, 2023, 4:57 AM

11 points

7 comments8 min readLW link

Predict the future! Earn fake internet points! Get a (free) gambling addiction!

Robert CousineauDec 12, 2023, 4:39 AM

3 points

0 comments1 min readLW link

The likely first longevity drug is based on sketchy science. This is bad for science and bad for longevity.

BobBurgersDec 12, 2023, 2:42 AM

161 points

34 comments5 min readLW link

When will GPT-5 come out? Prediction markets vs. Extrapolation

MalteDec 12, 2023, 2:41 AM

12 points

9 comments3 min readLW link

On plans for a functional society

kave and Vaniver

Dec 12, 2023, 12:07 AM

41 points

8 comments13 min readLW link

Secondary Risk Markets

VaniverDec 11, 2023, 9:52 PM

35 points

4 comments4 min readLW link

Has anyone experimented with Dodrio, a tool for exploring transformer models through interactive visualization?

Bill BenzonDec 11, 2023, 8:34 PM

4 points

0 comments1 min readLW link

[Valence series] 3. Valence & Beliefs

Steven ByrnesDec 11, 2023, 8:21 PM

77 points

12 comments21 min readLW link 1 review

[Question] Am I ethically obligated to extend the life of my dog with life-extension treatments about to hit the market?

TrudosKudosDec 11, 2023, 7:41 PM

−3 points

2 comments1 min readLW link

Adversarial Robustness Could Help Prevent Catastrophic Misuse

aogDec 11, 2023, 7:12 PM

30 points

18 comments9 min readLW link

The Consciousness Box

GradualImprovementDec 11, 2023, 4:45 PM

33 points

24 comments4 min readLW link

Empirical work that might shed light on scheming (Section 6 of “Scheming AIs”)

Joe CarlsmithDec 11, 2023, 4:30 PM

8 points

0 comments21 min readLW link

Into AI Safety: Episode 3

jacobhaimesDec 11, 2023, 4:30 PM

6 points

0 comments1 min readLW link

(into-ai-safety.github.io)

Implicitly Typed C

jefftkDec 11, 2023, 4:10 PM

16 points

0 comments1 min readLW link

(www.jefftk.com)

37C3 Hacker x Rationalist Meetup

Kiboneu and ctrltab

Dec 11, 2023, 4:02 PM

5 points

5 comments1 min readLW link

re: Yudkowsky on biological materials

bhauthDec 11, 2023, 1:28 PM

182 points

30 comments5 min readLW link

Ideoculture

elvDec 11, 2023, 10:29 AM

8 points

2 comments6 min readLW link

Quick thoughts on the implications of multi-agent views of mind on AI takeover

Kaj_SotalaDec 11, 2023, 6:34 AM

47 points

14 comments4 min readLW link

Auditing failures vs concentrated failures

ryan_greenblatt and Fabien Roger

Dec 11, 2023, 2:47 AM

47 points

1 comment7 min readLW link 1 review

Deeply Cover Car Crashes?

jefftkDec 10, 2023, 10:20 PM

36 points

32 comments1 min readLW link

(www.jefftk.com)

Principles For Product Liability (With Application To AI)

johnswentworthDec 10, 2023, 9:27 PM

37 points

55 comments10 min readLW link

[Question] What do you do to remember and reference the LessWrong posts that were most personally significant to you, in terms of intellectual development or general usefulness?

lillybaeumDec 10, 2023, 5:52 PM

5 points

7 comments1 min readLW link

[Question] Do websites and apps actually generally get worse after updates, or is it just an effect of the fear of change?

lillybaeumDec 10, 2023, 5:26 PM

36 points

35 comments2 min readLW link 1 review

How LDT helps reduce the AI arms race

Tamsin LeakeDec 10, 2023, 4:21 PM

65 points

13 comments4 min readLW link

(carado.moe)

Understanding Subjective Probabilities

Isaac KingDec 10, 2023, 6:03 AM

31 points

16 comments10 min readLW link

Send us example gnarly bugs

Beth Barnes, Megan Kinniment and Tao Lin

Dec 10, 2023, 5:23 AM

77 points

10 comments2 min readLW link

Conceptual coherence for concrete categories in humans and LLMs

Bill BenzonDec 9, 2023, 11:49 PM

13 points

1 comment2 min readLW link

2d ai-partners as a comprehensive motivation tool

AiresJLDec 9, 2023, 9:59 PM

3 points

0 comments1 min readLW link

Without—MicroFiction 250 words

Carissa CassielDec 9, 2023, 9:49 PM

20 points

1 comment1 min readLW link

Some negative steganography results

Fabien RogerDec 9, 2023, 8:22 PM

60 points

5 comments2 min readLW link

Summing up “Scheming AIs” (Section 5)

Joe Carlsmith9 Dec 2023 15:48 UTC

2 points

1 comment11 min readLW link

The Offense-Defense Balance Rarely Changes

Maxwell Tabarrok9 Dec 2023 15:21 UTC

77 points

23 comments3 min readLW link

(maximumprogress.substack.com)

A Philosophical Tautology

Nox ML9 Dec 2023 14:06 UTC

−2 points

45 comments2 min readLW link

Unpicking Extinction

ukc100149 Dec 2023 9:15 UTC

35 points

10 comments10 min readLW link

Finding Sparse Linear Connections between Features in LLMs

Logan Riggs, Sam Mitchell and Adam Kaufman

9 Dec 2023 2:27 UTC

70 points

5 comments10 min readLW link

[Question] Option Space Nomenclature

SilverFlame8 Dec 2023 23:14 UTC

1 point

0 comments1 min readLW link

“Model UN Solutions”

Arjun Panickssery8 Dec 2023 23:06 UTC

36 points

5 comments1 min readLW link

(open.substack.com)

Speed arguments against scheming (Section 4.4-4.7 of “Scheming AIs”)

Joe Carlsmith8 Dec 2023 21:09 UTC

9 points

0 comments15 min readLW link