All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025 2026

AllJanFeb Mar Apr May Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 456 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Project ideas: Governance during explosive technological growth

Lukas Finnveden4 Jan 2024 23:51 UTC

20 points

0 comments16 min readLW link

(www.forethought.org)

Hello

S Benfield4 Jan 2024 23:35 UTC

6 points

0 comments2 min readLW link

Using Threats to Achieve Socially Optimal Outcomes

StrivingForLegibility4 Jan 2024 23:30 UTC

8 points

0 comments3 min readLW link

Best-Responding Is Not Always the Best Response

StrivingForLegibility4 Jan 2024 23:30 UTC

10 points

0 comments3 min readLW link

Safety Data Sheets for Optimization Processes

StrivingForLegibility4 Jan 2024 23:30 UTC

15 points

1 comment4 min readLW link

The Gears of Argmax

StrivingForLegibility4 Jan 2024 23:30 UTC

12 points

1 comment3 min readLW link

Cellular reprogramming, pneumatic launch systems, and terraforming Mars: Some things I learned about at Foresight Vision Weekend

jasoncrawford4 Jan 2024 19:33 UTC

28 points

0 comments8 min readLW link

(rootsofprogress.org)

Deep atheism and AI risk

Joe Carlsmith4 Jan 2024 18:58 UTC

155 points

24 comments27 min readLW link 2 reviews

Some Vacation Photos

johnswentworth4 Jan 2024 17:15 UTC

83 points

0 comments1 min readLW link

AISN #29: Progress on the EU AI Act Plus, the NY Times sues OpenAI for Copyright Infringement, and Congressional Questions about Research Standards in AI Safety

Dan H and Corin Katzke

4 Jan 2024 16:09 UTC

8 points

0 comments6 min readLW link

(newsletter.safe.ai)

EAG Bay Area Satellite event: AI Institution Design Hackathon 2024

elte4 Jan 2024 15:02 UTC

1 point

0 comments1 min readLW link

AI #45: To Be Determined

Zvi4 Jan 2024 15:00 UTC

52 points

4 comments31 min readLW link

(thezvi.wordpress.com)

Screen-supported Portable Monitor

jefftk4 Jan 2024 13:50 UTC

16 points

10 comments1 min readLW link

(www.jefftk.com)

[Question] Which investments for aligned-AI outcomes?

tailcalled4 Jan 2024 13:28 UTC

9 points

9 comments2 min readLW link

Non-alignment project ideas for making transformative AI go well

Lukas Finnveden4 Jan 2024 7:23 UTC

44 points

1 comment3 min readLW link

(www.forethought.org)

Fact Checking and Retaliation Against Sources

jefftk4 Jan 2024 0:41 UTC

7 points

2 comments4 min readLW link

(www.jefftk.com)

Investigating Alternative Futures: Human and Superintelligence Interaction Scenarios

Hiroshi Yamakawa3 Jan 2024 23:46 UTC

2 points

0 comments17 min readLW link

“Attitudes Toward Artificial General Intelligence: Results from American Adults 2021 and 2023”—call for reviewers (Seeds of Science)

rogersbacon3 Jan 2024 20:11 UTC

4 points

0 comments1 min readLW link

What’s up with LLMs representing XORs of arbitrary features?

Sam Marks3 Jan 2024 19:44 UTC

159 points

64 comments16 min readLW link

Spirit Airlines Merger Play

sapphire3 Jan 2024 19:25 UTC

5 points

12 comments1 min readLW link

$300 for the best sci-fi prompt: the results

RomanS3 Jan 2024 19:10 UTC

16 points

19 comments7 min readLW link

Agent membranes/boundaries and formalizing “safety”

Chris Lakin3 Jan 2024 17:55 UTC

26 points

46 comments3 min readLW link

Safety First: safety before full alignment. The deontic sufficiency hypothesis.

Chris Lakin3 Jan 2024 17:55 UTC

48 points

3 comments3 min readLW link

Practically A Book Review: Appendix to “Nonlinear’s Evidence: Debunking False and Misleading Claims” (ThingOfThings)

tailcalled3 Jan 2024 17:07 UTC

111 points

25 comments2 min readLW link

(thingofthings.substack.com)

Trivial Mathematics as a Path Forward

ACrackedPot3 Jan 2024 16:41 UTC

−4 points

2 comments2 min readLW link

Copyright Confrontation #1

Zvi3 Jan 2024 15:50 UTC

34 points

7 comments18 min readLW link

(thezvi.wordpress.com)

[Question] Theoretically, could we balance the budget painlessly?

Logan Zoellner3 Jan 2024 14:46 UTC

4 points

12 comments1 min readLW link

Johannes’ Biography

Johannes C. Mayer3 Jan 2024 13:27 UTC

28 points

0 comments10 min readLW link

What Helped Me—Kale, Blood, CPAP, X-tiamine, Methylphenidate

Johannes C. Mayer3 Jan 2024 13:22 UTC

38 points

12 comments2 min readLW link

[Question] Does LessWrong make a difference when it comes to AI alignment?

PhilosophicalSoul3 Jan 2024 12:21 UTC

18 points

13 comments1 min readLW link

[Question] Terminology: <something>-ware for ML?

Oliver Sourbut3 Jan 2024 11:42 UTC

17 points

27 comments1 min readLW link

Trading off Lives

jefftk3 Jan 2024 3:40 UTC

55 points

12 comments2 min readLW link

(www.jefftk.com)

MonoPoly Restricted Trust

ymeskhout2 Jan 2024 23:02 UTC

31 points

37 comments9 min readLW link

Agent membranes and causal distance

Chris Lakin2 Jan 2024 22:43 UTC

20 points

3 comments3 min readLW link

Focusing on Mal-Alignment

John Fisher2 Jan 2024 19:51 UTC

1 point

0 comments1 min readLW link

Gentleness and the artificial Other

Joe Carlsmith2 Jan 2024 18:21 UTC

322 points

34 comments11 min readLW link 1 review

Otherness and control in the age of AGI

Joe Carlsmith2 Jan 2024 18:15 UTC

51 points

1 comment7 min readLW link 1 review

Apologizing is a Core Rationalist Skill

johnswentworth2 Jan 2024 17:47 UTC

157 points

42 comments5 min readLW link

Cortés, AI Risk, and the Dynamics of Competing Conquerors

James_Miller2 Jan 2024 16:37 UTC

14 points

3 comments3 min readLW link

OpenAI’s Preparedness Framework: Praise & Recommendations

Orpheus162 Jan 2024 16:20 UTC

66 points

1 comment7 min readLW link

Dating Roundup #2: If At First You Don’t Succeed

Zvi2 Jan 2024 16:00 UTC

54 points

29 comments47 min readLW link

(thezvi.wordpress.com)

Looking for Reading Recommendations: Content Moderation, Power & Censorship

Joerg Weiss2 Jan 2024 11:37 UTC

2 points

7 comments1 min readLW link

AI Is Not Software

Davidmanheim2 Jan 2024 7:58 UTC

63 points

29 comments5 min readLW link

Are Metaculus AI Timelines Inconsistent?

Chris_Leong2 Jan 2024 6:47 UTC

17 points

7 comments2 min readLW link

Boston Solstice 2023 Retrospective

jefftk2 Jan 2024 3:10 UTC

33 points

0 comments6 min readLW link

(www.jefftk.com)

Steering Llama-2 with contrastive activation additions

Nina Panickssery, Wuschel Schulz, NickGabs, Meg, evhub and TurnTrout

2 Jan 2024 0:47 UTC

125 points

29 comments8 min readLW link

(arxiv.org)

Twin Cities ACX Meetup—January 2024

Timothy M.1 Jan 2024 21:13 UTC

1 point

2 comments1 min readLW link

San Francisco ACX Meetup “First Saturday”

guenael1 Jan 2024 20:58 UTC

1 point

1 comment1 min readLW link

Mech Interp Challenge: January—Deciphering the Caesar Cipher Model

CallumMcDougall1 Jan 2024 18:03 UTC

17 points

0 comments3 min readLW link

Aldix and the Book of Life

ville1 Jan 2024 17:23 UTC

1 point

0 comments4 min readLW link

(medium.com)