All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025 2026

AllJanFeb Mar Apr May Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 678 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

AI Risk and the US Presidential Candidates

Zane6 Jan 2024 20:18 UTC

41 points

22 comments6 min readLW link

A Challenge to Effective Altruism’s Premises

False Name6 Jan 2024 18:46 UTC

−26 points

3 comments3 min readLW link

Lack of Spider-Man is evidence against the simulation hypothesis

RamblinDash6 Jan 2024 18:17 UTC

7 points

23 comments1 min readLW link

A Land Tax For Britain

A.H.6 Jan 2024 15:52 UTC

6 points

9 comments4 min readLW link

Book review: Trick or treatment (2008)

Fleece Minutia6 Jan 2024 15:40 UTC

1 point

0 comments2 min readLW link

Are we inside a black hole?

Jay6 Jan 2024 13:30 UTC

2 points

5 comments1 min readLW link

Survey of 2,778 AI authors: six parts in pictures

KatjaGrace6 Jan 2024 4:43 UTC

80 points

1 comment2 min readLW link

Project ideas: Epistemics

Lukas Finnveden5 Jan 2024 23:41 UTC

43 points

4 comments17 min readLW link

(www.forethought.org)

Almost everyone I’ve met would be well-served thinking more about what to focus on

Henrik Karlsson5 Jan 2024 21:01 UTC

98 points

9 comments11 min readLW link 1 review

(www.henrikkarlsson.xyz)

The Next ChatGPT Moment: AI Avatars

kolmplex and southpaw

5 Jan 2024 20:14 UTC

43 points

10 comments1 min readLW link

AI Impacts 2023 Expert Survey on Progress in AI

habryka5 Jan 2024 19:42 UTC

28 points

2 comments7 min readLW link

(wiki.aiimpacts.org)

Technology path dependence and evaluating expertise

bhauth and Muireall

5 Jan 2024 19:21 UTC

25 points

2 comments15 min readLW link

The Hippie Rabbit Hole -Nuggets of Gold in Rivers of Bullshit

Jonathan Moregård5 Jan 2024 18:27 UTC

41 points

20 comments8 min readLW link

(honestliving.substack.com)

[Question] What technical topics could help with boundaries/membranes?

Chris Lakin5 Jan 2024 18:14 UTC

15 points

25 comments1 min readLW link

Catching AIs red-handed

ryan_greenblatt and Buck

5 Jan 2024 17:43 UTC

116 points

28 comments17 min readLW link 1 review

AI Impacts Survey: December 2023 Edition

Zvi5 Jan 2024 14:40 UTC

34 points

6 comments10 min readLW link

(thezvi.wordpress.com)

Forecast your 2024 with Fatebook

Sage Future5 Jan 2024 14:07 UTC

19 points

0 comments1 min readLW link

(fatebook.io)

Predictive model agents are sort of corrigible

Raymond Douglas5 Jan 2024 14:05 UTC

35 points

6 comments3 min readLW link

Striking Implications for Learning Theory, Interpretability — and Safety?

RogerDearnaley5 Jan 2024 8:46 UTC

37 points

4 comments2 min readLW link

If I ran the zoo

Optimization Process5 Jan 2024 5:14 UTC

18 points

1 comment2 min readLW link

Does AI care about reality or just its own perception?

RedFishBlueFish5 Jan 2024 4:05 UTC

−6 points

8 comments1 min readLW link

MIRI 2024 Mission and Strategy Update

Malo5 Jan 2024 0:20 UTC

223 points

44 comments8 min readLW link

Project ideas: Governance during explosive technological growth

Lukas Finnveden4 Jan 2024 23:51 UTC

20 points

0 comments16 min readLW link

(www.forethought.org)

Hello

S Benfield4 Jan 2024 23:35 UTC

6 points

0 comments2 min readLW link

Using Threats to Achieve Socially Optimal Outcomes

StrivingForLegibility4 Jan 2024 23:30 UTC

8 points

0 comments3 min readLW link

Best-Responding Is Not Always the Best Response

StrivingForLegibility4 Jan 2024 23:30 UTC

10 points

0 comments3 min readLW link

Safety Data Sheets for Optimization Processes

StrivingForLegibility4 Jan 2024 23:30 UTC

15 points

1 comment4 min readLW link

The Gears of Argmax

StrivingForLegibility4 Jan 2024 23:30 UTC

12 points

1 comment3 min readLW link

Cellular reprogramming, pneumatic launch systems, and terraforming Mars: Some things I learned about at Foresight Vision Weekend

jasoncrawford4 Jan 2024 19:33 UTC

28 points

0 comments8 min readLW link

(rootsofprogress.org)

Deep atheism and AI risk

Joe Carlsmith4 Jan 2024 18:58 UTC

155 points

24 comments27 min readLW link 2 reviews

Some Vacation Photos

johnswentworth4 Jan 2024 17:15 UTC

83 points

0 comments1 min readLW link

AISN #29: Progress on the EU AI Act Plus, the NY Times sues OpenAI for Copyright Infringement, and Congressional Questions about Research Standards in AI Safety

Dan H and Corin Katzke

4 Jan 2024 16:09 UTC

8 points

0 comments6 min readLW link

(newsletter.safe.ai)

EAG Bay Area Satellite event: AI Institution Design Hackathon 2024

elte4 Jan 2024 15:02 UTC

1 point

0 comments1 min readLW link

AI #45: To Be Determined

Zvi4 Jan 2024 15:00 UTC

52 points

4 comments31 min readLW link

(thezvi.wordpress.com)

Screen-supported Portable Monitor

jefftk4 Jan 2024 13:50 UTC

16 points

10 comments1 min readLW link

(www.jefftk.com)

[Question] Which investments for aligned-AI outcomes?

tailcalled4 Jan 2024 13:28 UTC

9 points

9 comments2 min readLW link

Non-alignment project ideas for making transformative AI go well

Lukas Finnveden4 Jan 2024 7:23 UTC

44 points

1 comment3 min readLW link

(www.forethought.org)

Fact Checking and Retaliation Against Sources

jefftk4 Jan 2024 0:41 UTC

7 points

2 comments4 min readLW link

(www.jefftk.com)

Investigating Alternative Futures: Human and Superintelligence Interaction Scenarios

Hiroshi Yamakawa3 Jan 2024 23:46 UTC

2 points

0 comments17 min readLW link

“Attitudes Toward Artificial General Intelligence: Results from American Adults 2021 and 2023”—call for reviewers (Seeds of Science)

rogersbacon3 Jan 2024 20:11 UTC

4 points

0 comments1 min readLW link

What’s up with LLMs representing XORs of arbitrary features?

Sam Marks3 Jan 2024 19:44 UTC

159 points

64 comments16 min readLW link

Spirit Airlines Merger Play

sapphire3 Jan 2024 19:25 UTC

5 points

12 comments1 min readLW link

$300 for the best sci-fi prompt: the results

RomanS3 Jan 2024 19:10 UTC

16 points

19 comments7 min readLW link

Agent membranes/boundaries and formalizing “safety”

Chris Lakin3 Jan 2024 17:55 UTC

26 points

46 comments3 min readLW link

Safety First: safety before full alignment. The deontic sufficiency hypothesis.

Chris Lakin3 Jan 2024 17:55 UTC

48 points

3 comments3 min readLW link

Practically A Book Review: Appendix to “Nonlinear’s Evidence: Debunking False and Misleading Claims” (ThingOfThings)

tailcalled3 Jan 2024 17:07 UTC

111 points

25 comments2 min readLW link

(thingofthings.substack.com)

Trivial Mathematics as a Path Forward

ACrackedPot3 Jan 2024 16:41 UTC

−4 points

2 comments2 min readLW link

Copyright Confrontation #1

Zvi3 Jan 2024 15:50 UTC

34 points

7 comments18 min readLW link

(thezvi.wordpress.com)

[Question] Theoretically, could we balance the budget painlessly?

Logan Zoellner3 Jan 2024 14:46 UTC

4 points

12 comments1 min readLW link

Johannes’ Biography

Johannes C. Mayer3 Jan 2024 13:27 UTC

28 points

0 comments10 min readLW link