All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025

AllJanFeb Mar Apr May Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8910 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Hiring decisions are not suitable for prediction markets

SimonMJan 8, 2024, 9:11 PM

12 points

6 comments1 min readLW link

Better Anomia

jefftkJan 8, 2024, 6:40 PM

8 points

0 comments1 min readLW link

(www.jefftk.com)

A starter guide for evals

Marius Hobbhahn, Jérémy Scheurer, Mikita Balesni, rusheb and AlexMeinke

Jan 8, 2024, 6:24 PM

54 points

2 comments12 min readLW link

(www.apolloresearch.ai)

Is it justifiable for non-experts to have strong opinions about Gaza?

Yair Halberstadt and Adam Zerner

Jan 8, 2024, 5:31 PM

23 points

12 comments30 min readLW link

Project ideas: Backup plans & Cooperative AI

Lukas FinnvedenJan 8, 2024, 5:19 PM

18 points

0 comments LW link

(www.forethought.org)

Hackathon and Staying Up-to-Date in AI

jacobhaimesJan 8, 2024, 5:10 PM

11 points

0 comments1 min readLW link

(into-ai-safety.github.io)

When “yang” goes wrong

Joe CarlsmithJan 8, 2024, 4:35 PM

73 points

6 comments13 min readLW link

Task vectors & analogy making in LLMs

SergiiJan 8, 2024, 3:17 PM

9 points

1 comment4 min readLW link

(grgv.xyz)

[Question] How to find translations of a book?

ViliamJan 8, 2024, 2:57 PM

9 points

8 comments1 min readLW link

[Question] Why aren’t Yudkowsky & Bostrom getting more attention now?

JoshuaFoxJan 8, 2024, 2:42 PM

14 points

8 comments1 min readLW link

2023 Prediction Evaluations

ZviJan 8, 2024, 2:40 PM

47 points

0 comments28 min readLW link

(thezvi.wordpress.com)

There is no sharp boundary between deontology and consequentialism

quetzal_rainbowJan 8, 2024, 11:01 AM

8 points

2 comments1 min readLW link

Reflections on my first year of AI safety research

Jay BaileyJan 8, 2024, 7:49 AM

53 points

3 comments LW link

Why There Is Hope For An Alignment Solution

DarklightJan 8, 2024, 6:58 AM

10 points

0 comments12 min readLW link

Sledding Among Hazards

jefftkJan 8, 2024, 3:30 AM

19 points

5 comments1 min readLW link

(www.jefftk.com)

Utility is relative

CrimsonChinJan 8, 2024, 2:31 AM

2 points

4 comments2 min readLW link

A model of research skill

L Rudolf LJan 8, 2024, 12:13 AM

60 points

6 comments12 min readLW link

(www.strataoftheworld.com)

We shouldn’t fear superintelligence because it already exists

Spencer ChubbJan 7, 2024, 5:59 PM

−22 points

14 comments1 min readLW link

(Partial) failure in replicating deceptive alignment experiment

claudia.biancottiJan 7, 2024, 5:56 PM

1 point

0 comments1 min readLW link

Project ideas: Sentience and rights of digital minds

Lukas FinnvedenJan 7, 2024, 5:34 PM

20 points

0 comments LW link

(www.forethought.org)

Deceptive AI ≠ Deceptively-aligned AI

Steven ByrnesJan 7, 2024, 4:55 PM

96 points

19 comments6 min readLW link

Bayesians Commit the Gambler’s Fallacy

Kevin DorstJan 7, 2024, 12:54 PM

49 points

30 comments8 min readLW link

(kevindorst.substack.com)

Towards AI Safety Infrastructure: Talk & Outline

Paul BricmanJan 7, 2024, 9:31 AM

11 points

0 comments2 min readLW link

(www.youtube.com)

Defending against hypothetical moon life during Apollo 11

eukaryoteJan 7, 2024, 4:49 AM

57 points

9 comments32 min readLW link

(eukaryotewritesblog.com)

The Sequences on YouTube

Neil Jan 7, 2024, 1:44 AM

26 points

9 comments2 min readLW link

AI Risk and the US Presidential Candidates

ZaneJan 6, 2024, 8:18 PM

41 points

22 comments6 min readLW link

A Challenge to Effective Altruism’s Premises

False NameJan 6, 2024, 6:46 PM

−26 points

3 comments3 min readLW link

Lack of Spider-Man is evidence against the simulation hypothesis

RamblinDashJan 6, 2024, 6:17 PM

7 points

23 comments1 min readLW link

A Land Tax For Britain

A.H.Jan 6, 2024, 3:52 PM

6 points

9 comments4 min readLW link

Book review: Trick or treatment (2008)

Fleece MinutiaJan 6, 2024, 3:40 PM

1 point

0 comments2 min readLW link

Are we inside a black hole?

JayJan 6, 2024, 1:30 PM

2 points

5 comments1 min readLW link

Survey of 2,778 AI authors: six parts in pictures

KatjaGraceJan 6, 2024, 4:43 AM

80 points

1 comment2 min readLW link

Project ideas: Epistemics

Lukas FinnvedenJan 5, 2024, 11:41 PM

43 points

4 comments LW link

(www.forethought.org)

Almost everyone I’ve met would be well-served thinking more about what to focus on

Henrik KarlssonJan 5, 2024, 9:01 PM

96 points

8 comments11 min readLW link

(www.henrikkarlsson.xyz)

The Next ChatGPT Moment: AI Avatars

kolmplex and southpaw

Jan 5, 2024, 8:14 PM

43 points

10 comments1 min readLW link

AI Impacts 2023 Expert Survey on Progress in AI

habrykaJan 5, 2024, 7:42 PM

28 points

2 comments7 min readLW link

(wiki.aiimpacts.org)

Technology path dependence and evaluating expertise

bhauth and Muireall

Jan 5, 2024, 7:21 PM

25 points

2 comments15 min readLW link

The Hippie Rabbit Hole -Nuggets of Gold in Rivers of Bullshit

Jonathan MoregårdJan 5, 2024, 6:27 PM

39 points

20 comments8 min readLW link

(honestliving.substack.com)

[Question] What technical topics could help with boundaries/membranes?

ChipmonkJan 5, 2024, 6:14 PM

15 points

25 comments1 min readLW link

Catching AIs red-handed

ryan_greenblatt and Buck

Jan 5, 2024, 5:43 PM

111 points

27 comments17 min readLW link

AI Impacts Survey: December 2023 Edition

ZviJan 5, 2024, 2:40 PM

34 points

6 comments10 min readLW link

(thezvi.wordpress.com)

Forecast your 2024 with Fatebook

Sage FutureJan 5, 2024, 2:07 PM

19 points

0 comments1 min readLW link

(fatebook.io)

Predictive model agents are sort of corrigible

Raymond DouglasJan 5, 2024, 2:05 PM

35 points

6 comments3 min readLW link

Striking Implications for Learning Theory, Interpretability — and Safety?

RogerDearnaleyJan 5, 2024, 8:46 AM

37 points

4 comments2 min readLW link

If I ran the zoo

Optimization ProcessJan 5, 2024, 5:14 AM

18 points

1 comment2 min readLW link

Does AI care about reality or just its own perception?

RedFishBlueFishJan 5, 2024, 4:05 AM

−6 points

8 comments1 min readLW link

MIRI 2024 Mission and Strategy Update

MaloJan 5, 2024, 12:20 AM

223 points

44 comments8 min readLW link

Project ideas: Governance during explosive technological growth

Lukas FinnvedenJan 4, 2024, 11:51 PM

14 points

0 comments LW link

(www.forethought.org)

Hello

S BenfieldJan 4, 2024, 11:35 PM

6 points

0 comments2 min readLW link

Using Threats to Achieve Socially Optimal Outcomes

StrivingForLegibilityJan 4, 2024, 11:30 PM

8 points

0 comments3 min readLW link

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer