All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025

All Jan Feb Mar Apr May Jun Jul Aug Sep Oct NovDec

All1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Best-of-N Jailbreaking

John Hughes, saraprice, Aengus Lynch, Rylan Schaeffer, Fazl, Henry Sleight, Ethan Perez and mrinank_sharma

Dec 14, 2024, 4:58 AM

78 points

5 comments2 min readLW link

(arxiv.org)

The 2023 LessWrong Review: The Basic Ask

RaemonDec 4, 2024, 7:52 PM

77 points

25 comments9 min readLW link

2025 Prediction Thread

habrykaDec 30, 2024, 1:50 AM

77 points

21 comments1 min readLW link

When AI 10x’s AI R&D, What Do We Do?

Logan RiggsDec 21, 2024, 11:56 PM

72 points

16 comments4 min readLW link

Intricacies of Feature Geometry in Large Language Models

7vik, Lucius Bushnaq and Nandi

Dec 7, 2024, 6:10 PM

71 points

0 comments12 min readLW link

An Illustrated Summary of “Robust Agents Learn Causal World Model”

DalcyDec 14, 2024, 3:02 PM

67 points

2 comments10 min readLW link

Learn to write well BEFORE you have something worth saying

eukaryoteDec 29, 2024, 11:42 PM

67 points

18 comments3 min readLW link

(eukaryotewritesblog.com)

Drexler’s Nanotech Software

PeterMcCluskeyDec 2, 2024, 4:55 AM

67 points

9 comments4 min readLW link

(bayesianinvestor.com)

Anthropic leadership conversation

Zach Stein-PerlmanDec 20, 2024, 10:00 PM

67 points

17 comments6 min readLW link

(www.youtube.com)

Checking in on Scott’s composition image bet with imagen 3

Dave OrrDec 22, 2024, 7:04 PM

65 points

0 comments1 min readLW link

Retrospective: PIBBSS Fellowship 2024

DusanDNesic, clem_acs and Lucas Teixeira

Dec 20, 2024, 3:55 PM

64 points

1 comment4 min readLW link

A Qualitative Case for LTFF: Filling Critical Ecosystem Gaps

LinchDec 3, 2024, 9:57 PM

64 points

2 comments LW link

Zen and The Art of Semiconductor Manufacturing

RecurrentedDec 9, 2024, 5:19 PM

64 points

2 comments9 min readLW link

(futuring.substack.com)

RL, but don’t do anything I wouldn’t do

Gunnar_ZarnckeDec 7, 2024, 10:54 PM

63 points

5 comments1 min readLW link

(arxiv.org)

o3, Oh My

ZviDec 30, 2024, 2:10 PM

63 points

17 comments36 min readLW link

(thezvi.wordpress.com)

Measuring whether AIs can statelessly strategize to subvert security measures

Alex Mallen and Buck

Dec 19, 2024, 9:25 PM

62 points

0 comments11 min readLW link

ReSolsticed vol I: “We’re Not Going Quietly”

RaemonDec 26, 2024, 5:52 PM

61 points

4 comments19 min readLW link

Cognitive Work and AI Safety: A Thermodynamic Perspective

Daniel MurfetDec 8, 2024, 9:42 PM

61 points

9 comments4 min readLW link

A case for donating to AI risk reduction (including if you work in AI)

tlevinDec 2, 2024, 7:05 PM

61 points

2 comments LW link

Ideas for benchmarking LLM creativity

gwernDec 16, 2024, 5:18 AM

60 points

11 comments1 min readLW link

(gwern.net)

Funding Case: AI Safety Camp 11

Remmelt, Robert Kralisch and Linda Linsefors

Dec 23, 2024, 8:51 AM

60 points

4 comments6 min readLW link

(manifund.org)

o1 Turns Pro

ZviDec 10, 2024, 5:00 PM

59 points

3 comments14 min readLW link

(thezvi.wordpress.com)

AI #95: o1 Joins the API

ZviDec 19, 2024, 3:10 PM

58 points

1 comment41 min readLW link

(thezvi.wordpress.com)

AI #96: o3 But Not Yet For Thee

ZviDec 26, 2024, 8:30 PM

58 points

8 comments36 min readLW link

(thezvi.wordpress.com)

AI Assistants Should Have a Direct Line to Their Developers

Jan_KulveitDec 28, 2024, 5:01 PM

57 points

6 comments2 min readLW link

Luck Based Medicine: No Good Very Bad Winter Cured My Hypothyroidism

ElizabethDec 8, 2024, 8:10 PM

55 points

3 comments2 min readLW link

(acesounderglass.com)

Vegans need to eat just enough Meat—emperically evaluate the minimum ammount of meat that maximizes utility

Johannes C. MayerDec 22, 2024, 10:08 PM

55 points

35 comments3 min readLW link

[Question] What Have Been Your Most Valuable Casual Conversations At Conferences?

johnswentworthDec 25, 2024, 5:49 AM

54 points

21 comments1 min readLW link

I Finally Worked Through Bayes’ Theorem (Personal Achievement)

keltanDec 5, 2024, 2:04 AM

53 points

7 comments9 min readLW link

A toy evaluation of inference code tampering

Fabien RogerDec 9, 2024, 5:43 PM

52 points

0 comments9 min readLW link

(alignment.anthropic.com)

Just one more exposure bro

ChipmonkDec 12, 2024, 9:37 PM

52 points

6 comments2 min readLW link

(chrislakin.blog)

Correct my H5N1 research

ElizabethDec 9, 2024, 7:07 PM

52 points

24 comments2 min readLW link

Considerations on orca intelligence

Towards_KeeperhoodDec 29, 2024, 2:35 PM

51 points

14 comments9 min readLW link

A Solution for AGI/ASI Safety

Weibing WangDec 18, 2024, 7:44 PM

50 points

29 comments1 min readLW link

D&D.Sci Dungeonbuilding: the Dungeon Tournament

aphyerDec 14, 2024, 4:30 AM

49 points

16 comments3 min readLW link

AI #94: Not Now, Google

ZviDec 12, 2024, 3:40 PM

49 points

3 comments64 min readLW link

(thezvi.wordpress.com)

A dataset of questions on decision-theoretic reasoning in Newcomb-like problems

Caspar Oesterheld, Ethan Perez and Chi Nguyen

Dec 16, 2024, 10:42 PM

49 points

1 comment2 min readLW link

(arxiv.org)

Careless thinking: A theory of bad thinking

Nathan YoungDec 17, 2024, 6:23 PM

49 points

17 comments9 min readLW link

(nathanpmyoung.substack.com)

Analysis of Global AI Governance Strategies

Sammy Martin, Justin Bullock and Corin Katzke

Dec 4, 2024, 10:45 AM

49 points

10 comments36 min readLW link

Greedy-Advantage-Aware RLHF

sej2020Dec 27, 2024, 7:47 PM

48 points

15 comments13 min readLW link

Cognitive Biases Contributing to AI X-risk — a deleted excerpt from my 2018 ARCHES draft

Andrew_CritchDec 3, 2024, 9:29 AM

48 points

2 comments5 min readLW link

Book a Time to Chat about Interp Research

Logan RiggsDec 3, 2024, 5:27 PM

47 points

3 comments1 min readLW link

Review: Breaking Free with Dr. Stone

TurnTroutDec 18, 2024, 1:26 AM

47 points

5 comments1 min readLW link

(turntrout.com)

Deep Learning is cheap Solomonoff induction?

Lucius Bushnaq, Kaarel and Dmitry Vaintrob

Dec 7, 2024, 11:00 AM

45 points

1 comment17 min readLW link

Detection of Asymptomatically Spreading Pathogens

jefftkDec 5, 2024, 6:20 PM

45 points

8 comments7 min readLW link

(www.jefftk.com)

The Deep Lore of LightHaven, with Oliver Habryka (TBC episode 228)

Eneasz and habryka

Dec 24, 2024, 10:45 PM

45 points

4 comments91 min readLW link

(thebayesianconspiracy.substack.com)

Conjecture: A Roadmap for Cognitive Software and A Humanist Future of AI

Connor Leahy and Gabriel Alfour

Dec 2, 2024, 1:28 PM

44 points

10 comments29 min readLW link

(www.conjecture.dev)

Preppers Are Too Negative on Objects

jefftkDec 18, 2024, 2:30 AM

44 points

2 comments1 min readLW link

(www.jefftk.com)

Began a pay-on-results coaching experiment, made $40,300 since July

ChipmonkDec 29, 2024, 9:12 PM

43 points

15 comments1 min readLW link

(chrislakin.blog)

Claude’s Constitutional Consequentialism?

1a3ornDec 19, 2024, 7:53 PM

43 points

6 comments6 min readLW link

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer