All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025

All Jan Feb Mar Apr MayJunJul Aug Sep Oct Nov Dec

All 1 234 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

[Question] How could AIs ‘see’ each other’s source code?

KennyJun 2, 2023, 10:41 PM

29 points

45 comments1 min readLW link

Proposal: labs should precommit to pausing if an AI argues for itself to be improved

NickGabsJun 2, 2023, 10:31 PM

3 points

3 comments4 min readLW link

Inference from a Mathematical Description of an Existing Alignment Research: a proposal for an outer alignment research program

Christopher KingJun 2, 2023, 9:54 PM

7 points

4 comments16 min readLW link

Thoughts on Dancing the Whole Dance: Positional Calling for Contra

jefftkJun 2, 2023, 8:50 PM

10 points

0 comments5 min readLW link

(www.jefftk.com)

Advice for Entering AI Safety Research

scasperJun 2, 2023, 8:46 PM

26 points

2 comments5 min readLW link

AI should be used to find better morality

JorterderJun 2, 2023, 8:38 PM

−21 points

1 comment1 min readLW link

A mind needn’t be curious to reap the benefits of curiosity

So8resJun 2, 2023, 6:00 PM

78 points

14 comments1 min readLW link

[Question] Are computationally complex algorithms expensive to have, expensive to operate, or both?

Noosphere89Jun 2, 2023, 5:50 PM

7 points

5 comments1 min readLW link

[Replication] Conjecture’s Sparse Coding in Toy Models

Hoagy and Logan Riggs

Jun 2, 2023, 5:34 PM

24 points

0 comments1 min readLW link

Limits to Learning: Rethinking AGI’s Path to Dominance

tangerineJun 2, 2023, 4:43 PM

10 points

4 comments15 min readLW link

The Control Problem: Unsolved or Unsolvable?

RemmeltJun 2, 2023, 3:42 PM

55 points

46 comments14 min readLW link

Hallucinating Suction

Johannes C. MayerJun 2, 2023, 2:16 PM

6 points

0 comments2 min readLW link

Winning doesn’t need to flow through increases in rationality

MichelJun 2, 2023, 12:05 PM

11 points

5 comments1 min readLW link

Product Recommendation: LessWrong dialogues with Recast

Bart BussmannJun 2, 2023, 8:05 AM

5 points

0 comments1 min readLW link

Think carefully before calling RL policies “agents”

TurnTroutJun 2, 2023, 3:46 AM

134 points

38 comments4 min readLW link 1 review

Dreams of “Mathopedia”

Nicholas / Heather KrossJun 2, 2023, 1:30 AM

40 points

16 comments2 min readLW link

(www.thinkingmuchbetter.com)

Outreach success: Intro to AI risk that has been successful

Michael TontchevJun 1, 2023, 11:12 PM

83 points

8 comments74 min readLW link

(medium.com)

Open Source LLMs Can Now Actively Lie

Josh LevyJun 1, 2023, 10:03 PM

6 points

0 comments3 min readLW link

Safe AI and moral AI

William D'AlessandroJun 1, 2023, 9:36 PM

−3 points

0 comments10 min readLW link

AI #14: A Very Good Sentence

ZviJun 1, 2023, 9:30 PM

118 points

30 comments65 min readLW link

(thezvi.wordpress.com)

Four levels of understanding decision theory

Max HJun 1, 2023, 8:55 PM

12 points

11 comments4 min readLW link

Things I Learned by Spending Five Thousand Hours In Non-EA Charities

jennJun 1, 2023, 8:48 PM

430 points

35 comments8 min readLW link 1 review

(jenn.site)

self-improvement-executors are not goal-maximizers

bhauthJun 1, 2023, 8:46 PM

14 points

0 comments1 min readLW link

Experimental Fat Loss

johnlawrenceaspdenJun 1, 2023, 8:26 PM

23 points

5 comments1 min readLW link

Yudkowsky vs Hanson on FOOM: Whose Predictions Were Better?

1a3ornJun 1, 2023, 7:36 PM

137 points

76 comments24 min readLW link 2 reviews

Progress links and tweets, 2023-06-01

jasoncrawfordJun 1, 2023, 7:03 PM

10 points

3 comments1 min readLW link

(rootsofprogress.org)

[Question] When does an AI become intelligent enough to become self-aware and power-seeking?

FinalFormal2Jun 1, 2023, 6:09 PM

1 point

1 comment1 min readLW link

Uncertainty about the future does not imply that AGI will go well

Lauro LangoscoJun 1, 2023, 5:38 PM

62 points

11 comments7 min readLW link

[Question] What are the arguments for/against FOOM?

FinalFormal2Jun 1, 2023, 5:23 PM

8 points

0 comments1 min readLW link

Change my mind: Veganism entails trade-offs, and health is one of the axes

ElizabethJun 1, 2023, 5:10 PM

160 points

85 comments19 min readLW link 2 reviews

(acesounderglass.com)

The unspoken but ridiculous assumption of AI doom: the hidden doom assumption

Christopher KingJun 1, 2023, 5:01 PM

−9 points

1 comment3 min readLW link

Don’t waste your time meditating on meditation retreats!

EternallyBlissfulJun 1, 2023, 4:56 PM

23 points

7 comments11 min readLW link

[Request]: Use “Epilogenics” instead of “Eugenics” in most circumstances

GeneSmithJun 1, 2023, 3:36 PM

56 points

49 comments1 min readLW link

Book Club: Thomas Schelling’s “The Strategy of Conflict”

Optimization ProcessJun 1, 2023, 3:29 PM

6 points

1 comment1 min readLW link

Probably tell your friends when they make big mistakes

Chi NguyenJun 1, 2023, 2:30 PM

15 points

1 comment LW link

Yes, avoiding extinction from AI is an urgent priority: a response to Seth Lazar, Jeremy Howard, and Arvind Narayanan.

Soroush PourJun 1, 2023, 1:38 PM

17 points

0 comments5 min readLW link

(www.soroushjp.com)

Work dumber not smarter

lemonhopeJun 1, 2023, 12:40 PM

101 points

17 comments3 min readLW link

Short Remark on the (subjective) mathematical ‘naturalness’ of the Nanda—Lieberum addition modulo 113 algorithm

carboniferous_umbraculum Jun 1, 2023, 11:31 AM

104 points

12 comments2 min readLW link

How will they feed us

meijer1973Jun 1, 2023, 8:49 AM

4 points

3 comments5 min readLW link

“LLMs Don’t Have a Coherent Model of the World”—What it Means, Why it Matters

DavidmanheimJun 1, 2023, 7:46 AM

32 points

2 comments7 min readLW link

General intelligence: what is it, what makes it hard, and will we have it soon?

homeopathicsyzygyJun 1, 2023, 6:46 AM

2 points

0 comments21 min readLW link

Maximal Sentience: A Sentience Spectrum and Test Foundation

SnowyiuJun 1, 2023, 6:45 AM

1 point

2 comments4 min readLW link

Re: The Crux List

Logan ZoellnerJun 1, 2023, 4:48 AM

11 points

0 comments2 min readLW link

An explanation of decision theories

metachiralityJun 1, 2023, 3:42 AM

20 points

4 comments5 min readLW link

Dancing to Positional Calling

jefftkJun 1, 2023, 2:40 AM

11 points

2 comments2 min readLW link

(www.jefftk.com)

Intrinsic vs. Extrinsic Alignment

Alfonso Pérez EscuderoJun 1, 2023, 1:06 AM

1 point

1 comment3 min readLW link

Limiting factors to predict AI take-off speed

Alfonso Pérez EscuderoMay 31, 2023, 11:19 PM

1 point

0 comments6 min readLW link

Unpredictability and the Increasing Difficulty of AI Alignment for Increasingly Intelligent AI

Max_He-HoMay 31, 2023, 10:25 PM

5 points

2 comments20 min readLW link

Shutdown-Seeking AI

Simon GoldsteinMay 31, 2023, 10:19 PM

50 points

32 comments15 min readLW link

Full Automation is Unlikely and Unnecessary for Explosive Growth

aogMay 31, 2023, 9:55 PM

28 points

3 comments5 min readLW link