All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025 2026

All Jan Feb Mar Apr MayJunJul Aug Sep Oct Nov Dec

All 1 2 3 456 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

The ants and the grasshopper

Richard_Ngo4 Jun 2023 22:00 UTC

506 points

45 comments5 min readLW link 4 reviews

(www.narrativeark.xyz)

[Question] implications of NN design for education

bhauth4 Jun 2023 20:50 UTC

9 points

3 comments1 min readLW link

Nature < Nurture for AIs

scottviteri4 Jun 2023 20:38 UTC

14 points

23 comments7 min readLW link

One implementation of regulatory GPU restrictions

porby4 Jun 2023 20:34 UTC

42 points

6 comments5 min readLW link

How to embark on a journey of self-discovery (and potentially succeed)

Ester Dobiášová4 Jun 2023 18:46 UTC

7 points

0 comments14 min readLW link

(ladyesik.wordpress.com)

AI Safety Fundamentals: An Informal Cohort Starting Soon!

Tiago de Vassal4 Jun 2023 17:15 UTC

4 points

0 comments1 min readLW link

How to Think About Activation Patching

Neel Nanda4 Jun 2023 14:17 UTC

50 points

5 comments20 min readLW link

(www.neelnanda.io)

A Disneyland Without Children

L Rudolf L4 Jun 2023 13:06 UTC

137 points

12 comments20 min readLW link 4 reviews

(nosetgauge.substack.com)

I bet everyone 1000€ that I can make them dramatically happier & cure their depression in 3 months!

EternallyBlissful4 Jun 2023 12:30 UTC

4 points

11 comments9 min readLW link

Do You Really Want Effective Altruism?

williamsae4 Jun 2023 8:06 UTC

−7 points

3 comments7 min readLW link

“What if everyone died except me and the superintelligent AI?”

sjeffh4 Jun 2023 5:08 UTC

−19 points

0 comments1 min readLW link

[Link Post] Bytes Are All You Need: Transformers Operating Directly On File Bytes

Capybasilisk3 Jun 2023 22:45 UTC

18 points

2 comments1 min readLW link

Humanity and science are incompatible.

archeon3 Jun 2023 22:15 UTC

−18 points

2 comments1 min readLW link

Optimization happens inside the mind, not in the world

azsantosk3 Jun 2023 21:36 UTC

17 points

10 comments5 min readLW link

[Question] What would a post that argues against the Orthogonality Thesis that LessWrong users approve of look like?

Thoth Hermes3 Jun 2023 21:21 UTC

3 points

3 comments1 min readLW link

A Double-Feature on The Extropians

Maxwell Tabarrok3 Jun 2023 18:27 UTC

60 points

4 comments1 min readLW link

What exactly does ‘Slow Down’ look like?

Steve M3 Jun 2023 18:11 UTC

7 points

0 comments1 min readLW link

Announcing AISafety.info’s Write-a-thon (June 16-18) and Second Distillation Fellowship (July 3-October 2)

steven04613 Jun 2023 2:03 UTC

33 points

1 comment2 min readLW link

Terry Tao is hosting an “AI to Assist Mathematical Reasoning” workshop

junk heap homotopy3 Jun 2023 1:19 UTC

12 points

1 comment1 min readLW link

(terrytao.wordpress.com)

Upcoming AI regulations are likely to make for an unsafer world

Shmi3 Jun 2023 1:07 UTC

18 points

14 comments1 min readLW link

The AGI Race Between the US and China Doesn’t Exist.

Eva_B3 Jun 2023 0:22 UTC

33 points

15 comments7 min readLW link

(evabehrens.substack.com)

Unfaithful Explanations in Chain-of-Thought Prompting

Miles Turpin3 Jun 2023 0:22 UTC

43 points

8 comments7 min readLW link

[Question] How could AIs ‘see’ each other’s source code?

Kenny2 Jun 2023 22:41 UTC

29 points

45 comments1 min readLW link

Proposal: labs should precommit to pausing if an AI argues for itself to be improved

NickGabs2 Jun 2023 22:31 UTC

3 points

3 comments4 min readLW link

Inference from a Mathematical Description of an Existing Alignment Research: a proposal for an outer alignment research program

Christopher King2 Jun 2023 21:54 UTC

7 points

4 comments16 min readLW link

Thoughts on Dancing the Whole Dance: Positional Calling for Contra

jefftk2 Jun 2023 20:50 UTC

10 points

0 comments5 min readLW link

(www.jefftk.com)

Advice for Entering AI Safety Research

scasper2 Jun 2023 20:46 UTC

27 points

2 comments5 min readLW link

AI should be used to find better morality

Jorterder2 Jun 2023 20:38 UTC

−21 points

1 comment1 min readLW link

A mind needn’t be curious to reap the benefits of curiosity

So8res2 Jun 2023 18:00 UTC

79 points

14 comments1 min readLW link

[Question] Are computationally complex algorithms expensive to have, expensive to operate, or both?

Noosphere892 Jun 2023 17:50 UTC

7 points

5 comments1 min readLW link

[Replication] Conjecture’s Sparse Coding in Toy Models

Hoagy and Logan Riggs

2 Jun 2023 17:34 UTC

25 points

0 comments1 min readLW link

Limits to Learning: Rethinking AGI’s Path to Dominance

tangerine2 Jun 2023 16:43 UTC

10 points

4 comments15 min readLW link

The Control Problem: Unsolved or Unsolvable?

Remmelt2 Jun 2023 15:42 UTC

61 points

46 comments13 min readLW link

Hallucinating Suction

Johannes C. Mayer2 Jun 2023 14:16 UTC

6 points

0 comments2 min readLW link

Winning doesn’t need to flow through increases in rationality

Michel2 Jun 2023 12:05 UTC

11 points

5 comments1 min readLW link

Product Recommendation: LessWrong dialogues with Recast

Bart Bussmann2 Jun 2023 8:05 UTC

5 points

0 comments1 min readLW link

Think carefully before calling RL policies “agents”

TurnTrout2 Jun 2023 3:46 UTC

135 points

38 comments4 min readLW link 1 review

Outreach success: Intro to AI risk that has been successful

Michael Tontchev1 Jun 2023 23:12 UTC

84 points

8 comments74 min readLW link

(medium.com)

Open Source LLMs Can Now Actively Lie

Josh Levy1 Jun 2023 22:03 UTC

6 points

0 comments3 min readLW link

Safe AI and moral AI

William D'Alessandro1 Jun 2023 21:36 UTC

−3 points

0 comments10 min readLW link

AI #14: A Very Good Sentence

Zvi1 Jun 2023 21:30 UTC

118 points

30 comments65 min readLW link

(thezvi.wordpress.com)

Four levels of understanding decision theory

Max H1 Jun 2023 20:55 UTC

12 points

11 comments4 min readLW link

Things I Learned by Spending Five Thousand Hours In Non-EA Charities

jenn1 Jun 2023 20:48 UTC

451 points

37 comments8 min readLW link 1 review

(jenn.site)

self-improvement-executors are not goal-maximizers

bhauth1 Jun 2023 20:46 UTC

14 points

0 comments1 min readLW link

Experimental Fat Loss

johnlawrenceaspden1 Jun 2023 20:26 UTC

23 points

5 comments1 min readLW link

Yudkowsky vs Hanson on FOOM: Whose Predictions Were Better?

1a3orn1 Jun 2023 19:36 UTC

145 points

76 comments24 min readLW link 2 reviews

Progress links and tweets, 2023-06-01

jasoncrawford1 Jun 2023 19:03 UTC

10 points

3 comments1 min readLW link

(rootsofprogress.org)

[Question] When does an AI become intelligent enough to become self-aware and power-seeking?

FinalFormal21 Jun 2023 18:09 UTC

1 point

1 comment1 min readLW link

Uncertainty about the future does not imply that AGI will go well

Lauro Langosco1 Jun 2023 17:38 UTC

62 points

11 comments7 min readLW link

[Question] What are the arguments for/against FOOM?

FinalFormal21 Jun 2023 17:23 UTC

8 points

0 comments1 min readLW link