The ants and the grasshopper

Richard_Ngo4 Jun 2023 22:00 UTC
417 points
35 comments5 min readLW link
(www.narrativeark.xyz)

[Question] im­pli­ca­tions of NN de­sign for education

bhauth4 Jun 2023 20:50 UTC
9 points
3 comments1 min readLW link

Na­ture < Nur­ture for AIs

scottviteri4 Jun 2023 20:38 UTC
14 points
22 comments7 min readLW link

One im­ple­men­ta­tion of reg­u­la­tory GPU restrictions

porby4 Jun 2023 20:34 UTC
32 points
6 comments5 min readLW link

How to em­bark on a jour­ney of self-dis­cov­ery (and po­ten­tially suc­ceed)

Ester Dobiášová4 Jun 2023 18:46 UTC
6 points
0 comments14 min readLW link
(ladyesik.wordpress.com)

AI Safety Fun­da­men­tals: An In­for­mal Co­hort Start­ing Soon!

Tiago de Vassal4 Jun 2023 17:15 UTC
4 points
0 comments1 min readLW link

How to Think About Ac­ti­va­tion Patching

Neel Nanda4 Jun 2023 14:17 UTC
47 points
5 comments20 min readLW link
(www.neelnanda.io)

[Fic­tion] A Dis­ney­land Without Children

L Rudolf L4 Jun 2023 13:06 UTC
67 points
3 comments1 min readLW link

I bet ev­ery­one 1000€ that I can make them dra­mat­i­cally hap­pier & cure their de­pres­sion in 3 months!

Anton Rodenhauser4 Jun 2023 12:30 UTC
4 points
11 comments9 min readLW link

Do You Really Want Effec­tive Altru­ism?

williamsae4 Jun 2023 8:06 UTC
−7 points
3 comments7 min readLW link

“What if ev­ery­one died ex­cept me and the su­per­in­tel­li­gent AI?”

sjeffh4 Jun 2023 5:08 UTC
−19 points
0 comments1 min readLW link

[Link Post] Bytes Are All You Need: Trans­form­ers Oper­at­ing Directly On File Bytes

Capybasilisk3 Jun 2023 22:45 UTC
18 points
2 comments1 min readLW link

Hu­man­ity and sci­ence are in­com­pat­i­ble.

archeon3 Jun 2023 22:15 UTC
−18 points
2 comments1 min readLW link

Op­ti­miza­tion hap­pens in­side the mind, not in the world

azsantosk3 Jun 2023 21:36 UTC
17 points
10 comments5 min readLW link

[Question] What would a post that ar­gues against the Orthog­o­nal­ity Th­e­sis that LessWrong users ap­prove of look like?

Thoth Hermes3 Jun 2023 21:21 UTC
3 points
3 comments1 min readLW link

A Dou­ble-Fea­ture on The Extropians

Maxwell Tabarrok3 Jun 2023 18:27 UTC
58 points
4 comments1 min readLW link

What ex­actly does ‘Slow Down’ look like?

Steve M3 Jun 2023 18:11 UTC
7 points
0 comments1 min readLW link

An­nounc­ing AISafety.info’s Write-a-thon (June 16-18) and Se­cond Distil­la­tion Fel­low­ship (July 3-Oc­to­ber 2)

steven04613 Jun 2023 2:03 UTC
33 points
1 comment2 min readLW link

Terry Tao is host­ing an “AI to As­sist Math­e­mat­i­cal Rea­son­ing” workshop

junk heap homotopy3 Jun 2023 1:19 UTC
12 points
1 comment1 min readLW link
(terrytao.wordpress.com)

Up­com­ing AI reg­u­la­tions are likely to make for an un­safer world

shminux3 Jun 2023 1:07 UTC
18 points
14 comments1 min readLW link

The AGI Race Between the US and China Doesn’t Ex­ist.

Eva_B3 Jun 2023 0:22 UTC
24 points
14 comments7 min readLW link
(evabehrens.substack.com)

Un­faith­ful Ex­pla­na­tions in Chain-of-Thought Prompting

miles3 Jun 2023 0:22 UTC
38 points
8 comments7 min readLW link

[Question] How could AIs ‘see’ each other’s source code?

Kenny2 Jun 2023 22:41 UTC
29 points
45 comments1 min readLW link

Pro­posal: labs should pre­com­mit to paus­ing if an AI ar­gues for it­self to be improved

NickGabs2 Jun 2023 22:31 UTC
3 points
3 comments4 min readLW link

In­fer­ence from a Math­e­mat­i­cal De­scrip­tion of an Ex­ist­ing Align­ment Re­search: a pro­posal for an outer al­ign­ment re­search program

Christopher King2 Jun 2023 21:54 UTC
7 points
4 comments16 min readLW link

Thoughts on Danc­ing the Whole Dance: Po­si­tional Cal­ling for Contra

jefftk2 Jun 2023 20:50 UTC
10 points
0 comments5 min readLW link
(www.jefftk.com)

Ad­vice for En­ter­ing AI Safety Research

scasper2 Jun 2023 20:46 UTC
25 points
2 comments5 min readLW link

AI should be used to find bet­ter morality

Jorterder2 Jun 2023 20:38 UTC
−20 points
1 comment1 min readLW link

A mind needn’t be cu­ri­ous to reap the benefits of curiosity

So8res2 Jun 2023 18:00 UTC
78 points
14 comments1 min readLW link

[Question] Are com­pu­ta­tion­ally com­plex al­gorithms ex­pen­sive to have, ex­pen­sive to op­er­ate, or both?

Noosphere892 Jun 2023 17:50 UTC
7 points
5 comments1 min readLW link

[Repli­ca­tion] Con­jec­ture’s Sparse Cod­ing in Toy Models

2 Jun 2023 17:34 UTC
23 points
0 comments1 min readLW link

Limits to Learn­ing: Re­think­ing AGI’s Path to Dominance

tangerine2 Jun 2023 16:43 UTC
3 points
4 comments15 min readLW link

The Con­trol Prob­lem: Un­solved or Un­solv­able?

Remmelt2 Jun 2023 15:42 UTC
49 points
46 comments14 min readLW link

Hal­lu­ci­nat­ing Suction

Johannes C. Mayer2 Jun 2023 14:16 UTC
6 points
0 comments2 min readLW link

Win­ning doesn’t need to flow through in­creases in rationality

MichelJusten2 Jun 2023 12:05 UTC
13 points
3 comments1 min readLW link

Product Recom­men­da­tion: LessWrong di­alogues with Recast

Bart Bussmann2 Jun 2023 8:05 UTC
5 points
0 comments1 min readLW link

Think care­fully be­fore call­ing RL poli­cies “agents”

TurnTrout2 Jun 2023 3:46 UTC
124 points
35 comments4 min readLW link

Dreams of “Matho­pe­dia”

NicholasKross2 Jun 2023 1:30 UTC
40 points
16 comments2 min readLW link
(www.thinkingmuchbetter.com)

Outreach suc­cess: In­tro to AI risk that has been successful

Michael Tontchev1 Jun 2023 23:12 UTC
83 points
8 comments74 min readLW link
(medium.com)

Open Source LLMs Can Now Ac­tively Lie

Josh Levy1 Jun 2023 22:03 UTC
6 points
0 comments3 min readLW link

Safe AI and moral AI

William D'Alessandro1 Jun 2023 21:36 UTC
−2 points
0 comments10 min readLW link

AI #14: A Very Good Sentence

Zvi1 Jun 2023 21:30 UTC
118 points
30 comments65 min readLW link
(thezvi.wordpress.com)

Four lev­els of un­der­stand­ing de­ci­sion theory

Max H1 Jun 2023 20:55 UTC
12 points
11 comments4 min readLW link

Things I Learned by Spend­ing Five Thou­sand Hours In Non-EA Charities

jenn1 Jun 2023 20:48 UTC
387 points
34 comments8 min readLW link
(jenn.site)

self-im­prove­ment-ex­ecu­tors are not goal-maximizers

bhauth1 Jun 2023 20:46 UTC
14 points
0 comments1 min readLW link

Ex­per­i­men­tal Fat Loss

johnlawrenceaspden1 Jun 2023 20:26 UTC
23 points
5 comments1 min readLW link

Yud­kowsky vs Han­son on FOOM: Whose Pre­dic­tions Were Bet­ter?

1a3orn1 Jun 2023 19:36 UTC
132 points
73 comments24 min readLW link

Progress links and tweets, 2023-06-01

jasoncrawford1 Jun 2023 19:03 UTC
10 points
3 comments1 min readLW link
(rootsofprogress.org)

[Question] When does an AI be­come in­tel­li­gent enough to be­come self-aware and power-seek­ing?

FinalFormal21 Jun 2023 18:09 UTC
1 point
1 comment1 min readLW link

Uncer­tainty about the fu­ture does not im­ply that AGI will go well

Lauro Langosco1 Jun 2023 17:38 UTC
62 points
11 comments7 min readLW link