The ants and the grasshopper

Richard_NgoJun 4, 2023, 10:00 PM
465 points
44 comments5 min readLW link4 reviews
(www.narrativeark.xyz)

[Question] im­pli­ca­tions of NN de­sign for education

bhauthJun 4, 2023, 8:50 PM
9 points
3 comments1 min readLW link

Na­ture < Nur­ture for AIs

scottviteriJun 4, 2023, 8:38 PM
14 points
22 comments7 min readLW link

One im­ple­men­ta­tion of reg­u­la­tory GPU restrictions

porbyJun 4, 2023, 8:34 PM
42 points
6 comments5 min readLW link

How to em­bark on a jour­ney of self-dis­cov­ery (and po­ten­tially suc­ceed)

Ester DobiášováJun 4, 2023, 6:46 PM
6 points
0 comments14 min readLW link
(ladyesik.wordpress.com)

AI Safety Fun­da­men­tals: An In­for­mal Co­hort Start­ing Soon!

Tiago de VassalJun 4, 2023, 5:15 PM
4 points
0 comments1 min readLW link

How to Think About Ac­ti­va­tion Patching

Neel NandaJun 4, 2023, 2:17 PM
50 points
5 comments20 min readLW link
(www.neelnanda.io)

A Dis­ney­land Without Children

L Rudolf LJun 4, 2023, 1:06 PM
126 points
11 commentsLW link4 reviews
(nosetgauge.substack.com)

I bet ev­ery­one 1000€ that I can make them dra­mat­i­cally hap­pier & cure their de­pres­sion in 3 months!

EternallyBlissfulJun 4, 2023, 12:30 PM
4 points
11 comments9 min readLW link

Do You Really Want Effec­tive Altru­ism?

williamsaeJun 4, 2023, 8:06 AM
−7 points
3 comments7 min readLW link

“What if ev­ery­one died ex­cept me and the su­per­in­tel­li­gent AI?”

sjeffhJun 4, 2023, 5:08 AM
−19 points
0 comments1 min readLW link

[Link Post] Bytes Are All You Need: Trans­form­ers Oper­at­ing Directly On File Bytes

CapybasiliskJun 3, 2023, 10:45 PM
18 points
2 comments1 min readLW link

Hu­man­ity and sci­ence are in­com­pat­i­ble.

archeonJun 3, 2023, 10:15 PM
−18 points
2 comments1 min readLW link

Op­ti­miza­tion hap­pens in­side the mind, not in the world

azsantoskJun 3, 2023, 9:36 PM
17 points
10 comments5 min readLW link

[Question] What would a post that ar­gues against the Orthog­o­nal­ity Th­e­sis that LessWrong users ap­prove of look like?

Thoth HermesJun 3, 2023, 9:21 PM
3 points
3 comments1 min readLW link

A Dou­ble-Fea­ture on The Extropians

Maxwell TabarrokJun 3, 2023, 6:27 PM
59 points
4 comments1 min readLW link

What ex­actly does ‘Slow Down’ look like?

Steve MJun 3, 2023, 6:11 PM
7 points
0 comments1 min readLW link

An­nounc­ing AISafety.info’s Write-a-thon (June 16-18) and Se­cond Distil­la­tion Fel­low­ship (July 3-Oc­to­ber 2)

steven0461Jun 3, 2023, 2:03 AM
33 points
1 comment2 min readLW link

Terry Tao is host­ing an “AI to As­sist Math­e­mat­i­cal Rea­son­ing” workshop

junk heap homotopyJun 3, 2023, 1:19 AM
12 points
1 comment1 min readLW link
(terrytao.wordpress.com)

Up­com­ing AI reg­u­la­tions are likely to make for an un­safer world

ShmiJun 3, 2023, 1:07 AM
18 points
14 comments1 min readLW link

The AGI Race Between the US and China Doesn’t Ex­ist.

Eva_BJun 3, 2023, 12:22 AM
33 points
15 comments7 min readLW link
(evabehrens.substack.com)

Un­faith­ful Ex­pla­na­tions in Chain-of-Thought Prompting

Miles TurpinJun 3, 2023, 12:22 AM
42 points
8 comments7 min readLW link

[Question] How could AIs ‘see’ each other’s source code?

KennyJun 2, 2023, 10:41 PM
29 points
45 comments1 min readLW link

Pro­posal: labs should pre­com­mit to paus­ing if an AI ar­gues for it­self to be improved

NickGabsJun 2, 2023, 10:31 PM
3 points
3 comments4 min readLW link

In­fer­ence from a Math­e­mat­i­cal De­scrip­tion of an Ex­ist­ing Align­ment Re­search: a pro­posal for an outer al­ign­ment re­search program

Christopher KingJun 2, 2023, 9:54 PM
7 points
4 comments16 min readLW link

Thoughts on Danc­ing the Whole Dance: Po­si­tional Cal­ling for Contra

jefftkJun 2, 2023, 8:50 PM
10 points
0 comments5 min readLW link
(www.jefftk.com)

Ad­vice for En­ter­ing AI Safety Research

scasperJun 2, 2023, 8:46 PM
26 points
2 comments5 min readLW link

AI should be used to find bet­ter morality

JorterderJun 2, 2023, 8:38 PM
−21 points
1 comment1 min readLW link

A mind needn’t be cu­ri­ous to reap the benefits of curiosity

So8resJun 2, 2023, 6:00 PM
78 points
14 comments1 min readLW link

[Question] Are com­pu­ta­tion­ally com­plex al­gorithms ex­pen­sive to have, ex­pen­sive to op­er­ate, or both?

Noosphere89Jun 2, 2023, 5:50 PM
7 points
5 comments1 min readLW link

[Repli­ca­tion] Con­jec­ture’s Sparse Cod­ing in Toy Models

Jun 2, 2023, 5:34 PM
24 points
0 comments1 min readLW link

Limits to Learn­ing: Re­think­ing AGI’s Path to Dominance

tangerineJun 2, 2023, 4:43 PM
10 points
4 comments15 min readLW link

The Con­trol Prob­lem: Un­solved or Un­solv­able?

RemmeltJun 2, 2023, 3:42 PM
55 points
46 comments14 min readLW link

Hal­lu­ci­nat­ing Suction

Johannes C. MayerJun 2, 2023, 2:16 PM
6 points
0 comments2 min readLW link

Win­ning doesn’t need to flow through in­creases in rationality

MichelJun 2, 2023, 12:05 PM
11 points
5 comments1 min readLW link

Product Recom­men­da­tion: LessWrong di­alogues with Recast

Bart BussmannJun 2, 2023, 8:05 AM
5 points
0 comments1 min readLW link

Think care­fully be­fore call­ing RL poli­cies “agents”

TurnTroutJun 2, 2023, 3:46 AM
134 points
38 comments4 min readLW link1 review

Dreams of “Matho­pe­dia”

Nicholas / Heather KrossJun 2, 2023, 1:30 AM
40 points
16 comments2 min readLW link
(www.thinkingmuchbetter.com)

Outreach suc­cess: In­tro to AI risk that has been successful

Michael TontchevJun 1, 2023, 11:12 PM
83 points
8 comments74 min readLW link
(medium.com)

Open Source LLMs Can Now Ac­tively Lie

Josh LevyJun 1, 2023, 10:03 PM
6 points
0 comments3 min readLW link

Safe AI and moral AI

William D'AlessandroJun 1, 2023, 9:36 PM
−3 points
0 comments10 min readLW link

AI #14: A Very Good Sentence

ZviJun 1, 2023, 9:30 PM
118 points
30 comments65 min readLW link
(thezvi.wordpress.com)

Four lev­els of un­der­stand­ing de­ci­sion theory

Max HJun 1, 2023, 8:55 PM
12 points
11 comments4 min readLW link

Things I Learned by Spend­ing Five Thou­sand Hours In Non-EA Charities

jennJun 1, 2023, 8:48 PM
430 points
35 comments8 min readLW link1 review
(jenn.site)

self-im­prove­ment-ex­ecu­tors are not goal-maximizers

bhauthJun 1, 2023, 8:46 PM
14 points
0 comments1 min readLW link

Ex­per­i­men­tal Fat Loss

johnlawrenceaspdenJun 1, 2023, 8:26 PM
23 points
5 comments1 min readLW link

Yud­kowsky vs Han­son on FOOM: Whose Pre­dic­tions Were Bet­ter?

1a3ornJun 1, 2023, 7:36 PM
137 points
76 comments24 min readLW link2 reviews

Progress links and tweets, 2023-06-01

jasoncrawfordJun 1, 2023, 7:03 PM
10 points
3 comments1 min readLW link
(rootsofprogress.org)

[Question] When does an AI be­come in­tel­li­gent enough to be­come self-aware and power-seek­ing?

FinalFormal2Jun 1, 2023, 6:09 PM
1 point
1 comment1 min readLW link

Uncer­tainty about the fu­ture does not im­ply that AGI will go well

Lauro LangoscoJun 1, 2023, 5:38 PM
62 points
11 comments7 min readLW link