An­drew Ng wants to have a con­ver­sa­tion about ex­tinc­tion risk from AI

Leon Lang5 Jun 2023 22:29 UTC
32 points
2 comments1 min readLW link
(twitter.com)

True Re­jec­tion Challenges

Screwtape5 Jun 2023 22:17 UTC
20 points
11 comments5 min readLW link

AISafety.info “How can I help?” FAQ

5 Jun 2023 22:09 UTC
58 points
0 comments2 min readLW link

An­swer to a ques­tion: what do I think about God’s com­mu­ni­ca­tion pat­terns?

Jim Pivarski5 Jun 2023 21:40 UTC
1 point
16 comments8 min readLW link

The In­trin­sic In­ter­play of Hu­man Values and Ar­tifi­cial In­tel­li­gence: Nav­i­gat­ing the Op­ti­miza­tion Challenge

Joe Kwon5 Jun 2023 20:41 UTC
2 points
1 comment18 min readLW link

The (lo­cal) unit of in­tel­li­gence is FLOPs

boazbarak5 Jun 2023 18:23 UTC
40 points
7 comments5 min readLW link

Tu­tor-GPT & Ped­a­gog­i­cal Reasoning

courtlandleer5 Jun 2023 17:53 UTC
26 points
3 comments4 min readLW link

Not an­other bias!

Lionel5 Jun 2023 17:50 UTC
3 points
0 comments1 min readLW link
(lionelpage.substack.com)

What I’ve been read­ing, June 2023

jasoncrawford5 Jun 2023 17:08 UTC
16 points
0 comments7 min readLW link
(rootsofprogress.org)

Hu­mans don’t un­der­stand how we do most things

Nathan11235 Jun 2023 14:35 UTC
2 points
2 comments2 min readLW link

Wild­fire of strategicness

TsviBT5 Jun 2023 13:59 UTC
36 points
19 comments1 min readLW link

Speak­ing off-meta

Epirito5 Jun 2023 13:56 UTC
4 points
0 comments1 min readLW link

Some Thoughts on Con­di­tional Fore­casts – Les­sons from the 2020 Election

Javier5 Jun 2023 11:58 UTC
14 points
2 comments4 min readLW link

5/​23

Celer5 Jun 2023 5:50 UTC
10 points
0 comments1 min readLW link
(keller.substack.com)

We Are Less Wrong than E. T. Jaynes on Loss Func­tions in Hu­man Society

Zack_M_Davis5 Jun 2023 5:34 UTC
45 points
14 comments2 min readLW link

Monthly Shorts 8/​21

Celer5 Jun 2023 5:30 UTC
13 points
2 comments3 min readLW link
(keller.substack.com)

Ages Sur­vey: Results

jefftk5 Jun 2023 2:10 UTC
57 points
10 comments5 min readLW link
(www.jefftk.com)

Meta-con­ver­sa­tion shouldn’t be taboo

Adam Zerner5 Jun 2023 0:19 UTC
34 points
36 comments4 min readLW link

The ants and the grasshopper

Richard_Ngo4 Jun 2023 22:00 UTC
417 points
35 comments5 min readLW link
(www.narrativeark.xyz)

[Question] im­pli­ca­tions of NN de­sign for education

bhauth4 Jun 2023 20:50 UTC
9 points
3 comments1 min readLW link

Na­ture < Nur­ture for AIs

scottviteri4 Jun 2023 20:38 UTC
14 points
22 comments7 min readLW link

One im­ple­men­ta­tion of reg­u­la­tory GPU restrictions

porby4 Jun 2023 20:34 UTC
32 points
6 comments5 min readLW link

How to em­bark on a jour­ney of self-dis­cov­ery (and po­ten­tially suc­ceed)

Ester Dobiášová4 Jun 2023 18:46 UTC
6 points
0 comments14 min readLW link
(ladyesik.wordpress.com)

AI Safety Fun­da­men­tals: An In­for­mal Co­hort Start­ing Soon!

Tiago de Vassal4 Jun 2023 17:15 UTC
4 points
0 comments1 min readLW link

How to Think About Ac­ti­va­tion Patching

Neel Nanda4 Jun 2023 14:17 UTC
47 points
5 comments20 min readLW link
(www.neelnanda.io)

[Fic­tion] A Dis­ney­land Without Children

L Rudolf L4 Jun 2023 13:06 UTC
67 points
3 comments1 min readLW link

I bet ev­ery­one 1000€ that I can make them dra­mat­i­cally hap­pier & cure their de­pres­sion in 3 months!

Anton Rodenhauser4 Jun 2023 12:30 UTC
4 points
11 comments9 min readLW link

Do You Really Want Effec­tive Altru­ism?

williamsae4 Jun 2023 8:06 UTC
−7 points
3 comments7 min readLW link

“What if ev­ery­one died ex­cept me and the su­per­in­tel­li­gent AI?”

sjeffh4 Jun 2023 5:08 UTC
−19 points
0 comments1 min readLW link

[Link Post] Bytes Are All You Need: Trans­form­ers Oper­at­ing Directly On File Bytes

Capybasilisk3 Jun 2023 22:45 UTC
18 points
2 comments1 min readLW link

Hu­man­ity and sci­ence are in­com­pat­i­ble.

archeon3 Jun 2023 22:15 UTC
−18 points
2 comments1 min readLW link

Op­ti­miza­tion hap­pens in­side the mind, not in the world

azsantosk3 Jun 2023 21:36 UTC
17 points
10 comments5 min readLW link

[Question] What would a post that ar­gues against the Orthog­o­nal­ity Th­e­sis that LessWrong users ap­prove of look like?

Thoth Hermes3 Jun 2023 21:21 UTC
3 points
3 comments1 min readLW link

A Dou­ble-Fea­ture on The Extropians

Maxwell Tabarrok3 Jun 2023 18:27 UTC
58 points
4 comments1 min readLW link

What ex­actly does ‘Slow Down’ look like?

Steve M3 Jun 2023 18:11 UTC
7 points
0 comments1 min readLW link

An­nounc­ing AISafety.info’s Write-a-thon (June 16-18) and Se­cond Distil­la­tion Fel­low­ship (July 3-Oc­to­ber 2)

steven04613 Jun 2023 2:03 UTC
33 points
1 comment2 min readLW link

Terry Tao is host­ing an “AI to As­sist Math­e­mat­i­cal Rea­son­ing” workshop

junk heap homotopy3 Jun 2023 1:19 UTC
12 points
1 comment1 min readLW link
(terrytao.wordpress.com)

Up­com­ing AI reg­u­la­tions are likely to make for an un­safer world

shminux3 Jun 2023 1:07 UTC
18 points
14 comments1 min readLW link

The AGI Race Between the US and China Doesn’t Ex­ist.

Eva_B3 Jun 2023 0:22 UTC
24 points
14 comments7 min readLW link
(evabehrens.substack.com)

Un­faith­ful Ex­pla­na­tions in Chain-of-Thought Prompting

miles3 Jun 2023 0:22 UTC
38 points
8 comments7 min readLW link

[Question] How could AIs ‘see’ each other’s source code?

Kenny2 Jun 2023 22:41 UTC
29 points
45 comments1 min readLW link

Pro­posal: labs should pre­com­mit to paus­ing if an AI ar­gues for it­self to be improved

NickGabs2 Jun 2023 22:31 UTC
3 points
3 comments4 min readLW link

In­fer­ence from a Math­e­mat­i­cal De­scrip­tion of an Ex­ist­ing Align­ment Re­search: a pro­posal for an outer al­ign­ment re­search program

Christopher King2 Jun 2023 21:54 UTC
7 points
4 comments16 min readLW link

Thoughts on Danc­ing the Whole Dance: Po­si­tional Cal­ling for Contra

jefftk2 Jun 2023 20:50 UTC
10 points
0 comments5 min readLW link
(www.jefftk.com)

Ad­vice for En­ter­ing AI Safety Research

scasper2 Jun 2023 20:46 UTC
25 points
2 comments5 min readLW link

AI should be used to find bet­ter morality

Jorterder2 Jun 2023 20:38 UTC
−20 points
1 comment1 min readLW link

A mind needn’t be cu­ri­ous to reap the benefits of curiosity

So8res2 Jun 2023 18:00 UTC
78 points
14 comments1 min readLW link

[Question] Are com­pu­ta­tion­ally com­plex al­gorithms ex­pen­sive to have, ex­pen­sive to op­er­ate, or both?

Noosphere892 Jun 2023 17:50 UTC
7 points
5 comments1 min readLW link

[Repli­ca­tion] Con­jec­ture’s Sparse Cod­ing in Toy Models

2 Jun 2023 17:34 UTC
23 points
0 comments1 min readLW link

Limits to Learn­ing: Re­think­ing AGI’s Path to Dominance

tangerine2 Jun 2023 16:43 UTC
3 points
4 comments15 min readLW link