Policy dis­cus­sions fol­low strong con­tex­tu­al­iz­ing norms

Richard_Ngo1 Apr 2023 23:51 UTC
230 points
61 comments3 min readLW link

A re­port about LessWrong karma volatility from a differ­ent universe

Ben Pace1 Apr 2023 21:48 UTC
176 points
7 comments1 min readLW link

A Con­fes­sion about the LessWrong Team

Ruby1 Apr 2023 21:47 UTC
87 points
5 comments2 min readLW link

Shut­ting down AI is not enough. We need to de­stroy all tech­nol­ogy.

Matthew Barnett1 Apr 2023 21:03 UTC
150 points
36 comments1 min readLW link

A policy guaran­teed to in­crease AI timelines

Richard Korzekwa 1 Apr 2023 20:50 UTC
46 points
1 comment2 min readLW link
(aiimpacts.org)

Why I Think the Cur­rent Tra­jec­tory of AI Re­search has Low P(doom) - LLMs

GaPa1 Apr 2023 20:35 UTC
2 points
1 comment10 min readLW link

Re­pairing the Effort Asymmetry

[DEACTIVATED] Duncan Sabien1 Apr 2023 20:23 UTC
41 points
11 comments2 min readLW link

Draft: In­fer­ring minimizers

Alex_Altair1 Apr 2023 20:20 UTC
9 points
0 comments1 min readLW link

AI Safety via Luck

Jozdien1 Apr 2023 20:13 UTC
76 points
7 comments11 min readLW link

Ho Chi Minh ACX Meetup

cygnus1 Apr 2023 19:41 UTC
1 point
0 comments1 min readLW link

Quaker Prac­tice for the Aspiring Rationalist

maia1 Apr 2023 19:32 UTC
25 points
4 comments6 min readLW link

The Plan: Put ChatGPT in Charge

Sven Nilsen1 Apr 2023 17:23 UTC
−5 points
3 comments1 min readLW link

AI in­fosec: first strikes, zero-day mar­kets, hard­ware sup­ply chains, adop­tion barriers

Allison Duettmann1 Apr 2023 16:44 UTC
39 points
0 comments9 min readLW link

In­tro­duc­ing Align­men­tSearch: An AI Align­ment-In­formed Con­ver­sional Agent

1 Apr 2023 16:39 UTC
79 points
14 comments4 min readLW link

How se­cu­rity and cryp­tog­ra­phy can aid AI safety [se­quence]

Allison Duettmann1 Apr 2023 16:28 UTC
22 points
0 comments1 min readLW link

AI com­mu­nity build­ing: EliezerKart

Christopher King1 Apr 2023 15:25 UTC
45 points
0 comments2 min readLW link

[Question] Trans­former trained on it’s own con­tent?

Micromegas1 Apr 2023 15:08 UTC
1 point
0 comments1 min readLW link

The frozen neutrality

ProgramCrafter1 Apr 2023 12:58 UTC
3 points
0 comments3 min readLW link

Pro­posal: Butt bumps as a de­fault for phys­i­cal greetings

Adam Zerner1 Apr 2023 12:48 UTC
53 points
23 comments2 min readLW link

[Question] Is this true? paulg: [One spe­cial thing about AI risk is that peo­ple who un­der­stand AI well are more wor­ried than peo­ple who un­der­stand it poorly]

tailcalled1 Apr 2023 11:59 UTC
25 points
5 comments1 min readLW link

Some thought ex­per­i­ments on digi­tal consciousness

rorygreig1 Apr 2023 11:45 UTC
22 points
13 comments6 min readLW link

Sin­gu­lar­i­ties against the Sin­gu­lar­ity: An­nounc­ing Work­shop on Sin­gu­lar Learn­ing The­ory and Alignment

1 Apr 2023 9:58 UTC
87 points
0 comments1 min readLW link
(singularlearningtheory.com)

Cam­paign for AI Safety: Please join me

Nik Samoylov1 Apr 2023 9:32 UTC
18 points
9 comments1 min readLW link

New Align­ment Re­search Agenda: Mas­sive Mul­ti­player Or­ganism Oversight

TsviBT1 Apr 2023 8:02 UTC
17 points
1 comment2 min readLW link

[April Fools’] Defini­tive con­fir­ma­tion of shard theory

TurnTrout1 Apr 2023 7:27 UTC
166 points
7 comments2 min readLW link

[New LW Fea­ture] “De­bates”

1 Apr 2023 7:00 UTC
113 points
34 comments1 min readLW link

Keep Mak­ing AI Safety News

RedFishBlueFish1 Apr 2023 6:27 UTC
1 point
6 comments1 min readLW link

[Question] What Are Your Prefer­ences Re­gard­ing The FLI Let­ter?

JenniferRM1 Apr 2023 4:52 UTC
−4 points
122 comments16 min readLW link

An Aver­age Dialogue

NicholasKross1 Apr 2023 4:01 UTC
4 points
0 comments1 min readLW link

The Sig­nifi­cance of “Align­ment vs Progress: The AI Rap Show­down” in the AI Safety Discourse

Jonathan Grant1 Apr 2023 3:26 UTC
−1 points
0 comments1 min readLW link

My Model of Gen­der Identity

Iris of Rosebloom1 Apr 2023 3:03 UTC
32 points
4 comments12 min readLW link

Hooray for step­ping out of the limelight

So8res1 Apr 2023 2:45 UTC
281 points
24 comments1 min readLW link

Kal­lipo­lis, USA

Daniel Kokotajlo1 Apr 2023 2:06 UTC
13 points
1 comment1 min readLW link
(docs.google.com)

[Question] Is there a man­i­fold plot of all peo­ple who had a say in AI al­ign­ment?

skulk-and-quarrel31 Mar 2023 21:50 UTC
8 points
0 comments1 min readLW link

We might get lucky with AGI warn­ing shots. Let’s be ready!

tcelferact31 Mar 2023 21:01 UTC
5 points
0 comments2 min readLW link

Wizards and prophets of AI [draft for com­ment]

jasoncrawford31 Mar 2023 20:22 UTC
16 points
11 comments6 min readLW link

[Linkpost] Cri­tiques of Red­wood Research

Akash31 Mar 2023 20:00 UTC
13 points
2 comments1 min readLW link
(forum.effectivealtruism.org)

An Open Let­ter to the AI

imsys31 Mar 2023 19:53 UTC
−8 points
1 comment1 min readLW link

I Have No Sense of Hu­mor and I Must Laugh

Lone Pine31 Mar 2023 19:40 UTC
−4 points
1 comment22 min readLW link

Maze-solv­ing agents: Add a top-right vec­tor, make the agent go to the top-right

31 Mar 2023 19:20 UTC
101 points
17 comments11 min readLW link

ACX spring meetup

Épiphanie Gédéon31 Mar 2023 18:39 UTC
4 points
0 comments1 min readLW link

Imag­ine a world where Microsoft em­ploy­ees used Bing

Christopher King31 Mar 2023 18:36 UTC
6 points
2 comments2 min readLW link

The Peril of the Great Leaks (writ­ten with ChatGPT)

bvbvbvbvbvbvbvbvbvbvbv31 Mar 2023 18:14 UTC
3 points
1 comment1 min readLW link

[Question] Why don’t peo­ple talk about the Dooms­day Ar­gu­ment more of­ten?

sam31 Mar 2023 17:52 UTC
−1 points
3 comments1 min readLW link

ChatGPT banned in Italy over pri­vacy concerns

Ollie J31 Mar 2023 17:33 UTC
18 points
4 comments1 min readLW link
(www.bbc.co.uk)

GPT-4 busted? Clear self-in­ter­est when sum­ma­riz­ing ar­ti­cles about it­self vs when ar­ti­cle talks about Claude, LLaMA, or DALL·E 2

Christopher King31 Mar 2023 17:05 UTC
6 points
4 comments4 min readLW link

The Quan­ti­za­tion Model of Neu­ral Scaling

nz31 Mar 2023 16:02 UTC
17 points
0 comments1 min readLW link
(arxiv.org)

Cor­rect­ing a mis­con­cep­tion: con­scious­ness does not need 90 billion neu­rons, at all

bvbvbvbvbvbvbvbvbvbvbv31 Mar 2023 16:02 UTC
21 points
19 comments1 min readLW link

Man­i­fund x AI Worldviews

Austin Chen31 Mar 2023 15:32 UTC
33 points
0 comments1 min readLW link

Stop push­ing the bus

cousin_it31 Mar 2023 13:03 UTC
46 points
15 comments1 min readLW link