Pro­ject Lawful Au­dio­book: An Unoffi­cial Fan Pro­duc­tion with ElevenLabs AI

Askwho19 Jul 2023 23:34 UTC
17 points
1 comment1 min readLW link
(askwhocastsai.substack.com)

Us­ing pre­dic­tors in cor­rigible systems

porby19 Jul 2023 22:29 UTC
19 points
6 comments27 min readLW link

men­tal num­ber lines

bhauth19 Jul 2023 21:01 UTC
10 points
5 comments1 min readLW link

[Question] Any sug­ges­tions for an im­pact­ful mas­ter’s the­sis in Poli­ti­cal Science?

Klara Helene Nielsen19 Jul 2023 17:44 UTC
1 point
0 comments1 min readLW link

In­ci­dent re­port­ing for AI safety

Zach Stein-Perlman19 Jul 2023 17:00 UTC
22 points
0 comments1 min readLW link

Align­ment Grant­mak­ing is Fund­ing-Limited Right Now

johnswentworth19 Jul 2023 16:49 UTC
307 points
67 comments1 min readLW link

Zener Science

Screwtape19 Jul 2023 16:40 UTC
16 points
11 comments6 min readLW link

Tal­linn, Es­to­nia ACX Sum­mer Meetup

Andrew19 Jul 2023 16:22 UTC
1 point
1 comment1 min readLW link

Desider­ata for an AI

Nathan Helm-Burger19 Jul 2023 16:18 UTC
8 points
0 comments4 min readLW link

Valuism—an ap­proach to life for you to consider

spencerg19 Jul 2023 15:23 UTC
17 points
2 comments1 min readLW link

He­donic Loops and Tam­ing RL

beren19 Jul 2023 15:12 UTC
20 points
14 comments9 min readLW link

[Question] What Caused the Puz­zling De­cline in Ac­tivism Against Policy Violence Towards Black Peo­ple?

ChristianKl19 Jul 2023 14:40 UTC
12 points
2 comments1 min readLW link

Lisa Feld­man Bar­rett ver­sus Paul Ek­man on fa­cial ex­pres­sions & ba­sic emotions

Steven Byrnes19 Jul 2023 14:26 UTC
29 points
15 comments15 min readLW link

AISN#15: China and the US take ac­tion to reg­u­late AI, re­sults from a tour­na­ment fore­cast­ing AI risk, up­dates on xAI’s plan, and Meta re­leases its open-source and com­mer­cially available Llama 2

19 Jul 2023 13:01 UTC
16 points
0 comments6 min readLW link
(newsletter.safe.ai)

Tech­nolog­i­cal solu­tions to the cli­mate crisis

dominicq19 Jul 2023 12:39 UTC
6 points
5 comments3 min readLW link
(sundaystopwatch.eu)

Se­cret Cos­mos: Introduction

Al Link19 Jul 2023 11:51 UTC
−35 points
3 comments14 min readLW link
(allink.substack.com)

Cri­tiques of promi­nent AI safety or­ga­ni­za­tions: Introduction

Omega.19 Jul 2023 6:54 UTC
7 points
0 comments5 min readLW link
(forum.effectivealtruism.org)

Dig­ging From The Pit­falls of Rationality

UtilityMonster19 Jul 2023 4:52 UTC
6 points
2 comments2 min readLW link
(utilitymonster.substack.com)

House Gro­cery Spending

jefftk19 Jul 2023 3:00 UTC
13 points
0 comments5 min readLW link
(www.jefftk.com)

A brief his­tory of computers

Adam Zerner19 Jul 2023 2:59 UTC
71 points
18 comments33 min readLW link

Sim­ple al­ign­ment plan that maybe works

Iknownothing18 Jul 2023 22:48 UTC
4 points
8 comments1 min readLW link

Pros­pera-dump

tailcalled18 Jul 2023 21:36 UTC
10 points
16 comments1 min readLW link

Tiny Mech In­terp Pro­jects: Emer­gent Po­si­tional Embed­dings of Words

Neel Nanda18 Jul 2023 21:24 UTC
48 points
1 comment9 min readLW link

Quick Thoughts on Lan­guage Models

RohanS18 Jul 2023 20:38 UTC
6 points
0 comments4 min readLW link

Still no Lie De­tec­tor for LLMs

18 Jul 2023 19:56 UTC
47 points
2 comments21 min readLW link

Meta an­nounces Llama 2; “open sources” it for com­mer­cial use

LawrenceC18 Jul 2023 19:28 UTC
46 points
12 comments1 min readLW link
(about.fb.com)

The Rope Man­age­ment The­ory: A Com­pre­hen­sive Ap­proach to Mo­du­lat­ing Re­ward Per­cep­tion and Miti­gat­ing He­donic Adaptation

Oren Montano18 Jul 2023 17:45 UTC
−23 points
2 comments3 min readLW link

AI Im­pacts Quar­terly Newslet­ter, Apr-Jun 2023

18 Jul 2023 17:14 UTC
6 points
0 comments3 min readLW link
(blog.aiimpacts.org)

Clever ar­guers give weak ev­i­dence, not zero

dkl918 Jul 2023 17:07 UTC
7 points
2 comments1 min readLW link
(dkl9.net)

Mea­sur­ing and Im­prov­ing the Faith­ful­ness of Model-Gen­er­ated Rea­son­ing

18 Jul 2023 16:36 UTC
109 points
13 comments6 min readLW link

[Question] Least-prob­le­matic Re­source for learn­ing RL?

Dalcy18 Jul 2023 16:30 UTC
6 points
3 comments1 min readLW link

Char­ter Cities: why they’re ex­cit­ing & how they might work

Jackson Wagner18 Jul 2023 13:57 UTC
19 points
7 comments1 min readLW link

Nar­ra­tive The­ory. Part 6. Ar­tifi­cial Neu­ral Networks

Eris18 Jul 2023 9:22 UTC
3 points
0 comments2 min readLW link

Train for in­cor­rigi­bil­ity, then re­verse it (Shut­down Prob­lem Con­test Sub­mis­sion)

Daniel_Eth18 Jul 2023 8:26 UTC
9 points
1 comment1 min readLW link

The shape of AGI: Car­toons and back of envelope

boazbarak17 Jul 2023 20:57 UTC
27 points
18 comments6 min readLW link

Pre­dic­tive his­tory classes

dkl917 Jul 2023 20:48 UTC
67 points
17 comments2 min readLW link
(dkl9.net)

High­lights from The In­dus­trial Revolu­tion, by T. S. Ashton

jasoncrawford17 Jul 2023 19:02 UTC
17 points
0 comments10 min readLW link
(rootsofprogress.org)

Ex­is­ten­tial Risk Per­sua­sion Tournament

PeterMcCluskey17 Jul 2023 18:04 UTC
71 points
1 comment8 min readLW link
(bayesianinvestor.com)

[In­ter­view w/​ Rob Miles] The case for tak­ing AI Safety seriously

fowlertm17 Jul 2023 17:08 UTC
17 points
1 comment1 min readLW link

An­nounc­ing the Ex­is­ten­tial In­foSec Forum

calebp9917 Jul 2023 17:05 UTC
10 points
0 comments2 min readLW link

Nar­ra­tive The­ory. Part 4. Neu­ral Darwinism

Eris17 Jul 2023 16:45 UTC
3 points
0 comments2 min readLW link

Sapi­ent Algorithms

Valentine17 Jul 2023 16:30 UTC
80 points
15 comments5 min readLW link

New ca­reer re­view: AI safety tech­ni­cal research

Benjamin Hilton17 Jul 2023 15:34 UTC
14 points
0 comments1 min readLW link

[Question] Con­di­tional on liv­ing in a AI safety/​al­ign­ment by de­fault uni­verse, what are the im­pli­ca­tions of this as­sump­tion be­ing true?

Noosphere8917 Jul 2023 14:44 UTC
26 points
10 comments1 min readLW link

Thoughts on “Pro­cess-Based Su­per­vi­sion”

Steven Byrnes17 Jul 2023 14:08 UTC
74 points
4 comments23 min readLW link

Proof of pos­te­ri­or­ity: a defense against AI-gen­er­ated misinformation

jchan17 Jul 2023 12:04 UTC
32 points
3 comments5 min readLW link

An Overview of AI risks—the Flyer

17 Jul 2023 12:03 UTC
20 points
0 comments1 min readLW link
(docs.google.com)

[Question] Build knowl­edge base first, or backchain?

NicholasKross17 Jul 2023 3:44 UTC
11 points
5 comments1 min readLW link

A fic­tional AI law laced w/​ al­ign­ment theory

MiguelDev17 Jul 2023 1:42 UTC
6 points
0 comments2 min readLW link

Au­toIn­ter­pre­ta­tion Finds Sparse Cod­ing Beats Alternatives

Hoagy17 Jul 2023 1:41 UTC
54 points
1 comment7 min readLW link