Man­i­fold x CSPI $25k Fore­cast­ing Tournament

David Chee9 Aug 2022 21:13 UTC
5 points
0 comments1 min readLW link
(www.cspicenter.com)

Pro­posal: Con­sider not us­ing dis­tance-di­rec­tion-di­men­sion words in ab­stract discussions

moridinamael9 Aug 2022 20:44 UTC
45 points
18 comments5 min readLW link

[Question] How would two su­per­in­tel­li­gent AIs in­ter­act, if they are un­al­igned with each other?

Nathan11239 Aug 2022 18:58 UTC
4 points
6 comments1 min readLW link

Disagree­ments about Align­ment: Why, and how, we should try to solve them

ojorgensen9 Aug 2022 18:49 UTC
11 points
2 comments16 min readLW link

Progress links and tweets, 2022-08-09

jasoncrawford9 Aug 2022 17:35 UTC
11 points
3 comments1 min readLW link
(rootsofprogress.org)

[Question] Is it pos­si­ble to find ven­ture cap­i­tal for AI re­search org with strong safety fo­cus?

AnonResearch9 Aug 2022 16:12 UTC
6 points
1 comment1 min readLW link

[Question] Many Gods re­fu­ta­tion and In­stru­men­tal Goals. (Proper one)

aditya malik9 Aug 2022 11:59 UTC
0 points
15 comments1 min readLW link

Con­tent gen­er­a­tion. Where do we draw the line?

Q Home9 Aug 2022 10:51 UTC
6 points
7 comments2 min readLW link

[Question] What are some al­ter­na­tives to Shap­ley val­ues which drop ad­di­tivity?

eapi9 Aug 2022 9:16 UTC
11 points
6 comments1 min readLW link
(math.stackexchange.com)

Ra­dio Bostrom: Au­dio nar­ra­tions of pa­pers by Nick Bostrom

PeterH9 Aug 2022 8:56 UTC
12 points
0 comments2 min readLW link
(forum.effectivealtruism.org)

Team Shard Sta­tus Report

David Udell9 Aug 2022 5:33 UTC
38 points
8 comments3 min readLW link

An­nounc­ing: Mechanism De­sign for AI Safety—Read­ing Group

Rubi J. Hudson9 Aug 2022 4:21 UTC
18 points
3 comments4 min readLW link

[Question] What are some Works that might be use­ful but are difficult, so for­got­ten?

TekhneMakre9 Aug 2022 2:22 UTC
10 points
5 comments1 min readLW link

Pro­ject pro­posal: Test­ing the IBP defi­ni­tion of agent

9 Aug 2022 1:09 UTC
21 points
4 comments2 min readLW link

How (not) to choose a re­search project

9 Aug 2022 0:26 UTC
78 points
11 comments7 min readLW link

[Question] Are ya win­ning, son?

Nathan11239 Aug 2022 0:06 UTC
14 points
13 comments2 min readLW link

Gen­eral al­ign­ment properties

TurnTrout8 Aug 2022 23:40 UTC
50 points
2 comments1 min readLW link

Ex­per­i­ment: Be my math tu­tor?

sudo8 Aug 2022 22:50 UTC
12 points
5 comments1 min readLW link

En­cul­tured AI, Part 1 Ap­pendix: Rele­vant Re­search Examples

8 Aug 2022 22:44 UTC
11 points
1 comment7 min readLW link

En­cul­tured AI Pre-plan­ning, Part 1: En­abling New Benchmarks

8 Aug 2022 22:44 UTC
63 points
2 comments6 min readLW link

Broad Bas­ins and Data Compression

8 Aug 2022 20:33 UTC
33 points
6 comments7 min readLW link

In­ter­pretabil­ity/​Tool-ness/​Align­ment/​Cor­rigi­bil­ity are not Composable

johnswentworth8 Aug 2022 18:05 UTC
129 points
12 comments3 min readLW link

LW Meetup @ DEFCON (Las Ve­gas) − 5-7pm Thu. Aug. 11 at Fo­rum Food Court (Cae­sars)

jchan8 Aug 2022 14:57 UTC
6 points
0 comments1 min readLW link

A suffi­ciently para­noid pa­per­clip maximizer

RomanS8 Aug 2022 11:17 UTC
17 points
10 comments2 min readLW link

[Question] In­stru­men­tal Goals and Many Gods Re­fu­ta­tion

aditya malik8 Aug 2022 10:46 UTC
−10 points
4 comments1 min readLW link

Area un­der the curve, Eat Dirt, Broc­coli Er­rors, Coper­ni­cus & Chaos

CFAR!Duncan8 Aug 2022 8:17 UTC
36 points
0 comments7 min readLW link

Steganog­ra­phy in Chain of Thought Reasoning

A Ray8 Aug 2022 3:47 UTC
61 points
13 comments6 min readLW link

How Deadly Will Roughly-Hu­man-Level AGI Be?

David Udell8 Aug 2022 1:59 UTC
12 points
6 comments1 min readLW link

[Question] Can we get full au­dio for Eliezer’s con­ver­sa­tion with Sam Har­ris?

JakubK7 Aug 2022 20:35 UTC
30 points
8 comments1 min readLW link

Com­plex­ity No Bar to AI (Or, why Com­pu­ta­tional Com­plex­ity mat­ters less than you think for real life prob­lems)

Noosphere897 Aug 2022 19:55 UTC
17 points
14 comments3 min readLW link
(www.gwern.net)

The les­sons of Xanadu

jasoncrawford7 Aug 2022 17:59 UTC
109 points
20 comments8 min readLW link
(jasoncrawford.org)

Care­ful with Caching

jefftk7 Aug 2022 15:20 UTC
15 points
3 comments1 min readLW link
(www.jefftk.com)

[Question] How would Log­i­cal De­ci­sion The­o­ries ad­dress the Psy­chopath But­ton?

Nathan11237 Aug 2022 15:19 UTC
5 points
33 comments1 min readLW link

Jack Clark on the re­al­ities of AI policy

Kaj_Sotala7 Aug 2022 8:44 UTC
68 points
3 comments3 min readLW link
(threadreaderapp.com)

Ex­pected (So­cial) Value

algrthms7 Aug 2022 8:16 UTC
5 points
2 comments3 min readLW link

La­men­ta­tions, Gaza and Empathy

Yair Halberstadt7 Aug 2022 7:55 UTC
19 points
2 comments3 min readLW link

Paper read­ing as a Cargo Cult

jem-mosig7 Aug 2022 7:50 UTC
70 points
10 comments5 min readLW link

Most Ivy-smart stu­dents aren’t at Ivy-tier schools

Aaron Bergman7 Aug 2022 3:18 UTC
82 points
7 comments8 min readLW link
(www.aaronbergman.net)

Seat­tle Septem­ber meetup: Ab­sur­dity Bias

Nikita Sokolsky7 Aug 2022 1:37 UTC
3 points
0 comments1 min readLW link

Do meta-memes and meta-an­timemes ex­ist? e.g. ‘The map is not the ter­ri­tory’ is also a map

M. Y. Zuo7 Aug 2022 1:17 UTC
4 points
31 comments1 min readLW link

New­comb­ness of the Din­ing Philoso­phers Problem

Nathan11236 Aug 2022 21:58 UTC
10 points
2 comments2 min readLW link

[AMA] An­nounc­ing Open Phil’s Univer­sity Group Or­ga­nizer and Cen­tury Fel­low­ships [x-post]

6 Aug 2022 21:48 UTC
14 points
0 comments13 min readLW link
(forum.effectivealtruism.org)

Bos­ton Rents Over Time II

jefftk6 Aug 2022 21:20 UTC
23 points
0 comments2 min readLW link
(www.jefftk.com)

Dwarves & D.Sci: Data Fortress

aphyer6 Aug 2022 18:24 UTC
35 points
26 comments3 min readLW link

A De­cep­tively Sim­ple Ar­gu­ment in fa­vor of Prob­lem Factorization

Logan Zoellner6 Aug 2022 17:32 UTC
3 points
4 comments1 min readLW link

A Data limited future

Donald Hobson6 Aug 2022 14:56 UTC
52 points
25 comments2 min readLW link

Six weeks doesn’t make a habit

lynettebye6 Aug 2022 8:54 UTC
47 points
1 comment3 min readLW link

Why I Am Skep­ti­cal of AI Reg­u­la­tion as an X-Risk Miti­ga­tion Strategy

A Ray6 Aug 2022 5:46 UTC
31 points
14 comments2 min readLW link

My ad­vice on find­ing your own path

A Ray6 Aug 2022 4:57 UTC
35 points
3 comments3 min readLW link

Pre­dic­tIt is clos­ing due to CFTC chang­ing its mind

eigen6 Aug 2022 3:34 UTC
20 points
4 comments1 min readLW link