Don’t be a Maxi

Cole Killian31 Jul 2022 23:59 UTC
15 points
7 comments2 min readLW link
(colekillian.com)

Ab­strac­tion sac­ri­fices causal clarity

Marv K31 Jul 2022 19:24 UTC
2 points
0 comments3 min readLW link

Time-log­ging pro­grams and/​or spread­sheets (2022)

mikbp31 Jul 2022 18:18 UTC
3 points
3 comments1 min readLW link

Con­ser­vatism is a ra­tio­nal re­sponse to epistemic uncertainty

contrarianbrit31 Jul 2022 18:04 UTC
2 points
11 comments9 min readLW link
(thomasprosser.substack.com)

South Bay ACX/​LW Meetup

IS31 Jul 2022 15:30 UTC
2 points
0 comments1 min readLW link

Per­verse In­de­pen­dence Incentives

jefftk31 Jul 2022 14:40 UTC
58 points
3 comments1 min readLW link
(www.jefftk.com)

Wolfram Re­search v Cook

Kenny31 Jul 2022 13:35 UTC
7 points
2 comments8 min readLW link

Wanted: No­ta­tion for credal resilience

PeterH31 Jul 2022 7:35 UTC
21 points
12 comments1 min readLW link

Anatomy of a Dat­ing Document

squidious31 Jul 2022 2:40 UTC
26 points
24 comments4 min readLW link
(opalsandbonobos.blogspot.com)

chin­chilla’s wild implications

nostalgebraist31 Jul 2022 1:18 UTC
415 points
128 comments10 min readLW link1 review

AGI-level rea­soner will ap­pear sooner than an agent; what the hu­man­ity will do with this rea­soner is critical

Roman Leventov30 Jul 2022 20:56 UTC
24 points
10 comments1 min readLW link

[Question] What job should I do?

Tom Paine30 Jul 2022 9:15 UTC
2 points
8 comments1 min readLW link

How trans­parency changed over time

ViktoriaMalyasova30 Jul 2022 4:36 UTC
21 points
0 comments6 min readLW link

Trans­lat­ing be­tween La­tent Spaces

30 Jul 2022 3:25 UTC
27 points
2 comments8 min readLW link

Drexler’s Nan­otech Forecast

PeterMcCluskey30 Jul 2022 0:45 UTC
25 points
28 comments3 min readLW link
(www.bayesianinvestor.com)

Hu­mans Reflect­ing on HRH

leogao29 Jul 2022 21:56 UTC
26 points
4 comments2 min readLW link

Com­par­ing Four Ap­proaches to In­ner Alignment

Lucas Teixeira29 Jul 2022 21:06 UTC
35 points
1 comment9 min readLW link

Ques­tions for a The­ory of Narratives

Marv K29 Jul 2022 19:31 UTC
5 points
4 comments4 min readLW link

Focusing

CFAR!Duncan29 Jul 2022 19:15 UTC
107 points
23 comments14 min readLW link

Con­jec­ture: In­ter­nal In­fo­haz­ard Policy

29 Jul 2022 19:07 UTC
131 points
6 comments19 min readLW link

Ab­stract­ing The Hard­ness of Align­ment: Un­bounded Atomic Optimization

adamShimi29 Jul 2022 18:59 UTC
66 points
3 comments16 min readLW link

Bucket Errors

CFAR!Duncan29 Jul 2022 18:50 UTC
40 points
7 comments11 min readLW link

Distil­la­tion Con­test—Re­sults and Recap

Aris29 Jul 2022 17:40 UTC
34 points
0 comments7 min readLW link

The gen­er­al­ized Sier­pin­ski-Mazurk­iewicz the­o­rem.

Donald Hobson29 Jul 2022 0:12 UTC
11 points
4 comments1 min readLW link

The Con­ver­sa­tions We Make Space For

Severin T. Seehrich28 Jul 2022 21:37 UTC
21 points
0 comments3 min readLW link

An­nounc­ing the AI Safety Field Build­ing Hub, a new effort to provide AISFB pro­jects, men­tor­ship, and funding

Vael Gates28 Jul 2022 21:29 UTC
49 points
3 comments6 min readLW link

Defin­ing Op­ti­miza­tion in a Deeper Way Part 4

J Bostock28 Jul 2022 17:02 UTC
7 points
0 comments5 min readLW link

Covid 7/​28/​22: Ruin­ing It For Everyone

Zvi28 Jul 2022 15:10 UTC
32 points
8 comments12 min readLW link
(thezvi.wordpress.com)

Mon­key­pox Post #2

Zvi28 Jul 2022 13:20 UTC
36 points
3 comments6 min readLW link
(thezvi.wordpress.com)

For Bet­ter Com­ment­ing, Stop Out Loud

DirectedEvolution28 Jul 2022 1:39 UTC
18 points
30 comments1 min readLW link

Seek­ing beta read­ers who are ig­no­rant of biol­ogy but knowl­edge­able about AI safety

Holly_Elmore27 Jul 2022 23:02 UTC
11 points
6 comments1 min readLW link

Prin­ci­ples of Pri­vacy for Align­ment Research

johnswentworth27 Jul 2022 19:53 UTC
72 points
30 comments7 min readLW link

Mo­ral strate­gies at differ­ent ca­pa­bil­ity levels

Richard_Ngo27 Jul 2022 18:50 UTC
112 points
14 comments5 min readLW link
(thinkingcomplete.blogspot.com)

Progress links and tweets, 2022-07-27

jasoncrawford27 Jul 2022 17:20 UTC
18 points
0 comments1 min readLW link
(rootsofprogress.org)

Quan­tum Ad­van­tage in Learn­ing from Experiments

Dennis Towne27 Jul 2022 15:49 UTC
5 points
5 comments1 min readLW link
(ai.googleblog.com)

Levels of Pluralism

adamShimi27 Jul 2022 9:35 UTC
34 points
0 comments14 min readLW link

Hu­man tri­als for the Mar­burg vac­cine: fund­ing op­por­tu­nity?

americanwalrus27 Jul 2022 5:53 UTC
3 points
0 comments1 min readLW link
(www.independent.co.uk)

[Question] “Fa­nat­i­cal” Longter­mists: Why is Pas­cal’s Wager wrong?

Yitz27 Jul 2022 4:16 UTC
3 points
7 comments1 min readLW link

Unify­ing Bar­gain­ing No­tions (2/​2)

Diffractor27 Jul 2022 3:40 UTC
116 points
19 comments21 min readLW link

AGI ruin sce­nar­ios are likely (and dis­junc­tive)

So8res27 Jul 2022 3:21 UTC
170 points
38 comments6 min readLW link

Tech­noc­racy and the Space Age

jasoncrawford26 Jul 2022 23:14 UTC
25 points
5 comments2 min readLW link
(rootsofprogress.org)

«Boundaries», Part 1: a key miss­ing con­cept from util­ity theory

Andrew_Critch26 Jul 2022 23:03 UTC
158 points
32 comments7 min readLW link

In­co­her­ence of un­bounded selfishness

emmab26 Jul 2022 22:27 UTC
−6 points
2 comments1 min readLW link

«Boundaries» Se­quence (In­dex Post)

Andrew_Critch26 Jul 2022 19:12 UTC
25 points
1 comment1 min readLW link

Ac­tive In­fer­ence as a for­mal­i­sa­tion of in­stru­men­tal convergence

Roman Leventov26 Jul 2022 17:55 UTC
12 points
2 comments3 min readLW link
(direct.mit.edu)

NeurIPS ML Safety Work­shop 2022

Dan H26 Jul 2022 15:28 UTC
72 points
2 comments1 min readLW link
(neurips2022.mlsafety.org)

AI ethics vs AI alignment

Wei Dai26 Jul 2022 13:08 UTC
5 points
1 comment1 min readLW link

Utility func­tions and prob­a­bil­ities are entangled

Thomas Kwa26 Jul 2022 5:36 UTC
15 points
5 comments1 min readLW link

How Promis­ing is The­o­ret­i­cal Re­search on Ra­tion­al­ity? Seek­ing Ca­reer Advice

Aspirant22326 Jul 2022 1:08 UTC
3 points
3 comments3 min readLW link

Pre­dic­tion mar­kets meetup/​cowork­ing (hosted by Man­i­fold Mar­kets)

26 Jul 2022 0:14 UTC
2 points
0 comments1 min readLW link