Kel­sey Piper’s re­cent in­ter­view of SBF

agucova16 Nov 2022 20:30 UTC
51 points
29 comments2 min readLW link
(www.vox.com)

The Echo Principle

Jonathan Moregård16 Nov 2022 20:09 UTC
4 points
0 comments3 min readLW link
(honestliving.substack.com)

[Question] Is there some rea­son LLMs haven’t seen broader use?

tailcalled16 Nov 2022 20:04 UTC
25 points
27 comments1 min readLW link

When should we be sur­prised that an in­ven­tion took “so long”?

jasoncrawford16 Nov 2022 20:04 UTC
32 points
11 comments4 min readLW link
(rootsofprogress.org)

Ques­tions about Value Lock-in, Pa­ter­nal­ism, and Empowerment

Sam F. Brown16 Nov 2022 15:33 UTC
13 points
2 comments12 min readLW link
(sambrown.eu)

If Pro­fes­sional In­vestors Missed This...

jefftk16 Nov 2022 15:00 UTC
37 points
18 comments3 min readLW link
(www.jefftk.com)

Disagree­ment with bio an­chors that lead to shorter timelines

Marius Hobbhahn16 Nov 2022 14:40 UTC
75 points
17 comments7 min readLW link1 review

Cur­rent themes in mechanis­tic in­ter­pretabil­ity research

16 Nov 2022 14:14 UTC
89 points
2 comments12 min readLW link

Un­pack­ing “Shard The­ory” as Hunch, Ques­tion, The­ory, and Insight

Jacy Reese Anthis16 Nov 2022 13:54 UTC
31 points
9 comments2 min readLW link

Mir­a­cles and why not to be­lieve them

mruwnik16 Nov 2022 12:07 UTC
4 points
0 comments2 min readLW link

[Question] How do peo­ple do re­mote re­search col­lab­o­ra­tions effec­tively?

Krieger16 Nov 2022 11:51 UTC
8 points
0 comments1 min readLW link

Method of state­ments: an al­ter­na­tive to taboo

Q Home16 Nov 2022 10:57 UTC
7 points
0 comments41 min readLW link

The two con­cep­tions of Ac­tive In­fer­ence: an in­tel­li­gence ar­chi­tec­ture and a the­ory of agency

Roman Leventov16 Nov 2022 9:30 UTC
18 points
0 comments4 min readLW link

Devel­oper ex­pe­rience for the motivation

Adam Zerner16 Nov 2022 7:12 UTC
49 points
7 comments4 min readLW link

Progress links and tweets, 2022-11-15

jasoncrawford16 Nov 2022 3:21 UTC
9 points
0 comments2 min readLW link
(rootsofprogress.org)

EA & LW Fo­rums Weekly Sum­mary (7th Nov − 13th Nov 22′)

Zoe Williams16 Nov 2022 3:04 UTC
19 points
0 comments14 min readLW link

The FTX Saga—Simplified

Annapurna16 Nov 2022 2:42 UTC
44 points
10 comments7 min readLW link
(jorgevelez.substack.com)

Utili­tar­i­anism and the idea of a “ra­tio­nal agent” are fun­da­men­tally in­con­sis­tent with reality

banev16 Nov 2022 0:19 UTC
−4 points
1 comment1 min readLW link

[Question] Is the speed of train­ing large mod­els go­ing to in­crease sig­nifi­cantly in the near fu­ture due to Cere­bras An­dromeda?

Amal 15 Nov 2022 22:50 UTC
13 points
11 comments1 min readLW link

[Question] What is our cur­rent best in­fo­haz­ard policy for AGI (safety) re­search?

Roman Leventov15 Nov 2022 22:33 UTC
12 points
2 comments1 min readLW link

ACX/​SSC Meetup 1 pm Sun­day Nov 20

svfritz15 Nov 2022 20:39 UTC
2 points
0 comments1 min readLW link

SBF x LoL

Nicholas Kross15 Nov 2022 20:24 UTC
17 points
6 comments4 min readLW link

Some re­search ideas in forecasting

Jsevillamol15 Nov 2022 19:47 UTC
35 points
2 comments6 min readLW link

Strat­egy of In­ner Conflict

Jonathan Moregård15 Nov 2022 19:38 UTC
9 points
4 comments6 min readLW link
(honestliving.substack.com)

The limited up­side of interpretability

Peter S. Park15 Nov 2022 18:46 UTC
13 points
11 comments10 min readLW link

Why bet Kelly?

AlexMennen15 Nov 2022 18:12 UTC
32 points
14 comments5 min readLW link

En­tropy Scal­ing And In­trin­sic Me­mory

15 Nov 2022 18:11 UTC
20 points
5 comments5 min readLW link

[Question] Will nan­otech/​biotech be what leads to AI doom?

tailcalled15 Nov 2022 17:38 UTC
4 points
9 comments2 min readLW link

Value For­ma­tion: An Over­ar­ch­ing Model

Thane Ruthenis15 Nov 2022 17:16 UTC
34 points
20 comments34 min readLW link

In­ter­nal com­mu­ni­ca­tion framework

15 Nov 2022 12:41 UTC
38 points
13 comments12 min readLW link

Bet­ter Mastodon Aliases

jefftk15 Nov 2022 12:10 UTC
14 points
3 comments1 min readLW link
(www.jefftk.com)

The econ­omy as an anal­ogy for ad­vanced AI systems

15 Nov 2022 11:16 UTC
28 points
0 comments5 min readLW link

We need bet­ter pre­dic­tion markets

eigen15 Nov 2022 4:54 UTC
9 points
8 comments1 min readLW link

Prevent­ing, re­vers­ing, and ad­dress­ing data leak­age: some thoughts

VipulNaik15 Nov 2022 2:09 UTC
14 points
4 comments25 min readLW link

Win­ners of the AI Safety Nudge Competition

Marc Carauleanu15 Nov 2022 1:06 UTC
4 points
0 comments1 min readLW link

Ly­ing to Save Humanity

cebsuvx14 Nov 2022 23:04 UTC
−1 points
4 comments1 min readLW link

Mo­ral con­ta­gion heuristic

Mvolz14 Nov 2022 21:17 UTC
14 points
3 comments2 min readLW link

Will we run out of ML data? Ev­i­dence from pro­ject­ing dataset size trends

Pablo Villalobos14 Nov 2022 16:42 UTC
75 points
12 comments2 min readLW link
(epochai.org)

I (with the help of a few more peo­ple) am plan­ning to cre­ate an in­tro­duc­tion to AI Safety that a smart teenager can un­der­stand. What am I miss­ing?

Tapatakt14 Nov 2022 16:12 UTC
3 points
5 comments1 min readLW link

Two New New­comb Variants

eva_14 Nov 2022 14:01 UTC
26 points
24 comments3 min readLW link

Im­prov­ing Emer­gency Ve­hi­cle Utilization

jefftk14 Nov 2022 14:00 UTC
15 points
10 comments1 min readLW link
(www.jefftk.com)

X-risk Miti­ga­tion Does Ac­tu­ally Re­quire Longter­mism

DragonGod14 Nov 2022 12:54 UTC
6 points
1 comment1 min readLW link

[Question] Why don’t we have self driv­ing cars yet?

Linda Linsefors14 Nov 2022 12:19 UTC
22 points
16 comments1 min readLW link

Ei­gen­val­ues for Dis­tance from The Bud­dhist Pre­cepts And The Ten Commandments

benjamin.j.campbell14 Nov 2022 5:50 UTC
−3 points
2 comments1 min readLW link

AI Safety Micro­grant Round

Chris_Leong14 Nov 2022 4:25 UTC
22 points
1 comment3 min readLW link

Es­ti­mat­ing the prob­a­bil­ity that FTX Fu­ture Fund grant money gets clawed back

spencerg14 Nov 2022 3:33 UTC
28 points
6 comments1 min readLW link
(manifold.markets)

Ra­tional over­con­fi­dence in the tens of billions: re­cent example

banev13 Nov 2022 22:48 UTC
−20 points
3 comments2 min readLW link

In Defence of Tem­po­ral Dis­count­ing in Longter­mist Ethics

DragonGod13 Nov 2022 21:54 UTC
25 points
4 comments3 min readLW link

An­nounc­ing Non­lin­ear Emer­gency Funding

KatWoods13 Nov 2022 19:02 UTC
54 points
0 comments1 min readLW link

The Align­ment Com­mu­nity Is Cul­turally Broken

sudo13 Nov 2022 18:53 UTC
137 points
68 comments2 min readLW link