What is the solu­tion to the Align­ment prob­lem?

Algon30 Apr 2022 23:19 UTC
24 points
2 comments1 min readLW link

[Question] Why hasn’t deep learn­ing gen­er­ated sig­nifi­cant eco­nomic value yet?

Alex_Altair30 Apr 2022 20:27 UTC
115 points
89 comments2 min readLW link

Nu­clear En­ergy—Good but not the silver bul­let we were hop­ing for

Marius Hobbhahn30 Apr 2022 15:41 UTC
64 points
33 comments15 min readLW link1 review

Quick Thoughts on A.I. Governance

Nicholas Kross30 Apr 2022 14:49 UTC
70 points
8 comments2 min readLW link
(www.thinkingmuchbetter.com)

Dis­cus­sion on Thomas Philip­pon’s pa­per on TFP growth be­ing linear

Arjun Yadav30 Apr 2022 14:25 UTC
2 points
0 comments1 min readLW link
(forum.effectivealtruism.org)

[un­ti­tled post]

superads9130 Apr 2022 13:01 UTC
0 points
49 comments1 min readLW link

Note-Tak­ing with­out Hid­den Messages

Hoagy30 Apr 2022 11:15 UTC
17 points
2 comments4 min readLW link

[Question] How good is spend­ing?

tryactions30 Apr 2022 7:27 UTC
5 points
11 comments1 min readLW link

[Linkpost] New multi-modal Deep­mind model fus­ing Chin­chilla with images and videos

p.b.30 Apr 2022 3:47 UTC
53 points
18 comments1 min readLW link

Sal­vage Epistemology

jimrandomh30 Apr 2022 2:10 UTC
102 points
119 comments1 min readLW link

Learn­ing the smooth prior

29 Apr 2022 21:10 UTC
35 points
0 comments12 min readLW link

[Question] Do FDT (or similar) recom­mend repa­ra­tions?

David Scott Krueger (formerly: capybaralet)29 Apr 2022 17:34 UTC
13 points
3 comments1 min readLW link

Say­ing no to the Appleman

Johannes C. Mayer29 Apr 2022 10:39 UTC
47 points
12 comments3 min readLW link

Prize for Align­ment Re­search Tasks

29 Apr 2022 8:57 UTC
64 points
38 comments10 min readLW link

In­creas­ing De­mand­ing­ness in EA

jefftk29 Apr 2022 1:20 UTC
61 points
22 comments3 min readLW link
(www.jefftk.com)

[Question] What is a train­ing “step” vs. “epi­sode” in ma­chine learn­ing?

Evan R. Murphy28 Apr 2022 21:53 UTC
10 points
4 comments1 min readLW link

Facts Matter

mrdlm28 Apr 2022 21:19 UTC
20 points
2 comments3 min readLW link

[Question] Is al­ign­ment pos­si­ble?

Shay28 Apr 2022 21:18 UTC
0 points
5 comments1 min readLW link

Two Proso­cial Re­jec­tion Norms

Emrik28 Apr 2022 20:53 UTC
54 points
21 comments3 min readLW link

Dath Ilan vs. Sid Meier’s Alpha Cen­tauri: Pareto Improvements

David Udell28 Apr 2022 19:26 UTC
35 points
16 comments2 min readLW link

A Parable Of Explainability

George3d628 Apr 2022 16:46 UTC
10 points
5 comments5 min readLW link
(www.epistem.ink)

Keep your pro­tos in one repo

RobertM28 Apr 2022 15:53 UTC
5 points
4 comments5 min readLW link
(docs.protocall.dev)

Covid 4/​28/​22: Take My Paxlovid, Please

Zvi28 Apr 2022 15:20 UTC
35 points
14 comments8 min readLW link
(thezvi.wordpress.com)

3-bit filters

iivonen28 Apr 2022 11:55 UTC
8 points
0 comments2 min readLW link

Jaan Tal­linn’s 2021 Philan­thropy Overview

jaan28 Apr 2022 9:55 UTC
71 points
2 comments1 min readLW link
(jaan.online)

Doom sooner

Flaglandbase28 Apr 2022 7:24 UTC
1 point
0 comments3 min readLW link

How Might an Align­ment At­trac­tor Look like?

Shmi28 Apr 2022 6:46 UTC
47 points
15 comments2 min readLW link

Virtue sig­nal­ing is some­times the best or the only met­ric we have

Holly_Elmore28 Apr 2022 4:52 UTC
41 points
43 comments5 min readLW link

The Gospel of Martin Luther

lsusr28 Apr 2022 4:29 UTC
9 points
2 comments1 min readLW link

Let­ter to my Squire

lsusr28 Apr 2022 4:16 UTC
9 points
0 comments1 min readLW link

Slides: Po­ten­tial Risks From Ad­vanced AI

Aryeh Englander28 Apr 2022 2:15 UTC
7 points
0 comments1 min readLW link

Naive com­ments on AGIlignment

Ericf28 Apr 2022 1:08 UTC
−8 points
4 comments1 min readLW link

AI Alter­na­tive Fu­tures: Sce­nario Map­ping Ar­tifi­cial In­tel­li­gence Risk—Re­quest for Par­ti­ci­pa­tion (*Closed*)

Kakili27 Apr 2022 22:07 UTC
10 points
2 comments8 min readLW link

The Speed + Sim­plic­ity Prior is prob­a­bly anti-deceptive

Yonadav Shavit27 Apr 2022 19:30 UTC
30 points
28 comments12 min readLW link

If you’re very op­ti­mistic about ELK then you should be op­ti­mistic about outer alignment

Sam Marks27 Apr 2022 19:30 UTC
17 points
8 comments3 min readLW link

The Game of Masks

Slimepriestess27 Apr 2022 18:03 UTC
50 points
18 comments11 min readLW link
(hivewired.wordpress.com)

Law-Fol­low­ing AI 3: Lawless AI Agents Un­der­mine Sta­bi­liz­ing Agreements

Cullen27 Apr 2022 17:30 UTC
2 points
2 comments3 min readLW link

Law-Fol­low­ing AI 2: In­tent Align­ment + Su­per­in­tel­li­gence → Lawless AI (By De­fault)

Cullen27 Apr 2022 17:27 UTC
5 points
2 comments6 min readLW link

Law-Fol­low­ing AI 1: Se­quence In­tro­duc­tion and Structure

Cullen27 Apr 2022 17:26 UTC
18 points
10 comments9 min readLW link

[In­tro to brain-like-AGI safety] 13. Sym­bol ground­ing & hu­man so­cial instincts

Steven Byrnes27 Apr 2022 13:30 UTC
73 points
15 comments15 min readLW link

The case for turn­ing glowfic into Sequences

Thomas Kwa27 Apr 2022 6:58 UTC
88 points
29 comments5 min readLW link

[Link] Ev­i­dence of Fabri­cated Data in a Vi­tamin C trial by Paul E Marik et al in CHEST

Kenny27 Apr 2022 6:48 UTC
6 points
1 comment1 min readLW link

SERI ML Align­ment The­ory Schol­ars Pro­gram 2022

27 Apr 2022 0:43 UTC
69 points
6 comments3 min readLW link

EU Max­i­miz­ing in a Gloomy World

David Udell27 Apr 2022 0:28 UTC
6 points
2 comments1 min readLW link

Why Copi­lot Ac­cel­er­ates Timelines

Michaël Trazzi26 Apr 2022 22:06 UTC
35 points
14 comments7 min readLW link

Univer­sals of Mo­ral­ity: Toward Hu­man-Cen­tric Com­mu­ni­ca­tion Platforms

scafaria26 Apr 2022 21:15 UTC
−3 points
3 comments5 min readLW link
(scafaria.com)

[$20K in Prizes] AI Safety Ar­gu­ments Competition

26 Apr 2022 16:13 UTC
75 points
518 comments3 min readLW link

Con­ti­nen­tal Philos­o­phy as Un­der­grad­u­ate Mathematics

Jan26 Apr 2022 8:05 UTC
17 points
3 comments9 min readLW link
(universalprior.substack.com)

dalle2 comments

nostalgebraist26 Apr 2022 5:30 UTC
183 points
14 comments13 min readLW link
(nostalgebraist.tumblr.com)

Make a neu­ral net­work in ~10 minutes

Arjun Yadav26 Apr 2022 5:24 UTC
8 points
0 comments4 min readLW link
(arjunyadav.net)