11 Jun 2022 22:53 UTC

8 points

0 comments6 min readLW link

[Question] Why has no person / group ever taken over the world?

Aryeh Englander11 Jun 2022 20:51 UTC

25 points

19 comments1 min readLW link

[Question] Are there English-speaking meetups in Frankfurt/Munich/Zurich?

Grant Demaree11 Jun 2022 20:02 UTC

6 points

2 comments1 min readLW link

Beauty and the Beast

Tomás B.11 Jun 2022 18:59 UTC

48 points

8 comments6 min readLW link

Poorly-Aimed Death Rays

Thane Ruthenis11 Jun 2022 18:29 UTC

48 points

5 comments4 min readLW link

AGI Safety Communications Initiative

ines11 Jun 2022 17:34 UTC

7 points

0 comments1 min readLW link

A gaming group for rationality-aware people

dhatas11 Jun 2022 16:04 UTC

7 points

0 comments1 min readLW link

[Question] Why don’t you introduce really impressive people you personally know to AI alignment (more often)?

Verden11 Jun 2022 15:59 UTC

33 points

14 comments1 min readLW link

Godzilla Strategies

johnswentworth11 Jun 2022 15:44 UTC

166 points

72 comments3 min readLW link

Steganography and the CycleGAN—alignment failure case study

Jan Czechowski11 Jun 2022 9:41 UTC

34 points

0 comments4 min readLW link

The Mountain Troll

lsusr11 Jun 2022 9:14 UTC

105 points

26 comments2 min readLW link

Show LW: YodaTimer.com

Adam Zerner11 Jun 2022 8:52 UTC

27 points

4 comments1 min readLW link

How fast can we perform a forward pass?

jsteinhardt10 Jun 2022 23:30 UTC

53 points

9 comments15 min readLW link

(bounded-regret.ghost.io)

Summary of “AGI Ruin: A List of Lethalities”

Stephen McAleese10 Jun 2022 22:35 UTC

45 points

2 comments8 min readLW link

How dangerous is human-level AI?

Alex_Altair10 Jun 2022 17:38 UTC

21 points

4 comments8 min readLW link

Another plausible scenario of AI risk: AI builds military infrastructure while collaborating with humans, defects later.

avturchin10 Jun 2022 17:24 UTC

10 points

2 comments1 min readLW link

Leaving Google, Joining the Nucleic Acid Observatory

jefftk10 Jun 2022 17:00 UTC

114 points

4 comments3 min readLW link

(www.jefftk.com)

On The Spectrum, On The Guest List: (v) The Fleur Room

party girl10 Jun 2022 14:50 UTC

8 points

1 comment14 min readLW link

(onthespectrumontheguestlist.substack.com)

Progress Report 6: get the tool working

Nathan Helm-Burger10 Jun 2022 11:18 UTC

4 points

0 comments2 min readLW link

[Question] Is AI Alignment Impossible?

Heighn10 Jun 2022 10:08 UTC

3 points

3 comments1 min readLW link

I No Longer Believe Intelligence to be “Magical”

DragonGod10 Jun 2022 8:58 UTC

28 points

34 comments6 min readLW link

[linkpost] The final AI benchmark: BIG-bench

RomanS10 Jun 2022 8:53 UTC

25 points

21 comments1 min readLW link

[Question] Could Patent-Trolling delay AI timelines?

Pablo Repetto10 Jun 2022 2:53 UTC

1 point

3 comments1 min readLW link

[Question] Kolmogorov’s AI Forecast

interstice10 Jun 2022 2:36 UTC

9 points

1 comment1 min readLW link

Tao, Kontsevich & others on HLAI in Math

interstice10 Jun 2022 2:25 UTC

41 points

5 comments2 min readLW link

(www.youtube.com)

A plausible story about AI risk.

DeLesley Hutchins10 Jun 2022 2:08 UTC

16 points

2 comments4 min readLW link

Open Problems in AI X-Risk [PAIS #5]

Dan H and TW123

10 Jun 2022 2:08 UTC

61 points

6 comments36 min readLW link

[Question] why assume AGIs will optimize for fixed goals?

nostalgebraist10 Jun 2022 1:28 UTC

157 points

60 comments4 min readLW link 2 reviews

Bureaucracy of AIs

Logan Zoellner9 Jun 2022 23:03 UTC

17 points

6 comments14 min readLW link

You Only Get One Shot: an Intuition Pump for Embedded Agency

Oliver Sourbut9 Jun 2022 21:38 UTC

24 points

4 comments2 min readLW link

[Question] Forestalling Atmospheric Ignition

Lone Pine9 Jun 2022 20:49 UTC

11 points

9 comments1 min readLW link

How Do Selection Theorems Relate To Interpretability?

johnswentworth9 Jun 2022 19:39 UTC

60 points

14 comments3 min readLW link

Progress links and tweets, 2022-06-08

jasoncrawford9 Jun 2022 19:13 UTC

11 points

0 comments1 min readLW link

(rootsofprogress.org)

If no near-term alignment strategy, research should aim for the long-term

harsimony9 Jun 2022 19:10 UTC

7 points

1 comment1 min readLW link

Operationalizing two tasks in Gary Marcus’s AGI challenge

Bill Benzon9 Jun 2022 18:31 UTC

12 points

3 comments8 min readLW link

Why it’s bad to kill Grandma

dynomight9 Jun 2022 18:12 UTC

29 points

14 comments8 min readLW link

(dynomight.substack.com)

[Question] Modeling humanity’s robustness to GCRs?

T4319 Jun 2022 17:34 UTC

2 points

2 comments2 min readLW link

[Question] If there was a millennium equivalent prize for AI alignment, what would the problems be?

Yair Halberstadt9 Jun 2022 16:56 UTC

17 points

4 comments1 min readLW link

Book Review: How the World Became Rich

Davis Kedrosky9 Jun 2022 16:55 UTC

14 points

0 comments10 min readLW link

(daviskedrosky.substack.com)

Covid 6/9/22: Nice

Zvi9 Jun 2022 16:30 UTC

26 points

2 comments12 min readLW link

(thezvi.wordpress.com)

Website For Yoda Timers

Adam Zerner9 Jun 2022 16:28 UTC

16 points

1 comment1 min readLW link

AI Could Defeat All Of Us Combined

HoldenKarnofsky9 Jun 2022 15:50 UTC

170 points

42 comments17 min readLW link

(www.cold-takes.com)

The “mind-body vicious cycle” model of RSI & back pain

Steven Byrnes9 Jun 2022 12:30 UTC

92 points

32 comments12 min readLW link

[Linkpost & Discussion] AI Trained on 4Chan Becomes ‘Hate Speech Machine’ [and outperforms GPT-3 on TruthfulQA Benchmark?!]

Yitz9 Jun 2022 10:59 UTC

16 points

5 comments2 min readLW link

(www.vice.com)

Comment reply: my low-quality thoughts on why CFAR didn’t get farther with a “real/efficacious art of rationality”

AnnaSalamon9 Jun 2022 2:12 UTC

274 points

80 comments17 min readLW link 1 review

Today in AI Risk History: The Terminator (1984 film) was released.

Impassionata9 Jun 2022 1:32 UTC

−3 points

6 comments1 min readLW link

There’s probably a tradeoff between AI capability and safety, and we should act like it

David Johnston9 Jun 2022 0:17 UTC

3 points

3 comments1 min readLW link

[Question] Has anyone actually tried to convince Terry Tao or other top mathematicians to work on alignment?

P.8 Jun 2022 22:26 UTC

64 points

51 comments4 min readLW link

Entitlement as a major amplifier of unhappiness

VipulNaik8 Jun 2022 22:08 UTC

29 points

6 comments7 min readLW link

[Question] Silly Online Rules

Gunnar_Zarncke8 Jun 2022 20:40 UTC

8 points

12 comments1 min readLW link