All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025 2026

All Jan Feb Mar Apr May Jun Jul AugSepOct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 121314 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

EA & LW Forums Weekly Summary (5 − 11 Sep 22′)

Zoe Williams12 Sep 2022 23:24 UTC

24 points

0 comments13 min readLW link

Time is not the bottleneck (on making progress thinking about difficult things)

kman12 Sep 2022 20:45 UTC

30 points

10 comments1 min readLW link

[Linkpost] A survey on over 300 works about interpretability in deep networks

scasper12 Sep 2022 19:07 UTC

97 points

7 comments2 min readLW link

(arxiv.org)

[Question] Why do People Think Intelligence Will be “Easy”?

DragonGod12 Sep 2022 17:32 UTC

15 points

32 comments2 min readLW link

Alignment via prosocial brain algorithms

Cameron Berg12 Sep 2022 13:48 UTC

45 points

30 comments6 min readLW link

I’ve written a Fantasy Novel to Promote Effective Altruism

Timothy Underwood12 Sep 2022 12:14 UTC

23 points

21 comments13 min readLW link

Ideological Inference Engines: Making Deontology Differentiable*

Paul Bricman12 Sep 2022 12:00 UTC

6 points

0 comments14 min readLW link

Freeloading?

jefftk12 Sep 2022 11:20 UTC

28 points

24 comments3 min readLW link

(www.jefftk.com)

Can you force a neural network to keep generalizing?

Q Home12 Sep 2022 10:14 UTC

2 points

10 comments5 min readLW link

Black Box Investigation Research Hackathon

Esben Kran and Jonas Hallgren

12 Sep 2022 7:20 UTC

9 points

4 comments2 min readLW link

Argument against 20% GDP growth from AI within 10 years [Linkpost]

aog12 Sep 2022 4:08 UTC

59 points

20 comments5 min readLW link

(twitter.com)

Fermi Paradox: Iron Age Milky Way

Rofel Wodring11 Sep 2022 20:32 UTC

−10 points

9 comments3 min readLW link

You Don’t Have To Click The Links

Simon Berens11 Sep 2022 18:13 UTC

28 points

7 comments1 min readLW link

The Ultimate Step-by-Step Hiring Playbook

intellectronica11 Sep 2022 14:39 UTC

8 points

2 comments4 min readLW link

(www.intellectronica.net)

[Question] In forecasting, how do accuracy, calibration and reliability relate to each other?

amarai11 Sep 2022 12:04 UTC

3 points

4 comments1 min readLW link

Briefly thinking through some analogs of debate

Eli Tyre11 Sep 2022 12:02 UTC

20 points

3 comments4 min readLW link

Making a New Table Leaf

jefftk11 Sep 2022 11:40 UTC

19 points

0 comments1 min readLW link

(www.jefftk.com)

AI Risk Intro 1: Advanced AI Might Be Very Bad

CallumMcDougall and L Rudolf L

11 Sep 2022 10:57 UTC

46 points

13 comments30 min readLW link

A Pin and a Balloon: Anthropic Fragility Increases Chances of Runaway Global Warming

avturchin11 Sep 2022 10:25 UTC

33 points

23 comments52 min readLW link

[Question] Is there an Ultimate text editor?

Johannes C. Mayer11 Sep 2022 9:19 UTC

4 points

10 comments1 min readLW link

Pascal: The Greatness and Littleness of Man, A Thinking Reed

NoBadCake10 Sep 2022 20:05 UTC

9 points

0 comments1 min readLW link

[Job] Project Manager: Community Health (CEA)

Xodarap10 Sep 2022 18:40 UTC

3 points

0 comments1 min readLW link

(www.centreforeffectivealtruism.org)

Unbounded utility functions and precommitment

MichaelStJules10 Sep 2022 16:16 UTC

4 points

3 comments1 min readLW link

[Question] What is the “Less Wrong” approved acronym for 1984-risk?

Logan Zoellner10 Sep 2022 14:38 UTC

5 points

8 comments1 min readLW link

Find out how utilitarian you are—a mega thread of philosophy polls

spencerg10 Sep 2022 14:05 UTC

8 points

3 comments1 min readLW link

(twitter.com)

Put Dirty Dishes in the Dishwasher

jefftk10 Sep 2022 13:10 UTC

37 points

16 comments1 min readLW link

(www.jefftk.com)

Quintin’s alignment papers roundup—week 1

Quintin Pope10 Sep 2022 6:39 UTC

122 points

6 comments9 min readLW link

Path dependence in ML inductive biases

Vivek Hebbar and evhub

10 Sep 2022 1:38 UTC

68 points

13 comments10 min readLW link

Keeping Time in Epoch Seconds

Gordon Seidoh Worley10 Sep 2022 0:28 UTC

11 points

2 comments2 min readLW link

Ought will host a factored cognition “Lab Meeting”

jungofthewon and stuhlmueller

9 Sep 2022 23:46 UTC

35 points

1 comment1 min readLW link

Web4/Heaven—The Simulation

Dunning K.9 Sep 2022 22:58 UTC

26 points

2 comments1 min readLW link

Evaluations project @ ARC is hiring a researcher and a webdev/engineer

Beth Barnes9 Sep 2022 22:46 UTC

99 points

7 comments10 min readLW link

Swap and Scale

Stephen Fowler9 Sep 2022 22:41 UTC

17 points

3 comments1 min readLW link

My emotional reaction to the current funding situation

Sam F. Brown9 Sep 2022 22:02 UTC

108 points

36 comments5 min readLW link

(sambrown.eu)

AlexaTM − 20 Billion Parameter Model With Impressive Performance

MrThink9 Sep 2022 21:46 UTC

5 points

0 comments1 min readLW link

[Fun][Link] Alignment SMBC Comic

Gunnar_Zarncke9 Sep 2022 21:38 UTC

8 points

2 comments1 min readLW link

(www.smbc-comics.com)

Gatekeeper Victory: AI Box Reflection

Double and DaemonicSigil

9 Sep 2022 21:38 UTC

7 points

6 comments9 min readLW link

Interpreting Affordable Housing

jefftk9 Sep 2022 19:40 UTC

16 points

0 comments1 min readLW link

(www.jefftk.com)

London Rationalish Meetup 2022-09-11

calmiguana9 Sep 2022 18:39 UTC

1 point

0 comments1 min readLW link

AI alignment with humans… but with which humans?

geoffreymiller9 Sep 2022 18:21 UTC

12 points

33 comments3 min readLW link

[Question] Should you refrain from having children because of the risk posed by artificial intelligence?

Mientras9 Sep 2022 17:39 UTC

18 points

31 comments1 min readLW link

Notes on Resolve

David Gross9 Sep 2022 16:42 UTC

10 points

3 comments31 min readLW link

Oversight Leagues: The Training Game as a Feature

Paul Bricman9 Sep 2022 10:08 UTC

20 points

6 comments10 min readLW link

Understanding and avoiding value drift

TurnTrout9 Sep 2022 4:16 UTC

48 points

14 comments6 min readLW link

Samotsvety’s AI risk forecasts

elifland9 Sep 2022 4:01 UTC

44 points

0 comments4 min readLW link

Most People Start With The Same Few Bad Ideas

johnswentworth9 Sep 2022 0:29 UTC

177 points

31 comments3 min readLW link

Monitoring for deceptive alignment

evhub8 Sep 2022 23:07 UTC

130 points

8 comments9 min readLW link

[An email with a bunch of links I sent an experienced ML researcher interested in learning about Alignment / x-safety.]

David Scott Krueger8 Sep 2022 22:28 UTC

47 points

1 comment5 min readLW link

Progress links & tweets, 2022-09-08

jasoncrawford8 Sep 2022 20:43 UTC

13 points

3 comments1 min readLW link

(rootsofprogress.org)

Postmortem: Trying out for Manifold Markets

Milli | Martin and Austin Chen

8 Sep 2022 17:54 UTC

24 points

0 comments3 min readLW link