All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025 2026

All Jan Feb Mar Apr May Jun Jul AugSepOct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 212223 24 25 26 27 28 29 30

Toy Models of Superposition

evhub21 Sep 2022 23:48 UTC

69 points

4 comments5 min readLW link 1 review

(transformer-circuits.pub)

How to Train Your AGI Dragon

Eris Discordia21 Sep 2022 22:28 UTC

−1 points

3 comments5 min readLW link

An issue with MacAskill’s Evidentialist’s Wager

Martín Soto21 Sep 2022 22:02 UTC

6 points

9 comments4 min readLW link

Announcing AISIC 2022 - the AI Safety Israel Conference, October 19-20

Davidmanheim21 Sep 2022 19:32 UTC

13 points

0 comments1 min readLW link

Nearcast-based “deployment problem” analysis

HoldenKarnofsky21 Sep 2022 18:52 UTC

87 points

2 comments26 min readLW link

Scraping training data for your mind

Henrik Karlsson21 Sep 2022 16:27 UTC

47 points

4 comments8 min readLW link

(escapingflatland.substack.com)

Trends in Training Dataset Sizes

Pablo Villalobos21 Sep 2022 15:47 UTC

25 points

2 comments5 min readLW link

(epochai.org)

[Question] Can you define “utility” in utilitarianism without using words for specific human emotions?

SurvivalBias21 Sep 2022 3:29 UTC

13 points

46 comments1 min readLW link

“Infohazards” The ML Field’s Greatest Excuse.

Puffy Bird21 Sep 2022 3:19 UTC

−3 points

1 comment3 min readLW link

Case Rates to Sequencing Reads

jefftk21 Sep 2022 2:00 UTC

15 points

4 comments4 min readLW link

(www.jefftk.com)

Towards deconfusing wireheading and reward maximization

leogao21 Sep 2022 0:36 UTC

81 points

7 comments4 min readLW link

[Question] What key nutrients are required for daily energy?

trevor20 Sep 2022 23:30 UTC

7 points

4 comments1 min readLW link

Quantified Intuitions: An epistemics training website including a new EA-themed calibration app

Sage Future and elifland

20 Sep 2022 22:25 UTC

28 points

2 comments2 min readLW link

The Redaction Machine

Ben20 Sep 2022 22:03 UTC

530 points

48 comments27 min readLW link 1 review

You Are Not Measuring What You Think You Are Measuring

johnswentworth20 Sep 2022 20:04 UTC

441 points

45 comments8 min readLW link 2 reviews

What happened to the idea of progress?

jasoncrawford20 Sep 2022 19:56 UTC

8 points

1 comment1 min readLW link

(bigthink.com)

Features and Antifeatures

Davis_Kingsley20 Sep 2022 17:54 UTC

26 points

8 comments1 min readLW link

Cryptocurrency Exploits Show the Importance of Proactive Policies for AI X-Risk

eSpencer20 Sep 2022 17:53 UTC

1 point

0 comments4 min readLW link

Alignment Org Cheat Sheet

Orpheus16 and Thomas Larsen

20 Sep 2022 17:36 UTC

65 points

8 comments4 min readLW link

Doing oversight from the very start of training seems hard

peterbarnett20 Sep 2022 17:21 UTC

14 points

3 comments3 min readLW link

$13,000 of prizes for changing our mind about who to fund (Clearer Thinking Regrants Forecasting Tournament)

spencerg20 Sep 2022 16:06 UTC

14 points

3 comments1 min readLW link

(manifold.markets)

Progress links and tweets, 2022-09-20

jasoncrawford20 Sep 2022 14:07 UTC

7 points

1 comment1 min readLW link

(rootsofprogress.org)

[Question] If we have Human-level chatbots, won’t we end up being ruled by possible people?

Erlja Jkdf.20 Sep 2022 13:59 UTC

5 points

13 comments1 min readLW link

Twitter Polls: Evidence is Evidence

Zvi20 Sep 2022 12:30 UTC

34 points

8 comments7 min readLW link

(thezvi.wordpress.com)

Some of the most important entrepreneurship skills are tacit knowledge

Ruhul20 Sep 2022 12:06 UTC

20 points

0 comments7 min readLW link

Character alignment

p.b.20 Sep 2022 8:27 UTC

24 points

0 comments2 min readLW link

Losing the root for the tree

Adam Zerner20 Sep 2022 4:53 UTC

509 points

31 comments9 min readLW link 1 review

Failed Adventures in Delay

jefftk20 Sep 2022 2:20 UTC

8 points

0 comments2 min readLW link

(www.jefftk.com)

Gene drives: why the wait?

Metacelsus19 Sep 2022 23:37 UTC

125 points

50 comments3 min readLW link

(denovo.substack.com)

Prize idea: Transmit MIRI and Eliezer’s worldviews

elifland19 Sep 2022 21:21 UTC

47 points

18 comments2 min readLW link

Rationality Dojo Berlin Handout

UnplannedCauliflower19 Sep 2022 20:11 UTC

19 points

0 comments7 min readLW link

A noob goes to the SERI MATS presentations

Lowell Dennings19 Sep 2022 17:35 UTC

27 points

0 comments5 min readLW link

Do bamboos set themselves on fire?

Malmesbury19 Sep 2022 15:34 UTC

173 points

14 comments6 min readLW link 1 review

Cambridge LW Meetup: Authentic Relating Games

Tony Wang19 Sep 2022 14:51 UTC

1 point

0 comments1 min readLW link

PIBBSS (AI alignment) is hiring for a Project Manager

Nora_Ammann19 Sep 2022 13:54 UTC

9 points

0 comments1 min readLW link

Quintin’s alignment papers roundup—week 2

Quintin Pope19 Sep 2022 13:41 UTC

67 points

2 comments10 min readLW link

Some notes on solving hard problems

Joe Rocca19 Sep 2022 12:58 UTC

52 points

8 comments29 min readLW link

Safety timelines: How long will it take to solve alignment?

Esben Kran, JonathanRystroem and Steinthal

19 Sep 2022 12:53 UTC

38 points

7 comments6 min readLW link

(forum.effectivealtruism.org)

Belgrade, Serbia—LW Meetup

игорь тимофеев19 Sep 2022 12:47 UTC

3 points

0 comments1 min readLW link

The ELK Framing I’ve Used

sudo19 Sep 2022 10:28 UTC

5 points

1 comment1 min readLW link

Quick Book Review: Crucial Conversations

Gordon Seidoh Worley19 Sep 2022 6:25 UTC

28 points

2 comments2 min readLW link

How my team at Lightcone sometimes gets stuff done

Bird Concept19 Sep 2022 5:47 UTC

201 points

43 comments7 min readLW link 1 review

EA & LW Forums Weekly Summary (12 − 18 Sep ’22)

Zoe Williams19 Sep 2022 5:08 UTC

11 points

0 comments13 min readLW link

Book Swap

Screwtape19 Sep 2022 2:33 UTC

12 points

0 comments2 min readLW link

Pretending not to Notice

jefftk19 Sep 2022 2:30 UTC

46 points

12 comments2 min readLW link

(www.jefftk.com)

[To Be Revised]Perhaps the Meaning of Life, An Adventure in Pluralistic Morality

NoBadCake18 Sep 2022 22:37 UTC

−5 points

3 comments4 min readLW link

Leveraging Legal Informatics to Align AI

John Nay18 Sep 2022 20:39 UTC

11 points

0 comments3 min readLW link

(forum.effectivealtruism.org)

The Inter-Agent Facet of AI Alignment

Michael Oesterle18 Sep 2022 20:39 UTC

12 points

1 comment5 min readLW link

Biden should be applauded for appointing Renee Wegrzyn for ARPA-H

ChristianKl18 Sep 2022 19:57 UTC

34 points

0 comments2 min readLW link

Summaries: Alignment Fundamentals Curriculum

Leon Lang18 Sep 2022 13:08 UTC

44 points

3 comments1 min readLW link

(docs.google.com)