All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025 2026

All Jan Feb Mar Apr MayJunJul Aug Sep Oct Nov Dec

All 1 234 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

Adversarial training, importance sampling, and anti-adversarial training for AI whistleblowing

Buck2 Jun 2022 23:48 UTC

42 points

0 comments3 min readLW link

The prototypical catastrophic AI action is getting root access to its datacenter

Buck2 Jun 2022 23:46 UTC

189 points

13 comments2 min readLW link 1 review

Confused why a “capabilities research is good for alignment progress” position isn’t discussed more

Kaj_Sotala2 Jun 2022 21:41 UTC

132 points

27 comments4 min readLW link

Announcing a contest: EA Criticism and Red Teaming

fin2 Jun 2022 20:27 UTC

17 points

1 comment14 min readLW link

(forum.effectivealtruism.org)

Fact post: project-based learning

dominicq2 Jun 2022 20:18 UTC

12 points

4 comments3 min readLW link

The case for using the term ‘steelmanning’ instead of ‘principle of charity’

ChristianKl2 Jun 2022 19:24 UTC

26 points

7 comments3 min readLW link

Covid 6/2/22: Declining to Respond

Zvi2 Jun 2022 13:50 UTC

55 points

10 comments7 min readLW link

(thezvi.wordpress.com)

The horror of what must, yet cannot, be true

Kaj_Sotala2 Jun 2022 10:20 UTC

56 points

18 comments2 min readLW link

(kajsotala.fi)

Paradigms of AI alignment: components and enablers

Vika2 Jun 2022 6:19 UTC

54 points

4 comments8 min readLW link

The Bio Anchors Forecast

Ansh Radhakrishnan2 Jun 2022 1:32 UTC

13 points

0 comments3 min readLW link

Venue Changed ACX Montreal Meetup Jun 18 2022

E2 Jun 2022 0:43 UTC

10 points

0 comments1 min readLW link

Public beliefs vs. Private beliefs

Eli Tyre1 Jun 2022 21:33 UTC

147 points

30 comments5 min readLW link

[Question] Probability that the President would win election against a random adult citizen?

Daniel Kokotajlo1 Jun 2022 20:38 UTC

15 points

26 comments1 min readLW link

Revisiting “Why Global Poverty”

jefftk1 Jun 2022 20:20 UTC

20 points

2 comments3 min readLW link

(www.jefftk.com)

[Question] What will happen when an all-reaching AGI starts attempting to fix human character flaws?

Michael Bright1 Jun 2022 18:45 UTC

1 point

6 comments1 min readLW link

[Question] Any prior work on mutiagent dynamics for continuous distributions over agents?

Quintin Pope1 Jun 2022 18:12 UTC

15 points

2 comments1 min readLW link

[Question] Formation via nucleation of boltzmann brains

Zeruel0171 Jun 2022 18:05 UTC

0 points

9 comments1 min readLW link

Halifax Rationality / EA Coworking Day

Ideopunk and interstice

1 Jun 2022 17:47 UTC

9 points

0 comments1 min readLW link

Machines vs Memes Part 3: Imitation and Memes

ceru231 Jun 2022 13:36 UTC

7 points

0 comments7 min readLW link

Rationalism in an Age of Egregores

David Udell1 Jun 2022 7:29 UTC

14 points

11 comments2 min readLW link

Wielding civilization

dominicq1 Jun 2022 7:11 UTC

29 points

2 comments2 min readLW link

Machines vs. Memes 2: Memetically-Motivated Model Extensions

naterush31 May 2022 22:03 UTC

6 points

0 comments4 min readLW link

Machines vs Memes Part 1: AI Alignment and Memetics

Harriet Farlow31 May 2022 22:03 UTC

19 points

1 comment6 min readLW link

The Hard Intelligence Hypothesis and Its Bearing on Succession Induced Foom

DragonGod31 May 2022 19:04 UTC

10 points

7 comments4 min readLW link

Paper: Teaching GPT3 to express uncertainty in words

Owain_Evans31 May 2022 13:27 UTC

97 points

7 comments4 min readLW link

Effective Altruism Virtual Programs Jul-Aug 2022

Yve Nichols-Evans31 May 2022 12:56 UTC

1 point

0 comments1 min readLW link

[Question] What is the state of Chinese AI research?

Ratios31 May 2022 10:05 UTC

34 points

16 comments1 min readLW link

The Brain That Builds Itself

Jan31 May 2022 9:42 UTC

57 points

6 comments8 min readLW link

(universalprior.substack.com)

[Question] Is there any formal argument that climate change needs to more extreme weather events?

ChristianKl31 May 2022 9:01 UTC

8 points

8 comments1 min readLW link

Progress links and tweets, 2022-05-30

jasoncrawford30 May 2022 23:20 UTC

18 points

0 comments1 min readLW link

(rootsofprogress.org)

The Reverse Basilisk

Dunning K.30 May 2022 23:10 UTC

17 points

23 comments2 min readLW link

Deliberate Grieving

Raemon30 May 2022 20:49 UTC

191 points

16 comments9 min readLW link 2 reviews

Perform Tractable Research While Avoiding Capabilities Externalities [Pragmatic AI Safety #4]

Dan H and TW123

30 May 2022 20:25 UTC

51 points

3 comments25 min readLW link

[Question] A terrifying variant of Boltzmann’s brains problem

Zeruel01730 May 2022 20:08 UTC

5 points

12 comments4 min readLW link

Ceiling Air Purifier

jefftk30 May 2022 19:20 UTC

88 points

11 comments2 min readLW link

(www.jefftk.com)

Notion template for personal predictions

Arjun Yadav30 May 2022 17:47 UTC

1 point

0 comments1 min readLW link

Six Dimensions of Operational Adequacy in AGI Projects

Eliezer Yudkowsky30 May 2022 17:00 UTC

323 points

66 comments13 min readLW link 1 review

My SERI MATS Application

Daniel Paleka30 May 2022 2:04 UTC

16 points

0 comments8 min readLW link

Reshaping the AI Industry

Thane Ruthenis29 May 2022 22:54 UTC

148 points

35 comments21 min readLW link

The Unbearable Lightness of Web Vulnerabilities

aiiixiii29 May 2022 21:13 UTC

29 points

2 comments1 min readLW link

(www.theoreticalstructures.io)

Finding the Right Problem

tobot29 May 2022 17:52 UTC

8 points

0 comments2 min readLW link

The impact you might have working on AI safety

Fabien Roger29 May 2022 16:31 UTC

5 points

1 comment4 min readLW link

The Problem With The Current State of AGI Definitions

Yitz29 May 2022 13:58 UTC

40 points

22 comments8 min readLW link

[Question] Request for nice questions to think about while trying to sleep

oh5432129 May 2022 13:47 UTC

9 points

2 comments1 min readLW link

Will working here advance AGI? Help us not destroy the world!

Yonatan Cale29 May 2022 11:42 UTC

30 points

46 comments1 min readLW link

Passable Puppet

burmesetheater29 May 2022 11:07 UTC

6 points

1 comment3 min readLW link

Multiple AIs in boxes, evaluating each other’s alignment

Moebius31429 May 2022 8:36 UTC

8 points

0 comments14 min readLW link

[Question] How would you build Dath Ilan on earth?

Yair Halberstadt29 May 2022 7:26 UTC

35 points

29 comments1 min readLW link

Distributed Decisions

johnswentworth29 May 2022 2:43 UTC

66 points

6 comments6 min readLW link

Distilled—AGI Safety from First Principles

Harrison G29 May 2022 0:57 UTC

11 points

1 comment14 min readLW link