All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025

All Jan Feb Mar Apr May Jun Jul AugSepOct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 171819 20 21 22 23 24 25 26 27 28 29 30

[Question] Updates on FLI’s Value Aligment Map?

T43117 Sep 2022 22:27 UTC

17 points

4 comments1 min readLW link

Most sensible abstraction & feature set for a systems language?

Jasen Qin17 Sep 2022 19:49 UTC

0 points

5 comments10 min readLW link

Sparse trinary weighted RNNs as a path to better language model interpretability

Am8ryllis17 Sep 2022 19:48 UTC

19 points

13 comments3 min readLW link

Apply for mentorship in AI Safety field-building

Orpheus1617 Sep 2022 19:06 UTC

9 points

0 comments1 min readLW link

(forum.effectivealtruism.org)

Refine’s Third Blog Post Day/Week

adamShimi17 Sep 2022 17:03 UTC

18 points

0 comments1 min readLW link

[Closed] Prize and fast track to alignment research at ALTER

Vanessa Kosoy17 Sep 2022 16:58 UTC

63 points

8 comments3 min readLW link

Remote Login For Turnkey Devices?

jefftk17 Sep 2022 15:40 UTC

9 points

2 comments2 min readLW link

(www.jefftk.com)

Many therapy schools work with inner multiplicity (not just IFS)

David Althaus and Ewelina Tur

17 Sep 2022 10:27 UTC

52 points

16 comments18 min readLW link

Should AI learn human values, human norms or something else?

Q Home17 Sep 2022 6:19 UTC

5 points

1 comment4 min readLW link

Takeaways from our robust injury classifier project [Redwood Research]

dmz17 Sep 2022 3:55 UTC

143 points

12 comments6 min readLW link 1 review

[Question] Why doesn’t China (or didn’t anyone) encourage/mandate elastomeric respirators to control COVID?

Wei Dai17 Sep 2022 3:07 UTC

34 points

15 comments1 min readLW link

Emergency Residential Solar Jury-Rigging

jefftk17 Sep 2022 2:30 UTC

34 points

0 comments3 min readLW link

(www.jefftk.com)

A Bite Sized Introduction to ELK

Luk2718217 Sep 2022 0:28 UTC

5 points

0 comments6 min readLW link

D&D.Sci September 2022: The Allocation Helm

abstractapplic16 Sep 2022 23:10 UTC

34 points

34 comments1 min readLW link

Towards a philosophy of safety

jasoncrawford16 Sep 2022 21:10 UTC

12 points

2 comments8 min readLW link

(rootsofprogress.org)

Refine Blogpost Day #3: The shortforms I did write

Alexander Gietelink Oldenziel16 Sep 2022 21:03 UTC

23 points

0 comments1 min readLW link

[Question] Why are we sure that AI will “want” something?

Shmi16 Sep 2022 20:35 UTC

31 points

57 comments1 min readLW link

Katja Grace on Slowing Down AI, AI Expert Surveys And Estimating AI Risk

Michaël Trazzi16 Sep 2022 17:45 UTC

40 points

2 comments3 min readLW link

(theinsideview.ai)

Levels of goals and alignment

zeshen16 Sep 2022 16:44 UTC

27 points

4 comments6 min readLW link

Representational Tethers: Tying AI Latents To Human Ones

Paul Bricman16 Sep 2022 14:45 UTC

30 points

0 comments16 min readLW link

I wrote a fantasy novel to promote EA: More Chapters

Timothy Underwood16 Sep 2022 9:47 UTC

18 points

0 comments47 min readLW link

Guidelines for Mad Entrepreneurs

David Udell16 Sep 2022 6:33 UTC

31 points

0 comments11 min readLW link

Affordable Housing Investment Fund

jefftk16 Sep 2022 2:30 UTC

18 points

2 comments1 min readLW link

(www.jefftk.com)

In a world without AI, we need gene-editing to protect Nature. (Not how you think)

Erlja Jkdf.16 Sep 2022 1:24 UTC

−11 points

2 comments1 min readLW link

AstralCodexTen and Rationality Meetup Organisers’ Retreat — Europe, Middle East, and Africa 2023

Sam F. Brown15 Sep 2022 22:38 UTC

25 points

2 comments2 min readLW link

(www.rationalitymeetups.org)

A market is a neural network

David Hugh-Jones15 Sep 2022 21:53 UTC

7 points

4 comments8 min readLW link

Understanding Conjecture: Notes from Connor Leahy interview

Orpheus1615 Sep 2022 18:37 UTC

107 points

23 comments15 min readLW link

How should DeepMind’s Chinchilla revise our AI forecasts?

Cleo Nardo15 Sep 2022 17:54 UTC

35 points

12 comments13 min readLW link

Rational Animations’ Script Writing Contest

Writer15 Sep 2022 16:56 UTC

23 points

1 comment3 min readLW link

Covid 9/15/22: Permanent Normal

Zvi15 Sep 2022 16:00 UTC

32 points

9 comments20 min readLW link

(thezvi.wordpress.com)

[Question] Are Human Brains Universal?

DragonGod15 Sep 2022 15:15 UTC

16 points

28 comments5 min readLW link

Intelligence failures and a theory of change for forecasting

NathanBarnard15 Sep 2022 15:02 UTC

5 points

0 comments10 min readLW link

Why deceptive alignment matters for AGI safety

Marius Hobbhahn15 Sep 2022 13:38 UTC

68 points

13 comments13 min readLW link

FDT defects in a realistic Twin Prisoners’ Dilemma

SMK15 Sep 2022 8:55 UTC

39 points

1 comment16 min readLW link

[Question] What’s the longest a sentient observer could survive in the Dark Era?

Raemon15 Sep 2022 8:43 UTC

33 points

15 comments1 min readLW link

The Value of Not Being an Imposter

sudo15 Sep 2022 8:32 UTC

5 points

0 comments1 min readLW link

Capability and Agency as Cornerstones of AI risk — My current model

wilm15 Sep 2022 8:25 UTC

10 points

4 comments12 min readLW link

General advice for transitioning into Theoretical AI Safety

Martín Soto15 Sep 2022 5:23 UTC

12 points

0 comments10 min readLW link

Sequencing Intro II: Adapters

jefftk15 Sep 2022 3:30 UTC

12 points

0 comments2 min readLW link

(www.jefftk.com)

[Question] Forecasting thread: How does AI risk level vary based on timelines?

elifland14 Sep 2022 23:56 UTC

34 points

7 comments1 min readLW link

Coordinate-Free Interpretability Theory

johnswentworth14 Sep 2022 23:33 UTC

52 points

17 comments5 min readLW link

Progress links and tweets, 2022-09-14

jasoncrawford14 Sep 2022 23:21 UTC

9 points

2 comments1 min readLW link

(rootsofprogress.org)

Effective altruism in the garden of ends

Tyler Alterman14 Sep 2022 22:02 UTC

24 points

1 comment27 min readLW link

The problem with the media presentation of “believing in AI”

Roman Leventov14 Sep 2022 21:05 UTC

3 points

0 comments1 min readLW link

Seeing the Schema

vitaliya14 Sep 2022 20:45 UTC

23 points

6 comments1 min readLW link

Responding to ‘Beyond Hyperanthropomorphism’

ukc1001414 Sep 2022 20:37 UTC

9 points

0 comments16 min readLW link

When is intent alignment sufficient or necessary to reduce AGI conflict?

JesseClifton, Sammy Martin and Anthony DiGiovanni

14 Sep 2022 19:39 UTC

40 points

0 comments9 min readLW link

When would AGIs engage in conflict?

JesseClifton, Sammy Martin and Anthony DiGiovanni

14 Sep 2022 19:38 UTC

52 points

5 comments13 min readLW link

When does technical work to reduce AGI conflict make a difference?: Introduction

JesseClifton, Sammy Martin and Anthony DiGiovanni

14 Sep 2022 19:38 UTC

52 points

3 comments6 min readLW link

ACT-1: Transformer for Actions

Daniel Kokotajlo14 Sep 2022 19:09 UTC

52 points

4 comments1 min readLW link

(www.adept.ai)