All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025 2026

All Jan Feb Mar Apr May Jun Jul Aug Sep OctNovDec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 141516 17 18 19 20 21 22 23 24 25 26 27 28 29 30

Lying to Save Humanity

cebsuvx14 Nov 2022 23:04 UTC

−1 points

4 comments1 min readLW link

Moral contagion heuristic

Mvolz14 Nov 2022 21:17 UTC

14 points

3 comments2 min readLW link

Will we run out of ML data? Evidence from projecting dataset size trends

Pablo Villalobos14 Nov 2022 16:42 UTC

75 points

12 comments2 min readLW link

(epochai.org)

I (with the help of a few more people) am planning to create an introduction to AI Safety that a smart teenager can understand. What am I missing?

Tapatakt14 Nov 2022 16:12 UTC

3 points

5 comments1 min readLW link

Two New Newcomb Variants

eva_14 Nov 2022 14:01 UTC

26 points

24 comments3 min readLW link

Improving Emergency Vehicle Utilization

jefftk14 Nov 2022 14:00 UTC

15 points

10 comments1 min readLW link

(www.jefftk.com)

X-risk Mitigation Does Actually Require Longtermism

DragonGod14 Nov 2022 12:54 UTC

6 points

1 comment1 min readLW link

[Question] Why don’t we have self driving cars yet?

Linda Linsefors14 Nov 2022 12:19 UTC

22 points

16 comments1 min readLW link

Eigenvalues for Distance from The Buddhist Precepts And The Ten Commandments

benjamin.j.campbell14 Nov 2022 5:50 UTC

−3 points

2 comments1 min readLW link

AI Safety Microgrant Round

Chris_Leong14 Nov 2022 4:25 UTC

22 points

1 comment3 min readLW link

Estimating the probability that FTX Future Fund grant money gets clawed back

spencerg14 Nov 2022 3:33 UTC

28 points

6 comments1 min readLW link

(manifold.markets)

Rational overconfidence in the tens of billions: recent example

banev13 Nov 2022 22:48 UTC

−20 points

3 comments2 min readLW link

In Defence of Temporal Discounting in Longtermist Ethics

DragonGod13 Nov 2022 21:54 UTC

25 points

4 comments3 min readLW link

Announcing Nonlinear Emergency Funding

KatWoods13 Nov 2022 19:02 UTC

54 points

0 comments1 min readLW link

The Alignment Community Is Culturally Broken

sudo13 Nov 2022 18:53 UTC

142 points

68 comments2 min readLW link

The Futility of Status and Signalling

Ape in the coat13 Nov 2022 17:14 UTC

20 points

4 comments3 min readLW link

A short critique of Vanessa Kosoy’s PreDCA

Martín Soto13 Nov 2022 16:00 UTC

28 points

8 comments4 min readLW link

What’s the Alternative to Independence?

jefftk13 Nov 2022 15:30 UTC

50 points

3 comments1 min readLW link

(www.jefftk.com)

Decision making under model ambiguity, moral uncertainty, and other agents with free will?

Jobst Heitzig13 Nov 2022 12:50 UTC

4 points

0 comments1 min readLW link

(forum.effectivealtruism.org)

The sky is not blue (pardon the obviousness)

banev13 Nov 2022 10:49 UTC

−13 points

6 comments1 min readLW link

Characterizing Intrinsic Compositionality in Transformers with Tree Projections

Ulisse Mini13 Nov 2022 9:46 UTC

12 points

2 comments1 min readLW link

(arxiv.org)

Noting an unsubstantiated belief about the FTX disaster

Yitz13 Nov 2022 5:37 UTC

50 points

52 comments1 min readLW link

Women and Effective Altruism

P. G. Keerthana Gopalakrishnan12 Nov 2022 20:57 UTC

−30 points

15 comments2 min readLW link

(keerthanapg.com)

A Poem for S.B.F.

AnthonyRepetto12 Nov 2022 20:41 UTC

−30 points

21 comments1 min readLW link

Musings on the appropriate targets for standards

tailcalled12 Nov 2022 20:19 UTC

11 points

13 comments1 min readLW link

Ways to buy time

Orpheus16, Olive Branch and Thomas Larsen

12 Nov 2022 19:31 UTC

34 points

23 comments12 min readLW link

User-Controlled Algorithmic Feeds

jefftk12 Nov 2022 15:20 UTC

35 points

7 comments2 min readLW link

(www.jefftk.com)

Vanessa Kosoy’s PreDCA, distilled

Martín Soto12 Nov 2022 11:38 UTC

17 points

19 comments5 min readLW link

Poster Session on AI Safety

Neil Crawford12 Nov 2022 3:50 UTC

7 points

8 comments4 min readLW link

Is AI Gain-of-Function research a thing?

MadHatter12 Nov 2022 2:33 UTC

9 points

2 comments2 min readLW link

Why don’t organizations have a CREAMO?

Shmi12 Nov 2022 2:19 UTC

0 points

8 comments1 min readLW link

“Rudeness”, a useful coordination mechanic

Raemon11 Nov 2022 22:27 UTC

52 points

20 comments2 min readLW link

Internalizing the damage of bad-acting partners creates incentives for due diligence

tailcalled11 Nov 2022 20:57 UTC

17 points

7 comments1 min readLW link

Speculation on Current Opportunities for Unusually High Impact in Global Health

johnswentworth11 Nov 2022 20:47 UTC

114 points

31 comments4 min readLW link

[Question] Is acausal extortion possible?

sisyphus11 Nov 2022 19:48 UTC

−20 points

35 comments3 min readLW link

Catharsis in Bb

jefftk11 Nov 2022 17:40 UTC

6 points

0 comments1 min readLW link

(www.jefftk.com)

Instrumental convergence is what makes general intelligence possible

tailcalled11 Nov 2022 16:38 UTC

105 points

11 comments4 min readLW link

Weekly Roundup #5

Zvi11 Nov 2022 16:20 UTC

33 points

0 comments6 min readLW link

(thezvi.wordpress.com)

Charging for the Dharma

jchan11 Nov 2022 14:02 UTC

32 points

18 comments5 min readLW link

[Question] EA (& AI Safety) has overestimated its projected funding — which decisions must be revised?

Cleo Nardo11 Nov 2022 13:50 UTC

22 points

7 comments1 min readLW link

(forum.effectivealtruism.org)

Where the logical fallacy is not (Generalization From Fictional Evidence)

banev11 Nov 2022 10:41 UTC

−12 points

14 comments1 min readLW link

Why I’m Working On Model Agnostic Interpretability

Jessica Rumbelow11 Nov 2022 9:24 UTC

27 points

9 comments2 min readLW link

How likely are malign priors over objectives? [aborted WIP]

David Johnston11 Nov 2022 5:36 UTC

−1 points

0 comments8 min readLW link

Do Timeless Decision Theorists reject all blackmail from other Timeless Decision Theorists?

myren11 Nov 2022 0:38 UTC

7 points

8 comments3 min readLW link

We must be very clear: fraud in the service of effective altruism is unacceptable

evhub10 Nov 2022 23:31 UTC

42 points

56 comments3 min readLW link

[simulation] 4chan user claiming to be the attorney hired by Google’s sentient chatbot LaMDA shares wild details of encounter

janus10 Nov 2022 21:39 UTC

19 points

1 comment13 min readLW link

(generative.ink)

divine carrot

Alok Singh10 Nov 2022 20:50 UTC

18 points

2 comments1 min readLW link

(alok.github.io)

Metaculus Announces The Million Predictions Hackathon

ChristianWilliams10 Nov 2022 20:00 UTC

7 points

0 comments1 min readLW link

(metaculus.medium.com)

The harnessing of complexity

geduardo10 Nov 2022 18:44 UTC

6 points

2 comments3 min readLW link

[Question] I there a demo of “You can’t fetch the coffee if you’re dead”?

Ram Rachum10 Nov 2022 18:41 UTC

8 points

9 comments1 min readLW link