All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 202020212022 2023 2024 2025

All Jan Feb Mar Apr May Jun Jul Aug Sep Oct NovDec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 171819 20 21 22 23 24 25 26 27 28 29 30 31

In Defense of Attempting Hard Things, and my story of the Leverage ecosystem

Cathleen17 Dec 2021 23:08 UTC

115 points

42 comments1 min readLW link 2 reviews

(cathleensdiscoveries.com)

[Question] Getting diagnosed for ADHD if I don’t plan on taking meds?

vroomerify17 Dec 2021 19:27 UTC

6 points

6 comments1 min readLW link

Venture Granters, The VCs of public goods, incentivizing good dreams

mako yass17 Dec 2021 8:57 UTC

12 points

9 comments12 min readLW link

Understand the exponential function: R0 of the COVID

Yandong Zhang17 Dec 2021 6:44 UTC

−6 points

17 comments1 min readLW link

Some motivations to gradient hack

peterbarnett17 Dec 2021 3:06 UTC

8 points

0 comments6 min readLW link

Blog Respectably

lsusr17 Dec 2021 1:23 UTC

14 points

4 comments1 min readLW link

The Case for Radical Optimism about Interpretability

Quintin Pope16 Dec 2021 23:38 UTC

66 points

16 comments8 min readLW link 1 review

-

Alice K16 Dec 2021 23:03 UTC

2 points

2 comments1 min readLW link

Evidence Sets: Towards Inductive-Biases based Analysis of Prosaic AGI

bayesian_kitten16 Dec 2021 22:41 UTC

22 points

10 comments21 min readLW link

Housing Markets, Satisficers, and One-Track Goodhart

J Bostock16 Dec 2021 21:38 UTC

2 points

2 comments2 min readLW link

Covid 12/16: On Your Marks

Zvi16 Dec 2021 21:00 UTC

53 points

36 comments9 min readLW link

(thezvi.wordpress.com)

Reviews of “Is power-seeking AI an existential risk?”

Joe Carlsmith16 Dec 2021 20:48 UTC

80 points

20 comments1 min readLW link

The “Other” Option

jsteinhardt16 Dec 2021 20:20 UTC

24 points

1 comment7 min readLW link

(bounded-regret.ghost.io)

What Caplan’s “Missing Mood” Heuristic Is Really For

DirectedEvolution16 Dec 2021 19:47 UTC

32 points

8 comments4 min readLW link

Subway Slides

jefftk16 Dec 2021 19:30 UTC

11 points

2 comments1 min readLW link

(www.jefftk.com)

Virulence Management

harsimony16 Dec 2021 19:25 UTC

4 points

0 comments3 min readLW link

(harsimony.wordpress.com)

Omicron Post #7

Zvi16 Dec 2021 17:30 UTC

155 points

41 comments12 min readLW link

(thezvi.wordpress.com)

[Question] Where can one learn deep intuitions about information theory?

Valentine16 Dec 2021 15:47 UTC

72 points

27 comments2 min readLW link

Elicitation for Modeling Transformative AI Risks

Davidmanheim16 Dec 2021 15:24 UTC

30 points

3 comments9 min readLW link

An Open Letter to the Monastic Academy and community members

HS202116 Dec 2021 9:04 UTC

45 points

46 comments1 min readLW link

Five Missing Moods

mike_hawke16 Dec 2021 1:25 UTC

14 points

3 comments3 min readLW link

Motivations, Natural Selection, and Curriculum Engineering

Oliver Sourbut16 Dec 2021 1:07 UTC

16 points

0 comments42 min readLW link

Universality and the “Filter”

maggiehayes16 Dec 2021 0:47 UTC

10 points

2 comments11 min readLW link

More power to you

jasoncrawford15 Dec 2021 23:50 UTC

16 points

14 comments1 min readLW link

(rootsofprogress.org)

My Overview of the AI Alignment Landscape: A Bird’s Eye View

Neel Nanda15 Dec 2021 23:44 UTC

127 points

9 comments15 min readLW link

SmartPoop 1.0: An AI Safety Science-Fiction

Lê Nguyên Hoang15 Dec 2021 22:28 UTC

7 points

1 comment1 min readLW link

Bay Area Rationalist Field Day

Raj Thimmiah15 Dec 2021 19:57 UTC

7 points

1 comment1 min readLW link

Framing approaches to alignment and the hard problem of AI cognition

ryan_greenblatt15 Dec 2021 19:06 UTC

16 points

15 comments27 min readLW link

South Bay ACX/LW Pre-Holiday Get-Together

IS15 Dec 2021 16:58 UTC

5 points

0 comments1 min readLW link

Leverage

lsusr15 Dec 2021 5:20 UTC

23 points

2 comments1 min readLW link

We’ll Always Have Crazy

Duncan Sabien (Inactive)15 Dec 2021 2:55 UTC

36 points

22 comments13 min readLW link

2020 Review: The Discussion Phase

Vaniver15 Dec 2021 1:12 UTC

55 points

14 comments2 min readLW link

The Natural Abstraction Hypothesis: Implications and Evidence

CallumMcDougall14 Dec 2021 23:14 UTC

40 points

9 comments19 min readLW link

Robin Hanson’s “Humans are Early”

Raemon14 Dec 2021 22:07 UTC

11 points

0 comments2 min readLW link

(www.overcomingbias.com)

Ngo’s view on alignment difficulty

Richard_Ngo and Eliezer Yudkowsky

14 Dec 2021 21:34 UTC

63 points

7 comments17 min readLW link

A proposed system for ideas jumpstart

Valentin202614 Dec 2021 21:01 UTC

4 points

2 comments3 min readLW link

Should we rely on the speed prior for safety?

Marc Carauleanu14 Dec 2021 20:45 UTC

14 points

5 comments5 min readLW link

ARC’s first technical report: Eliciting Latent Knowledge

paulfchristiano, Mark Xu and Ajeya Cotra

14 Dec 2021 20:09 UTC

228 points

90 comments1 min readLW link 3 reviews

(docs.google.com)

ARC is hiring!

paulfchristiano and Mark Xu

14 Dec 2021 20:09 UTC

64 points

2 comments1 min readLW link

Interlude: Agents as Automobiles

Daniel Kokotajlo14 Dec 2021 18:49 UTC

26 points

6 comments5 min readLW link

Zvi’s Thoughts on the Survival and Flourishing Fund (SFF)

Zvi14 Dec 2021 14:30 UTC

193 points

65 comments64 min readLW link 1 review

(thezvi.wordpress.com)

Consequentialism & corrigibility

Steven Byrnes14 Dec 2021 13:23 UTC

72 points

35 comments7 min readLW link

Mystery Hunt 2022

Scott Garrabrant13 Dec 2021 21:57 UTC

30 points

5 comments1 min readLW link

Enabling More Feedback for AI Safety Researchers

frances_lorenz13 Dec 2021 20:10 UTC

17 points

0 comments3 min readLW link

Language Model Alignment Research Internships

Ethan Perez13 Dec 2021 19:53 UTC

74 points

1 comment1 min readLW link

Omicron Post #6

Zvi13 Dec 2021 18:00 UTC

89 points

30 comments8 min readLW link

(thezvi.wordpress.com)

Analysis of Bird Box (2018)

TekhneMakre13 Dec 2021 17:30 UTC

11 points

3 comments5 min readLW link

Solving Interpretability Week

Logan Riggs13 Dec 2021 17:09 UTC

11 points

5 comments1 min readLW link

Understanding and controlling auto-induced distributional shift

L Rudolf L13 Dec 2021 14:59 UTC

33 points

4 comments16 min readLW link

A fate worse than death?

RomanS13 Dec 2021 11:05 UTC

−25 points

26 comments2 min readLW link