All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025 2026

All Jan Feb MarAprMay Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 131415 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

Exploring toy neural nets under node removal. Section 1.

Donald Hobson13 Apr 2022 23:30 UTC

12 points

7 comments8 min readLW link

Make a Movie Showing Alignment Failures

Logan Riggs13 Apr 2022 21:54 UTC

76 points

11 comments2 min readLW link

Summary: “How to Do Research” by OSP’s Red

Pablo Repetto13 Apr 2022 19:46 UTC

9 points

0 comments3 min readLW link

(pabloernesto.github.io)

A Quick Guide to Confronting Doom

Ruby13 Apr 2022 19:30 UTC

246 points

33 comments2 min readLW link

Design, Implement and Verify

rwallace13 Apr 2022 18:14 UTC

32 points

13 comments4 min readLW link

Takeoff speeds have a huge effect on what it means to work on AI x-risk

Buck13 Apr 2022 17:38 UTC

140 points

27 comments2 min readLW link 2 reviews

Budapest Meetup

Richard Horvath13 Apr 2022 17:23 UTC

2 points

0 comments1 min readLW link

[Question] What to include in a guest lecture on existential risks from AI?

Aryeh Englander13 Apr 2022 17:03 UTC

20 points

9 comments1 min readLW link

Common Knowledge is a Circle Game for Toddlers

ryan_b13 Apr 2022 15:24 UTC

60 points

1 comment1 min readLW link

Another list of theories of impact for interpretability

Beth Barnes13 Apr 2022 13:29 UTC

33 points

1 comment5 min readLW link

The Cage of the Language

Martin Sustrik13 Apr 2022 5:20 UTC

54 points

19 comments2 min readLW link

[Question] What’s a good probability distribution family (e.g. “log-normal”) to use for AGI timelines?

David Scott Krueger (formerly: capybaralet)13 Apr 2022 4:45 UTC

9 points

11 comments1 min readLW link

How dath ilan coordinates around solving alignment

Thomas Kwa13 Apr 2022 4:22 UTC

67 points

46 comments5 min readLW link

What more compute does for brain-like models: response to Rohin

Nathan Helm-Burger13 Apr 2022 3:40 UTC

24 points

14 comments12 min readLW link

[Question] “Fragility of Value” vs. LLMs

Not Relevant13 Apr 2022 2:02 UTC

34 points

33 comments1 min readLW link

Commensurable Scientific Paradigms; or, computable induction

samshap13 Apr 2022 0:01 UTC

14 points

0 comments5 min readLW link

Convincing People of Alignment with Street Epistemology

Logan Riggs12 Apr 2022 23:43 UTC

54 points

4 comments3 min readLW link

Useful Vices for Wicked Problems

HoldenKarnofsky12 Apr 2022 19:30 UTC

86 points

2 comments17 min readLW link 1 review

(www.cold-takes.com)

SSC/ACX, San Diego, Schelling Point, Meetups Everywhere

CitizenTen12 Apr 2022 18:50 UTC

2 points

0 comments1 min readLW link

SSC/ACX San Diego Rock Climbing

CitizenTen12 Apr 2022 18:46 UTC

2 points

0 comments1 min readLW link

[Question] Does the rationalist community have a membership funnel?

Alex_Altair12 Apr 2022 18:44 UTC

38 points

17 comments1 min readLW link

A Small Negative Result on Debate

Sam Bowman12 Apr 2022 18:19 UTC

42 points

11 comments1 min readLW link

US Taxes: Adjust Withholding When Donating?

jefftk12 Apr 2022 15:50 UTC

15 points

1 comment1 min readLW link

(www.jefftk.com)

Introducing Effective Self-Help

Ben Williamson12 Apr 2022 15:01 UTC

19 points

0 comments16 min readLW link

Ukraine Post #10: Next Phase

Zvi12 Apr 2022 13:40 UTC

47 points

14 comments14 min readLW link

(thezvi.wordpress.com)

Is technical AI alignment research a net positive?

cranberry_bear12 Apr 2022 13:07 UTC

6 points

2 comments2 min readLW link

[Question] What is your advice for elder care, particularly taking care of dementia patients?

RasmusHB12 Apr 2022 11:33 UTC

4 points

6 comments1 min readLW link

Reward model hacking as a challenge for reward learning

Erik Jenner12 Apr 2022 9:39 UTC

25 points

1 comment9 min readLW link

How I use Anki: expanding the scope of SRS

CallumMcDougall12 Apr 2022 8:28 UTC

37 points

8 comments19 min readLW link

[Question] What do you think will most probably happen to our consciousness when our simulation ends?

ArtMi12 Apr 2022 8:23 UTC

1 point

5 comments1 min readLW link

Favorites & Performers

Soma12 Apr 2022 5:50 UTC

9 points

0 comments1 min readLW link

A broad basin of attraction around human values?

Wei Dai12 Apr 2022 5:15 UTC

120 points

19 comments2 min readLW link

AI governance student hackathon on Saturday, April 23: register now!

mic12 Apr 2022 4:48 UTC

14 points

0 comments1 min readLW link

The Platonist’s Dilemma: A Remix on the Prisoner’s.

James Camacho12 Apr 2022 3:49 UTC

7 points

2 comments5 min readLW link

[Question] Three questions about mesa-optimizers

Eric Neyman12 Apr 2022 2:58 UTC

26 points

5 comments3 min readLW link

The Amish

PeterMcCluskey12 Apr 2022 2:54 UTC

50 points

5 comments6 min readLW link

(www.bayesianinvestor.com)

Rationalist Should Win. Not Dying with Dignity and Funding WBE.

CitizenTen12 Apr 2022 2:14 UTC

32 points

15 comments5 min readLW link

[Question] How can I determine that Elicit is not some weak AGI’s attempt at taking over the world ?

Lucie Philippon12 Apr 2022 0:54 UTC

5 points

3 comments1 min readLW link

Summary: “How to Write Quickly...” by John Wentworth

Pablo Repetto11 Apr 2022 23:26 UTC

4 points

0 comments2 min readLW link

(pabloernesto.github.io)

Rambling thoughts on having multiple selves

cranberry_bear11 Apr 2022 22:43 UTC

15 points

1 comment3 min readLW link

An AI-in-a-box success model

azsantosk11 Apr 2022 22:28 UTC

16 points

1 comment10 min readLW link

The Regulatory Option: A response to near 0% survival odds

Matthew Lowenstein11 Apr 2022 22:00 UTC

46 points

21 comments6 min readLW link

The Efficient LessWrong Hypothesis—Stock Investing Competition

MrThink11 Apr 2022 20:43 UTC

30 points

35 comments2 min readLW link

Review: Structure and Interpretation of Computer Programs

L Rudolf L11 Apr 2022 20:27 UTC

17 points

9 comments10 min readLW link

(www.strataoftheworld.com)

[Question] Underappreciated content on LessWrong

Ege Erdil11 Apr 2022 17:40 UTC

22 points

5 comments1 min readLW link

Editing Advice for LessWrong Users

JustisMills11 Apr 2022 16:32 UTC

238 points

16 comments6 min readLW link 1 review

Post-history is written by the martyrs

Veedrac11 Apr 2022 15:45 UTC

51 points

2 comments19 min readLW link

(www.royalroad.com)

What Chords Do You Need?

jefftk11 Apr 2022 15:00 UTC

11 points

0 comments3 min readLW link

(www.jefftk.com)

What can people not smart/technical/”competent” enough for AI research/AI risk work do to reduce AI-risk/maximize AI safety? (which is most people?)

Alex K. Chen (StochasticCockatoo)11 Apr 2022 14:05 UTC

7 points

3 comments3 min readLW link

Goodhart’s Law Causal Diagrams

JustinShovelain and Jeremy Gillen

11 Apr 2022 13:52 UTC

35 points

6 comments6 min readLW link