All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025 2026

All Jan Feb Mar Apr May Jun Jul AugSepOct Nov Dec

All 1 2 3 4 567 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

AI Governance Needs Technical Work

Mau5 Sep 2022 22:28 UTC

41 points

1 comment8 min readLW link

Overton Gymnastics: An Exercise in Discomfort

Shoshannah Tekofsky and omark

5 Sep 2022 19:20 UTC

40 points

15 comments4 min readLW link

The Good King

GregorDeVillain5 Sep 2022 19:17 UTC

−6 points

0 comments13 min readLW link

Beta Readers are Great

HoldenKarnofsky5 Sep 2022 19:10 UTC

28 points

0 comments1 min readLW link

(www.cold-takes.com)

Impact Shares For Speculative Projects

Elizabeth5 Sep 2022 18:00 UTC

30 points

8 comments7 min readLW link

(acesounderglass.com)

An unofficial “Highlights from the Sequences” tier list

Orpheus165 Sep 2022 14:07 UTC

29 points

1 comment5 min readLW link

A Game About AI Alignment (& Meta-Ethics): What Are the Must Haves?

JonathanErhardt5 Sep 2022 7:55 UTC

18 points

15 comments2 min readLW link

[Exploratory] What does it mean that an experiment is high bit?

Johannes C. Mayer5 Sep 2022 3:13 UTC

5 points

0 comments2 min readLW link

(Link) I’m Missing a Chunk of My Brain

mukashi5 Sep 2022 2:10 UTC

13 points

2 comments1 min readLW link

(www.nytimes.com)

THE 3 WILLPOWER KEYS

GregorDeVillain4 Sep 2022 22:57 UTC

−11 points

0 comments4 min readLW link

What’s your Mission?

GregorDeVillain4 Sep 2022 18:52 UTC

−4 points

1 comment6 min readLW link

EA, Veganism and Negative Animal Utilitarianism

Yair Halberstadt4 Sep 2022 18:30 UTC

10 points

12 comments1 min readLW link

The ethics of reclining airplane seats

braces4 Sep 2022 17:59 UTC

93 points

70 comments1 min readLW link

Russian Food for Petrov Day

weft4 Sep 2022 17:57 UTC

17 points

9 comments1 min readLW link

Prototyping in C

jefftk4 Sep 2022 17:50 UTC

19 points

11 comments2 min readLW link

(www.jefftk.com)

Turn your flashcards into Art

Heye Groß4 Sep 2022 17:31 UTC

16 points

2 comments1 min readLW link

Let’s Terraform West Texas

blackstampede4 Sep 2022 16:24 UTC

88 points

33 comments5 min readLW link

[Question] Help me find a good Hackathon subject

Charbel-Raphaël4 Sep 2022 8:40 UTC

6 points

18 comments1 min readLW link

Bay Solstice 2022 Call For Volunteers

Scott Alexander4 Sep 2022 6:44 UTC

43 points

2 comments1 min readLW link

The shard theory of human values

Quintin Pope and TurnTrout

4 Sep 2022 4:28 UTC

261 points

67 comments24 min readLW link 2 reviews

Breaking Newcomb’s Problem with Non-Halting states

Slimepriestess4 Sep 2022 4:01 UTC

16 points

9 comments5 min readLW link

Monthly Shorts 8/22

Celer4 Sep 2022 2:30 UTC

3 points

0 comments7 min readLW link

(keller.substack.com)

Fully Live Electronic Contra

jefftk4 Sep 2022 1:30 UTC

9 points

0 comments1 min readLW link

(www.jefftk.com)

How To Know What the AI Knows—An ELK Distillation

Fabien Roger4 Sep 2022 0:46 UTC

7 points

0 comments5 min readLW link

Private alignment research sharing and coordination

porby4 Sep 2022 0:01 UTC

63 points

13 comments5 min readLW link

AXRP Episode 18 - Concept Extrapolation with Stuart Armstrong

DanielFilan3 Sep 2022 23:12 UTC

12 points

1 comment39 min readLW link

An Update on Academia vs. Industry (one year into my faculty job)

David Scott Krueger (formerly: capybaralet)3 Sep 2022 20:43 UTC

122 points

18 comments4 min readLW link

[Question] Request for Alignment Research Project Recommendations

Rauno Arike3 Sep 2022 15:29 UTC

10 points

2 comments1 min readLW link

Three scenarios of pseudo-alignment

Eleni Angelou3 Sep 2022 12:47 UTC

9 points

0 comments3 min readLW link

Bugs or Features?

qbolec3 Sep 2022 7:04 UTC

73 points

9 comments2 min readLW link

[Exploratory] Seperate exploratory writing from public writing

Johannes C. Mayer3 Sep 2022 2:57 UTC

6 points

2 comments1 min readLW link

We may be able to see sharp left turns coming

Ethan Perez and Neel Nanda

3 Sep 2022 2:55 UTC

54 points

29 comments1 min readLW link

[Exploratory] Exploratory Writing Info

Johannes C. Mayer3 Sep 2022 2:50 UTC

3 points

3 comments1 min readLW link

[Question] Can someone explain to me why most researchers think alignment is probably something that is humanly tractable?

iamthouthouarti3 Sep 2022 1:12 UTC

32 points

11 comments1 min readLW link

Behaviour Manifolds and the Hessian of the Total Loss—Notes and Criticism

carboniferous_umbraculum 3 Sep 2022 0:15 UTC

35 points

5 comments6 min readLW link

Sticky goals: a concrete experiment for understanding deceptive alignment

evhub2 Sep 2022 21:57 UTC

39 points

13 comments3 min readLW link

Agency engineering: is AI-alignment “to human intent” enough?

catubc2 Sep 2022 18:14 UTC

9 points

10 comments6 min readLW link

Hanover, Germany—ACX Meetups Everywhere 2022

eikowagenknecht2 Sep 2022 17:31 UTC

2 points

0 comments1 min readLW link

Laziness in AI

Richard Henage2 Sep 2022 17:04 UTC

13 points

5 comments1 min readLW link

Exporting Hangouts History

jefftk2 Sep 2022 15:00 UTC

27 points

0 comments2 min readLW link

(www.jefftk.com)

Simulators

janus2 Sep 2022 12:45 UTC

689 points

169 comments41 min readLW link 8 reviews

(generative.ink)

Levelling Up in AI Safety Research Engineering

GMM2 Sep 2022 4:59 UTC

58 points

9 comments15 min readLW link

Stop Discouraging Microwave Formula Preparation

jefftk2 Sep 2022 2:10 UTC

68 points

12 comments2 min readLW link

(www.jefftk.com)

A Richly Interactive AGI Alignment Chart

lisperati2 Sep 2022 0:44 UTC

14 points

6 comments1 min readLW link

Appendix: How to run a successful Hamming circle

CFAR!Duncan2 Sep 2022 0:22 UTC

47 points

6 comments7 min readLW link

Replacement for PONR concept

Daniel Kokotajlo2 Sep 2022 0:09 UTC

59 points

6 comments2 min readLW link

AI coordination needs clear wins

evhub1 Sep 2022 23:41 UTC

148 points

16 comments2 min readLW link 1 review

Short story speculating on possible ramifications of AI on the art world

Yitz1 Sep 2022 21:15 UTC

30 points

8 comments3 min readLW link

(archiveofourown.org)

Why was progress so slow in the past?

jasoncrawford1 Sep 2022 20:26 UTC

54 points

31 comments6 min readLW link

(rootsofprogress.org)

AI Safety and Neighboring Communities: A Quick-Start Guide, as of Summer 2022

Sam Bowman1 Sep 2022 19:15 UTC

76 points

2 comments7 min readLW link