All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025 2026

All Jan Feb Mar Apr May Jun Jul AugSepOct Nov Dec

All 1 2 3 456 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

THE 3 WILLPOWER KEYS

GregorDeVillain4 Sep 2022 22:57 UTC

−11 points

0 comments4 min readLW link

What’s your Mission?

GregorDeVillain4 Sep 2022 18:52 UTC

−4 points

1 comment6 min readLW link

EA, Veganism and Negative Animal Utilitarianism

Yair Halberstadt4 Sep 2022 18:30 UTC

10 points

12 comments1 min readLW link

The ethics of reclining airplane seats

braces4 Sep 2022 17:59 UTC

95 points

72 comments1 min readLW link

Russian Food for Petrov Day

weft4 Sep 2022 17:57 UTC

17 points

9 comments1 min readLW link

Prototyping in C

jefftk4 Sep 2022 17:50 UTC

19 points

11 comments2 min readLW link

(www.jefftk.com)

Turn your flashcards into Art

Heye Groß4 Sep 2022 17:31 UTC

16 points

2 comments1 min readLW link

Let’s Terraform West Texas

blackstampede4 Sep 2022 16:24 UTC

88 points

33 comments5 min readLW link

[Question] Help me find a good Hackathon subject

Charbel-Raphaël4 Sep 2022 8:40 UTC

6 points

18 comments1 min readLW link

Bay Solstice 2022 Call For Volunteers

Scott Alexander4 Sep 2022 6:44 UTC

43 points

2 comments1 min readLW link

The shard theory of human values

Quintin Pope and TurnTrout

4 Sep 2022 4:28 UTC

262 points

67 comments24 min readLW link 2 reviews

Breaking Newcomb’s Problem with Non-Halting states

Slimepriestess4 Sep 2022 4:01 UTC

16 points

9 comments5 min readLW link

Monthly Shorts 8/22

Celer4 Sep 2022 2:30 UTC

3 points

0 comments7 min readLW link

(keller.substack.com)

Fully Live Electronic Contra

jefftk4 Sep 2022 1:30 UTC

9 points

0 comments1 min readLW link

(www.jefftk.com)

How To Know What the AI Knows—An ELK Distillation

Fabien Roger4 Sep 2022 0:46 UTC

7 points

0 comments5 min readLW link

Private alignment research sharing and coordination

porby4 Sep 2022 0:01 UTC

63 points

13 comments5 min readLW link

AXRP Episode 18 - Concept Extrapolation with Stuart Armstrong

DanielFilan3 Sep 2022 23:12 UTC

12 points

1 comment39 min readLW link

An Update on Academia vs. Industry (one year into my faculty job)

David Scott Krueger3 Sep 2022 20:43 UTC

122 points

18 comments4 min readLW link

[Question] Request for Alignment Research Project Recommendations

Rauno Arike3 Sep 2022 15:29 UTC

10 points

2 comments1 min readLW link

Three scenarios of pseudo-alignment

Eleni Angelou3 Sep 2022 12:47 UTC

9 points

0 comments3 min readLW link

Bugs or Features?

qbolec3 Sep 2022 7:04 UTC

73 points

9 comments2 min readLW link

[Exploratory] Seperate exploratory writing from public writing

Johannes C. Mayer3 Sep 2022 2:57 UTC

6 points

2 comments1 min readLW link

We may be able to see sharp left turns coming

Ethan Perez and Neel Nanda

3 Sep 2022 2:55 UTC

54 points

29 comments1 min readLW link

[Exploratory] Exploratory Writing Info

Johannes C. Mayer3 Sep 2022 2:50 UTC

3 points

3 comments1 min readLW link

[Question] Can someone explain to me why most researchers think alignment is probably something that is humanly tractable?

iamthouthouarti3 Sep 2022 1:12 UTC

32 points

11 comments1 min readLW link

Behaviour Manifolds and the Hessian of the Total Loss—Notes and Criticism

carboniferous_umbraculum 3 Sep 2022 0:15 UTC

35 points

5 comments6 min readLW link

Sticky goals: a concrete experiment for understanding deceptive alignment

evhub2 Sep 2022 21:57 UTC

39 points

13 comments3 min readLW link

Agency engineering: is AI-alignment “to human intent” enough?

catubc2 Sep 2022 18:14 UTC

9 points

10 comments6 min readLW link

Hanover, Germany—ACX Meetups Everywhere 2022

eikowagenknecht2 Sep 2022 17:31 UTC

2 points

0 comments1 min readLW link

Laziness in AI

Richard Henage2 Sep 2022 17:04 UTC

13 points

5 comments1 min readLW link

Exporting Hangouts History

jefftk2 Sep 2022 15:00 UTC

27 points

0 comments2 min readLW link

(www.jefftk.com)

Simulators

janus2 Sep 2022 12:45 UTC

713 points

170 comments41 min readLW link 8 reviews

(generative.ink)

Levelling Up in AI Safety Research Engineering

GMM2 Sep 2022 4:59 UTC

59 points

9 comments15 min readLW link

Stop Discouraging Microwave Formula Preparation

jefftk2 Sep 2022 2:10 UTC

69 points

12 comments2 min readLW link

(www.jefftk.com)

A Richly Interactive AGI Alignment Chart

lisperati2 Sep 2022 0:44 UTC

14 points

6 comments1 min readLW link

Appendix: How to run a successful Hamming circle

CFAR!Duncan2 Sep 2022 0:22 UTC

47 points

6 comments7 min readLW link

Replacement for PONR concept

Daniel Kokotajlo2 Sep 2022 0:09 UTC

59 points

6 comments2 min readLW link

AI coordination needs clear wins

evhub1 Sep 2022 23:41 UTC

148 points

16 comments2 min readLW link 1 review

Short story speculating on possible ramifications of AI on the art world

Yitz1 Sep 2022 21:15 UTC

30 points

8 comments3 min readLW link

(archiveofourown.org)

Why was progress so slow in the past?

jasoncrawford1 Sep 2022 20:26 UTC

54 points

31 comments6 min readLW link

(rootsofprogress.org)

AI Safety and Neighboring Communities: A Quick-Start Guide, as of Summer 2022

Sam Bowman1 Sep 2022 19:15 UTC

76 points

2 comments7 min readLW link

Gradient Hacker Design Principles From Biology

johnswentworth1 Sep 2022 19:03 UTC

60 points

13 comments3 min readLW link

Book review: Put Your Ass Where Your Heart Wants to Be

Ruhul1 Sep 2022 18:21 UTC

1 point

2 comments10 min readLW link

A Survey of Foundational Methods in Inverse Reinforcement Learning

adamk1 Sep 2022 18:21 UTC

27 points

0 comments12 min readLW link

I Tripped and Became GPT! (And How This Updated My Timelines)

Frankophone1 Sep 2022 17:56 UTC

31 points

0 comments4 min readLW link

[Question] Fixed point theory (locally (α,β,ψ) dominated contractive condition)

muzammil1 Sep 2022 17:56 UTC

0 points

3 comments1 min readLW link

Alignment is hard. Communicating that, might be harder

Eleni Angelou1 Sep 2022 16:57 UTC

7 points

8 comments3 min readLW link

Covid 9/1/22: Meet the New Booster

Zvi1 Sep 2022 14:00 UTC

41 points

6 comments14 min readLW link

(thezvi.wordpress.com)

A Starter-kit for Rationality Space

Jesse Hoogland1 Sep 2022 13:04 UTC

43 points

0 comments1 min readLW link

(github.com)

Pondering the paucity of volcanic profanity post Pompeii perusal

CraigMichael1 Sep 2022 9:29 UTC

21 points

2 comments15 min readLW link