All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025 2026

All Jan Feb Mar Apr May Jun Jul AugSepOct Nov Dec

All 1 2 3 4 5 6 7 8 9 101112 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

Pascal: The Greatness and Littleness of Man, A Thinking Reed

NoBadCake10 Sep 2022 20:05 UTC

9 points

0 comments1 min readLW link

[Job] Project Manager: Community Health (CEA)

Xodarap10 Sep 2022 18:40 UTC

3 points

0 comments1 min readLW link

(www.centreforeffectivealtruism.org)

Unbounded utility functions and precommitment

MichaelStJules10 Sep 2022 16:16 UTC

4 points

3 comments1 min readLW link

[Question] What is the “Less Wrong” approved acronym for 1984-risk?

Logan Zoellner10 Sep 2022 14:38 UTC

5 points

8 comments1 min readLW link

Find out how utilitarian you are—a mega thread of philosophy polls

spencerg10 Sep 2022 14:05 UTC

8 points

3 comments1 min readLW link

(twitter.com)

Put Dirty Dishes in the Dishwasher

jefftk10 Sep 2022 13:10 UTC

37 points

16 comments1 min readLW link

(www.jefftk.com)

Quintin’s alignment papers roundup—week 1

Quintin Pope10 Sep 2022 6:39 UTC

122 points

6 comments9 min readLW link

Path dependence in ML inductive biases

Vivek Hebbar and evhub

10 Sep 2022 1:38 UTC

68 points

13 comments10 min readLW link

Keeping Time in Epoch Seconds

Gordon Seidoh Worley10 Sep 2022 0:28 UTC

11 points

2 comments2 min readLW link

Ought will host a factored cognition “Lab Meeting”

jungofthewon and stuhlmueller

9 Sep 2022 23:46 UTC

35 points

1 comment1 min readLW link

Web4/Heaven—The Simulation

Dunning K.9 Sep 2022 22:58 UTC

26 points

2 comments1 min readLW link

Evaluations project @ ARC is hiring a researcher and a webdev/engineer

Beth Barnes9 Sep 2022 22:46 UTC

99 points

7 comments10 min readLW link

Swap and Scale

Stephen Fowler9 Sep 2022 22:41 UTC

17 points

3 comments1 min readLW link

My emotional reaction to the current funding situation

Sam F. Brown9 Sep 2022 22:02 UTC

108 points

36 comments5 min readLW link

(sambrown.eu)

AlexaTM − 20 Billion Parameter Model With Impressive Performance

MrThink9 Sep 2022 21:46 UTC

5 points

0 comments1 min readLW link

[Fun][Link] Alignment SMBC Comic

Gunnar_Zarncke9 Sep 2022 21:38 UTC

8 points

2 comments1 min readLW link

(www.smbc-comics.com)

Gatekeeper Victory: AI Box Reflection

Double and DaemonicSigil

9 Sep 2022 21:38 UTC

7 points

6 comments9 min readLW link

Interpreting Affordable Housing

jefftk9 Sep 2022 19:40 UTC

16 points

0 comments1 min readLW link

(www.jefftk.com)

London Rationalish Meetup 2022-09-11

calmiguana9 Sep 2022 18:39 UTC

1 point

0 comments1 min readLW link

AI alignment with humans… but with which humans?

geoffreymiller9 Sep 2022 18:21 UTC

12 points

33 comments3 min readLW link

[Question] Should you refrain from having children because of the risk posed by artificial intelligence?

Mientras9 Sep 2022 17:39 UTC

18 points

31 comments1 min readLW link

Notes on Resolve

David Gross9 Sep 2022 16:42 UTC

10 points

3 comments31 min readLW link

Oversight Leagues: The Training Game as a Feature

Paul Bricman9 Sep 2022 10:08 UTC

20 points

6 comments10 min readLW link

Understanding and avoiding value drift

TurnTrout9 Sep 2022 4:16 UTC

48 points

14 comments6 min readLW link

Samotsvety’s AI risk forecasts

elifland9 Sep 2022 4:01 UTC

44 points

0 comments4 min readLW link

Most People Start With The Same Few Bad Ideas

johnswentworth9 Sep 2022 0:29 UTC

177 points

31 comments3 min readLW link

Monitoring for deceptive alignment

evhub8 Sep 2022 23:07 UTC

130 points

8 comments9 min readLW link

[An email with a bunch of links I sent an experienced ML researcher interested in learning about Alignment / x-safety.]

David Scott Krueger8 Sep 2022 22:28 UTC

47 points

1 comment5 min readLW link

Progress links & tweets, 2022-09-08

jasoncrawford8 Sep 2022 20:43 UTC

13 points

3 comments1 min readLW link

(rootsofprogress.org)

Postmortem: Trying out for Manifold Markets

Milli | Martin and Austin Chen

8 Sep 2022 17:54 UTC

24 points

0 comments3 min readLW link

Thoughts on AGI consciousness / sentience

Steven Byrnes8 Sep 2022 16:40 UTC

45 points

37 comments6 min readLW link

A rough idea for solving ELK: An approach for training generalist agents like GATO to make plans and describe them to humans clearly and honestly.

Michael Soareverix8 Sep 2022 15:20 UTC

2 points

2 comments2 min readLW link

What Should AI Owe To Us? Accountable and Aligned AI Systems via Contractualist AI Alignment

xuan8 Sep 2022 15:04 UTC

27 points

16 comments25 min readLW link

ACX Book Review Discussion

Screwtape8 Sep 2022 14:22 UTC

5 points

0 comments1 min readLW link

Covid 9/8/22: Booster Boosting

Zvi8 Sep 2022 13:50 UTC

34 points

9 comments24 min readLW link

(thezvi.wordpress.com)

Solar Blackout Resistance

jefftk8 Sep 2022 13:30 UTC

69 points

32 comments3 min readLW link

(www.jefftk.com)

All AGI safety questions welcome (especially basic ones) [Sept 2022]

plex8 Sep 2022 11:56 UTC

22 points

48 comments3 min readLW link

[Question] Sequences/Eliezer essays beyond those in AI to Zombies?

Domenic8 Sep 2022 5:05 UTC

4 points

4 comments1 min readLW link

Linkpost: Github Copilot productivity experiment

Daniel Kokotajlo8 Sep 2022 4:41 UTC

88 points

4 comments1 min readLW link

(github.blog)

OpenPrinciples Bootcamp (Free) -- Reflect & Act on your Rationality Principles.

ti_guo8 Sep 2022 3:06 UTC

6 points

3 comments4 min readLW link

Searching for Modularity in Large Language Models

NickyP and Stephen Fowler

8 Sep 2022 2:25 UTC

44 points

3 comments14 min readLW link

90% of anything should be bad (& the precision-recall tradeoff)

cartografie8 Sep 2022 1:20 UTC

34 points

22 comments6 min readLW link

How to Do Research. v1

Pablo Repetto8 Sep 2022 1:08 UTC

29 points

4 comments41 min readLW link

(pabloernesto.github.io)

Galaxy Trucker Needs a New Second Half

jefftk7 Sep 2022 20:10 UTC

13 points

7 comments1 min readLW link

(www.jefftk.com)

[Question] In a lack of data, how should you weigh credences in theoretical physics’s Theories of Everything, or TOEs?

Noosphere897 Sep 2022 18:25 UTC

7 points

11 comments1 min readLW link

Generators Of Disagreement With AI Alignment

George3d67 Sep 2022 18:15 UTC

27 points

9 comments9 min readLW link

(www.epistem.ink)

Shrödinger’s lottery or: Why you are going to live forever

Chase Dowdell7 Sep 2022 18:13 UTC

1 point

2 comments4 min readLW link

Is training data going to be diluted by AI-generated content?

Hannes Thurnherr7 Sep 2022 18:13 UTC

10 points

7 comments1 min readLW link

It’s (not) how you use it

Eleni Angelou7 Sep 2022 17:15 UTC

8 points

1 comment2 min readLW link

First we shape our social graph; then it shapes us

Henrik Karlsson7 Sep 2022 15:50 UTC

53 points

6 comments8 min readLW link

(escapingflatland.substack.com)