All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025 2026

AllJanFeb Mar Apr May Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 212223 24 25 26 27 28 29 30 31

Rock, Paper and Scissors: A Game Theory View

Edward P. Könings21 Jan 2023 21:00 UTC

18 points

3 comments4 min readLW link

(edwardknings.substack.com)

A new Heuristic to Update on the Credences of Others

aaron_mai21 Jan 2023 21:00 UTC

6 points

0 comments20 min readLW link

AI Safety “Textbook”. Test chapter. Orthogonality Thesis, Goodhart Law and Instrumental Convergency

Tapatakt and LacrimalBird

21 Jan 2023 18:13 UTC

4 points

1 comment12 min readLW link

[Linkpost] TIME article: DeepMind’s CEO Helped Take AI Mainstream. Now He’s Urging Caution

Orpheus1621 Jan 2023 16:51 UTC

58 points

2 comments3 min readLW link

(time.com)

Small Go Boards

jefftk21 Jan 2023 14:50 UTC

19 points

6 comments2 min readLW link

(www.jefftk.com)

[Question] Why are we so illogical?

Program Den21 Jan 2023 8:28 UTC

−25 points

0 comments1 min readLW link

Announcing aisafety.training

JJ Hepburn21 Jan 2023 1:01 UTC

61 points

4 comments1 min readLW link

Why real estate is the only investment that matters in AI dominated future

G20 Jan 2023 19:40 UTC

7 points

10 comments1 min readLW link

Transcript of Sam Altman’s interview touching on AI safety

Andy_McKenzie20 Jan 2023 16:14 UTC

121 points

42 comments10 min readLW link

[Question] COVID contagiousness after negative tests?

wunan20 Jan 2023 15:02 UTC

10 points

2 comments1 min readLW link

Critique of some recent philosophy of LLMs’ minds

Roman Leventov20 Jan 2023 12:53 UTC

52 points

8 comments20 min readLW link

Preface

iy3d20 Jan 2023 12:38 UTC

4 points

1 comment2 min readLW link

Lost in Innovation: The Case of Phlogiston

adamShimi20 Jan 2023 12:19 UTC

20 points

8 comments4 min readLW link

(epistemologicalvigilance.substack.com)

finite, actual infinity, potential infinity

Alok Singh20 Jan 2023 11:00 UTC

3 points

15 comments1 min readLW link

(alok.github.io)

Generalizability & Hope for AI [MLAISU W03]

Esben Kran20 Jan 2023 10:06 UTC

5 points

2 comments2 min readLW link

(newsletter.apartresearch.com)

What’s going on with ‘crunch time’?

rosehadshar20 Jan 2023 9:42 UTC

54 points

6 comments4 min readLW link

Shard theory alignment has important, often-overlooked free parameters.

Charlie Steiner20 Jan 2023 9:30 UTC

37 points

10 comments3 min readLW link

Solving For Meta-Ethics By Inducing From The Self

VisionaryHera20 Jan 2023 7:21 UTC

4 points

1 comment9 min readLW link

Vegan Nutrition Testing Project: Interim Report

Elizabeth20 Jan 2023 5:50 UTC

105 points

37 comments8 min readLW link

(acesounderglass.com)

Maybe you can learn exotic experiences via analytical thought

Q Home20 Jan 2023 1:50 UTC

2 points

6 comments15 min readLW link

The Gallery for Painting Transformations—A GPT-3 Analogy

Robert_AIZI19 Jan 2023 23:32 UTC

1 point

0 comments6 min readLW link

(aizi.substack.com)

AGI safety field building projects I’d like to see

Severin T. Seehrich19 Jan 2023 22:40 UTC

68 points

28 comments9 min readLW link

Extensionality and the univalence axiom of type theory

Thomas Kehrenberg19 Jan 2023 22:36 UTC

6 points

2 comments16 min readLW link

The spiritual benefits of material progress

jasoncrawford19 Jan 2023 21:35 UTC

24 points

15 comments7 min readLW link

(rootsofprogress.org)

Announcing Cavendish Labs

derikk and agg

19 Jan 2023 20:15 UTC

59 points

5 comments2 min readLW link

(forum.effectivealtruism.org)

Thoughts on refusing harmful requests to large language models

William_S19 Jan 2023 19:49 UTC

32 points

4 comments2 min readLW link

MA RMV Overloaded

jefftk19 Jan 2023 16:40 UTC

16 points

0 comments2 min readLW link

(www.jefftk.com)

“Heretical Thoughts on AI” by Eli Dourado

DragonGod19 Jan 2023 16:11 UTC

146 points

38 comments3 min readLW link

(www.elidourado.com)

Covid 1/19/23: Flipped Numbers

Zvi19 Jan 2023 13:30 UTC

19 points

4 comments4 min readLW link

(thezvi.wordpress.com)

List of technical AI safety exercises and projects

JakubK19 Jan 2023 9:35 UTC

41 points

5 comments1 min readLW link

(docs.google.com)

Group-level Consequences of Psychological Problems

adamShimi and Gabriel Alfour

19 Jan 2023 9:27 UTC

28 points

3 comments2 min readLW link

6-paragraph AI risk intro for MAISI

JakubK19 Jan 2023 9:22 UTC

11 points

0 comments2 min readLW link

(www.maisi.club)

200 COP in MI: Studying Learned Features in Language Models

Neel Nanda19 Jan 2023 3:48 UTC

24 points

2 comments30 min readLW link

Amazon closing AmazonSmile to focus its philanthropic giving to programs with greater impact

Gordon Seidoh Worley19 Jan 2023 1:15 UTC

10 points

8 comments2 min readLW link

Gradient Filtering

Jozdien and janus

18 Jan 2023 20:09 UTC

56 points

16 comments13 min readLW link

[Cross-post] Is the Fermi Paradox due to the Flaw of Averages?

Aryeh Englander, Lonnie Chrisman and Yaakov T

18 Jan 2023 19:22 UTC

42 points

27 comments15 min readLW link

(lumina.com)

First Three Episodes of The Filan Cabinet

DanielFilan18 Jan 2023 19:20 UTC

17 points

1 comment1 min readLW link

[Question] Best Questions To Vet Potential Ai-Safety Applicants

jacksonjezion18 Jan 2023 19:01 UTC

6 points

1 comment1 min readLW link

[Question] Looking for a specific group of people

FriggenRedChickenMan18 Jan 2023 19:00 UTC

15 points

21 comments1 min readLW link

A problem with group epistemics

Mckay Jensen18 Jan 2023 17:06 UTC

4 points

4 comments3 min readLW link

(quevivasbien.github.io)

Why you should learn sign language

Noah Topper18 Jan 2023 17:03 UTC

54 points

23 comments7 min readLW link

(naivebayes.substack.com)

Flying With Covid

jefftk18 Jan 2023 17:00 UTC

44 points

29 comments3 min readLW link

(www.jefftk.com)

Prototype of Using GPT-3 to Generate Textbook-length Content

Rafael Cosman18 Jan 2023 14:25 UTC

2 points

8 comments40 min readLW link

(github.com)

How many people are working (directly) on reducing existential risk from AI?

Benjamin Hilton18 Jan 2023 8:46 UTC

20 points

1 comment4 min readLW link

(80000hours.org)

EA & LW Forum Summaries (9th Jan to 15th Jan 23′)

Zoe Williams18 Jan 2023 7:29 UTC

17 points

0 comments13 min readLW link

OpenAI’s Alignment Plan is not S.M.A.R.T.

Søren Elverlin18 Jan 2023 6:39 UTC

9 points

19 comments4 min readLW link

[Question] Formal definition of Ontology Mismatch?

NathanBarnard18 Jan 2023 5:52 UTC

6 points

0 comments1 min readLW link

[Question] Transformer Mech Interp: Any visualizations?

Joyee Chen18 Jan 2023 4:32 UTC

3 points

0 comments1 min readLW link

Neural networks generalize because of this one weird trick

Jesse Hoogland18 Jan 2023 0:10 UTC

215 points

35 comments15 min readLW link 1 review

(www.jessehoogland.com)

Progress links and tweets, 2023-01-17

jasoncrawford17 Jan 2023 21:31 UTC

13 points

3 comments2 min readLW link

(rootsofprogress.org)