All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025 2026

All Jan Feb Mar Apr May JunJulAug Sep Oct Nov Dec

All 1 2 3 4 5 6 789 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

How can I get over my fear of becoming an emulated consciousness?

James Dowdell7 Jul 2024 22:02 UTC

6 points

8 comments5 min readLW link

An Extremely Opinionated Annotated List of My Favourite Mechanistic Interpretability Papers v2

Neel Nanda7 Jul 2024 17:39 UTC

146 points

17 comments25 min readLW link 1 review

Joint mandatory donation as a way to increase the number of donations

Crazy philosopher7 Jul 2024 10:56 UTC

3 points

3 comments2 min readLW link

Rationality vs Alignment

Donatas Lučiūnas7 Jul 2024 10:12 UTC

−14 points

14 comments2 min readLW link

Beyond Biomarkers: Understanding Multiscale Causality

Matěj Nekoranec7 Jul 2024 9:56 UTC

13 points

0 comments7 min readLW link

Goodhart’s Law and Emotions

Zero Contradictions7 Jul 2024 8:32 UTC

0 points

5 comments1 min readLW link

(expandingrationality.substack.com)

Reflections on Less Online

Error7 Jul 2024 3:49 UTC

92 points

15 comments18 min readLW link

LK-99 in retrospect

bhauth7 Jul 2024 2:06 UTC

74 points

21 comments3 min readLW link

(www.bhauth.com)

Thresholding

Duncan Sabien (Inactive) and Screwtape

6 Jul 2024 18:53 UTC

48 points

8 comments2 min readLW link 1 review

(homosabiens.substack.com)

NYU Debate Training Update: Methods, Baselines, Preliminary Results

samarnesen6 Jul 2024 18:28 UTC

9 points

0 comments20 min readLW link

Scalable oversight as a quantitative rather than qualitative problem

Buck6 Jul 2024 17:42 UTC

86 points

11 comments3 min readLW link

An AI Manhattan Project is Not Inevitable

Maxwell Tabarrok6 Jul 2024 16:42 UTC

39 points

25 comments4 min readLW link

(www.maximum-progress.com)

[Linkpost] A Case for AI Consciousness

cdkg and Simon Goldstein

6 Jul 2024 14:52 UTC

22 points

2 comments1 min readLW link

(philpapers.org)

[Question] Can agents coordinate on randomness without outside sources?

Mikhail Samin6 Jul 2024 13:43 UTC

11 points

16 comments1 min readLW link

AI Alignment Research Engineer Accelerator (ARENA): Call for applicants v4.0

James Fox, Chloe Li, JamesH, Gracie Green and CallumMcDougall

6 Jul 2024 11:34 UTC

57 points

7 comments6 min readLW link

Links and brief musings for June

Kaj_Sotala6 Jul 2024 10:10 UTC

26 points

0 comments10 min readLW link

(kajsotala.fi)

Indecision and internalized authority figures

Kaj_Sotala6 Jul 2024 10:10 UTC

69 points

1 comment2 min readLW link

(kajsotala.fi)

Free Will, Determinism, And Choice

Zero Contradictions6 Jul 2024 6:34 UTC

8 points

3 comments1 min readLW link

(thewaywardaxolotl.blogspot.com)

Travel Buffer

jefftk6 Jul 2024 2:20 UTC

17 points

3 comments1 min readLW link

(www.jefftk.com)

[Question] What progress have we made on automated auditing?

LawrenceC6 Jul 2024 1:49 UTC

38 points

1 comment1 min readLW link

A “Bitter Lesson” Approach to Aligning AGI and ASI

RogerDearnaley6 Jul 2024 1:23 UTC

64 points

43 comments24 min readLW link

D&D.Sci: Whom Shall You Call?

abstractapplic5 Jul 2024 20:53 UTC

40 points

6 comments2 min readLW link

[Interim research report] Activation plateaus & sensitive directions in GPT2

StefanHex and jake_mendel

5 Jul 2024 17:05 UTC

66 points

2 comments5 min readLW link

Minimalist And Maximalist Type Systems

adamShimi5 Jul 2024 16:25 UTC

17 points

6 comments3 min readLW link

(epistemologicalfascinations.substack.com)

ML4Good Summer Bootcamps—Applications Open [deadline extended]

YM5 Jul 2024 13:59 UTC

12 points

0 comments1 min readLW link

[Question] Are there any plans to launch a paperback version of “Rationality: From AI to Zombies”?

m_arj5 Jul 2024 11:14 UTC

2 points

1 comment1 min readLW link

Doomsday Argument and the False Dilemma of Anthropic Reasoning

Ape in the coat5 Jul 2024 5:38 UTC

51 points

66 comments7 min readLW link

Finding the Wisdom to Build Safe AI

Gordon Seidoh Worley4 Jul 2024 19:04 UTC

36 points

10 comments9 min readLW link

Libs vs Frameworks, Middle-Level Regularities vs Theories

adamShimi4 Jul 2024 19:01 UTC

23 points

0 comments2 min readLW link

(epistemologicalfascinations.substack.com)

The Potential Impossibility of Subjective Death

VictorLJZ4 Jul 2024 18:17 UTC

3 points

35 comments1 min readLW link

Consider the humble rock (or: why the dumb thing kills you)

pleiotroth4 Jul 2024 13:54 UTC

79 points

12 comments4 min readLW link 1 review

AI #71: Farewell to Chevron

Zvi4 Jul 2024 13:40 UTC

53 points

9 comments36 min readLW link

(thezvi.wordpress.com)

The Dumbification of our smart screens

Itay Dreyfus4 Jul 2024 6:32 UTC

18 points

0 comments5 min readLW link

(productidentity.co)

Introduction to French AI Policy

Lucie Philippon4 Jul 2024 3:39 UTC

112 points

12 comments6 min readLW link

How predictive processing solved my wrist pain

max_shen4 Jul 2024 1:56 UTC

38 points

8 comments8 min readLW link

80,000 hours should remove OpenAI from the Job Board (and similar EA orgs should do similarly)

Raemon3 Jul 2024 20:34 UTC

274 points

71 comments3 min readLW link

Notes on Tuning Metacognition

Jo Jiao3 Jul 2024 19:54 UTC

10 points

0 comments5 min readLW link

When Are Results from Computational Complexity Not Too Coarse?

Dalcy3 Jul 2024 19:06 UTC

42 points

8 comments3 min readLW link

Musings on LLM Scale (Jul 2024)

Vladimir_Nesov3 Jul 2024 18:35 UTC

34 points

0 comments3 min readLW link

Static Analysis As A Lifestyle

adamShimi3 Jul 2024 18:29 UTC

65 points

11 comments3 min readLW link

(epistemologicalfascinations.substack.com)

AI development is an act of social revolution

artemiocobb3 Jul 2024 18:00 UTC

3 points

0 comments3 min readLW link

[Question] What percent of the sun would a Dyson Sphere cover?

Raemon3 Jul 2024 17:27 UTC

24 points

26 comments1 min readLW link

[Question] Isomorphisms don’t preserve subjective experience… right?

Terence Coelho3 Jul 2024 14:22 UTC

5 points

26 comments1 min readLW link

3C’s: A Recipe For Mathing Concepts

johnswentworth and David Lorell

3 Jul 2024 1:06 UTC

82 points

5 comments7 min readLW link

Announcing the AI Forecasting Benchmark Series | July 8, $120k in Prizes

ChristianWilliams2 Jul 2024 22:33 UTC

15 points

0 comments5 min readLW link

(www.metaculus.com)

Open Sourcing Metaculus

ChristianWilliams2 Jul 2024 22:30 UTC

44 points

0 comments2 min readLW link

(www.metaculus.com)

[Question] Why Can’t Sub-AGI Solve AI Alignment? Or: Why Would Sub-AGI AI Not be Aligned?

MrThink2 Jul 2024 20:13 UTC

4 points

23 comments1 min readLW link

[Question] Why haven’t there been assassination attempts against high profile AI accelerationists like sam altman yet?

louisTrem2 Jul 2024 18:16 UTC

−13 points

4 comments2 min readLW link

How ARENA course material gets made

CallumMcDougall2 Jul 2024 18:04 UTC

41 points

2 comments7 min readLW link

An AI Race With China Can Be Better Than Not Racing

niplav2 Jul 2024 17:57 UTC

68 points

36 comments11 min readLW link