All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025 2026

All Jan Feb Mar Apr May Jun Jul Aug Sep Oct NovDec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 202122 23 24 25 26 27 28 29 30 31

Seattle Winter Solstice

a7x20 Dec 2023 20:30 UTC

6 points

1 comment1 min readLW link

How Would an Utopia-Maximizer Look Like?

Thane Ruthenis20 Dec 2023 20:01 UTC

32 points

23 comments10 min readLW link

Succession

Richard_Ngo20 Dec 2023 19:25 UTC

175 points

48 comments11 min readLW link

(www.narrativeark.xyz)

Metaculus Introduces Multiple Choice Questions

ChristianWilliams20 Dec 2023 19:00 UTC

4 points

0 comments1 min readLW link

(www.metaculus.com)

Brighter Than Today Versions

jefftk20 Dec 2023 18:20 UTC

16 points

2 comments2 min readLW link

(www.jefftk.com)

Gaia Network: a practical, incremental pathway to Open Agency Architecture

Roman Leventov and Rafael Kaufmann Nedal

20 Dec 2023 17:11 UTC

23 points

8 comments16 min readLW link

On the future of language models

owencb20 Dec 2023 16:58 UTC

105 points

17 comments36 min readLW link

[Valence series] Appendix A: Hedonic tone / (dis)pleasure / (dis)liking

Steven Byrnes20 Dec 2023 15:54 UTC

23 points

3 comments13 min readLW link

Matrix completion prize results

paulfchristiano20 Dec 2023 15:40 UTC

45 points

0 comments2 min readLW link

(www.alignment.org)

[Question] What’s the minimal additive constant for Kolmogorov Complexity that a programming language can achieve?

Noosphere8920 Dec 2023 15:36 UTC

11 points

15 comments1 min readLW link

Legalize butanol?

bhauth20 Dec 2023 14:24 UTC

39 points

20 comments5 min readLW link

(www.bhauth.com)

A short dialogue on comparability of values

cousin_it20 Dec 2023 14:08 UTC

28 points

7 comments1 min readLW link

Inside View, Outside View… And Opposing View

chaosmage20 Dec 2023 12:35 UTC

21 points

1 comment5 min readLW link

Heuristics for preventing major life mistakes

SK220 Dec 2023 8:01 UTC

29 points

2 comments3 min readLW link

Escaping Skeuomorphism

Stuart Johnson20 Dec 2023 3:51 UTC

29 points

0 comments8 min readLW link

Ronny and Nate discuss what sorts of minds humanity is likely to find by Machine Learning

So8res and Ronny Fernandez

19 Dec 2023 23:39 UTC

43 points

30 comments25 min readLW link

[Question] What are the best Siderea posts?

mike_hawke19 Dec 2023 23:07 UTC

18 points

2 comments1 min readLW link

Meaning & Agency

abramdemski19 Dec 2023 22:27 UTC

93 points

17 comments14 min readLW link

s/acc: Safe Accelerationism Manifesto

lorepieri19 Dec 2023 22:19 UTC

−4 points

5 comments2 min readLW link

(lorenzopieri.com)

Don’t Share Information Exfohazardous on Others’ AI-Risk Models

Thane Ruthenis19 Dec 2023 20:09 UTC

70 points

11 comments1 min readLW link

Paper: Tell, Don’t Show- Declarative facts influence how LLMs generalize

Owain_Evans and Alex Meinke

19 Dec 2023 19:14 UTC

45 points

4 comments6 min readLW link

(arxiv.org)

Interview: Applications w/ Alice Rigg

jacobhaimes19 Dec 2023 19:03 UTC

12 points

0 comments1 min readLW link

(into-ai-safety.github.io)

How does a toy 2 digit subtraction transformer predict the sign of the output?

Evan Anders19 Dec 2023 18:56 UTC

14 points

0 comments8 min readLW link

(evanhanders.blog)

Incremental AI Risks from Proxy-Simulations

kmenou19 Dec 2023 18:56 UTC

2 points

0 comments1 min readLW link

(individual.utoronto.ca)

Goal-Completeness is like Turing-Completeness for AGI

Liron19 Dec 2023 18:12 UTC

51 points

26 comments3 min readLW link

SociaLLM: proposal for a language model design for personalised apps, social science, and AI safety research

Roman Leventov19 Dec 2023 16:49 UTC

17 points

5 comments3 min readLW link

Chording “The Next Right Thing”

jefftk19 Dec 2023 15:40 UTC

11 points

0 comments2 min readLW link

(www.jefftk.com)

Monthly Roundup #13: December 2023

Zvi19 Dec 2023 15:10 UTC

32 points

5 comments26 min readLW link

(thezvi.wordpress.com)

Effective Aspersions: How the Nonlinear Investigation Went Wrong

TracingWoodgrains19 Dec 2023 12:00 UTC

190 points

172 comments31 min readLW link 2 reviews

A Universal Emergent Decomposition of Retrieval Tasks in Language Models

Alexandre Variengien and Eric Winsor

19 Dec 2023 11:52 UTC

84 points

3 comments10 min readLW link

(arxiv.org)

Assessment of AI safety agendas: think about the downside risk

Roman Leventov19 Dec 2023 9:00 UTC

13 points

1 comment1 min readLW link

Constellations are Younger than Continents

Jeffrey Heninger19 Dec 2023 6:12 UTC

271 points

21 comments2 min readLW link

The Dark Arts

lsusr and Lyrongolem

19 Dec 2023 4:41 UTC

137 points

49 comments9 min readLW link

When scientists consider whether their research will end the world

Harlan19 Dec 2023 3:47 UTC

30 points

4 comments11 min readLW link

(blog.aiimpacts.org)

Is the far future inevitably zero sum?

Srdjan Miletic19 Dec 2023 1:45 UTC

8 points

2 comments2 min readLW link

(dissent.blog)

The ‘Neglected Approaches’ Approach: AE Studio’s Alignment Agenda

Cameron Berg, Kvee, Trent Hodgeson and Marc Carauleanu

18 Dec 2023 20:35 UTC

194 points

23 comments12 min readLW link 1 review

The Shortest Path Between Scylla and Charybdis

Thane Ruthenis18 Dec 2023 20:08 UTC

50 points

8 comments5 min readLW link

OpenAI: Preparedness framework

Zach Stein-Perlman18 Dec 2023 18:30 UTC

70 points

23 comments4 min readLW link

(openai.com)

[Valence series] 5. “Valence Disorders” in Mental Health & Personality

Steven Byrnes18 Dec 2023 15:26 UTC

46 points

13 comments14 min readLW link

Discussion: Challenges with Unsupervised LLM Knowledge Discovery

Seb Farquhar, Vikrant Varma, zac_kenton, gasteigerjo, Vlad Mikulik and Rohin Shah

18 Dec 2023 11:58 UTC

149 points

21 comments10 min readLW link

Interpreting the Learning of Deceit

RogerDearnaley18 Dec 2023 8:12 UTC

32 points

14 comments9 min readLW link

Talk: “AI Would Be A Lot Less Alarming If We Understood Agents”

johnswentworth17 Dec 2023 23:46 UTC

58 points

3 comments1 min readLW link

(www.youtube.com)

∀: a story

Richard_Ngo17 Dec 2023 22:42 UTC

42 points

1 comment8 min readLW link

(www.narrativeark.xyz)

Reviving a 2015 MacBook

jefftk17 Dec 2023 21:00 UTC

13 points

0 comments1 min readLW link

(www.jefftk.com)

A Common-Sense Case For Mutually-Misaligned AGIs Allying Against Humans

Thane Ruthenis17 Dec 2023 20:28 UTC

29 points

7 comments11 min readLW link

The Limits of Artificial Consciousness: A Biology-Based Critique of Chalmers’ Fading Qualia Argument

Štěpán Los17 Dec 2023 19:11 UTC

−6 points

9 comments17 min readLW link

What makes teaching math special

Viliam17 Dec 2023 14:15 UTC

45 points

27 comments11 min readLW link

The predictive power of dissipative adaptation

dr_s17 Dec 2023 14:01 UTC

59 points

16 comments19 min readLW link

Linkpost: Francesca v Harvard

Linch17 Dec 2023 6:18 UTC

5 points

5 comments2 min readLW link

(www.francesca-v-harvard.org)

The Serendipity of Density

jefftk17 Dec 2023 3:50 UTC

40 points

4 comments1 min readLW link

(www.jefftk.com)