All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025 2026

AllJanFeb Mar Apr May Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 3031

The problem with proportional extrapolation

pathos_bot30 Jan 2024 23:40 UTC

8 points

0 comments1 min readLW link

Counterfactual Mechanism Networks

StrivingForLegibility30 Jan 2024 20:30 UTC

5 points

0 comments5 min readLW link

Control vs Selection: Civilisation is best at control, but navigating AGI requires selection

VojtaKovarik30 Jan 2024 19:06 UTC

7 points

1 comment1 min readLW link

AI governance frames

NathanBarnard30 Jan 2024 18:18 UTC

3 points

0 comments3 min readLW link

Deciding What Project/Org to Start: A Guide to Prioritization Research

Alexandra Bos30 Jan 2024 18:13 UTC

8 points

0 comments7 min readLW link

on neodymium magnets

bhauth30 Jan 2024 15:58 UTC

47 points

6 comments4 min readLW link

(www.bhauth.com)

[Question] Can we create self-improving AIs that perfect their own ethics?

Gabi QUENE30 Jan 2024 14:45 UTC

1 point

10 comments1 min readLW link

Childhood and Education Roundup #4

Zvi30 Jan 2024 13:50 UTC

44 points

10 comments24 min readLW link

(thezvi.wordpress.com)

Last call for submissions for TAIS 2024!

Blaine30 Jan 2024 12:08 UTC

4 points

0 comments1 min readLW link

(tais2024.cc)

[Question] Has anyone actually changed their mind regarding Sleeping Beauty problem?

Ape in the coat30 Jan 2024 8:34 UTC

15 points

51 comments1 min readLW link

San Fernando Valley Rationality: February 15, 2024

Thomas Broadley30 Jan 2024 4:40 UTC

3 points

0 comments1 min readLW link

The case for more ambitious language model evals

Jozdien30 Jan 2024 0:01 UTC

121 points

30 comments5 min readLW link

A short ‘derivation’ of Watanabe’s Free Energy Formula

Wuschel Schulz29 Jan 2024 23:41 UTC

13 points

6 comments7 min readLW link

How important is AI hacking as LLMs advance?

Artem Karpov29 Jan 2024 18:41 UTC

1 point

0 comments6 min readLW link

LLM Psychometrics: A Speculative Approach to AI Safety

pskl29 Jan 2024 18:38 UTC

3 points

4 comments1 min readLW link

(pascal.cc)

[Question] How to write better?

TeaTieAndHat29 Jan 2024 17:02 UTC

8 points

24 comments1 min readLW link

Processor clock speeds are not how fast AIs think

Ege Erdil29 Jan 2024 14:39 UTC

142 points

55 comments2 min readLW link

Natural selection for ingame character build optimisation

Kongo Landwalker29 Jan 2024 11:34 UTC

8 points

5 comments2 min readLW link

Analogy Bank for AI Safety

utilistrutil29 Jan 2024 2:35 UTC

23 points

0 comments8 min readLW link

Minneapolis-St Paul ACX Article Club: Meditation and LSD

25Hour29 Jan 2024 1:24 UTC

3 points

0 comments1 min readLW link

Simple distribution approximation: When sampled 100 times, can language models yield 80% A and 20% B?

Teun van der Weij, Felix Hofstätter and Francis Rhys Ward

29 Jan 2024 0:24 UTC

39 points

5 comments4 min readLW link

Why I take short timelines seriously

Niki Dupuis28 Jan 2024 22:27 UTC

122 points

29 comments4 min readLW link

Win Friends and Influence People Ch. 2: The Bombshell

gull28 Jan 2024 21:40 UTC

37 points

13 comments17 min readLW link

(www.google.com)

Riga ACX February 2024 Meetup: 2023 in Review

Anastasia28 Jan 2024 21:36 UTC

4 points

0 comments1 min readLW link

Things You’re Allowed to Do: At the Dentist

rbinnn28 Jan 2024 18:39 UTC

39 points

16 comments1 min readLW link

(metavee.github.io)

[Question] What exactly did that great AI future involve again?

lemonhope28 Jan 2024 10:10 UTC

15 points

27 comments1 min readLW link

Palworld development blog post

bhauth28 Jan 2024 5:56 UTC

84 points

13 comments1 min readLW link

(note.com)

Virtually Rational—VRChat Meetup

Tomás B. and the gears to ascension

28 Jan 2024 5:52 UTC

25 points

3 comments1 min readLW link

[Stanford Daily] Table Talk

sudo28 Jan 2024 3:15 UTC

6 points

1 comment9 min readLW link

(stanforddaily.com)

AI Law-a-Thon

Iknownothing28 Jan 2024 2:30 UTC

5 points

3 comments1 min readLW link

Chapter 1 of How to Win Friends and Influence People

gull28 Jan 2024 0:32 UTC

53 points

5 comments17 min readLW link

(www.google.com)

Epistemic Hell

rogersbacon27 Jan 2024 17:13 UTC

86 points

20 comments14 min readLW link

David Burns Thinks Psychotherapy Is a Learnable Skill. Git Gud.

Morpheus27 Jan 2024 13:21 UTC

28 points

20 comments11 min readLW link

(podcast.clearerthinking.org)

Aligned AI is dual use technology

lc27 Jan 2024 6:50 UTC

58 points

31 comments2 min readLW link

Questions I’d Want to Ask an AGI+ to Test Its Understanding of Ethics

sweenesm26 Jan 2024 23:40 UTC

14 points

6 comments4 min readLW link

An Invitation to Refrain from Downvoting Posts into Net-Negative Karma

MikkW26 Jan 2024 20:13 UTC

3 points

12 comments1 min readLW link

The Good Balsamic Vinegar

jenn26 Jan 2024 19:30 UTC

52 points

4 comments2 min readLW link

(jenn.site)

The Perspective-based Explanation to the Reflective Inconsistency Paradox

dadadarren26 Jan 2024 19:00 UTC

10 points

16 comments8 min readLW link

To Boldly Code

StrivingForLegibility26 Jan 2024 18:25 UTC

26 points

4 comments3 min readLW link

Incorporating Mechanism Design Into Decision Theory

StrivingForLegibility26 Jan 2024 18:25 UTC

17 points

4 comments4 min readLW link

Making every researcher seek grants is a broken model

jasoncrawford26 Jan 2024 16:06 UTC

185 points

42 comments4 min readLW link 1 review

(rootsofprogress.org)

Notes on Innocence

David Gross26 Jan 2024 14:45 UTC

13 points

21 comments18 min readLW link

Stacked Laptop Monitor

jefftk26 Jan 2024 14:10 UTC

22 points

5 comments1 min readLW link

(www.jefftk.com)

Surgery Works Well Without The FDA

Maxwell Tabarrok26 Jan 2024 13:31 UTC

41 points

28 comments4 min readLW link

(maximumprogress.substack.com)

[Question] Workshop (hackathon, residence program, etc.) about for-profit AI Safety projects?

Roman Leventov26 Jan 2024 9:49 UTC

21 points

5 comments1 min readLW link

Without fundamental advances, misalignment and catastrophe are the default outcomes of training powerful AI

Jeremy Gillen and peterbarnett

26 Jan 2024 7:22 UTC

164 points

65 comments57 min readLW link 2 reviews

Approximately Bayesian Reasoning: Knightian Uncertainty, Goodhart, and the Look-Elsewhere Effect

RogerDearnaley26 Jan 2024 3:58 UTC

25 points

2 comments11 min readLW link

Musings on Cargo Cult Consciousness

Gareth Davidson25 Jan 2024 23:00 UTC

−13 points

11 comments17 min readLW link

RAND report finds no effect of current LLMs on viability of bioterrorism attacks

StellaAthena25 Jan 2024 19:17 UTC

94 points

14 comments1 min readLW link

(www.rand.org)

[Question] Bayesian Reflection Principles and Ignorance of the Future

crickets25 Jan 2024 19:00 UTC

5 points

3 comments1 min readLW link