All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025 2026

All JanFebMar Apr May Jun Jul Aug Sep Oct Nov Dec

All12 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29

Leading The Parade

johnswentworth31 Jan 2024 22:39 UTC

150 points

32 comments9 min readLW link 1 review

Proposal for an AI Safety Prize

sweenesm31 Jan 2024 18:35 UTC

3 points

0 comments2 min readLW link

Literally Everything is Infinite

Spiral31 Jan 2024 18:31 UTC

−9 points

8 comments5 min readLW link

What fuels your ambition?

Cissy31 Jan 2024 18:30 UTC

29 points

1 comment5 min readLW link

(www.moremyself.xyz)

“Genlangs” and Zipf’s Law: Do languages generated by ChatGPT statistically look human?

Justin-Diamond31 Jan 2024 18:30 UTC

2 points

2 comments1 min readLW link

(arxiv.org)

AI, Intellectual Property, and the Techno-Optimist Revolution

Justin-Diamond31 Jan 2024 18:30 UTC

1 point

0 comments1 min readLW link

(www.researchgate.net)

My Alignment “Plan”: Avoid Strong Optimisation and Align Economy

VojtaKovarik31 Jan 2024 17:03 UTC

24 points

9 comments7 min readLW link

Per protocol analysis as medical malpractice

braces31 Jan 2024 16:22 UTC

57 points

11 comments1 min readLW link

Adam Smith Meets AI Doomers

James_Miller31 Jan 2024 15:53 UTC

35 points

10 comments5 min readLW link

Ten Modes of Culture War Discourse

jchan31 Jan 2024 13:58 UTC

59 points

16 comments15 min readLW link

Without Fundamental Advances, Rebellion and Coup d’État are the Inevitable Outcomes of Dictators & Monarchs Trying to Control Large, Capable Countries

Roko31 Jan 2024 10:14 UTC

27 points

34 comments1 min readLW link

Explaining Impact Markets

Saul Munn31 Jan 2024 9:51 UTC

95 points

2 comments3 min readLW link

(www.brasstacks.blog)

Exploring OpenAI’s Latent Directions: Tests, Observations, and Poking Around

Johnny Lin31 Jan 2024 6:01 UTC

26 points

4 comments14 min readLW link

Clip keys together with tiny carabiners

Brendan Long31 Jan 2024 4:26 UTC

11 points

5 comments1 min readLW link

(www.brendanlong.com)

The problem with proportional extrapolation

pathos_bot30 Jan 2024 23:40 UTC

8 points

0 comments1 min readLW link

Counterfactual Mechanism Networks

StrivingForLegibility30 Jan 2024 20:30 UTC

5 points

0 comments5 min readLW link

Control vs Selection: Civilisation is best at control, but navigating AGI requires selection

VojtaKovarik30 Jan 2024 19:06 UTC

7 points

1 comment1 min readLW link

AI governance frames

NathanBarnard30 Jan 2024 18:18 UTC

3 points

0 comments3 min readLW link

Deciding What Project/Org to Start: A Guide to Prioritization Research

Alexandra Bos30 Jan 2024 18:13 UTC

8 points

0 comments7 min readLW link

on neodymium magnets

bhauth30 Jan 2024 15:58 UTC

47 points

6 comments4 min readLW link

(www.bhauth.com)

[Question] Can we create self-improving AIs that perfect their own ethics?

Gabi QUENE30 Jan 2024 14:45 UTC

1 point

10 comments1 min readLW link

Childhood and Education Roundup #4

Zvi30 Jan 2024 13:50 UTC

44 points

10 comments24 min readLW link

(thezvi.wordpress.com)

Last call for submissions for TAIS 2024!

Blaine30 Jan 2024 12:08 UTC

4 points

0 comments1 min readLW link

(tais2024.cc)

[Question] Has anyone actually changed their mind regarding Sleeping Beauty problem?

Ape in the coat30 Jan 2024 8:34 UTC

15 points

50 comments1 min readLW link

San Fernando Valley Rationality: February 15, 2024

Thomas Broadley30 Jan 2024 4:40 UTC

3 points

0 comments1 min readLW link

The case for more ambitious language model evals

Jozdien30 Jan 2024 0:01 UTC

119 points

30 comments5 min readLW link

A short ‘derivation’ of Watanabe’s Free Energy Formula

Wuschel Schulz29 Jan 2024 23:41 UTC

13 points

6 comments7 min readLW link

How important is AI hacking as LLMs advance?

Artem Karpov29 Jan 2024 18:41 UTC

1 point

0 comments6 min readLW link

LLM Psychometrics: A Speculative Approach to AI Safety

pskl29 Jan 2024 18:38 UTC

3 points

4 comments1 min readLW link

(pascal.cc)

[Question] How to write better?

TeaTieAndHat29 Jan 2024 17:02 UTC

8 points

24 comments1 min readLW link

Processor clock speeds are not how fast AIs think

Ege Erdil29 Jan 2024 14:39 UTC

142 points

55 comments2 min readLW link

Natural selection for ingame character build optimisation

Kongo Landwalker29 Jan 2024 11:34 UTC

8 points

5 comments2 min readLW link

Analogy Bank for AI Safety

utilistrutil29 Jan 2024 2:35 UTC

23 points

0 comments8 min readLW link

Minneapolis-St Paul ACX Article Club: Meditation and LSD

25Hour29 Jan 2024 1:24 UTC

3 points

0 comments1 min readLW link

Simple distribution approximation: When sampled 100 times, can language models yield 80% A and 20% B?

Teun van der Weij, Felix Hofstätter and Francis Rhys Ward

29 Jan 2024 0:24 UTC

39 points

5 comments4 min readLW link

Why I take short timelines seriously

NicholasKees28 Jan 2024 22:27 UTC

122 points

29 comments4 min readLW link

Win Friends and Influence People Ch. 2: The Bombshell

gull28 Jan 2024 21:40 UTC

37 points

13 comments17 min readLW link

(www.google.com)

Riga ACX February 2024 Meetup: 2023 in Review

Anastasia28 Jan 2024 21:36 UTC

4 points

0 comments1 min readLW link

Things You’re Allowed to Do: At the Dentist

rbinnn28 Jan 2024 18:39 UTC

39 points

16 comments1 min readLW link

(metavee.github.io)

[Question] What exactly did that great AI future involve again?

lemonhope28 Jan 2024 10:10 UTC

15 points

27 comments1 min readLW link

Palworld development blog post

bhauth28 Jan 2024 5:56 UTC

84 points

13 comments1 min readLW link

(note.com)

Virtually Rational—VRChat Meetup

Tomás B. and the gears to ascension

28 Jan 2024 5:52 UTC

25 points

3 comments1 min readLW link

[Stanford Daily] Table Talk

sudo28 Jan 2024 3:15 UTC

6 points

1 comment9 min readLW link

(stanforddaily.com)

AI Law-a-Thon

Iknownothing28 Jan 2024 2:30 UTC

5 points

3 comments1 min readLW link

Chapter 1 of How to Win Friends and Influence People

gull28 Jan 2024 0:32 UTC

53 points

5 comments17 min readLW link

(www.google.com)

Epistemic Hell

rogersbacon27 Jan 2024 17:13 UTC

86 points

20 comments14 min readLW link

David Burns Thinks Psychotherapy Is a Learnable Skill. Git Gud.

Morpheus27 Jan 2024 13:21 UTC

28 points

20 comments11 min readLW link

(podcast.clearerthinking.org)

Aligned AI is dual use technology

lc27 Jan 2024 6:50 UTC

58 points

31 comments2 min readLW link

Questions I’d Want to Ask an AGI+ to Test Its Understanding of Ethics

sweenesm26 Jan 2024 23:40 UTC

14 points

6 comments4 min readLW link

An Invitation to Refrain from Downvoting Posts into Net-Negative Karma

MikkW26 Jan 2024 20:13 UTC

3 points

12 comments1 min readLW link