All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025 2026

AllJanFeb Mar Apr May Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 293031

A short ‘derivation’ of Watanabe’s Free Energy Formula

Wuschel Schulz29 Jan 2024 23:41 UTC

13 points

6 comments7 min readLW link

How important is AI hacking as LLMs advance?

Artem Karpov29 Jan 2024 18:41 UTC

1 point

0 comments6 min readLW link

LLM Psychometrics: A Speculative Approach to AI Safety

pskl29 Jan 2024 18:38 UTC

3 points

4 comments1 min readLW link

(pascal.cc)

[Question] How to write better?

TeaTieAndHat29 Jan 2024 17:02 UTC

8 points

24 comments1 min readLW link

Processor clock speeds are not how fast AIs think

Ege Erdil29 Jan 2024 14:39 UTC

142 points

55 comments2 min readLW link

Natural selection for ingame character build optimisation

Kongo Landwalker29 Jan 2024 11:34 UTC

8 points

5 comments2 min readLW link

Analogy Bank for AI Safety

utilistrutil29 Jan 2024 2:35 UTC

23 points

0 comments8 min readLW link

Minneapolis-St Paul ACX Article Club: Meditation and LSD

25Hour29 Jan 2024 1:24 UTC

3 points

0 comments1 min readLW link

Simple distribution approximation: When sampled 100 times, can language models yield 80% A and 20% B?

Teun van der Weij, Felix Hofstätter and Francis Rhys Ward

29 Jan 2024 0:24 UTC

39 points

5 comments4 min readLW link

Why I take short timelines seriously

NicholasKees28 Jan 2024 22:27 UTC

122 points

29 comments4 min readLW link

Win Friends and Influence People Ch. 2: The Bombshell

gull28 Jan 2024 21:40 UTC

37 points

13 comments17 min readLW link

(www.google.com)

Riga ACX February 2024 Meetup: 2023 in Review

Anastasia28 Jan 2024 21:36 UTC

4 points

0 comments1 min readLW link

Things You’re Allowed to Do: At the Dentist

rbinnn28 Jan 2024 18:39 UTC

39 points

16 comments1 min readLW link

(metavee.github.io)

[Question] What exactly did that great AI future involve again?

lemonhope28 Jan 2024 10:10 UTC

15 points

27 comments1 min readLW link

Palworld development blog post

bhauth28 Jan 2024 5:56 UTC

84 points

13 comments1 min readLW link

(note.com)

Virtually Rational—VRChat Meetup

Tomás B. and the gears to ascension

28 Jan 2024 5:52 UTC

25 points

3 comments1 min readLW link

[Stanford Daily] Table Talk

sudo28 Jan 2024 3:15 UTC

6 points

1 comment9 min readLW link

(stanforddaily.com)

AI Law-a-Thon

Iknownothing28 Jan 2024 2:30 UTC

5 points

3 comments1 min readLW link

Chapter 1 of How to Win Friends and Influence People

gull28 Jan 2024 0:32 UTC

53 points

5 comments17 min readLW link

(www.google.com)

Epistemic Hell

rogersbacon27 Jan 2024 17:13 UTC

86 points

20 comments14 min readLW link

David Burns Thinks Psychotherapy Is a Learnable Skill. Git Gud.

Morpheus27 Jan 2024 13:21 UTC

28 points

20 comments11 min readLW link

(podcast.clearerthinking.org)

Aligned AI is dual use technology

lc27 Jan 2024 6:50 UTC

58 points

31 comments2 min readLW link

Questions I’d Want to Ask an AGI+ to Test Its Understanding of Ethics

sweenesm26 Jan 2024 23:40 UTC

14 points

6 comments4 min readLW link

An Invitation to Refrain from Downvoting Posts into Net-Negative Karma

MikkW26 Jan 2024 20:13 UTC

3 points

12 comments1 min readLW link

The Good Balsamic Vinegar

jenn26 Jan 2024 19:30 UTC

52 points

4 comments2 min readLW link

(jenn.site)

The Perspective-based Explanation to the Reflective Inconsistency Paradox

dadadarren26 Jan 2024 19:00 UTC

10 points

16 comments8 min readLW link

To Boldly Code

StrivingForLegibility26 Jan 2024 18:25 UTC

26 points

4 comments3 min readLW link

Incorporating Mechanism Design Into Decision Theory

StrivingForLegibility26 Jan 2024 18:25 UTC

17 points

4 comments4 min readLW link

Making every researcher seek grants is a broken model

jasoncrawford26 Jan 2024 16:06 UTC

184 points

42 comments4 min readLW link 1 review

(rootsofprogress.org)

Notes on Innocence

David Gross26 Jan 2024 14:45 UTC

13 points

21 comments18 min readLW link

Stacked Laptop Monitor

jefftk26 Jan 2024 14:10 UTC

22 points

5 comments1 min readLW link

(www.jefftk.com)

Surgery Works Well Without The FDA

Maxwell Tabarrok26 Jan 2024 13:31 UTC

41 points

28 comments4 min readLW link

(maximumprogress.substack.com)

[Question] Workshop (hackathon, residence program, etc.) about for-profit AI Safety projects?

Roman Leventov26 Jan 2024 9:49 UTC

21 points

5 comments1 min readLW link

Without fundamental advances, misalignment and catastrophe are the default outcomes of training powerful AI

Jeremy Gillen and peterbarnett

26 Jan 2024 7:22 UTC

161 points

65 comments57 min readLW link 2 reviews

Approximately Bayesian Reasoning: Knightian Uncertainty, Goodhart, and the Look-Elsewhere Effect

RogerDearnaley26 Jan 2024 3:58 UTC

25 points

2 comments11 min readLW link

Musings on Cargo Cult Consciousness

Gareth Davidson25 Jan 2024 23:00 UTC

−13 points

11 comments17 min readLW link

RAND report finds no effect of current LLMs on viability of bioterrorism attacks

StellaAthena25 Jan 2024 19:17 UTC

94 points

14 comments1 min readLW link

(www.rand.org)

[Question] Bayesian Reflection Principles and Ignorance of the Future

crickets25 Jan 2024 19:00 UTC

5 points

3 comments1 min readLW link

“Does your paradigm beget new, good, paradigms?”

Raemon25 Jan 2024 18:23 UTC

40 points

6 comments2 min readLW link

AI #48: The Talk of Davos

Zvi25 Jan 2024 16:20 UTC

38 points

9 comments36 min readLW link

(thezvi.wordpress.com)

Importing a Python File by Name

jefftk25 Jan 2024 16:00 UTC

12 points

7 comments1 min readLW link

(www.jefftk.com)

[Repost] The Copenhagen Interpretation of Ethics

mesaoptimizer25 Jan 2024 15:20 UTC

83 points

4 comments5 min readLW link

(web.archive.org)

Nash Bargaining between Subagents doesn’t solve the Shutdown Problem

A.H.25 Jan 2024 10:47 UTC

23 points

1 comment9 min readLW link

Status-oriented spending

Adam Zerner25 Jan 2024 6:46 UTC

14 points

19 comments4 min readLW link

Protecting agent boundaries

Chris Lakin25 Jan 2024 4:13 UTC

11 points

6 comments2 min readLW link

[Question] Is a random box of gas predictable after 20 seconds?

Thomas Kwa and habryka

24 Jan 2024 23:00 UTC

38 points

35 comments1 min readLW link

[Question] Will quantum randomness affect the 2028 election?

Thomas Kwa and habryka

24 Jan 2024 22:54 UTC

66 points

52 comments1 min readLW link

AISN #30: Investments in Compute and Military AI Plus, Japan and Singapore’s National AI Safety Institutes

Dan H and Corin Katzke

24 Jan 2024 19:38 UTC

27 points

1 comment6 min readLW link

(newsletter.safe.ai)

Krueger Lab AI Safety Internship 2024

Joey Bream24 Jan 2024 19:17 UTC

3 points

0 comments1 min readLW link

Agents that act for reasons: a thought experiment

Michele Campolo24 Jan 2024 16:47 UTC

3 points

0 comments3 min readLW link