All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025

AllJanFeb Mar Apr May Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 161718 19 20 21 22 23 24 25 26 27 28 29 30 31

Eliminating Cookie Banners is Hard

jefftkJan 13, 2024, 3:00 AM

23 points

15 comments3 min readLW link

(www.jefftk.com)

Introducing Alignment Stress-Testing at Anthropic

evhubJan 12, 2024, 11:51 PM

182 points

23 comments2 min readLW link

D&D.Sci(-fi): Colonizing the SuperHyperSphere

abstractapplicJan 12, 2024, 11:36 PM

48 points

23 comments2 min readLW link

Commonwealth Fusion Systems is the Same Scale as OpenAI

Jeffrey HeningerJan 12, 2024, 9:43 PM

22 points

13 comments2 min readLW link

Throughput vs. Latency

alkjash and Ruby

Jan 12, 2024, 9:37 PM

29 points

2 comments13 min readLW link

Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training

evhub, Carson Denison, Meg, Monte M, David Duvenaud, Nicholas Schiefer and Ethan Perez

Jan 12, 2024, 7:51 PM

305 points

95 comments3 min readLW link

(arxiv.org)

METAPHILOSOPHY—A Philosophizing through logical consequences

SeremoniaJan 12, 2024, 6:47 PM

−7 points

7 comments1 min readLW link

Idealism, Realistic & Pragmatic

SeremoniaJan 12, 2024, 6:16 PM

−7 points

3 comments1 min readLW link

The existential threat of humans.

Spiritus DeiJan 12, 2024, 5:50 PM

−24 points

0 comments3 min readLW link

[Question] Concrete examples of doing agentic things?

Jacob G-WJan 12, 2024, 3:59 PM

13 points

10 comments1 min readLW link

Land Reclamation is in the 9th Circle of Stagnation Hell

Maxwell TabarrokJan 12, 2024, 1:36 PM

54 points

6 comments2 min readLW link

(maximumprogress.substack.com)

What good is G-factor if you’re dumped in the woods? A field report from a camp counselor.

HastingsJan 12, 2024, 1:17 PM

149 points

22 comments1 min readLW link

A Chinese Room Containing a Stack of Stochastic Parrots

RogerDearnaleyJan 12, 2024, 6:29 AM

20 points

3 comments5 min readLW link

Decent plan prize announcement (1 paragraph, $1k)

lemonhopeJan 12, 2024, 6:27 AM

25 points

19 comments1 min readLW link

introduction to solid oxide electrolytes

bhauthJan 12, 2024, 5:35 AM

17 points

0 comments4 min readLW link

(www.bhauth.com)

Apply to the 2024 PIBBSS Summer Research Fellowship

Nora_Ammann, DusanDNesic and Lucas Teixeira

Jan 12, 2024, 4:06 AM

39 points

1 comment2 min readLW link

A Benchmark for Decision Theories

StrivingForLegibilityJan 11, 2024, 6:54 PM

10 points

0 comments2 min readLW link

An even deeper atheism

Joe CarlsmithJan 11, 2024, 5:28 PM

125 points

47 comments15 min readLW link

Motivating Alignment of LLM-Powered Agents: Easy for AGI, Hard for ASI?

RogerDearnaleyJan 11, 2024, 12:56 PM

35 points

4 comments39 min readLW link

Reprograming the Mind: Meditation as a Tool for Cognitive Optimization

Jonas HallgrenJan 11, 2024, 12:03 PM

32 points

3 comments11 min readLW link

AI-Generated Music for Learning

nomagicpillJan 11, 2024, 4:11 AM

9 points

1 comment1 min readLW link

(210ethan.github.io)

Introduce a Speed Maximum

jefftkJan 11, 2024, 2:50 AM

36 points

28 comments2 min readLW link

(www.jefftk.com)

[Question] Prediction markets are consistently underconfident. Why?

Sinclair ChenJan 11, 2024, 2:44 AM

11 points

4 comments1 min readLW link

Trying to align humans with inclusive genetic fitness

peterbarnettJan 11, 2024, 12:13 AM

23 points

5 comments10 min readLW link

Universal Love Integration Test: Hitler

RaemonJan 10, 2024, 11:55 PM

76 points

65 comments9 min readLW link

The Perceptron Controversy

Yuxi_LiuJan 10, 2024, 11:07 PM

65 points

18 comments1 min readLW link

(yuxi-liu-wired.github.io)

The Aspiring Rationalist Congregation

maiaJan 10, 2024, 10:52 PM

86 points

23 comments10 min readLW link

An Actually Intuitive Explanation of the Oberth Effect

Isaac KingJan 10, 2024, 8:23 PM

63 points

37 comments6 min readLW link

Beware the suboptimal routine

jwfiredragonJan 10, 2024, 7:02 PM

13 points

3 comments3 min readLW link

The true cost of fences

pleiotrothJan 10, 2024, 7:01 PM

3 points

2 comments4 min readLW link

“Dark Constitution” for constraining some superintelligences

ValentineJan 10, 2024, 4:02 PM

3 points

9 comments1 min readLW link

(www.anarchonomicon.com)

[Question] rabbit (a new AI company) and Large Action Model (LAM)

MiguelDevJan 10, 2024, 1:57 PM

17 points

3 comments1 min readLW link

Saving the world sucks

Defective AltruismJan 10, 2024, 5:55 AM

50 points

29 comments3 min readLW link

[Question] Questions about Solomonoff induction

mukashiJan 10, 2024, 1:16 AM

7 points

11 comments1 min readLW link

AI as a natural disaster

Neil Jan 10, 2024, 12:42 AM

11 points

1 comment7 min readLW link

Stop being surprised by the passage of time

duck_master and 00aleae

Jan 10, 2024, 12:36 AM

−2 points

1 comment3 min readLW link

A discussion of normative ethics

Gordon Seidoh Worley and Adam Zerner

Jan 9, 2024, 11:29 PM

10 points

6 comments25 min readLW link

On the Contrary, Steelmanning Is Normal; ITT-Passing Is Niche

Zack_M_DavisJan 9, 2024, 11:12 PM

45 points

31 comments4 min readLW link

[Question] What’s the protocol for if a novice has ML ideas that are unlikely to work, but might improve capabilities if they do work?

droctaJan 9, 2024, 10:51 PM

6 points

2 comments2 min readLW link

Goodbye, Shoggoth: The Stage, its Animatronics, & the Puppeteer – a New Metaphor

RogerDearnaleyJan 9, 2024, 8:42 PM

48 points

8 comments36 min readLW link

Bent or Blunt Hoods?

jefftkJan 9, 2024, 8:10 PM

23 points

0 comments1 min readLW link

(www.jefftk.com)

2024 ACX Predictions: Blind/Buy/Sell/Hold

ZviJan 9, 2024, 7:30 PM

33 points

2 comments31 min readLW link

(thezvi.wordpress.com)

Announcing the Double Crux Bot

sanyer, Sofia Vanhanen and sarah.bluhm

9 Jan 2024 18:54 UTC

53 points

10 comments3 min readLW link

Does AI risk “other” the AIs?

Joe Carlsmith9 Jan 2024 17:51 UTC

60 points

3 comments8 min readLW link

AI demands unprecedented reliability

Jono9 Jan 2024 16:30 UTC

22 points

5 comments2 min readLW link

Uncertainty in all its flavours

Cleo Nardo9 Jan 2024 16:21 UTC

34 points

6 comments35 min readLW link

Compensating for Life Biases

Jonathan Moregård9 Jan 2024 14:39 UTC

24 points

6 comments3 min readLW link

(honestliving.substack.com)

Can Morality Be Quantified?

Julius9 Jan 2024 6:35 UTC

3 points

0 comments5 min readLW link

Learning Math in Time for Alignment

Nicholas / Heather Kross9 Jan 2024 1:02 UTC

32 points

5 comments3 min readLW link

Brief Thoughts on Justifications for Paternalism

Srdjan Miletic9 Jan 2024 0:36 UTC

4 points

0 comments4 min readLW link

(dissent.blog)