All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025

All Jan Feb Mar Apr May Jun Jul Aug Sep Oct NovDec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 192021 22 23 24 25 26 27 28 29 30 31

Progress links and short notes, 2024-12-16

jasoncrawfordDec 16, 2024, 5:24 PM

7 points

0 comments2 min readLW link

(newsletter.rootsofprogress.org)

Effective Altruism FAQ

omnizoidDec 16, 2024, 4:27 PM

0 points

7 comments12 min readLW link

Variably compressibly studies are fun

dkl9Dec 16, 2024, 4:00 PM

0 points

0 comments2 min readLW link

(dkl9.net)

AIs Will Increasingly Attempt Shenanigans

ZviDec 16, 2024, 3:20 PM

114 points

2 comments26 min readLW link

(thezvi.wordpress.com)

Testing which LLM architectures can do hidden serial reasoning

Filip SondejDec 16, 2024, 1:48 PM

81 points

9 comments4 min readLW link

NeuroAI for AI safety: A Differential Path

nz and Patrick Mineault

Dec 16, 2024, 1:17 PM

22 points

0 comments7 min readLW link

(arxiv.org)

Circling as practice for “just be yourself”

Kaj_SotalaDec 16, 2024, 7:40 AM

86 points

5 comments4 min readLW link

(kajsotala.fi)

Reanalyzing the 2023 Expert Survey on Progress in AI

AI ImpactsDec 16, 2024, 6:10 AM

8 points

0 comments1 min readLW link

(blog.aiimpacts.org)

Ideas for benchmarking LLM creativity

gwernDec 16, 2024, 5:18 AM

60 points

11 comments1 min readLW link

(gwern.net)

Comparing the AirFanta 3Pro to the Coway AP-1512

jefftkDec 16, 2024, 1:40 AM

13 points

0 comments1 min readLW link

(www.jefftk.com)

[Question] are IQ tests a good measure of intelligence?

KvmanThinkingDec 15, 2024, 11:06 PM

0 points

5 comments1 min readLW link

Madison Secular Solstice

svfritzDec 15, 2024, 9:52 PM

1 point

0 comments1 min readLW link

[Question] Is AI alignment a purely functional property?

RokoDec 15, 2024, 9:42 PM

13 points

8 comments1 min readLW link

[Question] How counterfactual are logical counterfactuals?

Donald HobsonDec 15, 2024, 9:16 PM

11 points

10 comments1 min readLW link

Debunking the myth of safe AI

henophiliaDec 15, 2024, 5:44 PM

−11 points

8 comments1 min readLW link

(henophilia.substack.com)

Introducing Avatarism: A Rational Framework for Building actual Heaven

ratiba roDec 15, 2024, 5:17 PM

2 points

2 comments2 min readLW link

A Public Choice Take on Effective Altruism

vaishnav92Dec 15, 2024, 4:58 PM

9 points

4 comments3 min readLW link

(www.optimaloutliers.com)

World Models I’m Currently Building

temporaryDec 15, 2024, 4:29 PM

5 points

1 comment1 min readLW link

(samuelshadrach.com)

Dress Up For Secular Solstice

Gordon H.S.Dec 15, 2024, 4:28 PM

33 points

13 comments7 min readLW link

Remap your caps lock key

bilalchughtaiDec 15, 2024, 2:03 PM

80 points

18 comments1 min readLW link

Effective Evil’s AI Misalignment Plan

lsusrDec 15, 2024, 7:39 AM

83 points

9 comments3 min readLW link

Write Good Enough Code, Quickly

Oliver DanielsDec 15, 2024, 4:45 AM

19 points

10 comments8 min readLW link

How to Edit an Essay into a Solstice Speech?

CzynskiDec 15, 2024, 4:30 AM

5 points

1 comment1 min readLW link

(thepdv.wordpress.com)

How Your Physiology Affects the Mind’s Projection Fallacy

YanLyutnevDec 14, 2024, 9:10 PM

−1 points

0 comments6 min readLW link

Introducing the Evidence Color Wheel

Larry LeeDec 14, 2024, 4:08 PM

6 points

0 comments3 min readLW link

An Illustrated Summary of “Robust Agents Learn Causal World Model”

DalcyDec 14, 2024, 3:02 PM

67 points

2 comments10 min readLW link

Best-of-N Jailbreaking

John Hughes, saraprice, Aengus Lynch, Rylan Schaeffer, Fazl, Henry Sleight, Ethan Perez and mrinank_sharma

Dec 14, 2024, 4:58 AM

78 points

5 comments2 min readLW link

(arxiv.org)

D&D.Sci Dungeonbuilding: the Dungeon Tournament

aphyerDec 14, 2024, 4:30 AM

49 points

16 comments3 min readLW link

Creating Interpretable Latent Spaces with Gradient Routing

Jacob G-WDec 14, 2024, 4:00 AM

26 points

6 comments2 min readLW link

(jacobgw.com)

Probability of death by suicide by a 26 year old

John WisemanDec 14, 2024, 3:33 AM

−25 points

4 comments1 min readLW link

Matryoshka Sparse Autoencoders

Noa NabeshimaDec 14, 2024, 2:52 AM

98 points

15 comments11 min readLW link

[Question] What is MIRI currently doing?

RokoDec 14, 2024, 2:39 AM

32 points

14 comments1 min readLW link

The o1 System Card Is Not About o1

ZviDec 13, 2024, 8:30 PM

116 points

5 comments16 min readLW link

(thezvi.wordpress.com)

Arch-anarchy and The Fable of the Dragon-Tyrant

Peter lawless Dec 13, 2024, 8:15 PM

−10 points

0 comments1 min readLW link

Communications in Hard Mode (My new job at MIRI)

tanagrabeastDec 13, 2024, 8:13 PM

206 points

25 comments5 min readLW link

First Thoughts on Detachmentism

Jacob PetersonDec 13, 2024, 1:19 AM

−11 points

5 comments9 min readLW link

How to Build Heaven: A Constrained Boltzmann Brain Generator

High TidesDec 13, 2024, 1:04 AM

−8 points

3 comments5 min readLW link

Representing Irrationality in Game Theory

Larry LeeDec 13, 2024, 12:50 AM

−1 points

3 comments11 min readLW link

“Charity” as a conflationary alliance term

Jan_KulveitDec 12, 2024, 9:49 PM

35 points

2 comments5 min readLW link

Just one more exposure bro

ChipmonkDec 12, 2024, 9:37 PM

52 points

6 comments2 min readLW link

(chrislakin.blog)

The Dangers of Mirrored Life

Niko_McCarty and fin

Dec 12, 2024, 8:58 PM

119 points

9 comments29 min readLW link

(www.asimov.press)

Effective Networking as Sending Hard to Fake Signals

vaishnav92Dec 12, 2024, 8:32 PM

26 points

2 comments7 min readLW link

(www.optimaloutliers.com)

Mini PAPR Review

jefftkDec 12, 2024, 7:10 PM

10 points

0 comments2 min readLW link

(www.jefftk.com)

Biological risk from the mirror world

jasoncrawfordDec 12, 2024, 7:07 PM

334 points

38 comments7 min readLW link

(newsletter.rootsofprogress.org)

Naturalistic dualism

Arturo MaciasDec 12, 2024, 4:19 PM

−4 points

0 comments4 min readLW link

AI #94: Not Now, Google

ZviDec 12, 2024, 3:40 PM

49 points

3 comments64 min readLW link

(thezvi.wordpress.com)

Consciousness, Intelligence, and AI – Some Quick Notes [call it a mini-ramble]

Bill BenzonDec 12, 2024, 3:04 PM

−3 points

0 comments4 min readLW link

The Dissolution of AI Safety

RokoDec 12, 2024, 10:34 AM

8 points

44 comments1 min readLW link

(www.transhumanaxiology.com)

Is Optimization Correct?

Yoshinori OkamotoDec 12, 2024, 10:27 AM

−9 points

0 comments2 min readLW link

AXRP Episode 38.3 - Erik Jenner on Learned Look-Ahead

DanielFilanDec 12, 2024, 5:40 AM

20 points

0 comments16 min readLW link