All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025

All Jan FebMarApr May Jun Jul Aug Sep Oct Nov Dec

All12 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Rationality Research Report: Towards 10x OODA Looping?

RaemonFeb 24, 2024, 9:06 PM

117 points

25 comments15 min readLW link

Let’s ask some of the largest LLMs for tips and ideas on how to take over the world

Super AGIFeb 24, 2024, 8:35 PM

1 point

0 comments7 min readLW link

Exercise: Planmaking, Surprise Anticipation, and “Baba is You”

RaemonFeb 24, 2024, 8:33 PM

67 points

31 comments6 min readLW link

In search of God.

Spiritus DeiFeb 24, 2024, 6:59 PM

−19 points

3 comments7 min readLW link

Impossibility of Anthropocentric-Alignment

False NameFeb 24, 2024, 6:31 PM

−8 points

2 comments39 min readLW link

The Inner Alignment Problem

Jakub HalmešFeb 24, 2024, 5:55 PM

1 point

1 comment3 min readLW link

(jakubhalmes.substack.com)

We Need Major, But Not Radical, FDA Reform

Maxwell TabarrokFeb 24, 2024, 4:54 PM

42 points

12 comments7 min readLW link

(www.maximum-progress.com)

After Overmorrow: Scattered Musings on the Immediate Post-AGI World

Yuli_BanFeb 24, 2024, 3:49 PM

−3 points

0 comments26 min readLW link

[Question] CDT vs. EDT on Deterrence

Terence CoelhoFeb 24, 2024, 3:41 PM

1 point

9 comments1 min readLW link

Balancing Games

jefftkFeb 24, 2024, 2:40 PM

62 points

18 comments1 min readLW link

(www.jefftk.com)

How well do truth probes generalise?

mishajwFeb 24, 2024, 2:12 PM

93 points

11 comments9 min readLW link

Rawls’s Veil of Ignorance Doesn’t Make Any Sense

Arjun PanicksseryFeb 24, 2024, 1:18 PM

9 points

9 comments1 min readLW link

[Question] Can someone explain to me what went wrong with ChatGPT?

Valentin BaltadzhievFeb 24, 2024, 11:50 AM

9 points

1 comment1 min readLW link

The Sense Of Physical Necessity: A Naturalism Demo (Introduction)

LoganStrohlFeb 24, 2024, 2:56 AM

59 points

1 comment6 min readLW link

Instrumental deception and manipulation in LLMs—a case study

Olli JärviniemiFeb 24, 2024, 2:07 AM

39 points

13 comments12 min readLW link

A starting point for making sense of task structure (in machine learning)

Kaarel, RP and jake_mendel

Feb 24, 2024, 1:51 AM

45 points

2 comments12 min readLW link

Why you, personally, should want a larger human population

jasoncrawfordFeb 23, 2024, 7:48 PM

32 points

32 comments5 min readLW link

(rootsofprogress.org)

Deliberative Cognitive Algorithms as Scaffolding

Cole WyethFeb 23, 2024, 5:15 PM

20 points

4 comments3 min readLW link

The Shutdown Problem: Incomplete Preferences as a Solution

EJTFeb 23, 2024, 4:01 PM

53 points

33 comments42 min readLW link

In set theory, everything is a set

Jacob G-WFeb 23, 2024, 2:35 PM

11 points

9 comments2 min readLW link

The role of philosophical thinking in understanding large language models: Calibrating and closing the gap between first-person experience and underlying mechanisms

Bill BenzonFeb 23, 2024, 12:19 PM

4 points

0 comments10 min readLW link

Deep and obvious points in the gap between your thoughts and your pictures of thought

KatjaGraceFeb 23, 2024, 7:30 AM

42 points

6 comments1 min readLW link

(worldspiritsockpuppet.com)

Parasocial relationship logic

KatjaGraceFeb 23, 2024, 7:30 AM

20 points

1 comment1 min readLW link

(worldspiritsockpuppet.com)

Shaming with and without naming

KatjaGraceFeb 23, 2024, 7:30 AM

17 points

5 comments2 min readLW link

(worldspiritsockpuppet.com)

Complexity of value but not disvalue implies more focus on s-risk. Moral uncertainty and preference utilitarianism also do.

Chi NguyenFeb 23, 2024, 6:10 AM

52 points

18 comments LW link

[Question] Does increasing the power of a multimodal LLM get you an agentic AI?

yanni kyriacosFeb 23, 2024, 4:14 AM

3 points

3 comments1 min readLW link

The natural boundaries between people

ChipmonkFeb 23, 2024, 1:09 AM

23 points

2 comments8 min readLW link

(chipmonk.substack.com)

Contra Ngo et al. “Every ‘Every Bay Area House Party’ Bay Area House Party”

Ricki HeicklenFeb 22, 2024, 11:56 PM

186 points

5 comments4 min readLW link

(bayesshammai.substack.com)

AI #52: Oops

ZviFeb 22, 2024, 9:50 PM

50 points

9 comments29 min readLW link

(thezvi.wordpress.com)

Embed your second brain in your first brain

dkl9Feb 22, 2024, 9:46 PM

10 points

3 comments1 min readLW link

(dkl9.net)

The Gemini Incident

ZviFeb 22, 2024, 9:00 PM

80 points

19 comments18 min readLW link

(thezvi.wordpress.com)

Some Thoughts On Using Auctions For Land Valuation

harsimonyFeb 22, 2024, 7:54 PM

0 points

9 comments9 min readLW link

(progressandpoverty.substack.com)

The Binding of Isaac & Transparent Newcomb’s Problem

suvjectibityFeb 22, 2024, 6:56 PM

−11 points

0 comments10 min readLW link

Language Models Don’t Learn the Physical Manifestation of Language

Bruce W. Lee and Jaehyuk Lim

Feb 22, 2024, 6:52 PM

39 points

23 comments1 min readLW link

(arxiv.org)

Sora What

ZviFeb 22, 2024, 6:10 PM

47 points

3 comments9 min readLW link

(thezvi.wordpress.com)

Do sparse autoencoders find “true features”?

Demian TillFeb 22, 2024, 6:06 PM

74 points

33 comments11 min readLW link

Everything Wrong with Roko’s Claims about an Engineered Pandemic

WitheringWeightsFeb 22, 2024, 3:59 PM

96 points

10 comments16 min readLW link

The One and a Half Gemini

ZviFeb 22, 2024, 1:10 PM

73 points

4 comments8 min readLW link

(thezvi.wordpress.com)

[Question] How do I make predictions about the future to make sense of what to do with my life?

Raj ThimmiahFeb 22, 2024, 11:22 AM

8 points

1 comment1 min readLW link

How are voluntary commitments on vulnerability reporting going?

Adam JonesFeb 22, 2024, 8:43 AM

23 points

1 comment1 min readLW link

(adamjones.me)

Notes on Internal Objectives in Toy Models of Agents

Paul CologneseFeb 22, 2024, 8:02 AM

16 points

0 comments8 min readLW link

The Byronic Hero Always Loses

Cole WyethFeb 22, 2024, 1:31 AM

32 points

4 comments2 min readLW link

Job Listing: Managing Editor / Writer

Gretta DulebaFeb 21, 2024, 11:41 PM

43 points

2 comments1 min readLW link

The Pareto Best and the Curse of Doom

ScrewtapeFeb 21, 2024, 11:10 PM

120 points

21 comments9 min readLW link

AISN #31: A New AI Policy Bill in California Plus, Precedents for AI Governance and The EU AI Office

Dan HFeb 21, 2024, 9:58 PM

17 points

0 comments6 min readLW link

(newsletter.safe.ai)

Analogies between scaling labs and misaligned superintelligent AI

scasperFeb 21, 2024, 7:29 PM

77 points

5 comments4 min readLW link

Extinction Risks from AI: Invisible to Science?

VojtaKovarik, Chris van Merwijk and Ida Mattsson

Feb 21, 2024, 6:07 PM

24 points

7 comments1 min readLW link

(arxiv.org)

Extinction-level Goodhart’s Law as a Property of the Environment

VojtaKovarik and Ida Mattsson

Feb 21, 2024, 5:56 PM

23 points

0 comments10 min readLW link

Dynamics Crucial to AI Risk Seem to Make for Complicated Models

VojtaKovarik and Ida Mattsson

Feb 21, 2024, 5:54 PM

19 points

0 comments9 min readLW link

Which Model Properties are Necessary for Evaluating an Argument?

VojtaKovarik and Ida Mattsson

Feb 21, 2024, 5:52 PM

18 points

2 comments7 min readLW link