All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 202420252026

All Jan Feb Mar Apr May Jun Jul Aug Sep OctNovDec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 181920 21 22 23 24 25 26 27 28 29 30

Anthropic is (probably) not meeting its RSP security commitments

habryka18 Nov 2025 23:34 UTC

129 points

22 comments5 min readLW link

Considerations for setting the FLOP thresholds in our example international AI agreement

Aaron_Scher and peterbarnett

18 Nov 2025 23:31 UTC

54 points

5 comments7 min readLW link

Jailbreaking AI models to Phish Elderly Victims

Simon Lermen and Fred Heiding

18 Nov 2025 23:17 UTC

17 points

0 comments2 min readLW link

(simonlermen.substack.com)

Beren’s Essay on Obedience and Alignment

StanislavKrym18 Nov 2025 22:50 UTC

33 points

0 comments9 min readLW link

(www.beren.io)

Towards A Unified Theory Of Alignment

kenneth myers18 Nov 2025 22:03 UTC

4 points

3 comments4 min readLW link

[Question] Why are FICO scores effective?

Hruss18 Nov 2025 21:53 UTC

8 points

3 comments2 min readLW link

Bologna December Meetup

Luca Petrolati18 Nov 2025 20:19 UTC

3 points

0 comments1 min readLW link

The Aura of a Dark Lord

Dentosal18 Nov 2025 20:07 UTC

25 points

0 comments3 min readLW link

Reading LLM chain of thought makes me more rational

Michael Steele18 Nov 2025 19:53 UTC

1 point

0 comments1 min readLW link

New Report: An International Agreement to Prevent the Premature Creation of Artificial Superintelligence

peterbarnett, Aaron_Scher, David Abecassis and Brian Abeyta

18 Nov 2025 19:09 UTC

223 points

23 comments3 min readLW link

Sign language as a generally-useful means of communication (even if you have good hearing)

beyarkay (Boyd Kane)18 Nov 2025 18:34 UTC

7 points

2 comments1 min readLW link

(boydkane.com)

Victor Taelin’s notes on Gemini 3

Gunnar_Zarncke18 Nov 2025 18:30 UTC

32 points

1 comment3 min readLW link

(x.com)

On Writing #2

Zvi18 Nov 2025 17:30 UTC

46 points

4 comments14 min readLW link

(thezvi.wordpress.com)

GPT 5.1 Follows Custom Instructions and Glazes

Zvi18 Nov 2025 17:30 UTC

28 points

1 comment20 min readLW link

(thezvi.wordpress.com)

ARC progress update: Competing with sampling

Eric Neyman, Victor Lecomte, Wilson Wu, Mikewins, Jacob_Hilton and George Robinson

18 Nov 2025 17:22 UTC

131 points

11 comments21 min readLW link

Status Is The Game Of The Losers’ Bracket

johnswentworth18 Nov 2025 17:08 UTC

94 points

48 comments4 min readLW link

Kairos is the new home for the Global Challenges Project, and we’re hiring for a GCP Director

Topaz and agucova

18 Nov 2025 13:54 UTC

6 points

0 comments1 min readLW link

Reconstellation: construct a flywheel for personal change

teebarnett18 Nov 2025 12:30 UTC

13 points

2 comments12 min readLW link

The Illegible Chain-of-Thought Menagerie

Artem Karpov18 Nov 2025 12:01 UTC

3 points

0 comments8 min readLW link

A Call for Better Risk Modelling

Jan Wehner and Charbel-Raphaël

18 Nov 2025 9:08 UTC

20 points

0 comments4 min readLW link

Eat The Richtext

dreeves18 Nov 2025 7:57 UTC

46 points

1 comment2 min readLW link

Memories of a British Boarding School #1

Ben Pace18 Nov 2025 7:57 UTC

36 points

0 comments5 min readLW link

Preference Weighting and the Abilene Paradox

Screwtape18 Nov 2025 7:56 UTC

29 points

1 comment8 min readLW link

Don’t grow your org fast

Ruby18 Nov 2025 7:47 UTC

19 points

2 comments9 min readLW link

Continuity

abramdemski18 Nov 2025 5:59 UTC

27 points

4 comments3 min readLW link

How Colds Spread

RobertM18 Nov 2025 5:25 UTC

248 points

32 comments10 min readLW link

Aim for single piece flow

habryka18 Nov 2025 5:22 UTC

123 points

21 comments5 min readLW link

I store some memories spatially and I don’t know why

Alex_Altair18 Nov 2025 2:54 UTC

11 points

3 comments2 min readLW link

(namelessvirtue.com)

An Analogue Of Set Relationships For Distributions

johnswentworth and David Lorell

18 Nov 2025 1:03 UTC

53 points

4 comments3 min readLW link

No One Reads the Original Work

Algon18 Nov 2025 0:00 UTC

51 points

10 comments2 min readLW link

Middlemen Are Eating the World (And That’s Good, Actually)

Linch17 Nov 2025 22:26 UTC

48 points

4 comments4 min readLW link

(inchpin.substack.com)

[Question] Are there examples of communities where AI is making epistemics better now?

Ben Goldhaber17 Nov 2025 21:47 UTC

18 points

0 comments2 min readLW link

Generalisation Hacking: a first look at adversarial generalisation failures in deliberative alignment

Cam and Puria

17 Nov 2025 21:44 UTC

54 points

2 comments8 min readLW link

Varieties Of Doom

jdp17 Nov 2025 21:36 UTC

173 points

70 comments57 min readLW link

(minihf.com)

Omniscience one bit at a time: Chapter 5

Dentosal17 Nov 2025 21:31 UTC

9 points

1 comment2 min readLW link

The Barriers to Your Unemployment

claywren17 Nov 2025 21:08 UTC

9 points

0 comments7 min readLW link

Thoughts and experiences on using AI for learning

Mitali M17 Nov 2025 21:07 UTC

6 points

0 comments1 min readLW link

Cooling the brain to boost human IQ

Michael Steele17 Nov 2025 21:02 UTC

8 points

10 comments3 min readLW link

AI 2025 - Last Shipmas

Simon Lermen17 Nov 2025 19:39 UTC

66 points

5 comments7 min readLW link

Knowing Whether AI Alignment Is a One-Shot Problem Is a One-Shot Problem

MichaelDickens17 Nov 2025 19:11 UTC

32 points

2 comments3 min readLW link

Lessons from building a model organism testbed

joshc, sarun0, Annie Sorkin and michaelwaves

17 Nov 2025 17:58 UTC

22 points

1 comment14 min readLW link

# How the Crypto Bros and Poker Pros Blew the Whistle on UFOs. Prediction by @Grok, xAI January 2026

Krantz17 Nov 2025 16:19 UTC

−19 points

0 comments2 min readLW link

Close open loops

habryka17 Nov 2025 16:00 UTC

62 points

0 comments3 min readLW link

Lobsang’s Children

Tomás B.17 Nov 2025 15:12 UTC

61 points

0 comments23 min readLW link

50 Shades of Red

Aprillion17 Nov 2025 13:52 UTC

4 points

0 comments3 min readLW link

75 and 750 Words on Legal Personhood

Stephen Martin17 Nov 2025 13:50 UTC

21 points

0 comments3 min readLW link

Considerations regarding being nice to AIs

MattAlexander17 Nov 2025 13:05 UTC

8 points

0 comments15 min readLW link

A Market of Whispering Earrings

mrmoxon17 Nov 2025 13:02 UTC

2 points

0 comments2 min readLW link

Human behavior is an intuition-pump for AI risk

invertedpassion17 Nov 2025 11:46 UTC

4 points

0 comments16 min readLW link

On Comparative Advantage & AGI

CharlesD17 Nov 2025 9:33 UTC

11 points

0 comments3 min readLW link