All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 202420252026

All Jan Feb Mar Apr May JunJulAug Sep Oct Nov Dec

All 1 234 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

If Anyone Builds It, Everyone Dies: Advertisement design competition

yams2 Jul 2025 23:14 UTC

86 points

37 comments1 min readLW link

(intelligence.org)

Dialects for Humans: Sounding Distinct from LLMs

nebrelbug2 Jul 2025 23:03 UTC

9 points

2 comments2 min readLW link

Congress Asks Better Questions

Zvi2 Jul 2025 22:10 UTC

48 points

1 comment17 min readLW link

(thezvi.wordpress.com)

Eating Honey is (Probably) Fine, Actually

Linch2 Jul 2025 22:09 UTC

36 points

0 comments3 min readLW link

(linch.substack.com)

On Paying Attention

Alex Darby2 Jul 2025 21:52 UTC

5 points

0 comments7 min readLW link

Curing PMDD with Hair Loss Pills

David Lorell2 Jul 2025 21:35 UTC

105 points

4 comments8 min readLW link

[Question] RSS feed for 1 LW user?

Commander Zander2 Jul 2025 20:19 UTC

10 points

2 comments1 min readLW link

Thought Anchors: Which LLM Reasoning Steps Matter?

Uzay Macar, Paul Bogdan, Neel Nanda and Arthur Conmy

2 Jul 2025 20:16 UTC

35 points

6 comments6 min readLW link

(www.thought-anchors.com)

Cyberpunk Yoga

Commander Zander2 Jul 2025 20:04 UTC

7 points

0 comments3 min readLW link

The influence conjecture and its implcations

Bastian Gronager2 Jul 2025 19:36 UTC

−1 points

0 comments5 min readLW link

Idea on Bayes’ Theorem

BJ33832 Jul 2025 19:27 UTC

3 points

3 comments1 min readLW link

The Prisoner’s Dilemma—A Problematic Poster-Child

James Stephen Brown2 Jul 2025 19:10 UTC

9 points

0 comments3 min readLW link

AI Task Length Horizons in Offensive Cybersecurity

Sean Peters2 Jul 2025 18:36 UTC

73 points

10 comments12 min readLW link

Slicing the (Kosher) Hate Salami

ymeskhout2 Jul 2025 18:11 UTC

22 points

5 comments11 min readLW link

(www.ymeskhout.com)

Race and Gender Bias As An Example of Unfaithful Chain of Thought in the Wild

Adam Karvonen and Sam Marks

2 Jul 2025 16:35 UTC

191 points

26 comments4 min readLW link

Executive Belocracy: Review of Organization Types

belos2 Jul 2025 15:56 UTC

−1 points

0 comments11 min readLW link

(bestofagreatlot.substack.com)

There are two fundamentally different constraints on schemers

Buck2 Jul 2025 15:51 UTC

63 points

0 comments4 min readLW link

Mythbusting the supposed “1,000+ AI state bills that would hobble innovation”

sjadler2 Jul 2025 4:49 UTC

6 points

0 comments1 min readLW link

(stevenadler.substack.com)

[Question] Are LLMs being trained using LessWrong text?

Cedar2 Jul 2025 3:00 UTC

7 points

4 comments1 min readLW link

“What’s my goal?”

Raemon2 Jul 2025 2:58 UTC

132 points

9 comments2 min readLW link

Use AI to Dimensionalize

Jordan Rubin2 Jul 2025 2:43 UTC

10 points

1 comment3 min readLW link

(jordanmrubin.substack.com)

Why Engaging with Global Majority AI Policy Matters

Heramb2 Jul 2025 1:46 UTC

9 points

0 comments2 min readLW link

Lessons from Building Secular Ritual: A Winter Solstice Experiment

joshuamerriam2 Jul 2025 0:55 UTC

9 points

0 comments4 min readLW link

On The Formal Definition of Alignment

Davey2 Jul 2025 0:05 UTC

4 points

3 comments1 min readLW link

AI-202X: a game between humans and AGIs aligned to different futures?

StanislavKrym1 Jul 2025 23:37 UTC

5 points

0 comments16 min readLW link

Aether July 2025 Update

RohanS, Rauno Arike and Shubhorup Biswas

1 Jul 2025 21:08 UTC

26 points

7 comments3 min readLW link

AI Moratorium Stripped From BBB

Zvi1 Jul 2025 18:50 UTC

70 points

4 comments6 min readLW link

(thezvi.wordpress.com)

Manipulating Self-Preference In LLMs

Matthew Nguyen, Jou Barzdukas, Matthew Bozoukov and Hongyu Fu

1 Jul 2025 18:03 UTC

13 points

0 comments7 min readLW link

A Simple Explanation of AGI Risk

TurnTrout1 Jul 2025 16:18 UTC

58 points

4 comments5 min readLW link

(turntrout.com)

Authors Have a Responsibility to Communicate Clearly

TurnTrout1 Jul 2025 15:41 UTC

127 points

29 comments6 min readLW link

(turntrout.com)

Road to AnimalHarmBench

Arturs and Constance Li

1 Jul 2025 13:38 UTC

−1 points

0 comments1 min readLW link

(forum.effectivealtruism.org)

Embedded Altruism [slides]

owencb1 Jul 2025 13:02 UTC

22 points

3 comments1 min readLW link

Senate Strikes Potential AI Moratorium

T_W1 Jul 2025 11:49 UTC

16 points

0 comments1 min readLW link

(www.reuters.com)

[Question] Can AIs be shown their messages aren’t tampered with?

mruwnik1 Jul 2025 9:39 UTC

4 points

10 comments1 min readLW link

SLT for AI Safety

Jesse Hoogland1 Jul 2025 4:52 UTC

78 points

0 comments3 min readLW link

Problematic Professors

Eggs1 Jul 2025 2:54 UTC

16 points

5 comments2 min readLW link

I can’t tell if my ideas are good anymore because I talked to robots too much

Tyson30 Jun 2025 21:21 UTC

13 points

10 comments1 min readLW link

Q1 AI Benchmark Results: Pro Forecasters Crush Bots

Ben Wilson30 Jun 2025 21:12 UTC

14 points

0 comments22 min readLW link

(www.metaculus.com)

ACX Meetup Cape Town

tegan30 Jun 2025 21:11 UTC

1 point

0 comments1 min readLW link

The best simple argument for Pausing AI?

Gary Marcus30 Jun 2025 20:38 UTC

155 points

23 comments1 min readLW link

Hiring* an AI** Artist for LessWrong/Lightcone

Raemon30 Jun 2025 19:01 UTC

30 points

8 comments1 min readLW link

SAE on activation differences

Santiago Aranguri, jacob_drori and Neel Nanda

30 Jun 2025 17:50 UTC

45 points

3 comments5 min readLW link

The Spectrum of Attention: From Empathy to Hypnosis

jimmy30 Jun 2025 17:42 UTC

14 points

2 comments14 min readLW link

Substack and Other Blog Recommendations

Zvi30 Jun 2025 17:20 UTC

30 points

7 comments16 min readLW link

(thezvi.wordpress.com)

What We Learned Trying to Diff Base and Chat Models (And Why It Matters)

Clément Dumas, Julian Minder and Neel Nanda

30 Jun 2025 17:17 UTC

106 points

2 comments7 min readLW link

Don’t Eat Honey

Bentham's Bulldog30 Jun 2025 15:57 UTC

−15 points

70 comments6 min readLW link

Primary-budget voting registration

eg30 Jun 2025 15:39 UTC

1 point

4 comments2 min readLW link

Project Vend: Can Claude run a small shop?

Gunnar_Zarncke30 Jun 2025 15:22 UTC

53 points

8 comments1 min readLW link

(www.anthropic.com)

If you want to be vegan but you worry about health effects of no meat, consider being vegan except for mussels/oysters

KatWoods30 Jun 2025 13:28 UTC

80 points

15 comments1 min readLW link

How dangerous is encoded reasoning?

Artem Karpov30 Jun 2025 11:54 UTC

17 points

0 comments10 min readLW link