3 Jul 2025 22:56 UTC

54 points

0 comments2 min readLW link

(intelligence.org)

Making Sense of Consciousness Part 2: Attention

sarahconstantin3 Jul 2025 21:20 UTC

16 points

1 comment6 min readLW link

(sarahconstantin.substack.com)

Battle of the Sexes—how to solve any (solvable) dispute

James Stephen Brown3 Jul 2025 19:21 UTC

7 points

0 comments3 min readLW link

(nonzerosum.games)

How worker co-ops can help restore social trust

B Jacobs3 Jul 2025 19:13 UTC

12 points

7 comments6 min readLW link

(bobjacobs.substack.com)

The Ultimatum Game—take it or leave it

James Stephen Brown3 Jul 2025 19:05 UTC

11 points

1 comment2 min readLW link

(nonzerosum.games)

A comment on Bayesian vs. frequentist statistical practice

bilibili3 Jul 2025 17:47 UTC

0 points

0 comments1 min readLW link

AISN #58: Senate Removes State AI Regulation Moratorium

Corin Katzke and Dan H

3 Jul 2025 17:26 UTC

6 points

0 comments4 min readLW link

(newsletter.safe.ai)

Research Note: Our scheming precursor evals had limited predictive power for our in-context scheming evals

Marius Hobbhahn3 Jul 2025 15:57 UTC

75 points

0 comments1 min readLW link

(www.apolloresearch.ai)

AI #123: Moratorium Moratorium

Zvi3 Jul 2025 15:40 UTC

33 points

1 comment49 min readLW link

(thezvi.wordpress.com)

Call for suggestions—AI safety course

Boaz Barak3 Jul 2025 14:30 UTC

54 points

23 comments1 min readLW link

Why I am not a polygenic score nihilist

David Hugh-Jones3 Jul 2025 13:38 UTC

6 points

0 comments2 min readLW link

(wyclif.substack.com)

Hunch: minimalism is correct

Adam Zerner3 Jul 2025 5:03 UTC

18 points

12 comments2 min readLW link

If Anyone Builds It, Everyone Dies: Advertisement design competition

yams2 Jul 2025 23:14 UTC

86 points

37 comments1 min readLW link

(intelligence.org)

Dialects for Humans: Sounding Distinct from LLMs

nebrelbug2 Jul 2025 23:03 UTC

9 points

2 comments2 min readLW link

Congress Asks Better Questions

Zvi2 Jul 2025 22:10 UTC

48 points

1 comment17 min readLW link

(thezvi.wordpress.com)

Eating Honey is (Probably) Fine, Actually

Linch2 Jul 2025 22:09 UTC

36 points

0 comments3 min readLW link

(linch.substack.com)

On Paying Attention

Alex Darby2 Jul 2025 21:52 UTC

5 points

0 comments7 min readLW link

Curing PMDD with Hair Loss Pills

David Lorell2 Jul 2025 21:35 UTC

105 points

4 comments8 min readLW link

[Question] RSS feed for 1 LW user?

Commander Zander2 Jul 2025 20:19 UTC

10 points

2 comments1 min readLW link

Thought Anchors: Which LLM Reasoning Steps Matter?

Uzay Macar, Paul Bogdan, Neel Nanda and Arthur Conmy

2 Jul 2025 20:16 UTC

35 points

6 comments6 min readLW link

(www.thought-anchors.com)

Cyberpunk Yoga

Commander Zander2 Jul 2025 20:04 UTC

7 points

0 comments3 min readLW link

The influence conjecture and its implcations

Bastian Gronager2 Jul 2025 19:36 UTC

−1 points

0 comments5 min readLW link

Idea on Bayes’ Theorem

BJ33832 Jul 2025 19:27 UTC

3 points

3 comments1 min readLW link

The Prisoner’s Dilemma—A Problematic Poster-Child

James Stephen Brown2 Jul 2025 19:10 UTC

9 points

0 comments3 min readLW link

AI Task Length Horizons in Offensive Cybersecurity

Sean Peters2 Jul 2025 18:36 UTC

73 points

10 comments12 min readLW link

Slicing the (Kosher) Hate Salami

ymeskhout2 Jul 2025 18:11 UTC

22 points

5 comments11 min readLW link

(www.ymeskhout.com)

Race and Gender Bias As An Example of Unfaithful Chain of Thought in the Wild

Adam Karvonen and Sam Marks

2 Jul 2025 16:35 UTC

191 points

26 comments4 min readLW link

Executive Belocracy: Review of Organization Types

belos2 Jul 2025 15:56 UTC

−1 points

0 comments11 min readLW link

(bestofagreatlot.substack.com)

There are two fundamentally different constraints on schemers

Buck2 Jul 2025 15:51 UTC

63 points

0 comments4 min readLW link

Mythbusting the supposed “1,000+ AI state bills that would hobble innovation”

sjadler2 Jul 2025 4:49 UTC

6 points

0 comments1 min readLW link

(stevenadler.substack.com)

[Question] Are LLMs being trained using LessWrong text?

Cedar2 Jul 2025 3:00 UTC

7 points

4 comments1 min readLW link

“What’s my goal?”

Raemon2 Jul 2025 2:58 UTC

132 points

9 comments2 min readLW link

Use AI to Dimensionalize

Jordan Rubin2 Jul 2025 2:43 UTC

10 points

1 comment3 min readLW link

(jordanmrubin.substack.com)

Why Engaging with Global Majority AI Policy Matters

Heramb2 Jul 2025 1:46 UTC

9 points

0 comments2 min readLW link

Lessons from Building Secular Ritual: A Winter Solstice Experiment

joshuamerriam2 Jul 2025 0:55 UTC

9 points

0 comments4 min readLW link

On The Formal Definition of Alignment

Davey2 Jul 2025 0:05 UTC

4 points

3 comments1 min readLW link

AI-202X: a game between humans and AGIs aligned to different futures?

StanislavKrym1 Jul 2025 23:37 UTC

5 points

0 comments16 min readLW link

Aether July 2025 Update

RohanS, Rauno Arike and Shubhorup Biswas

1 Jul 2025 21:08 UTC

26 points

7 comments3 min readLW link

AI Moratorium Stripped From BBB

Zvi1 Jul 2025 18:50 UTC

70 points

4 comments6 min readLW link

(thezvi.wordpress.com)

Manipulating Self-Preference In LLMs

Matthew Nguyen, Jou Barzdukas, Matthew Bozoukov and Hongyu Fu

1 Jul 2025 18:03 UTC

13 points

0 comments7 min readLW link

A Simple Explanation of AGI Risk

TurnTrout1 Jul 2025 16:18 UTC

58 points

4 comments5 min readLW link

(turntrout.com)

Authors Have a Responsibility to Communicate Clearly

TurnTrout1 Jul 2025 15:41 UTC

127 points

29 comments6 min readLW link

(turntrout.com)

Road to AnimalHarmBench

Arturs and Constance Li

1 Jul 2025 13:38 UTC

−1 points

0 comments1 min readLW link

(forum.effectivealtruism.org)

Embedded Altruism [slides]

owencb1 Jul 2025 13:02 UTC

22 points

3 comments1 min readLW link

Senate Strikes Potential AI Moratorium

T_W1 Jul 2025 11:49 UTC

16 points

0 comments1 min readLW link

(www.reuters.com)

[Question] Can AIs be shown their messages aren’t tampered with?

mruwnik1 Jul 2025 9:39 UTC

4 points

10 comments1 min readLW link

SLT for AI Safety

Jesse Hoogland1 Jul 2025 4:52 UTC

78 points

0 comments3 min readLW link

Problematic Professors

Eggs1 Jul 2025 2:54 UTC

16 points

5 comments2 min readLW link

I can’t tell if my ideas are good anymore because I talked to robots too much

Tyson30 Jun 2025 21:21 UTC

13 points

10 comments1 min readLW link

Q1 AI Benchmark Results: Pro Forecasters Crush Bots

Ben Wilson30 Jun 2025 21:12 UTC

14 points

0 comments22 min readLW link

(www.metaculus.com)