Job List­ing (closed): CBAI Oper­a­tions Associates

Maite Abadia-Manthei21 Jul 2025 22:53 UTC
1 point
0 comments1 min readLW link
(www.cbai.ai)

If Any­one Builds It, Every­one Dies: Call for Trans­la­tors (for Sup­ple­men­tary Ma­te­ri­als)

yams21 Jul 2025 22:37 UTC
112 points
12 comments1 min readLW link

Why Real­ity Has A Well-Known Math Bias

Linch21 Jul 2025 22:13 UTC
42 points
18 comments1 min readLW link
(linch.substack.com)

Ques­tions about an­i­mal welfare markets

Austin Chen21 Jul 2025 21:54 UTC
9 points
0 comments5 min readLW link

Directly Try Solv­ing Align­ment for 5 weeks

Kabir Kumar21 Jul 2025 21:51 UTC
71 points
2 comments6 min readLW link
(beta.ai-plans.com)

Nav­i­gat­ing Re­spect: How to bid boldly, and when to hum­ble your­self preemptively

jimmy21 Jul 2025 20:30 UTC
14 points
2 comments12 min readLW link

Griz­zly Man screen­ing, tacos, car­l­smith discussion

Quinn21 Jul 2025 19:48 UTC
6 points
0 comments1 min readLW link

[Question] Refin­ing Gen­er­al­ized Han­gri­ness: Emo­tional Pro­cess­ing as Think­ing Tech

M. Key 21 Jul 2025 18:49 UTC
10 points
1 comment7 min readLW link

De­tect­ing High-Stakes In­ter­ac­tions with Ac­ti­va­tion Probes

21 Jul 2025 18:21 UTC
50 points
0 comments4 min readLW link

GDM also claims IMO gold medal

Yair Halberstadt21 Jul 2025 17:18 UTC
61 points
3 comments1 min readLW link
(deepmind.google)

Vi­su­al­iz­ing AI Align­ment Failures as Topolog­i­cal Nav­i­ga­tion Er­rors in Con­cep­tual Space

CC4CI21 Jul 2025 16:54 UTC
1 point
0 comments1 min readLW link

LLM Day­dream­ing (gw­ern.net)

Noosphere8921 Jul 2025 16:50 UTC
18 points
2 comments10 min readLW link
(gwern.net)

[Question] Mo­ral re­al­ism—ba­sic Q

Dagon21 Jul 2025 16:20 UTC
7 points
12 comments1 min readLW link

HRT in Menopause: A can­di­date for a case study of episte­mol­ogy in epi­demiol­ogy, statis­tics & medicine

foodforthought21 Jul 2025 16:18 UTC
40 points
2 comments4 min readLW link

Us­ing Older AI Models as a Form of Boycott

Jacob121 Jul 2025 12:18 UTC
6 points
2 comments1 min readLW link

Sub­stack for Best Posts

jefftk21 Jul 2025 12:10 UTC
11 points
1 comment2 min readLW link
(www.jefftk.com)

Monthly Roundup #32: July 2025

Zvi21 Jul 2025 12:00 UTC
41 points
10 comments37 min readLW link
(thezvi.wordpress.com)

Rea­sons to vote in non-de­ter­minis­tic elections

B Jacobs21 Jul 2025 11:09 UTC
8 points
1 comment8 min readLW link
(bobjacobs.substack.com)

Creative writ­ing with LLMs, part 1: Prompt­ing for fiction

Kaj_Sotala21 Jul 2025 8:47 UTC
38 points
10 comments20 min readLW link

Just Make a New Rule!

Zack_M_Davis21 Jul 2025 5:54 UTC
8 points
24 comments4 min readLW link

[Fic­tion] Our Trial

Nina Panickssery21 Jul 2025 3:56 UTC
68 points
1 comment3 min readLW link
(ninapanickssery.substack.com)

My First Month with Math Academy: An Ex­pe­rience Re­port from a Mid­dle School Dropout.

L.M.Sherlock21 Jul 2025 3:18 UTC
5 points
0 comments29 min readLW link
(lmsherlock.substack.com)

AI Safety course in­tro blog

boazbarak21 Jul 2025 2:35 UTC
16 points
0 comments1 min readLW link
(windowsontheory.org)

An Out­sider’s Roadmap into AI Safety Re­search (2025)

Luis M. Montoya21 Jul 2025 2:03 UTC
5 points
3 comments10 min readLW link

[Question] Help me learn more about AI

Mark Tranter21 Jul 2025 1:49 UTC
1 point
0 comments1 min readLW link

Un­bounded Embed­ded Agency: AEDT w.r.t. rOSI

Cole Wyeth20 Jul 2025 23:46 UTC
29 points
0 comments17 min readLW link

AI-Ori­ented Investments

PeterMcCluskey20 Jul 2025 21:31 UTC
28 points
0 comments1 min readLW link
(bayesianinvestor.com)

On The Shoulders of Sub­strates—how one phe­nomenon lays the foun­da­tion for the next

James Stephen Brown20 Jul 2025 21:11 UTC
14 points
1 comment3 min readLW link
(nonzerosum.games)

Life of Posts?

jmh20 Jul 2025 21:04 UTC
10 points
3 comments1 min readLW link

LLMs Can’t See Pix­els or Characters

Brendan Long20 Jul 2025 20:00 UTC
100 points
44 comments4 min readLW link
(www.brendanlong.com)

Oper­a­tional­iz­ing Func­tional Con­scious­ness: A Frame­work for AI Rights

Rudyon20 Jul 2025 17:50 UTC
−5 points
1 comment1 min readLW link
(kanarya.group)

Do “adult de­vel­op­men­tal stages” the­o­ries have any pre-the­o­retic mo­ti­va­tion?

Said Achmiz20 Jul 2025 14:37 UTC
35 points
19 comments3 min readLW link

Par­allel Park­ing and pos­si­bly In­stru­men­tal Convergence

CstineSublime20 Jul 2025 10:37 UTC
2 points
10 comments3 min readLW link

Plato’s Trolley

dr_s20 Jul 2025 10:07 UTC
36 points
11 comments7 min readLW link

Shal­low Water is Danger­ous Too

jefftk20 Jul 2025 2:30 UTC
222 points
24 comments2 min readLW link
(www.jefftk.com)

Your AI Safety org could get EU fund­ing up to €9.08M. Here’s how (+ free per­son­al­ized sup­port) Up­date: We­bi­nar 18/​8 Link Below

SamuelK20 Jul 2025 1:30 UTC
65 points
3 comments3 min readLW link

Make More Grayspaces

Duncan Sabien (Inactive)19 Jul 2025 22:22 UTC
296 points
65 comments13 min readLW link

Cheat­ing at Bets with the Even Odds Algorithm

omark19 Jul 2025 22:06 UTC
12 points
3 comments6 min readLW link

Can We Trust the Judge? A novel method of Model­ling Hu­man Bias and Sys­tem­atic Er­ror in De­bate-Based Scal­able Oversight

Andreea Zaman19 Jul 2025 21:44 UTC
1 point
0 comments7 min readLW link

Peel­ing Back The Re­mote­ness of Sources

adamShimi19 Jul 2025 17:41 UTC
16 points
1 comment13 min readLW link
(formethods.substack.com)

Se­quen­tial Co­her­ence: A Bot­tle­neck in Automation

19 Jul 2025 15:27 UTC
26 points
2 comments11 min readLW link

How Misal­igned AI Per­sonas Lead to Hu­man Ex­tinc­tion – Step by Step

Writer19 Jul 2025 13:59 UTC
14 points
0 comments7 min readLW link
(youtu.be)

L0 is not a neu­tral hyperparameter

19 Jul 2025 13:51 UTC
24 points
3 comments5 min readLW link

From Messy Shelves to Master Librar­i­ans: Toy-Model Ex­plo­ra­tion of Block-Di­ag­o­nal Geom­e­try in LM Activations

Yuxiao19 Jul 2025 12:26 UTC
5 points
1 comment4 min readLW link

OpenAI Claims IMO Gold Medal

Mikhail Samin19 Jul 2025 9:58 UTC
77 points
74 comments1 min readLW link
(x.com)

On the deep (un­cur­able?) vuln­er­a­bil­ity of MCPs

awu19 Jul 2025 2:50 UTC
5 points
6 comments1 min readLW link
(www.generalanalysis.com)

[Question] Best way to ask laypeo­ple for con­di­tional prob­a­bil­ities in a Bayes net?

Zack Friedman19 Jul 2025 2:45 UTC
11 points
1 comment1 min readLW link

[Question] Get sued or kill some­one: The trolly prob­lems of Psy­cholog­i­cal prac­tice.

Brad Dunn18 Jul 2025 23:35 UTC
12 points
2 comments3 min readLW link

re­sume limiting

bhauth18 Jul 2025 23:31 UTC
18 points
13 comments2 min readLW link
(www.bhauth.com)

[Linkpost] How Am I Get­ting Along with AI?

Gunnar_Zarncke18 Jul 2025 22:26 UTC
11 points
0 comments1 min readLW link
(jessiefischbein.substack.com)