What We Talk About When We Talk About Ob­jec­tive Functions

WaddingtonMay 18, 2025, 10:48 PM
8 points
0 comments6 min readLW link

Let­ter to The Honourable Evan Solomon

AnnapurnaMay 18, 2025, 4:34 PM
−1 points
0 comments2 min readLW link
(jorgevelez.substack.com)

Gödel, Escher, Bach in the age of LLMs

TahpMay 18, 2025, 4:22 PM
21 points
4 comments12 min readLW link
(passwordpaper.com)

Limits to Con­trol Workshop

May 18, 2025, 4:05 PM
12 points
2 comments3 min readLW link

ChatGPT de­ceives users that it’s cleared its mem­ory when it hasn’t

danielechlinMay 18, 2025, 3:17 PM
15 points
10 comments2 min readLW link

Model­ing ver­sus Implementation

Cole WyethMay 18, 2025, 1:38 PM
27 points
10 comments3 min readLW link

ALICE de­tects the con­ver­sion of lead into gold at the LHC

azerganteMay 18, 2025, 9:38 AM
4 points
0 comments1 min readLW link
(home.cern)

Euro­pean Links (18.05.25)

Martin SustrikMay 18, 2025, 4:20 AM
16 points
5 comments2 min readLW link
(250bpm.substack.com)

Google Logo Li­ga­ture Bug

jefftkMay 18, 2025, 2:40 AM
49 points
7 comments1 min readLW link
(www.jefftk.com)

Who wants a free AI Safety Do­main?

Zohar JacksonMay 18, 2025, 12:37 AM
5 points
0 comments1 min readLW link

Can Rea­son­ing Models Avoid the Most For­bid­den Tech­nique?

Brendan LongMay 17, 2025, 11:26 PM
8 points
8 comments3 min readLW link
(www.brendanlong.com)

What OpenAI Told Cal­ifor­nia’s At­tor­ney General

garrisonMay 17, 2025, 11:14 PM
108 points
3 commentsLW link
(www.obsolete.pub)

Mul­tipo­lar AI is Underrated

Allison DuettmannMay 17, 2025, 10:03 PM
16 points
1 comment16 min readLW link

[Question] Will we sur­vive if AI solves en­g­ineer­ing be­fore de­cep­tion?

Knight LeeMay 17, 2025, 7:22 PM
21 points
13 comments1 min readLW link

Seven ways to Im­prove the In­ter­nal Model Principle

Alfred HarwoodMay 17, 2025, 4:38 PM
14 points
0 comments13 min readLW link

D&D.Sci: The Choos­ing Ones

abstractapplicMay 17, 2025, 3:26 PM
46 points
17 comments1 min readLW link

The ab­sent-minded variations

dr_sMay 17, 2025, 6:57 AM
24 points
13 comments9 min readLW link

Book Re­view: The Art of Happiness

ScrewtapeMay 17, 2025, 4:56 AM
37 points
23 comments11 min readLW link

Man­age­ment is the Near Future

jefftkMay 17, 2025, 2:50 AM
52 points
10 comments2 min readLW link
(www.jefftk.com)

Proof Sec­tion to an In­tro­duc­tion to Re­in­force­ment Learn­ing for Un­der­stand­ing In­fra-Bayesianism

Brittany GelbMay 17, 2025, 2:36 AM
3 points
0 comments9 min readLW link

An In­tro­duc­tion to Re­in­force­ment Learn­ing for Un­der­stand­ing In­fra-Bayesianism

Brittany GelbMay 17, 2025, 2:34 AM
11 points
0 comments20 min readLW link

Me­mory De­cod­ing Jour­nal Club: “Sy­nap­tic ar­chi­tec­ture of a mem­ory en­gram in the mouse hip­pocam­pus.”

Devin WardMay 16, 2025, 11:55 PM
3 points
0 comments1 min readLW link

So­cial Anx­iety Isn’t About Be­ing Liked

ChipmonkMay 16, 2025, 10:26 PM
136 points
21 comments2 min readLW link
(chrislakin.blog)

Events: De­bate & Fic­tion Project

abramdemskiMay 16, 2025, 9:51 PM
39 points
1 comment1 min readLW link

How Fast Can Al­gorithms Ad­vance Ca­pa­bil­ities? | Epoch Gra­di­ent Update

henryjMay 16, 2025, 9:38 PM
37 points
8 comments6 min readLW link
(epoch.ai)

P-Values Know When You’re Cheating

EggsMay 16, 2025, 8:34 PM
21 points
2 comments2 min readLW link

Minds are magic

k64May 16, 2025, 7:10 PM
0 points
1 comment2 min readLW link

US-China trade talks should pave way for AI safety treaty [SCMP cross­post]

otto.bartenMay 16, 2025, 4:55 PM
10 points
0 comments3 min readLW link

Direct Real­ism is prob­a­bly false

TerriLeafMay 16, 2025, 4:36 PM
−3 points
19 comments3 min readLW link

Re­gard­ing South Africa

ZviMay 16, 2025, 4:10 PM
71 points
5 comments11 min readLW link
(thezvi.wordpress.com)

Notes on Consciousness

CSDDMay 16, 2025, 2:17 PM
3 points
3 comments1 min readLW link

re­flect­ing on criticism

Vadim GolubMay 16, 2025, 11:59 AM
4 points
5 comments10 min readLW link

Gen­er­at­ing the Fun­niest Joke with RL (ac­cord­ing to GPT-4.1)

aggMay 16, 2025, 5:09 AM
99 points
22 comments4 min readLW link

In­ter­pretable Fine Tun­ing Re­search Up­date and Work­ing Prototype

Matthew KhoriatyMay 16, 2025, 3:44 AM
9 points
0 comments4 min readLW link

It Is Un­ten­able That Near-Fu­ture AI Sce­nario Models Like “AI 2027” Don’t In­clude Open Source AI

Andrew DicksonMay 16, 2025, 2:20 AM
36 points
17 comments5 min readLW link

Ap­ply to Visit­ing Fel­lows at Con­stel­la­tion, due June 13

Ella MarkianosMay 16, 2025, 2:20 AM
1 point
0 comments2 min readLW link

Para­noid Debating

DresdenHeartMay 16, 2025, 2:20 AM
1 point
0 comments1 min readLW link

Bay Area Sum­mer Solstice

May 16, 2025, 12:20 AM
20 points
0 comments1 min readLW link

Stay­ing in a Cap­sule Hotel

jefftkMay 16, 2025, 12:20 AM
19 points
2 comments1 min readLW link
(www.jefftk.com)

Re­search­ing Syn­thetic Con­scious­ness: sound ap­peal­ing?

Brad DunnMay 15, 2025, 10:29 PM
10 points
1 comment1 min readLW link

Start­ing Over: What to tell Sarah, at the edge of pro­fes­sional oblivion.

Brad DunnMay 15, 2025, 9:34 PM
11 points
1 comment20 min readLW link

Tax-Op­ti­mized Risk in Port­fo­lio Allocation

Brendan LongMay 15, 2025, 6:53 PM
6 points
0 comments1 min readLW link
(www.brendanlong.com)

AI Safety Thurs­days: Un­der­stand­ing The Self-Other Over­lap Approach

Juliana EberschlagMay 15, 2025, 6:41 PM
2 points
0 comments1 min readLW link

Some skep­ti­cism about skep­ti­cism about effi­cacy of paus­ing AI

extinction-bountiesMay 15, 2025, 6:15 PM
5 points
1 comment2 min readLW link

time is event based

thiccythotMay 15, 2025, 6:07 PM
45 points
1 comment4 min readLW link

Con­sider Others’ Cost Tolerances

nomagicpillMay 15, 2025, 5:43 PM
23 points
2 comments4 min readLW link
(nomagicpill.github.io)

Prob­lems with in­struc­tion-fol­low­ing as an al­ign­ment target

Seth HerdMay 15, 2025, 3:41 PM
48 points
14 comments10 min readLW link

AI #116: If Any­one Builds It, Every­one Dies

ZviMay 15, 2025, 3:10 PM
47 points
5 comments42 min readLW link
(thezvi.wordpress.com)

Counter-con­sid­er­a­tions on AI arms races

May 15, 2025, 2:54 PM
22 points
0 comments18 min readLW link

AlphaEvolve

mannatvjainMay 15, 2025, 2:14 PM
29 points
0 comments5 min readLW link
(deepmind.google)