Sam Alt­man’s sister claims Sam sex­u­ally abused her—Part 4: Timeline, continued

pythagoras5015Apr 13, 2025, 11:41 PM
1 point
0 comments51 min readLW link

The Struc­ture of the Pain of Change

ReverendBayesApr 13, 2025, 9:51 PM
7 points
0 comments10 min readLW link

Луна Лавгуд и Комната Тайн, Часть 4

Apr 13, 2025, 8:55 PM
3 points
0 comments4 min readLW link

Thoughts on the Dou­ble Im­pact Project

Mati_RoyApr 13, 2025, 7:07 PM
27 points
14 comments2 min readLW link

In­tro to Multi-Agent Safety

james__pApr 13, 2025, 5:40 PM
11 points
0 comments5 min readLW link

[Question] How far are Western welfare states from cod­dling the pop­u­la­tion into be­com­ing use­less?

StanislavKrymApr 13, 2025, 5:08 PM
−24 points
5 comments1 min readLW link

Ves­ti­gial rea­son­ing in RL

Caleb BiddulphApr 13, 2025, 3:40 PM
50 points
8 comments9 min readLW link

Four Types of Disagreement

silentbobApr 13, 2025, 11:22 AM
50 points
2 comments5 min readLW link

How I switched ca­reers from soft­ware en­g­ineer to AI policy operations

Lucie PhilipponApr 13, 2025, 6:37 AM
58 points
1 comment5 min readLW link

Steel­man­ning heuris­tic arguments

Dmitry VaintrobApr 13, 2025, 1:09 AM
73 points
0 comments17 min readLW link

MONA: Three Month Later—Up­dates and Steganog­ra­phy Without Op­ti­miza­tion Pressure

Apr 12, 2025, 11:15 PM
31 points
0 comments5 min readLW link

The Era of the Divi­d­ual—are we fal­ling apart?

James Stephen BrownApr 12, 2025, 10:35 PM
3 points
2 comments4 min readLW link

Com­mit­ment Races are a tech­ni­cal prob­lem ASI can eas­ily solve

Knight LeeApr 12, 2025, 10:22 PM
7 points
6 comments6 min readLW link

The King’s Gift: How In­sti­tu­tions Re­brand Re­spon­si­bil­ity into Illu­sion

Hu YichaoApr 12, 2025, 7:38 PM
1 point
0 comments1 min readLW link

Ex­perts have it easy

beyarkayApr 12, 2025, 7:32 PM
23 points
3 comments9 min readLW link

find_pur­pose.exe

heatdeathandtaxesApr 12, 2025, 7:31 PM
−1 points
0 comments5 min readLW link
(heatdeathandtaxes.substack.com)

The Cynic Wasps in the Beehive

mempkoApr 12, 2025, 7:30 PM
−3 points
0 comments1 min readLW link
(blog.mempko.com)

Луна Лавгуд и Комната Тайн, Часть 3

Apr 12, 2025, 7:20 PM
3 points
0 comments2 min readLW link

[Question] What is autism?

Adam ZernerApr 12, 2025, 6:12 PM
18 points
7 comments1 min readLW link

Col­lege Ad­vice For Peo­ple Like Me

henryjApr 12, 2025, 2:36 PM
50 points
5 comments17 min readLW link
(www.henryjosephson.com)

Why does LW not put much more fo­cus on AI gov­er­nance and out­reach?

Apr 12, 2025, 2:24 PM
78 points
31 comments2 min readLW link

[Question] Is Lo­cal Order a Clue to Univer­sal En­tropy? How a Failed Pro­fes­sor Searches for a ‘Sa­cred Mo­ti­va­tional Order’

P. JoãoApr 12, 2025, 1:39 PM
2 points
2 comments2 min readLW link

What are good safety stan­dards for open source AIs from China?

ChristianKlApr 12, 2025, 1:06 PM
10 points
2 comments1 min readLW link

Will US tar­iffs push data cen­ters for large model train­ing offshore?

ChristianKlApr 12, 2025, 12:47 PM
20 points
3 comments1 min readLW link

Self prop­a­gat­ing story.

CanalettoApr 12, 2025, 12:32 PM
3 points
0 comments8 min readLW link

Cal­ling Bul­lshit—the Cheatsheet

Niklas LehmannApr 12, 2025, 11:43 AM
13 points
4 comments2 min readLW link

The In­ter­nal Model Prin­ci­ple: A Straight­for­ward Ex­pla­na­tion

Alfred HarwoodApr 12, 2025, 10:58 AM
22 points
1 comment19 min readLW link

ACX Spring Meetup 2025 @ Klang Valley, Malaysia

Yi-YangApr 12, 2025, 7:31 AM
2 points
0 comments1 min readLW link

Distributed whistleblowing

samuelshadrachApr 12, 2025, 6:36 AM
5 points
5 comments4 min readLW link
(samuelshadrach.com)

[Question] How likely are the USA to de­cay and how will it in­fluence the AI de­vel­op­ment?

StanislavKrymApr 12, 2025, 4:42 AM
10 points
0 comments1 min readLW link

[Question] Does this game have a name?

Mis-UnderstandingsApr 12, 2025, 1:52 AM
4 points
4 comments1 min readLW link

Bias Miti­ga­tion in Lan­guage Models by Steer­ing Features

akankshancApr 12, 2025, 12:10 AM
1 point
0 comments9 min readLW link
(akankshanc.io)

Do we want too much from a po­ten­tially godlike AGI?

StanislavKrymApr 11, 2025, 11:33 PM
−1 points
0 comments2 min readLW link

How train­ing-gamers might func­tion (and win)

Vivek HebbarApr 11, 2025, 9:26 PM
107 points
5 comments13 min readLW link

The limits of black-box eval­u­a­tions: two hypotheticals

TFDApr 11, 2025, 8:45 PM
1 point
0 comments4 min readLW link
(www.thefloatingdroid.com)

Com­ments on “AI 2027”

RandalyApr 11, 2025, 8:32 PM
19 points
14 comments7 min readLW link

De­bunk the myth -Test­ing the gen­er­al­ized rea­son­ing abil­ity of LLM

Defender7762Apr 11, 2025, 8:17 PM
1 point
5 comments4 min readLW link

The­o­ries of Im­pact for Causal­ity in AI Safety

alexisbellotApr 11, 2025, 8:16 PM
11 points
1 comment6 min readLW link

Why Big­ger Models Gen­er­al­ize Better

PapersToAGIApr 11, 2025, 7:54 PM
1 point
0 comments2 min readLW link

Can LLMs learn Stegano­graphic Rea­son­ing via RL?

Apr 11, 2025, 4:33 PM
28 points
2 comments6 min readLW link

My day in 2035

TenokeApr 11, 2025, 4:31 PM
19 points
2 comments7 min readLW link
(svilentodorov.xyz)

Youth Lockout

Xavi CFApr 11, 2025, 3:05 PM
47 points
6 comments5 min readLW link

[Question] Is the ethics of in­ter­ac­tion with prim­i­tive peo­ples already solved?

StanislavKrymApr 11, 2025, 2:56 PM
−4 points
0 comments1 min readLW link

OpenAI Re­sponses API changes mod­els’ behavior

Apr 11, 2025, 1:27 PM
53 points
6 comments2 min readLW link

Weird Ran­dom New­comb Problem

TapataktApr 11, 2025, 1:09 PM
21 points
16 comments4 min readLW link

On Google’s Safety Plan

ZviApr 11, 2025, 12:51 PM
57 points
6 comments33 min readLW link
(thezvi.wordpress.com)

Луна Лавгуд и Комната Тайн, Часть 2

Apr 11, 2025, 12:42 PM
2 points
1 comment3 min readLW link

Paper

dynomightApr 11, 2025, 12:20 PM
43 points
12 comments3 min readLW link

Why are neuro-sym­bolic sys­tems not con­sid­ered when it comes to AI Safety?

Edy NastaseApr 11, 2025, 9:41 AM
3 points
6 comments1 min readLW link

Crash sce­nario 1: Rapidly mo­bil­ise for a 2025 AI crash

RemmeltApr 11, 2025, 6:54 AM
12 points
4 comments1 min readLW link