RSS

An In­tro­duc­tion to AI Sandbagging

26 Apr 2024 13:40 UTC
41 points
1 comment8 min readLW link

My hour of mem­o­ryless lucidity

Eric Neyman4 May 2024 1:40 UTC
39 points
1 comment5 min readLW link
(ericneyman.wordpress.com)

KAN: Kol­mogorov-Arnold Networks

Gunnar_Zarncke1 May 2024 16:50 UTC
10 points
10 comments1 min readLW link
(arxiv.org)

Why I’m do­ing PauseAI

Joseph Miller30 Apr 2024 16:21 UTC
92 points
10 comments4 min readLW link

“AI Safety for Fleshy Hu­mans” an AI Safety ex­plainer by Nicky Case

habryka3 May 2024 18:10 UTC
45 points
7 comments4 min readLW link
(aisafety.dance)

Trans­form­ers Rep­re­sent Belief State Geom­e­try in their Resi­d­ual Stream

Adam Shai16 Apr 2024 21:16 UTC
349 points
79 comments12 min readLW link

If you weren’t such an idiot...

2 Mar 2024 0:01 UTC
119 points
60 comments2 min readLW link
(markxu.com)

[Question] Which skin­care prod­ucts are ev­i­dence-based?

Vanessa Kosoy2 May 2024 15:22 UTC
81 points
24 comments1 min readLW link

LLM+Plan­ners hy­bridi­s­a­tion for friendly AGI

installgentoo3 May 2024 8:40 UTC
6 points
2 comments1 min readLW link

[Question] Were there any an­cient ra­tio­nal­ists?

OliverHayman3 May 2024 18:26 UTC
11 points
3 comments1 min readLW link

Why is AGI/​ASI Inevitable?

DeathlessAmaranth2 May 2024 18:27 UTC
14 points
6 comments1 min readLW link

A list of core AI safety prob­lems and how I hope to solve them

davidad26 Aug 2023 15:12 UTC
161 points
26 comments5 min readLW link

Please stop pub­lish­ing ideas/​in­sights/​re­search about AI

Tamsin Leake2 May 2024 14:54 UTC
22 points
48 comments4 min readLW link

Disen­tan­gling Com­pe­tence and Intelligence

Robert Kralisch29 Apr 2024 0:12 UTC
23 points
7 comments6 min readLW link

Key take­aways from our EA and al­ign­ment re­search sur­veys

3 May 2024 18:10 UTC
69 points
2 comments21 min readLW link

An Un­in­ten­tional Compliment

28 Apr 2024 20:04 UTC
23 points
2 comments4 min readLW link

An ex­pla­na­tion of evil in an or­ga­nized world

KatjaGrace2 May 2024 5:20 UTC
26 points
9 comments2 min readLW link
(worldspiritsockpuppet.com)

Co­her­ence of Caches and Agents

johnswentworth1 Apr 2024 23:04 UTC
73 points
7 comments11 min readLW link

[Question] Can stealth air­craft be de­tected op­ti­cally?

Yair Halberstadt2 May 2024 7:47 UTC
18 points
24 comments1 min readLW link

Iron­ing Out the Squiggles

Zack_M_Davis29 Apr 2024 16:13 UTC
140 points
33 comments11 min readLW link