RSS

LLMs as a Plan­ning Overhang

Larks14 Jul 2024 2:54 UTC
14 points
0 comments2 min readLW link

Ice: The Penul­ti­mate Frontier

Roko13 Jul 2024 23:44 UTC
26 points
4 comments1 min readLW link
(transhumanaxiology.substack.com)

Trust as a bot­tle­neck to grow­ing teams quickly

benkuhn13 Jul 2024 18:00 UTC
16 points
1 comment5 min readLW link
(www.benkuhn.net)

Kinds of Motivation

Sable13 Jul 2024 15:52 UTC
6 points
0 comments7 min readLW link
(affablyevil.substack.com)

A sim­ple case for ex­treme in­ner misalignment

Richard_Ngo13 Jul 2024 15:40 UTC
47 points
16 comments7 min readLW link

Thought Ex­per­i­ments Website

minmi_drover13 Jul 2024 4:47 UTC
12 points
3 comments1 min readLW link

Me­moris­ing molec­u­lar structures

dkl912 Jul 2024 22:40 UTC
8 points
0 comments2 min readLW link
(dkl9.net)

Robin Han­son AI X-Risk De­bate — High­lights and Analysis

Liron12 Jul 2024 21:31 UTC
36 points
3 comments45 min readLW link
(www.youtube.com)

De­sign­ing Ar­tifi­cial Wis­dom: The Wise Work­flow Re­search Organization

Jordan Arel12 Jul 2024 19:18 UTC
2 points
0 comments8 min readLW link

White­board Pen Magaz­ines are Useful

Johannes C. Mayer12 Jul 2024 17:15 UTC
24 points
5 comments1 min readLW link

Align­ment: “Do what I would have wanted you to do”

Oleg Trott12 Jul 2024 16:47 UTC
13 points
43 comments1 min readLW link

Virtue taxation

Dentosal12 Jul 2024 14:56 UTC
9 points
1 comment2 min readLW link

Mov­ing away from phys­i­cal continuity

ProgramCrafter12 Jul 2024 5:05 UTC
2 points
1 comment1 min readLW link

Trans­former Cir­cuit Faith­ful­ness Met­rics Are Not Robust

12 Jul 2024 3:47 UTC
78 points
0 comments7 min readLW link
(arxiv.org)

On Ar­tifi­cial Wisdom

Jordan Arel12 Jul 2024 0:20 UTC
3 points
0 comments13 min readLW link

Yoshua Ben­gio: Rea­son­ing through ar­gu­ments against tak­ing AI safety seriously

Judd Rosenblatt11 Jul 2024 23:53 UTC
68 points
1 comment1 min readLW link
(yoshuabengio.org)

Pod­cast: “How the Smart Money teaches trad­ing with Ricki He­ick­len” (Pa­trick McKen­zie in­ter­view­ing)

rossry11 Jul 2024 22:49 UTC
20 points
2 comments1 min readLW link
(www.complexsystemspodcast.com)

Su­perba­bies: Put­ting The Pie­ces Together

sarahconstantin11 Jul 2024 20:40 UTC
121 points
7 comments10 min readLW link
(sarahconstantin.substack.com)

Sher­lock­ian Ab­duc­tion Master List

Cole Wyeth11 Jul 2024 20:27 UTC
39 points
44 comments19 min readLW link

Thoughts to ni­plav on lie-de­tec­tion, truth­fwl mechanisms, and wealth-inequality

11 Jul 2024 18:55 UTC
7 points
8 comments4 min readLW link