Donat­ing to help Democrats win in the 2024 elec­tions: re­search, de­ci­sion sup­port, and recommendations

Michael Cohn14 Jul 2024 22:57 UTC
−1 points
1 comment6 min readLW link

Four ways I’ve made bad decisions

Sodium14 Jul 2024 22:18 UTC
18 points
1 comment3 min readLW link

patent pro­cess problems

bhauth14 Jul 2024 21:12 UTC
33 points
13 comments5 min readLW link
(www.bhauth.com)

Break­ing Cir­cuit Breakers

14 Jul 2024 18:57 UTC
53 points
13 comments1 min readLW link
(confirmlabs.org)

Clopen sandwiches

dkl914 Jul 2024 13:07 UTC
4 points
0 comments1 min readLW link
(dkl9.net)

Child Handrail Returns

jefftk14 Jul 2024 12:40 UTC
12 points
0 comments1 min readLW link
(www.jefftk.com)

A (para­con­sis­tent) logic to deal with in­con­sis­tent preferences

B Jacobs14 Jul 2024 11:17 UTC
6 points
2 comments4 min readLW link
(bobjacobs.substack.com)

Robert Caro And Mechanis­tic Models In Biography

adamShimi14 Jul 2024 10:56 UTC
24 points
5 comments7 min readLW link
(epistemologicalfascinations.substack.com)

An In­tro­duc­tion to Rep­re­sen­ta­tion Eng­ineer­ing—an ac­ti­va­tion-based paradigm for con­trol­ling LLMs

j_we14 Jul 2024 10:37 UTC
37 points
6 comments17 min readLW link

LLMs as a Plan­ning Overhang

Larks14 Jul 2024 2:54 UTC
38 points
8 comments2 min readLW link

Brief notes on the Wikipe­dia game

Olli Järviniemi14 Jul 2024 2:28 UTC
68 points
9 comments4 min readLW link

Spark in the Dark Guest Spots

jefftk14 Jul 2024 1:40 UTC
6 points
0 comments1 min readLW link
(www.jefftk.com)

Ice: The Penul­ti­mate Frontier

Roko13 Jul 2024 23:44 UTC
66 points
56 comments1 min readLW link
(transhumanaxiology.substack.com)

Trust as a bot­tle­neck to grow­ing teams quickly

benkuhn13 Jul 2024 18:00 UTC
44 points
3 comments5 min readLW link
(www.benkuhn.net)

Stitch­ing SAEs of differ­ent sizes

13 Jul 2024 17:19 UTC
39 points
12 comments12 min readLW link

Kinds of Motivation

Sable13 Jul 2024 15:52 UTC
7 points
2 comments7 min readLW link
(affablyevil.substack.com)

A sim­ple case for ex­treme in­ner misalignment

Richard_Ngo13 Jul 2024 15:40 UTC
84 points
41 comments7 min readLW link

Real­ity Testing

Ben Turtel13 Jul 2024 15:20 UTC
−2 points
1 comment6 min readLW link
(bturtel.substack.com)

The world is awful. The world is much bet­ter. The world can be much bet­ter: The An­i­ma­tion.

Writer13 Jul 2024 14:03 UTC
10 points
0 comments1 min readLW link
(youtu.be)

The Modern Prob­lems with Conformity

Zero Contradictions13 Jul 2024 8:20 UTC
0 points
5 comments1 min readLW link
(expandingrationality.substack.com)

De­sign­ing Ar­tifi­cial Wis­dom: GitWise and AlphaWise

Jordan Arel13 Jul 2024 6:46 UTC
2 points
0 comments7 min readLW link

OpenAI’s In­tel­li­gence Levels

infinibot2713 Jul 2024 6:25 UTC
1 point
0 comments1 min readLW link
(www.bloomberg.com)

Some de­sir­able prop­er­ties of au­to­mated wisdom

Marius Adrian Nicoară13 Jul 2024 6:05 UTC
3 points
2 comments6 min readLW link

Thought Ex­per­i­ments Website

minmi_drover13 Jul 2024 4:47 UTC
11 points
11 comments1 min readLW link

A Se­cond Wet­suit Summer

jefftk13 Jul 2024 2:00 UTC
19 points
2 comments1 min readLW link
(www.jefftk.com)

Ti­maeus is hiring!

12 Jul 2024 23:42 UTC
67 points
6 comments2 min readLW link

Con­sider at­tend­ing the AI Se­cu­rity Fo­rum ’24, a 1-day pre-DEFCON event

Charlie Rogers-Smith12 Jul 2024 23:01 UTC
21 points
0 comments1 min readLW link

Me­moris­ing molec­u­lar structures

dkl912 Jul 2024 22:40 UTC
6 points
0 comments2 min readLW link
(dkl9.net)

Robin Han­son AI X-Risk De­bate — High­lights and Analysis

Liron12 Jul 2024 21:31 UTC
46 points
7 comments45 min readLW link
(www.youtube.com)

De­sign­ing Ar­tifi­cial Wis­dom: The Wise Work­flow Re­search Organization

Jordan Arel12 Jul 2024 19:18 UTC
2 points
0 comments8 min readLW link

White­board Pen Magaz­ines are Useful

Johannes C. Mayer12 Jul 2024 17:15 UTC
41 points
8 comments1 min readLW link

Align­ment: “Do what I would have wanted you to do”

Oleg Trott12 Jul 2024 16:47 UTC
11 points
48 comments1 min readLW link

Virtue taxation

Dentosal12 Jul 2024 14:56 UTC
9 points
1 comment2 min readLW link

Most smart and skil­led peo­ple are out­side of the EA/​ra­tio­nal­ist com­mu­nity: an analysis

titotal12 Jul 2024 12:13 UTC
109 points
39 comments14 min readLW link
(open.substack.com)

2024 Free­dom Com­mu­ni­ties Events

Tudor Iliescu12 Jul 2024 8:04 UTC
−6 points
1 comment1 min readLW link

Faith­ful vs In­ter­pretable Sparse Au­toen­coder Evals

Louka Ewington-Pitsos12 Jul 2024 5:37 UTC
2 points
0 comments12 min readLW link

Mov­ing away from phys­i­cal continuity

ProgramCrafter12 Jul 2024 5:05 UTC
2 points
1 comment1 min readLW link

Trans­former Cir­cuit Faith­ful­ness Met­rics Are Not Robust

12 Jul 2024 3:47 UTC
104 points
5 comments7 min readLW link
(arxiv.org)

On Ar­tifi­cial Wisdom

Jordan Arel12 Jul 2024 0:20 UTC
3 points
0 comments14 min readLW link

Yoshua Ben­gio: Rea­son­ing through ar­gu­ments against tak­ing AI safety seriously

Judd Rosenblatt11 Jul 2024 23:53 UTC
70 points
3 comments1 min readLW link
(yoshuabengio.org)

Pod­cast: “How the Smart Money teaches trad­ing with Ricki He­ick­len” (Pa­trick McKen­zie in­ter­view­ing)

rossry11 Jul 2024 22:49 UTC
20 points
2 comments1 min readLW link
(www.complexsystemspodcast.com)

Su­perba­bies: Put­ting The Pie­ces Together

sarahconstantin11 Jul 2024 20:40 UTC
216 points
37 comments10 min readLW link
(sarahconstantin.substack.com)

Sher­lock­ian Ab­duc­tion Master List

Cole Wyeth11 Jul 2024 20:27 UTC
52 points
66 comments36 min readLW link

Thoughts to ni­plav on lie-de­tec­tion, truth­fwl mechanisms, and wealth-inequality

11 Jul 2024 18:55 UTC
7 points
8 comments11 min readLW link

Games for AI Control

11 Jul 2024 18:40 UTC
45 points
0 comments5 min readLW link

Video In­tro to Guaran­teed Safe AI

11 Jul 2024 17:53 UTC
27 points
0 comments1 min readLW link
(youtu.be)

Effec­tive Empathy

Thac011 Jul 2024 15:14 UTC
4 points
1 comment1 min readLW link

AI #72: Deny­ing the Future

Zvi11 Jul 2024 15:00 UTC
45 points
8 comments41 min readLW link
(thezvi.wordpress.com)

The Best Bits From Build, Baby, Build

Maxwell Tabarrok11 Jul 2024 14:09 UTC
23 points
0 comments4 min readLW link
(www.maximum-progress.com)

[Question] What Other Lines of Work are Safe from AI Au­toma­tion?

RogerDearnaley11 Jul 2024 10:01 UTC
35 points
35 comments5 min readLW link