Should CA, TX, OK, and LA merge into a gi­ant swing state, just for elec­tions?

Thomas KwaNov 6, 2024, 11:01 PM
115 points
35 comments1 min readLW link

New Fund­ing Cat­e­gory Open in Fore­sight’s AI Safety Grants

Allison DuettmannNov 6, 2024, 10:59 PM
15 points
0 comments1 min readLW link

Scat­tered thoughts on what it means for an LLM to believe

TheManxLoinerNov 6, 2024, 10:10 PM
5 points
4 comments5 min readLW link

The Bayesian Con­spir­acy Live Recording

EneaszNov 6, 2024, 4:25 PM
9 points
0 comments1 min readLW link

An­thropic: Three Sketches of ASL-4 Safety Case Components

Zach Stein-PerlmanNov 6, 2024, 4:00 PM
95 points
33 comments1 min readLW link
(alignment.anthropic.com)

Meme Talk­ing Points

ymeskhoutNov 6, 2024, 3:27 PM
34 points
0 comments3 min readLW link

Ad­vi­sors for Smaller Ma­jor Donors?

jefftkNov 6, 2024, 2:30 PM
18 points
2 comments3 min readLW link
(www.jefftk.com)

Scis­sors State­ments for Pres­i­dent?

AnnaSalamonNov 6, 2024, 10:38 AM
118 points
32 comments1 min readLW link

[Question] How to cite LessWrong as an aca­demic source?

PhilosophicalSoulNov 6, 2024, 8:28 AM
6 points
6 comments1 min readLW link

How to put Cal­ifor­nia and Texas on the cam­paign trail!

Yair HalberstadtNov 6, 2024, 6:08 AM
25 points
4 comments1 min readLW link

LDT (and ev­ery­thing else) can be irrational

Christopher KingNov 6, 2024, 4:05 AM
10 points
15 comments2 min readLW link

Join my new sub­scriber chat

sarahconstantinNov 6, 2024, 2:30 AM
7 points
0 comments1 min readLW link
(sarahconstantin.substack.com)

Grace­ful Degradation

ScrewtapeNov 5, 2024, 11:57 PM
83 points
8 comments4 min readLW link

An al­ter­na­tive ap­proach to superbabies

Towards_KeeperhoodNov 5, 2024, 10:56 PM
48 points
19 comments3 min readLW link

Ap­ply to be a men­tor in SPAR!

agucovaNov 5, 2024, 9:32 PM
5 points
0 commentsLW link

Go­ing Beyond “im­ma­tu­rity”

moisentinelNov 5, 2024, 8:51 PM
−3 points
2 comments2 min readLW link

In­tent al­ign­ment as a step­ping-stone to value alignment

Seth HerdNov 5, 2024, 8:43 PM
37 points
8 comments3 min readLW link

Why Re­cur­sion Phar­ma­ceu­ti­cals aban­doned cell paint­ing for bright­field imaging

Abhishaike MahajanNov 5, 2024, 2:51 PM
29 points
1 comment18 min readLW link
(www.owlposting.com)

Win­ning isn’t enough

Nov 5, 2024, 11:37 AM
38 points
18 comments9 min readLW link

An­thropic—The case for tar­geted regulation

anagumaNov 5, 2024, 7:07 AM
11 points
0 comments2 min readLW link
(www.anthropic.com)

The Shal­low Bench

Karl FaulksNov 5, 2024, 5:07 AM
48 points
5 comments3 min readLW link

Us­ing Nar­ra­tive Prompt­ing to Ex­tract Policy Fore­casts from LLMs

Max GhenisNov 5, 2024, 4:37 AM
5 points
0 comments1 min readLW link

ML4Good (AI Safety Boot­camp) - Ex­pe­rience report

JanEbbingNov 5, 2024, 1:18 AM
13 points
0 comments3 min readLW link

Catas­trophic Cy­ber Ca­pa­bil­ities Bench­mark (3CB): Ro­bustly Eval­u­at­ing LLM Agent Cy­ber Offense Capabilities

Nov 5, 2024, 1:01 AM
8 points
0 comments6 min readLW link
(www.apartresearch.com)

[Question] Could or­cas be (trained to be) smarter than hu­mans? 

Towards_KeeperhoodNov 4, 2024, 11:29 PM
56 points
23 comments1 min readLW link

Me­tastatic Cancer Treat­ment Since 2010: The Suc­cess Stories

sarahconstantinNov 4, 2024, 10:50 PM
51 points
2 comments6 min readLW link
(sarahconstantin.substack.com)

Bay Win­ter Sols­tice 2024: Speech Auditions

ozymandiasNov 4, 2024, 10:31 PM
32 points
1 comment1 min readLW link

Em­pa­thy/​Sys­tem­iz­ing Quo­tient is a poor/​bi­ased model for the autism/​sex link

tailcalledNov 4, 2024, 9:11 PM
43 points
0 comments7 min readLW link

Distributed espionage

margetmagentaNov 4, 2024, 7:43 PM
3 points
0 comments1 min readLW link

GPT-8 may not be ASI

rvzlxax409Nov 4, 2024, 7:31 PM
−2 points
1 comment3 min readLW link

AI timelines don’t ac­count for base rate of tech progress

rvzlxax409Nov 4, 2024, 7:31 PM
−10 points
2 comments1 min readLW link

Up­date on the Mys­te­ri­ous Trump Buy­ers on Polymarket

AnnapurnaNov 4, 2024, 7:22 PM
19 points
9 comments1 min readLW link
(jorgevelez.substack.com)

[In­tu­itive self-mod­els] 8. Root­ing Out Free Will Intuitions

Steven ByrnesNov 4, 2024, 6:16 PM
70 points
19 comments24 min readLW link

Op­tion control

Joe CarlsmithNov 4, 2024, 5:54 PM
28 points
0 comments54 min readLW link

[Question] Notic­ing the World

EvolutionByDesignNov 4, 2024, 4:41 PM
4 points
1 comment1 min readLW link

The cur­rent state of RSPs

Zach Stein-PerlmanNov 4, 2024, 4:00 PM
23 points
2 comments9 min readLW link

[Question] Does the “an­cient wis­dom” ar­gu­ment have any val­idity? If a par­tic­u­lar teach­ing or tra­di­tion is old, to what ex­tent does this make it more trust­wor­thy?

SpectrumDTNov 4, 2024, 3:20 PM
18 points
49 comments1 min readLW link

A brief his­tory of the au­to­mated corporation

owencbNov 4, 2024, 2:35 PM
26 points
1 comment5 min readLW link
(strangecities.substack.com)

Ab­strac­tions are not Natural

Alfred HarwoodNov 4, 2024, 11:10 AM
25 points
21 comments11 min readLW link

[Linkpost] Build­ing Altru­is­tic and Mo­ral AI Agent with Brain-in­spired Affec­tive Em­pa­thy Mechanisms

Gunnar_ZarnckeNov 4, 2024, 10:15 AM
13 points
0 comments1 min readLW link
(arxiv.org)

Con­text-de­pen­dent consequentialism

Nov 4, 2024, 9:29 AM
31 points
6 comments27 min readLW link

Sur­vival with­out dignity

L Rudolf LNov 4, 2024, 2:29 AM
366 points
29 comments15 min readLW link
(nosetgauge.substack.com)

Drug de­vel­op­ment costs can range over two or­ders of magnitude

rossryNov 3, 2024, 11:13 PM
38 points
0 comments11 min readLW link

Redefin­ing Tol­er­ance: Beyond Pop­per’s Paradox

mindprison3 Nov 2024 22:23 UTC
−1 points
0 comments3 min readLW link

Goal: Un­der­stand Intelligence

Johannes C. Mayer3 Nov 2024 21:20 UTC
14 points
19 comments1 min readLW link

Cur­rent safety train­ing tech­niques do not fully trans­fer to the agent setting

3 Nov 2024 19:24 UTC
158 points
9 comments5 min readLW link

Why our poli­ti­ci­ans aren’t Median

Yair Halberstadt3 Nov 2024 14:03 UTC
62 points
15 comments3 min readLW link

Hu­man Bio­di­ver­sity (Part 4: As­tral Codex Ten)

Evan_Gaensbauer3 Nov 2024 4:20 UTC
−13 points
6 commentsLW link
(reflectivealtruism.com)

Un­der­stand­ing in­com­pa­ra­bil­ity ver­sus in­com­men­su­ra­bil­ity in re­la­tion to RLHF

artemiocobb2 Nov 2024 22:57 UTC
1 point
1 comment2 min readLW link

elec­tric turbofans

bhauth2 Nov 2024 22:50 UTC
63 points
2 comments5 min readLW link
(bhauth.com)