Care Doesn’t Scale

stavrosOct 28, 2024, 11:57 AM
27 points
1 comment1 min readLW link
(stevenscrawls.com)

Your mem­ory even­tu­ally drives con­fi­dence in each hy­poth­e­sis to 1 or 0

Crazy philosopherOct 28, 2024, 9:00 AM
3 points
6 comments1 min readLW link

Nerdtri­tion: sim­ple diets via spread­sheet abuse

dkl9Oct 27, 2024, 9:45 PM
8 points
0 comments3 min readLW link
(dkl9.net)

AGI Fermi Paradox

jrincaycOct 27, 2024, 8:14 PM
0 points
2 comments2 min readLW link

Sub­sti­tut­ing Talk­box for Breath Controller

jefftkOct 27, 2024, 7:10 PM
11 points
0 comments1 min readLW link
(www.jefftk.com)

Open Source Repli­ca­tion of An­thropic’s Cross­coder pa­per for model-diffing

Oct 27, 2024, 6:46 PM
48 points
4 comments5 min readLW link

Hiring a writer to co-au­thor with me (Spencer Green­berg for Clear­erThink­ing.org)

spencergOct 27, 2024, 5:34 PM
16 points
0 commentsLW link

In­ter­view with Bill O’Rourke—Rus­sian Cor­rup­tion, Putin, Ap­plied Ethics, and More

JohnGreerOct 27, 2024, 5:11 PM
3 points
0 comments6 min readLW link

On Shifgrethor

JustisMillsOct 27, 2024, 3:30 PM
67 points
18 comments2 min readLW link
(justismills.substack.com)

The hos­tile telepaths problem

ValentineOct 27, 2024, 3:26 PM
383 points
89 comments15 min readLW link

[Question] What are some good ways to form opinions on con­tro­ver­sial sub­jects in the cur­rent and up­com­ing era?

Terence CoelhoOct 27, 2024, 2:33 PM
9 points
21 comments1 min readLW link

Video lec­tures on the learn­ing-the­o­retic agenda

Vanessa KosoyOct 27, 2024, 12:01 PM
75 points
0 comments1 min readLW link
(www.youtube.com)

Dario Amodei’s “Machines of Lov­ing Grace” sound in­cred­ibly dan­ger­ous, for Humans

Super AGIOct 27, 2024, 5:05 AM
8 points
1 comment1 min readLW link

Elec­tro­static Air­ships?

DaemonicSigilOct 27, 2024, 4:32 AM
64 points
13 comments3 min readLW link
(pbement.com)

A suite of Vi­sion Sparse Au­toen­coders

Oct 27, 2024, 4:05 AM
25 points
0 comments1 min readLW link

Ways to think about alignment

Abhimanyu Pallavi SudhirOct 27, 2024, 1:40 AM
6 points
0 comments4 min readLW link

[Question] Is there a CFAR hand­book au­dio op­tion?

FinalFormal2Oct 26, 2024, 5:08 PM
16 points
0 comments1 min readLW link

Retrieval Aug­mented Ge­n­e­sis II — Holy Texts Se­man­tics Analysis

João Ribeiro MedeirosOct 26, 2024, 5:00 PM
−1 points
0 comments11 min readLW link

A su­perfi­cially plau­si­ble promis­ing al­ter­nate Earth with­out lockstep

LorecOct 26, 2024, 4:04 PM
−2 points
3 comments4 min readLW link

Galatea and the windup toy

Nicolas VillarrealOct 26, 2024, 2:52 PM
−3 points
0 comments13 min readLW link
(nicolasdvillarreal.substack.com)

Why is there Noth­ing rather than Some­thing?

Logan ZoellnerOct 26, 2024, 12:37 PM
27 points
3 comments4 min readLW link

The Sum­moned Heroine’s Pre­dic­tion Mar­kets Keep Pro­vid­ing Fi­nan­cial Ser­vices To The De­mon King!

abstractapplicOct 26, 2024, 12:34 PM
164 points
16 comments7 min readLW link

AI Safety Camp 10

Oct 26, 2024, 11:08 AM
38 points
9 comments18 min readLW link

Arith­metic Models: Bet­ter Than You Think

kqrOct 26, 2024, 9:42 AM
28 points
4 comments11 min readLW link
(entropicthoughts.com)

The Case For Bullying

Alexej GerstmaierOct 26, 2024, 4:56 AM
−50 points
8 comments1 min readLW link
(lexposedtruth.com)

Is the Power Grid Sus­tain­able?

jefftkOct 26, 2024, 2:30 AM
36 points
38 comments2 min readLW link
(www.jefftk.com)

[Question] (i no longer en­dorse this post) - cry­on­ics is a pas­cal’s mug­ging?

KvmanThinkingOct 25, 2024, 11:24 PM
−12 points
4 comments1 min readLW link

A Case for Con­scious Sig­nifi­cance rather than Free Will.

James Stephen BrownOct 25, 2024, 11:20 PM
10 points
2 comments6 min readLW link

In­tro­duc­ing Kairos: a new AI safety field­build­ing or­ga­ni­za­tion (the new home for SPAR and FSP)

agucovaOct 25, 2024, 9:59 PM
14 points
0 commentsLW link

Brief anal­y­sis of OP Tech­ni­cal AI Safety Funding

22tomOct 25, 2024, 7:37 PM
76 points
5 comments1 min readLW link

UK AISI: Early les­sons from eval­u­at­ing fron­tier AI systems

Zach Stein-PerlmanOct 25, 2024, 7:00 PM
26 points
0 comments2 min readLW link
(www.aisi.gov.uk)

Lab gov­er­nance read­ing list

Zach Stein-PerlmanOct 25, 2024, 6:00 PM
20 points
3 comments1 min readLW link

En­abling New Ap­pli­ca­tions with To­day’s Mechanis­tic In­ter­pretabil­ity Toolkit

ananya_joshiOct 25, 2024, 5:53 PM
3 points
0 comments3 min readLW link

OpenAI’s cy­ber­se­cu­rity is prob­a­bly reg­u­lated by NIS Regulations

Adam JonesOct 25, 2024, 11:06 AM
11 points
2 comments2 min readLW link
(adamjones.me)

Linkpost: Me­moran­dum on Ad­vanc­ing the United States’ Lead­er­ship in Ar­tifi­cial Intelligence

NisanOct 25, 2024, 4:37 AM
60 points
2 comments1 min readLW link
(www.whitehouse.gov)

Mak­ing a Pedalboard

jefftkOct 25, 2024, 12:10 AM
10 points
0 comments1 min readLW link
(www.jefftk.com)

What You Can Give In­stead of Advice

Karl FaulksOct 24, 2024, 11:10 PM
13 points
2 comments1 min readLW link

[Question] is it pos­si­ble to com­ment anony­mously on a post?

KvmanThinkingOct 24, 2024, 10:24 PM
2 points
2 comments1 min readLW link

Log­i­cal Proof for the Emer­gence and Sub­strate In­de­pen­dence of Sentience

rifeOct 24, 2024, 9:08 PM
4 points
31 comments1 min readLW link
(awakenmoon.ai)

Against Job Boards: Hu­man Cap­i­tal and the Leg­i­bil­ity Trap

vaishnav92Oct 24, 2024, 8:50 PM
6 points
1 comment5 min readLW link

IAPS: Map­ping Tech­ni­cal Safety Re­search at AI Companies

Zach Stein-PerlmanOct 24, 2024, 8:30 PM
42 points
13 commentsLW link
(www.iaps.ai)

Our Digi­tal and Biolog­i­cal Children

EneaszOct 24, 2024, 6:36 PM
28 points
0 comments3 min readLW link
(deathisbad.substack.com)

Reflec­tions on the Me­tas­trate­gies Workshop

gw24 Oct 2024 18:30 UTC
41 points
5 comments11 min readLW link

How Should We Mea­sure In­tel­li­gence Models: Why Use Fre­quency of Ele­men­tal In­for­ma­tion Operations

hwj2024 Oct 2024 16:54 UTC
1 point
0 comments5 min readLW link

Meta AI (FAIR) lat­est pa­per in­te­grates sys­tem-1 and sys­tem-2 think­ing into rea­son­ing mod­els.

happy friday24 Oct 2024 16:54 UTC
8 points
0 comments1 min readLW link

Balanc­ing La­bel Quan­tity and Qual­ity for Scal­able Elicitation

Alex Mallen24 Oct 2024 16:49 UTC
31 points
1 comment2 min readLW link

Claude Son­net 3.5.1 and Haiku 3.5

Zvi24 Oct 2024 14:50 UTC
51 points
9 comments16 min readLW link
(thezvi.wordpress.com)

Big tech tran­si­tions are slow (with im­pli­ca­tions for AI)

jasoncrawford24 Oct 2024 14:25 UTC
36 points
16 comments4 min readLW link
(blog.rootsofprogress.org)

Deriva­tive AT a discontinuity

Alok Singh24 Oct 2024 2:48 UTC
9 points
5 comments10 min readLW link

how to rapidly as­similate new information

dhruvmethi24 Oct 2024 2:18 UTC
9 points
3 comments8 min readLW link