Failures in Kindness

silentbobMar 26, 2024, 9:30 PM
421 points
60 comments9 min readLW link

AE Stu­dio @ SXSW: We need more AI con­scious­ness re­search (and fur­ther re­sources)

Mar 26, 2024, 8:59 PM
67 points
8 comments3 min readLW link

Melt­down: In­ter­face for llama.cpp and ChatGPT

nextcallerMar 26, 2024, 6:29 PM
5 points
2 comments1 min readLW link

Timelines to Trans­for­ma­tive AI: an investigation

Zershaaneh QureshiMar 26, 2024, 6:28 PM
20 points
2 comments50 min readLW link

Sum­mer Pro­gram for High-School­ers to start work­ing on im­pact­ful projects

nonplusMar 26, 2024, 6:26 PM
−1 points
0 comments1 min readLW link
(forum.effectivealtruism.org)

Modern Trans­form­ers are AGI, and Hu­man-Level

abramdemskiMar 26, 2024, 5:46 PM
218 points
87 comments5 min readLW link

My In­ter­view With Cade Metz on His Re­port­ing About Slate Star Codex

Zack_M_DavisMar 26, 2024, 5:18 PM
189 points
187 comments6 min readLW link

Bare­foot FAQ

dkl9Mar 26, 2024, 4:29 PM
0 points
8 comments2 min readLW link
(dkl9.net)

[Question] What’s Your Best AI Safety “Quip”?

False NameMar 26, 2024, 3:35 PM
−2 points
0 comments1 min readLW link

[Question] What is the na­ture of hu­mans gen­eral in­tel­li­gence and it’s im­pli­ca­tions for AGI?

Will_PearsonMar 26, 2024, 3:20 PM
5 points
4 comments1 min readLW link

Eco­nomics Roundup #1

ZviMar 26, 2024, 2:00 PM
25 points
4 comments23 min readLW link
(thezvi.wordpress.com)

En­hanc­ing biose­cu­rity with lan­guage mod­els: defin­ing re­search directions

micMar 26, 2024, 12:30 PM
12 points
0 commentsLW link
(papers.ssrn.com)

Le­gal­ity as a Ca­reer Harm Assess­ment Heuristic

jefftkMar 26, 2024, 11:20 AM
9 points
7 comments2 min readLW link
(www.jefftk.com)

[Question] Orthog­o­nal­ity Th­e­sis seems wrong

Donatas LučiūnasMar 26, 2024, 7:33 AM
−7 points
26 comments1 min readLW link

Per­cep­tual Blindspots: How to In­crease Self-Awareness

Declan MolonyMar 26, 2024, 5:37 AM
14 points
3 comments2 min readLW link

Retro fun­der pro­file & Man­i­fund team recs (ACX Grants 2024: Im­pact Mar­ket)

Saul MunnMar 26, 2024, 3:29 AM
9 points
0 commentsLW link

LessOn­line (May 31—June 2, Berkeley, CA)

Ben PaceMar 26, 2024, 2:34 AM
101 points
24 comments1 min readLW link
(Less.Online)

Run­ning a Ba­sic ACX Every­where Meetup

ScrewtapeMar 26, 2024, 1:57 AM
8 points
0 comments3 min readLW link

Pod­cast in­ter­view se­ries fea­tur­ing Dr. Peter Park

jacobhaimesMar 26, 2024, 12:25 AM
3 points
0 comments2 min readLW link
(into-ai-safety.github.io)

Third-party test­ing as a key in­gre­di­ent of AI policy

Zac Hatfield-DoddsMar 25, 2024, 10:40 PM
11 points
1 comment12 min readLW link
(www.anthropic.com)

Idea: Safe Fal­lback Reg­u­la­tions for Widely De­ployed AI Systems

Aaron_ScherMar 25, 2024, 9:27 PM
8 points
0 comments6 min readLW link

An­nounc­ing Neu­ron­pe­dia: Plat­form for ac­cel­er­at­ing re­search into Sparse Autoencoders

Mar 25, 2024, 9:17 PM
93 points
7 comments7 min readLW link

Test­ing ChatGPT for cell type recognition

MetacelsusMar 25, 2024, 7:59 PM
7 points
2 comments3 min readLW link
(denovo.substack.com)

Should ra­tio­nal­ists be spiritual /​ Spiritu­al­ity as over­com­ing delusion

Mar 25, 2024, 4:48 PM
49 points
57 comments29 min readLW link

Photo Cu­ra­tion Approach

jefftkMar 25, 2024, 3:10 PM
9 points
3 comments2 min readLW link
(www.jefftk.com)

On attunement

Joe CarlsmithMar 25, 2024, 12:47 PM
100 points
12 comments22 min readLW link

On Lex Frid­man’s Se­cond Pod­cast with Altman

ZviMar 25, 2024, 12:20 PM
51 points
10 comments10 min readLW link
(thezvi.wordpress.com)

On the Con­fu­sion be­tween In­ner and Outer Misalignment

Chris_LeongMar 25, 2024, 11:59 AM
17 points
10 comments1 min readLW link

A Bit For You

Ronak_MehtaMar 24, 2024, 10:18 PM
0 points
0 comments2 min readLW link
(ronakrm.github.io)

All About Con­cave and Con­vex Agents

mako yassMar 24, 2024, 9:37 PM
64 points
24 comments8 min readLW link

Do not delete your mis­al­igned AGI.

mako yassMar 24, 2024, 9:37 PM
62 points
13 comments3 min readLW link

[Question] Define “Agent” (Embed­ded)

ApolloniaMar 24, 2024, 8:14 PM
10 points
1 comment1 min readLW link

[Question] Could LLMs Help Gen­er­ate New Con­cepts in Hu­man Lan­guage?

Pekka LampeltoMar 24, 2024, 8:13 PM
10 points
4 comments2 min readLW link

Wittgen­stein and the Pri­vate Lan­guage Argument

TMFOWMar 24, 2024, 8:06 PM
4 points
0 comments14 min readLW link
(tmfow.substack.com)

Self-Play By Analogy

Amica TerraMar 24, 2024, 8:06 PM
−2 points
2 comments7 min readLW link

Can quan­tised au­toen­coders find and in­ter­pret cir­cuits in lan­guage mod­els?

charlieoneillMar 24, 2024, 8:05 PM
30 points
4 comments24 min readLW link

Man­dolin Harp Sen­sor Placement

jefftkMar 24, 2024, 6:40 PM
11 points
0 comments1 min readLW link
(www.jefftk.com)

AI Align­ment and the Clas­si­cal Hu­man­ist Tradition

PeteJMar 24, 2024, 1:37 PM
−1 points
4 comments2 min readLW link

UNGA Re­s­olu­tion on AI: 5 Key Take­aways Look­ing to Fu­ture Policy

HerambMar 24, 2024, 12:23 PM
3 points
0 comments3 min readLW link
(forum.effectivealtruism.org)

[Question] Are (Mo­tor)sports like F1 a good thing to cal­ibrate es­ti­mates against?

CstineSublimeMar 24, 2024, 9:07 AM
4 points
2 comments1 min readLW link

Nu­clear Quan­tum Im­mor­tal­ity Hack­ing

NezekMar 23, 2024, 10:08 PM
−3 points
2 comments2 min readLW link

As Many Ideas

ScrewtapeMar 23, 2024, 6:55 PM
7 points
0 comments1 min readLW link

My De­tailed Notes & Com­men­tary from Sec­u­lar Solstice

Jeffrey HeningerMar 23, 2024, 6:48 PM
35 points
16 comments13 min readLW link

Gen­eral Thoughts on Sec­u­lar Solstice

Jeffrey HeningerMar 23, 2024, 6:48 PM
101 points
60 comments8 min readLW link

How to make food/​wa­ter test­ing cheaper/​more scal­able? [eg for pu­rity/​toxin test­ing]

Alex K. Chen (parrot)Mar 23, 2024, 5:28 AM
9 points
2 comments1 min readLW link

Pro­to­typ­ing Pluck Sensors

jefftkMar 23, 2024, 1:30 AM
9 points
0 comments2 min readLW link
(www.jefftk.com)

Dangers of Closed-Loop AI

Gordon Seidoh WorleyMar 22, 2024, 11:52 PM
35 points
9 comments2 min readLW link

Why The In­sects Scream

Bentham's BulldogMar 22, 2024, 7:47 PM
4 points
11 comments9 min readLW link

What does “au­to­di­dact” mean?

bhauthMar 22, 2024, 6:37 PM
22 points
19 comments1 min readLW link

[Linkpost] Vague Ver­biage in Forecasting

trevorMar 22, 2024, 6:05 PM
11 points
9 comments3 min readLW link
(goodjudgment.com)