Min­i­mal Mo­ti­va­tion of Nat­u­ral Latents

14 Oct 2024 22:51 UTC
47 points
14 comments3 min readLW link

How long should poli­ti­cal (and other) terms be?

ohmurphy14 Oct 2024 21:38 UTC
5 points
0 comments1 min readLW link
(ohmurphy.substack.com)

Ex­am­ples of How I Use LLMs

jefftk14 Oct 2024 17:10 UTC
31 points
2 comments2 min readLW link
(www.jefftk.com)

It’s im­por­tant to know when to stop: Mechanis­tic Ex­plo­ra­tion of Gemma 2 List Generation

Gerard Boxo14 Oct 2024 17:04 UTC
9 points
0 comments6 min readLW link
(gboxo.github.io)

[Question] LW re­sources on child­hood ex­pe­riences?

nahir9159514 Oct 2024 17:04 UTC
10 points
7 comments1 min readLW link

Free Will, Neu­rotyp­i­cal Dom­i­nance, and the Path to ASI and Neu­ral­inks: Evolv­ing Beyond Scarcity

j_passeri14 Oct 2024 16:54 UTC
−1 points
3 comments3 min readLW link

Break­throughs, Neu­ro­di­ver­gence, and Work­ing Out­side the System

j_passeri14 Oct 2024 16:54 UTC
2 points
3 comments2 min readLW link

The case for un­learn­ing that re­moves in­for­ma­tion from LLM weights

Fabien Roger14 Oct 2024 14:08 UTC
102 points
18 comments6 min readLW link

Cir­cuits in Su­per­po­si­tion: Com­press­ing many small neu­ral net­works into one

14 Oct 2024 13:06 UTC
131 points
9 comments13 min readLW link

Beyond Defen­sive Technology

ejk6414 Oct 2024 11:34 UTC
11 points
1 comment10 min readLW link

Why Stop AI is bar­ri­cad­ing OpenAI

Remmelt14 Oct 2024 7:12 UTC
−16 points
32 comments6 min readLW link
(docs.google.com)

The Ex­plore vs. Ex­ploit Dilemma

nathanjzhao14 Oct 2024 6:20 UTC
1 point
0 comments1 min readLW link
(nathanzhao.cc)

AI Align­ment via Slow Sub­strates: Early Em­piri­cal Re­sults With StarCraft II

Lester Leong14 Oct 2024 4:05 UTC
60 points
9 comments12 min readLW link

some ques­tion­able space launch guns

bhauth13 Oct 2024 22:52 UTC
17 points
0 comments4 min readLW link
(bhauth.com)

[Question] What are your fa­vorite books or blogs that are out of print, or whose do­mains have ex­pired (es­pe­cially if they also aren’t on LibGen/​Way­back/​etc, or on Ama­zon)?

Arjun Panickssery13 Oct 2024 20:21 UTC
13 points
4 comments1 min readLW link

The Hopium Wars: the AGI En­tente Delusion

Max Tegmark13 Oct 2024 17:00 UTC
236 points
60 comments9 min readLW link

Parental Writ­ing Selec­tion Bias

jefftk13 Oct 2024 14:00 UTC
52 points
3 comments1 min readLW link
(www.jefftk.com)

Per­sonal Philosophy

Xor13 Oct 2024 3:01 UTC
3 points
0 comments2 min readLW link

Con­ta­gious Beliefs—Si­mu­lat­ing Poli­ti­cal Alignment

James Stephen Brown13 Oct 2024 0:27 UTC
8 points
0 comments2 min readLW link
(nonzerosum.games)

Bi­nary en­cod­ing as a sim­ple ex­plicit con­struc­tion for superposition

tailcalled12 Oct 2024 21:18 UTC
12 points
0 comments1 min readLW link

[Question] How Should We Use Limited Time to Max­i­mize Long-Term Im­pact?

queelius12 Oct 2024 20:02 UTC
10 points
3 comments1 min readLW link

A Per­centage Model of a Person

Sable12 Oct 2024 17:55 UTC
41 points
5 comments9 min readLW link
(affablyevil.substack.com)

AI Com­pute gov­er­nance: Ver­ify­ing AI chip location

Farhan12 Oct 2024 17:36 UTC
6 points
0 comments6 min readLW link

Ge­offrey Hin­ton on the Past, Pre­sent, and Fu­ture of AI

Stephen McAleese12 Oct 2024 16:41 UTC
23 points
5 comments18 min readLW link

[Question] I = W/​T?

HNX12 Oct 2024 15:15 UTC
0 points
3 comments1 min readLW link

AI re­search as­sis­tants com­pe­ti­tion 2024Q3: Tie be­tween Elicit and You.com

Elizabeth12 Oct 2024 15:10 UTC
64 points
4 comments3 min readLW link
(acesounderglass.com)

SAE fea­tures for re­fusal and syco­phancy steer­ing vectors

12 Oct 2024 14:54 UTC
29 points
4 comments7 min readLW link

Prices are Bounties

Maxwell Tabarrok12 Oct 2024 14:51 UTC
51 points
13 comments2 min readLW link
(www.maximum-progress.com)

Differ­en­tial knowl­edge interconnection

Roman Leventov12 Oct 2024 12:52 UTC
6 points
0 comments7 min readLW link

Most ar­gu­ments for AI Doom are ei­ther bad or weak

Logan Zoellner12 Oct 2024 11:57 UTC
4 points
100 comments3 min readLW link

Kas­sel ACX/​LW Meetup

Fernand012 Oct 2024 7:47 UTC
2 points
0 comments1 min readLW link

Neu­ral Net­work And New­ton’s Se­cond Law

Max Ma12 Oct 2024 6:25 UTC
−10 points
0 comments1 min readLW link

[Question] If the DoJ goes through with the Google breakup,where does Deep­mind end up?

O O12 Oct 2024 5:06 UTC
5 points
1 comment1 min readLW link

My mo­ti­va­tion and the­ory of change for work­ing in AI healthtech

Andrew_Critch12 Oct 2024 0:36 UTC
180 points
39 comments14 min readLW link

HDBSCAN is Sur­pris­ingly Effec­tive at Find­ing In­ter­pretable Clusters of the SAE De­coder Matrix

11 Oct 2024 23:06 UTC
8 points
2 comments10 min readLW link

Chang­ing the Mind of an LLM

testingthewaters11 Oct 2024 22:25 UTC
2 points
0 comments5 min readLW link

EIS XIV: Is mechanis­tic in­ter­pretabil­ity about to be prac­ti­cally use­ful?

scasper11 Oct 2024 22:13 UTC
68 points
4 comments7 min readLW link

Dario Amodei — Machines of Lov­ing Grace

Matrice Jacobine11 Oct 2024 21:43 UTC
63 points
26 comments1 min readLW link
(darioamodei.com)

“Deep Galac­tic Chillout” a space to re­lax dur­ing SF tech week & meet whole­some, fun people

Jared M.11 Oct 2024 19:50 UTC
1 point
0 comments1 min readLW link

Open let­ter to young EAs

Leif Wenar11 Oct 2024 19:49 UTC
9 points
10 comments1 min readLW link

The Great Bootstrap

KristianRonn11 Oct 2024 19:46 UTC
12 points
0 comments15 min readLW link

Em­brac­ing com­plex­ity when de­vel­op­ing and eval­u­at­ing AI re­spon­si­bly

Aliya Amirova11 Oct 2024 17:46 UTC
3 points
9 comments9 min readLW link

How much I’m pay­ing for AI pro­duc­tivity soft­ware (and the fu­ture of AI use)

jacquesthibs11 Oct 2024 17:11 UTC
59 points
18 comments8 min readLW link
(jacquesthibodeau.com)

AI: The Philoso­pher’s Stone of the 21st Century

HNX11 Oct 2024 16:55 UTC
0 points
2 comments29 min readLW link

[Question] Who cre­ated the Less Wrong Gather Town?

Arepo11 Oct 2024 8:53 UTC
2 points
1 comment1 min readLW link

A Heuris­tic Proof of Prac­ti­cal Aligned Superintelligence

Roko11 Oct 2024 5:05 UTC
7 points
6 comments1 min readLW link
(transhumanaxiology.substack.com)

An AI crash is our best bet for re­strict­ing AI

Remmelt11 Oct 2024 2:12 UTC
26 points
3 comments1 min readLW link

A Triple Decker for Elfland

jefftk11 Oct 2024 1:50 UTC
25 points
0 comments1 min readLW link
(www.jefftk.com)

OODA your OODA Loop

Raemon11 Oct 2024 0:50 UTC
38 points
3 comments3 min readLW link

Scal­ing pre­dic­tion mar­kets with meta-markets

Dentosal10 Oct 2024 21:17 UTC
1 point
0 comments2 min readLW link