Places of Lov­ing Grace [Story]

ankFeb 18, 2025, 11:49 PM
−1 points
0 comments4 min readLW link

Are SAE fea­tures from the Base Model still mean­ingful to LLaVA?

Shan23ChenFeb 18, 2025, 10:16 PM
8 points
2 comments10 min readLW link
(www.lesswrong.com)

Sparse Au­toen­coder Fea­tures for Clas­sifi­ca­tions and Transferability

Shan23ChenFeb 18, 2025, 10:14 PM
5 points
0 comments1 min readLW link
(arxiv.org)

A fable on AI x-risk

bgaesopFeb 18, 2025, 8:15 PM
8 points
4 comments1 min readLW link

The Un­earned Priv­ilege We Rarely Dis­cuss: Cog­ni­tive Capability

DiegoRojasFeb 18, 2025, 8:06 PM
−21 points
7 comments3 min readLW link

Call for Ap­pli­ca­tions: XLab Sum­mer Re­search Fel­low­ship

JoNeedsSleepFeb 18, 2025, 7:19 PM
9 points
0 comments1 min readLW link

AISN #48: Utility Eng­ineer­ing and EnigmaEval

Feb 18, 2025, 7:15 PM
4 points
0 comments4 min readLW link
(newsletter.safe.ai)

Ab­stract Math­e­mat­i­cal Con­cepts vs. Ab­strac­tions Over Real-World Systems

Thane RuthenisFeb 18, 2025, 6:04 PM
32 points
10 comments4 min readLW link

How ac­cu­rate was my “Altered Traits” book re­view?

lsusrFeb 18, 2025, 5:00 PM
41 points
3 comments3 min readLW link

Med­i­cal Roundup #4

ZviFeb 18, 2025, 1:40 PM
24 points
3 comments10 min readLW link
(thezvi.wordpress.com)

Dear AGI,

Nathan YoungFeb 18, 2025, 10:48 AM
88 points
11 comments3 min readLW link

There are a lot of up­com­ing re­treats/​con­fer­ences be­tween March and July (2025)

Feb 18, 2025, 9:30 AM
6 points
0 comments1 min readLW link

Sea Change

Charlie SandersFeb 18, 2025, 6:03 AM
−2 points
2 comments5 min readLW link
(www.dailymicrofiction.com)

Born on Third Base: The Case for In­her­it­ing Noth­ing and Build­ing Every­thing

charlieoneillFeb 18, 2025, 12:47 AM
−24 points
16 comments2 min readLW link

Do mod­els know when they are be­ing eval­u­ated?

Feb 17, 2025, 11:13 PM
59 points
6 comments12 min readLW link

AGI Safety & Align­ment @ Google Deep­Mind is hiring

Rohin ShahFeb 17, 2025, 9:11 PM
102 points
19 comments10 min readLW link

The Peeperi (un­finished) - By Katja Grace

Nathan YoungFeb 17, 2025, 7:33 PM
22 points
0 comments3 min readLW link
(docs.google.com)

Progress links and short notes, 2025-02-17

jasoncrawfordFeb 17, 2025, 7:18 PM
8 points
0 comments7 min readLW link
(newsletter.rootsofprogress.org)

Claude 3.5 Son­net (New)’s AGI scenario

Nathan YoungFeb 17, 2025, 6:47 PM
5 points
2 comments5 min readLW link

Talk­ing to lay­men about AI de­vel­op­ment

David SteelFeb 17, 2025, 6:42 PM
8 points
0 comments1 min readLW link

On the Re­birth of Aris­toc­racy in the Amer­i­can Regime

shawkisukkarFeb 17, 2025, 4:18 PM
−16 points
3 comments9 min readLW link
(shawkisukkar.substack.com)

Ascetic hedonism

dkl9Feb 17, 2025, 3:56 PM
15 points
9 comments2 min readLW link
(dkl9.net)

AIS Ber­lin, events, op­por­tu­ni­ties and the flipped game­board—Field­builders Newslet­ter, Fe­bru­ary 2025

Feb 17, 2025, 2:16 PM
6 points
0 comments3 min readLW link

Monthly Roundup #27: Fe­bru­ary 2025

ZviFeb 17, 2025, 2:10 PM
27 points
3 comments44 min readLW link
(thezvi.wordpress.com)

What new x- or s-risk field­build­ing or­gani­sa­tions would you like to see? An EOI form. (FBB #3)

gergogasparFeb 17, 2025, 12:39 PM
6 points
0 comments2 min readLW link

A His­tory of the Fu­ture, 2025-2040

L Rudolf LFeb 17, 2025, 12:03 PM
235 points
42 comments75 min readLW link
(nosetgauge.substack.com)

Ther­mo­dy­namic en­tropy = Kol­mogorov complexity

Aram EbtekarFeb 17, 2025, 5:56 AM
70 points
12 comments1 min readLW link
(doi.org)

THE ARCHIVE

Jason ReidFeb 17, 2025, 1:12 AM
7 points
0 comments6 min readLW link

[Question] What are the sur­viv­ing wor­lds like?

KvmanThinkingFeb 17, 2025, 12:41 AM
21 points
2 comments1 min readLW link

arch-an­ar­chist read­ing list

Peter lawless Feb 16, 2025, 10:47 PM
2 points
1 comment1 min readLW link

Cy­berE­con­omy. The Limits to Growth

Feb 16, 2025, 9:02 PM
−3 points
0 comments23 min readLW link

Co­op­er­a­tion for AI safety must tran­scend geopoli­ti­cal interference

Matrice JacobineFeb 16, 2025, 6:18 PM
7 points
6 commentsLW link
(www.scmp.com)

[Question] Pro­gram­ming Lan­guage Early Fund­ing?

J Thomas MorosFeb 16, 2025, 5:34 PM
2 points
6 comments3 min readLW link

[Closed] Gaug­ing In­ter­est for a Learn­ing-The­o­retic Agenda Men­tor­ship Programme

Vanessa KosoyFeb 16, 2025, 4:24 PM
54 points
5 comments2 min readLW link

Celtic Knots on Ein­stein Lattice

BenFeb 16, 2025, 3:56 PM
47 points
11 comments2 min readLW link

It’s been ten years. I pro­pose HPMOR An­niver­sary Par­ties.

ScrewtapeFeb 16, 2025, 1:43 AM
153 points
3 comments1 min readLW link

Come join Dove­tail’s agent foun­da­tions fel­low­ship talks & discussion

Alex_AltairFeb 15, 2025, 10:10 PM
24 points
0 comments1 min readLW link

Quan­tify­ing the Qual­i­ta­tive: Towards a Bayesian Ap­proach to Per­sonal Insight

Pruthvi KumarFeb 15, 2025, 7:50 PM
1 point
0 comments6 min readLW link

Knit­ting a Sweater in a Burn­ing House

CrimsonChinFeb 15, 2025, 7:50 PM
27 points
2 comments2 min readLW link

Microplas­tics: Much Less Than You Wanted To Know

Feb 15, 2025, 7:08 PM
82 points
8 comments13 min readLW link

Prefer­ence for un­cer­tainty and im­pact over­es­ti­ma­tion bias in al­tru­is­tic sys­tems.

LuckFeb 15, 2025, 12:27 PM
1 point
0 comments1 min readLW link

Ar­tifi­cial Static Place In­tel­li­gence: Guaran­teed Alignment

ankFeb 15, 2025, 11:08 AM
2 points
2 comments2 min readLW link

The cur­rent AI strate­gic land­scape: one bear’s perspective

Matrice JacobineFeb 15, 2025, 9:49 AM
11 points
0 commentsLW link
(philosophybear.substack.com)

6 (Po­ten­tial) Mis­con­cep­tions about AI Intellectuals

ozziegooenFeb 14, 2025, 11:51 PM
18 points
11 commentsLW link

[Question] Should Open Philan­thropy Make an Offer to Buy OpenAI?

mrtreasureFeb 14, 2025, 11:18 PM
25 points
1 comment1 min readLW link

A com­pu­ta­tional no-co­in­ci­dence principle

Eric NeymanFeb 14, 2025, 9:39 PM
148 points
38 comments6 min readLW link
(www.alignment.org)

Hope­ful hy­poth­e­sis, the Per­sona Juke­box.

Donald HobsonFeb 14, 2025, 7:24 PM
11 points
4 comments3 min readLW link

In­tro­duc­tion to Ex­pected Value Fanaticism

Petra KosonenFeb 14, 2025, 7:05 PM
9 points
8 comments1 min readLW link
(utilitarianism.net)

In­trin­sic Di­men­sion of Prompts in LLMs

Karthik ViswanathanFeb 14, 2025, 7:02 PM
3 points
0 comments4 min readLW link

Ob­jec­tive Real­ism: A Per­spec­tive Beyond Hu­man Constructs

ApatheosFeb 14, 2025, 7:02 PM
−12 points
1 comment2 min readLW link