Ra­tion­al­ist Sto­ry­tel­ling (French)

Camille Berger 19 Feb 2024 22:25 UTC
3 points
0 comments1 min readLW link

Abs-E (or, speak only in the pos­i­tive)

dkl919 Feb 2024 21:14 UTC
22 points
20 comments2 min readLW link
(dkl9.net)

Re­tire­ment Ac­counts and Short Timelines

jefftk19 Feb 2024 18:50 UTC
83 points
35 comments2 min readLW link
(www.jefftk.com)

Re­la­tional Think­ing in An­i­mals and Humans

Bruce W. Lee19 Feb 2024 18:34 UTC
4 points
0 comments4 min readLW link
(psycnet.apa.org)

How Tech­ni­cal AI Safety Re­searchers Can Help Im­ple­ment Pu­ni­tive Da­m­ages to Miti­gate Catas­trophic AI Risk

Gabriel Weil19 Feb 2024 18:00 UTC
18 points
0 comments4 min readLW link

Pro­to­col eval­u­a­tions: good analo­gies vs control

Fabien Roger19 Feb 2024 18:00 UTC
35 points
10 comments11 min readLW link

When Should Copy­right Get Shorter?

Maxwell Tabarrok19 Feb 2024 16:03 UTC
11 points
14 comments4 min readLW link
(www.maximum-progress.com)

Auto-match­ing hid­den lay­ers in Py­torch LLMs

chanind19 Feb 2024 12:40 UTC
2 points
0 comments3 min readLW link

I’d also take $7 trillion

bhauth19 Feb 2024 3:31 UTC
45 points
12 comments10 min readLW link
(www.bhauth.com)

On co­in­ci­dences and Bayesian rea­son­ing, as ap­plied to the ori­gins of COVID-19

viking_math19 Feb 2024 1:14 UTC
62 points
28 comments14 min readLW link

Solu­tion to the two en­velopes prob­lem for moral weights

MichaelStJules19 Feb 2024 0:15 UTC
9 points
1 comment1 min readLW link

Con­spir­acy In­ves­ti­ga­tion Done Right

ymeskhout19 Feb 2024 0:09 UTC
21 points
0 comments6 min readLW link

Scien­tific Method

Andrij “Androniq” Ghorbunov18 Feb 2024 21:06 UTC
20 points
4 comments30 min readLW link

[Question] Weigh­ing rep­u­ta­tional and moral con­se­quences of leav­ing Rus­sia or staying

spza18 Feb 2024 19:36 UTC
29 points
24 comments1 min readLW link

Things I’ve Grieved

Raemon18 Feb 2024 19:32 UTC
122 points
6 comments2 min readLW link

Senses of “know­ing” a person

dkl918 Feb 2024 19:13 UTC
3 points
0 comments1 min readLW link
(dkl9.net)

The Jolly Green Gi­ant Chron­i­cles [ChatGPT]

Bill Benzon18 Feb 2024 17:28 UTC
4 points
0 comments8 min readLW link

In­tu­ition for 1 + 2 + 3 + … = −1/​12

Shankar Sivarajan18 Feb 2024 16:46 UTC
13 points
28 comments3 min readLW link

No Click­bait—Misal­ign­ment Database

Kabir Kumar18 Feb 2024 5:35 UTC
5 points
10 comments1 min readLW link

Idea: NV⁻ Cen­ters for Brain Interpretability

James Camacho18 Feb 2024 5:28 UTC
10 points
1 comment3 min readLW link

Celi­acs don’t need to live in fear

futurehumdrum18 Feb 2024 2:30 UTC
16 points
4 comments4 min readLW link

“What if we could re­design so­ciety from scratch? The promise of char­ter cities.” [Ra­tional An­i­ma­tions video]

Jackson Wagner18 Feb 2024 0:57 UTC
39 points
7 comments1 min readLW link
(www.youtube.com)

So­cial me­dia use prob­a­bly in­duces ex­ces­sive mediocrity

trevor17 Feb 2024 22:49 UTC
7 points
11 comments12 min readLW link

Eval­u­at­ing Solar

jefftk17 Feb 2024 21:50 UTC
26 points
5 comments2 min readLW link
(www.jefftk.com)

Opinions sur­vey 2 (with ra­tio­nal­ism score at the end)

tailcalled17 Feb 2024 12:03 UTC
2 points
11 comments1 min readLW link
(docs.google.com)

Achiev­ing AI Align­ment through De­liber­ate Uncer­tainty in Mul­ti­a­gent Systems

Florian_Dietz17 Feb 2024 8:45 UTC
3 points
0 comments13 min readLW link

Com­mu­ni­ca­tion, con­scious­ness, and be­lief strength measures

Jakub Smékal17 Feb 2024 5:45 UTC
1 point
0 comments3 min readLW link

San Fer­nando Valley Ra­tion­al­ity: Fe­bru­ary 22, 2024

Thomas Broadley17 Feb 2024 1:58 UTC
3 points
0 comments1 min readLW link

Self-Aware­ness: Tax­on­omy and eval suite proposal

Daniel Kokotajlo17 Feb 2024 1:47 UTC
61 points
0 comments11 min readLW link

Opinions sur­vey (with ra­tio­nal­ism score at the end)

tailcalled17 Feb 2024 0:41 UTC
8 points
14 comments1 min readLW link
(docs.google.com)

Phal­lo­cen­tric­ity in GPT-J’s bizarre strat­ified ontology

mwatkins17 Feb 2024 0:16 UTC
55 points
37 comments9 min readLW link

FUTARCHY NOW BABY

sapphire17 Feb 2024 0:03 UTC
−8 points
7 comments1 min readLW link

Mak­ing the “stance” explicit

NicholasKees16 Feb 2024 23:57 UTC
23 points
3 comments2 min readLW link

2023 Sur­vey Results

Screwtape16 Feb 2024 22:24 UTC
150 points
26 comments44 min readLW link

Physics-based early warn­ing sig­nal shows that AMOC is on tip­ping course

Annapurna16 Feb 2024 22:07 UTC
19 points
3 comments1 min readLW link
(www.science.org)

Kingfisher Win­ter Tour 2024

jefftk16 Feb 2024 21:40 UTC
8 points
0 comments1 min readLW link
(www.jefftk.com)

The Poin­ter Re­s­olu­tion Problem

Jozdien16 Feb 2024 21:25 UTC
41 points
20 comments3 min readLW link

Every “Every Bay Area House Party” Bay Area House Party

Richard_Ngo16 Feb 2024 18:53 UTC
174 points
6 comments4 min readLW link

“No-one in my org puts money in their pen­sion”

Tobes16 Feb 2024 18:33 UTC
248 points
16 comments9 min readLW link
(seekingtobejolly.substack.com)

Ad­dress­ing Fea­ture Sup­pres­sion in SAEs

16 Feb 2024 18:32 UTC
81 points
3 comments10 min readLW link

Ret­ro­spec­tive: PIBBSS Fel­low­ship 2023

16 Feb 2024 17:48 UTC
31 points
1 comment8 min readLW link

Fate­book for Chrome: Make and em­bed fore­casts any­where on the web

16 Feb 2024 16:08 UTC
14 points
3 comments1 min readLW link

“Arc­tic In­stincts? The uni­ver­sal prin­ci­ples of Arc­tic psy­cholog­i­cal adap­ta­tion and the ori­gins of East Asian psy­chol­ogy”—Call for Re­view­ers (Seeds of Science)

rogersbacon16 Feb 2024 15:02 UTC
0 points
0 comments2 min readLW link

The Alt­man Technocracy

PhilosophicalSoul16 Feb 2024 13:27 UTC
5 points
31 comments2 min readLW link

Dis­cord space for peo­ple with FTX claw­backs/​claims request

kotrfa16 Feb 2024 9:04 UTC
1 point
0 comments1 min readLW link
(forum.effectivealtruism.org)

OpenAI’s Sora is an agent

CBiddulph16 Feb 2024 7:35 UTC
93 points
25 comments4 min readLW link

Mas­s­ape­qua (Long Is­land), New York – ACX/​SSC Meetup

Gabriel Weil16 Feb 2024 1:24 UTC
4 points
0 comments1 min readLW link

Offer­ing AI safety sup­port calls for ML professionals

Vael Gates15 Feb 2024 23:48 UTC
61 points
1 comment1 min readLW link

7. Evolu­tion and Ethics

RogerDearnaley15 Feb 2024 23:38 UTC
2 points
6 comments6 min readLW link

Map­ping the se­man­tic void III: Ex­plor­ing neighbourhoods

mwatkins15 Feb 2024 23:01 UTC
13 points
0 comments10 min readLW link