ACX meetup [De­cem­ber]

sallatik28 Nov 2022 22:06 UTC
2 points
0 comments1 min readLW link

Us­ing mechanis­tic in­ter­pretabil­ity to find in-dis­tri­bu­tion failure in toy transformers

Charlie George28 Nov 2022 19:39 UTC
6 points
0 comments4 min readLW link

Cur­rent Trends in Eco­nomics and their Shortcoming

joshlevent28 Nov 2022 19:36 UTC
5 points
0 comments2 min readLW link

Solv­ing for the op­ti­mal work-life bal­ance with ge­o­met­ric rationality

Eric Neyman28 Nov 2022 17:02 UTC
20 points
1 comment8 min readLW link

Dis­cussing how to al­ign Trans­for­ma­tive AI if it’s de­vel­oped very soon

elifland28 Nov 2022 16:17 UTC
37 points
2 comments28 min readLW link

(DIY) FMT for Anti-Aging & Biohacking

Anton Rodenhauser28 Nov 2022 15:49 UTC
26 points
7 comments6 min readLW link

Search­ing for Search

28 Nov 2022 15:31 UTC
91 points
8 comments14 min readLW link1 review

My take on Ja­cob Can­nell’s take on AGI safety

Steven Byrnes28 Nov 2022 14:01 UTC
71 points
15 comments30 min readLW link1 review

On the Di­plo­macy AI

Zvi28 Nov 2022 13:20 UTC
127 points
29 comments11 min readLW link
(thezvi.wordpress.com)

The Sin­gu­lar Value De­com­po­si­tions of Trans­former Weight Ma­tri­ces are Highly Interpretable

28 Nov 2022 12:54 UTC
196 points
33 comments31 min readLW link

[Question] How to cor­rect for mul­ti­plic­ity with AI-gen­er­ated mod­els?

Lao Mein28 Nov 2022 3:51 UTC
4 points
0 comments1 min readLW link

Good Fu­tures Ini­ti­a­tive: Win­ter Pro­ject Internship

Aris27 Nov 2022 23:41 UTC
28 points
4 comments4 min readLW link

Geo­met­ric Ra­tion­al­ity is Not VNM Rational

Scott Garrabrant27 Nov 2022 19:36 UTC
149 points
26 comments3 min readLW link

Align­ing my web server with de­vops prac­tices: part 1 (back­ups)

VipulNaik27 Nov 2022 18:38 UTC
17 points
0 comments18 min readLW link

Some thoughts about nat­u­ral com­pu­ta­tion and interactions

Adam Shai27 Nov 2022 18:15 UTC
11 points
1 comment3 min readLW link

Re­view: LOVE in a simbox

PeterMcCluskey27 Nov 2022 17:41 UTC
32 points
4 comments9 min readLW link
(bayesianinvestor.com)

Mastodon’s Du­bi­ous Crawler Exemption

jefftk27 Nov 2022 14:20 UTC
9 points
3 comments1 min readLW link
(www.jefftk.com)

Re­ward Is Not Ne­c­es­sary: How To Create A Com­po­si­tional Self-Pre­serv­ing Agent For Life-Long Learning

Capybasilisk27 Nov 2022 14:05 UTC
3 points
0 comments1 min readLW link
(arxiv.org)

[Question] About probabilities

mikbp27 Nov 2022 8:19 UTC
5 points
9 comments1 min readLW link

Always know where your ab­strac­tions break

lsusr27 Nov 2022 6:32 UTC
78 points
6 comments2 min readLW link

Science and Math

lsusr27 Nov 2022 4:05 UTC
19 points
6 comments1 min readLW link

Microstartup Sto­ries: Ini­tial Thoughts

Adam Zerner27 Nov 2022 1:22 UTC
17 points
2 comments9 min readLW link

Don’t al­ign agents to eval­u­a­tions of plans

TurnTrout26 Nov 2022 21:16 UTC
42 points
49 comments18 min readLW link

What videos should Ra­tional An­i­ma­tions make?

Writer26 Nov 2022 20:28 UTC
30 points
24 comments1 min readLW link

The First Filter

26 Nov 2022 19:37 UTC
67 points
5 comments1 min readLW link

Re­spect­ing your Lo­cal Preferences

Scott Garrabrant26 Nov 2022 19:04 UTC
73 points
1 comment4 min readLW link

[Question] Opinions on the sleep synap­tic home­osta­sis hy­poth­e­sis?

Angela Pretorius26 Nov 2022 19:01 UTC
3 points
0 comments1 min readLW link

Why square er­rors?

Aprillion (Peter Hozák)26 Nov 2022 13:40 UTC
41 points
11 comments2 min readLW link

[Question] As­sum­ing that at least one re­li­gion is true, what would you ex­pect it to be?

risedive26 Nov 2022 8:34 UTC
−9 points
9 comments1 min readLW link

Three Align­ment Schemas & Their Problems

Shoshannah Tekofsky26 Nov 2022 4:25 UTC
19 points
1 comment6 min readLW link

The many types of blog posts

Adam Zerner26 Nov 2022 3:57 UTC
10 points
2 comments4 min readLW link

New Fron­tiers in Mojibake

Adam Scherlis26 Nov 2022 2:37 UTC
60 points
7 comments6 min readLW link1 review
(adam.scherlis.com)

Semi-con­duc­tor/​AI Stock Dis­cus­sion.

sapphire25 Nov 2022 23:35 UTC
29 points
24 comments1 min readLW link

NEFFA Should Allow Small Children

jefftk25 Nov 2022 23:00 UTC
10 points
2 comments2 min readLW link
(www.jefftk.com)

Pod­cast: Shoshan­nah Tekofsky on skil­ling up in AI safety, vis­it­ing Berkeley, and de­vel­op­ing novel re­search ideas

Akash25 Nov 2022 20:47 UTC
37 points
2 comments9 min readLW link

The man and the tool

pedroalvarado25 Nov 2022 19:51 UTC
−1 points
0 comments4 min readLW link

[Question] What AI newslet­ters or sub­stacks about AI do you recom­mend?

wunan25 Nov 2022 19:29 UTC
6 points
1 comment1 min readLW link

Mechanis­tic anomaly de­tec­tion and ELK

paulfchristiano25 Nov 2022 18:50 UTC
133 points
21 comments21 min readLW link
(ai-alignment.com)

The Least Con­tro­ver­sial Ap­pli­ca­tion of Geo­met­ric Rationality

Scott Garrabrant25 Nov 2022 16:50 UTC
60 points
22 comments4 min readLW link

Planes are still decades away from dis­plac­ing most bird jobs

guzey25 Nov 2022 16:49 UTC
159 points
13 comments3 min readLW link

Take part in our gi­ant study of cog­ni­tive abil­ities and get a cus­tomized re­port of your strengths and weak­nesses!

spencerg25 Nov 2022 16:28 UTC
8 points
1 comment1 min readLW link
(www.guidedtrack.com)

Guardian AI (Misal­igned sys­tems are all around us.)

Jessica Rumbelow25 Nov 2022 15:55 UTC
15 points
6 comments2 min readLW link

In­tu­itions by ML re­searchers may get pro­gres­sively worse con­cern­ing likely can­di­dates for trans­for­ma­tive AI

Viktor Rehnberg25 Nov 2022 15:49 UTC
7 points
0 comments2 min readLW link

Refin­ing the Sharp Left Turn threat model, part 2: ap­ply­ing al­ign­ment techniques

25 Nov 2022 14:36 UTC
39 points
9 comments6 min readLW link
(vkrakovna.wordpress.com)

[Question] Who holds all the USDT?

ChristianKl25 Nov 2022 11:58 UTC
17 points
6 comments1 min readLW link

Fair Col­lec­tive Effi­cient Altruism

Jobst Heitzig25 Nov 2022 9:38 UTC
2 points
1 comment5 min readLW link

[Question] If hu­man­ity one day dis­cov­ers that it is a form of dis­ease that threat­ens to de­stroy the uni­verse, should it al­low it­self to be shut down?

shminux25 Nov 2022 8:27 UTC
4 points
12 comments1 min readLW link

Could a sin­gle alien mes­sage de­stroy us?

25 Nov 2022 7:32 UTC
59 points
23 comments6 min readLW link
(youtu.be)

How do I start a pro­gram­ming ca­reer in the West?

Lao Mein25 Nov 2022 6:37 UTC
38 points
7 comments2 min readLW link

The AI Safety com­mu­nity has four main work groups, Strat­egy, Gover­nance, Tech­ni­cal and Move­ment Building

peterslattery25 Nov 2022 3:45 UTC
1 point
0 comments6 min readLW link