Are short timelines ac­tu­ally bad?

joshcFeb 5, 2023, 9:21 PM
61 points
7 comments3 min readLW link

Stan­zas On Power Calculation

DirectedEvolutionFeb 5, 2023, 7:15 PM
9 points
0 comments1 min readLW link

A List of things I might do with a Proof Oracle

Logan ZoellnerFeb 5, 2023, 6:14 PM
−14 points
13 comments3 min readLW link

Teach­ing Sim­ple Boundaries

jefftkFeb 5, 2023, 5:30 PM
23 points
0 comments2 min readLW link
(www.jefftk.com)

Control

TsviBTFeb 5, 2023, 4:16 PM
21 points
14 comments9 min readLW link

Have an idea? Come to Oxford to dis­cuss and write (20 – 24 March)

Feb 5, 2023, 3:05 PM
20 points
0 comments1 min readLW link

H5N1 - thread for in­for­ma­tion shar­ing, plan­ning, and action

MathiasKBFeb 5, 2023, 12:44 PM
31 points
8 commentsLW link

Se­cond call: CFP for Re­bel­lion and Di­sobe­di­ence in AI workshop

Ram RachumFeb 5, 2023, 12:18 PM
2 points
0 comments2 min readLW link

Re­search Direc­tion: Be the AGI you want to see in the world

Feb 5, 2023, 7:15 AM
44 points
0 comments7 min readLW link

Sex is Good, Actually

Gordon Seidoh WorleyFeb 5, 2023, 6:33 AM
41 points
8 comments4 min readLW link

Ques­tions about AI that bother me

Eleni AngelouFeb 5, 2023, 5:04 AM
13 points
6 comments2 min readLW link

Eval­u­a­tions (of new AI Safety re­searchers) can be noisy

LawrenceCFeb 5, 2023, 4:15 AM
132 points
11 comments16 min readLW link1 review

Pan­demic Pre­dic­tion Check­list: H5N1 (6/​14)

DirectedEvolutionFeb 5, 2023, 3:26 AM
50 points
10 comments7 min readLW link

Pod­cast with Oli Habryka on LessWrong /​ Light­cone Infrastructure

DanielFilanFeb 5, 2023, 2:52 AM
88 points
20 comments1 min readLW link
(thefilancabinet.com)

Mislead­ing Fast Charg­ing Specs

jefftkFeb 5, 2023, 2:50 AM
9 points
3 comments1 min readLW link
(www.jefftk.com)

I hired 5 peo­ple to sit be­hind me and make me pro­duc­tive for a month

Simon BerensFeb 5, 2023, 1:19 AM
252 points
83 comments10 min readLW link
(www.simonberens.com)

Mo­dal Fix­point Co­op­er­a­tion with­out Löb’s Theorem

Andrew_CritchFeb 5, 2023, 12:58 AM
134 points
34 comments3 min readLW link1 review

Who in­vented knit­ting? The plot thick­ens

eukaryoteFeb 5, 2023, 12:24 AM
60 points
9 comments19 min readLW link
(eukaryotewritesblog.com)

Some mis­cel­la­neous thoughts on ChatGPT, sto­ries, and me­chan­i­cal interpretability

Bill BenzonFeb 4, 2023, 7:35 PM
2 points
0 comments3 min readLW link

O(“AGI Safety”)>O(“Stop Tyrants”)

AnthonyRepettoFeb 4, 2023, 6:38 PM
−4 points
11 comments1 min readLW link

Monthly Doom Ar­gu­ment Threads? Doom Ar­gu­ment Wiki?

LVSNFeb 4, 2023, 4:59 PM
3 points
0 comments1 min readLW link

The Fu­ture of Struc­tured Self Improvement

EvenflairFeb 4, 2023, 4:02 PM
27 points
4 comments1 min readLW link
(guildoftherose.org)

Em­pa­thy as a nat­u­ral con­se­quence of learnt re­ward models

berenFeb 4, 2023, 3:35 PM
48 points
27 comments13 min readLW link

Mech In­terp Pro­ject Ad­vis­ing Call: Me­mori­sa­tion in GPT-2 Small

Neel NandaFeb 4, 2023, 2:17 PM
7 points
0 comments1 min readLW link

Do IQ tests mea­sure in­tel­li­gence? - A pre­dic­tion mar­ket on my fu­ture be­liefs about the topic

tailcalledFeb 4, 2023, 11:19 AM
1 point
10 comments1 min readLW link
(manifold.markets)

AXRP Epi­sode 19 - Mechanis­tic In­ter­pretabil­ity with Neel Nanda

DanielFilanFeb 4, 2023, 3:00 AM
45 points
0 comments117 min readLW link

The 2/​3 rule for multi-fac­tor authentication

RomanHaukssonFeb 4, 2023, 2:57 AM
4 points
0 comments1 min readLW link
(roman.computer)

Path-Depen­dence in ChatGPT’s Poli­ti­cal Outputs

lsusrFeb 4, 2023, 2:02 AM
28 points
4 comments4 min readLW link

Fuck­ing God­damn Ba­sics of Ra­tion­al­ist Discourse

LoganStrohlFeb 4, 2023, 1:47 AM
356 points
103 comments1 min readLW link3 reviews

Small Talk is Good, Actually

Gordon Seidoh WorleyFeb 4, 2023, 12:38 AM
52 points
9 comments3 min readLW link

Up­date on Book Re­view Dom­i­nant As­surance Contract

Arjun PanicksseryFeb 3, 2023, 11:16 PM
9 points
0 commentsLW link

[Question] 2+2=π√2+n

Logan ZoellnerFeb 3, 2023, 10:27 PM
16 points
15 comments1 min readLW link

[Question] If I en­counter a ca­pa­bil­ities pa­per that kinda spooks me, what should I do with it?

the gears to ascensionFeb 3, 2023, 9:37 PM
28 points
8 comments1 min readLW link

[Question] What Are The Pre­con­di­tions/​Pr­ereq­ui­sites for Asymp­totic Anal­y­sis?

DragonGodFeb 3, 2023, 9:26 PM
8 points
2 comments1 min readLW link

[Linkpost] Google in­vested $300M in An­thropic in late 2022

Orpheus16Feb 3, 2023, 7:13 PM
73 points
14 comments1 min readLW link
(www.ft.com)

Many AI gov­er­nance pro­pos­als have a trade­off be­tween use­ful­ness and feasibility

Feb 3, 2023, 6:49 PM
22 points
2 comments2 min readLW link

Re­ply to Dun­can Sa­bien on Strawmanning

Zack_M_DavisFeb 3, 2023, 5:57 PM
43 points
11 comments4 min readLW link

Semi-rare plain lan­guage words that are great to remember

LVSNFeb 3, 2023, 4:33 PM
4 points
7 comments1 min readLW link

[Question] What qual­ities does an AGI need to have to re­al­ize the risk of false vac­uum, with­out hard­cod­ing physics the­o­ries into it?

RationalSieveFeb 3, 2023, 4:00 PM
1 point
4 comments1 min readLW link

Hous­ing and Tran­sit Roundup #3

ZviFeb 3, 2023, 3:10 PM
21 points
6 comments16 min readLW link
(thezvi.wordpress.com)

Ta­boo P(doom)

NathanBarnardFeb 3, 2023, 10:37 AM
14 points
10 comments1 min readLW link

ChatGPT: Tan­tal­iz­ing af­terthoughts in search of story tra­jec­to­ries [in­duc­tion heads]

Bill BenzonFeb 3, 2023, 10:35 AM
4 points
0 comments20 min readLW link

Jor­dan Peter­son: Guru/​Villain

Bryan Frances3 Feb 2023 9:02 UTC
−14 points
6 comments9 min readLW link

[Question] What is the risk of ask­ing a coun­ter­fac­tual or­a­cle a ques­tion that already had its an­swer erased?

Chris_Leong3 Feb 2023 3:13 UTC
7 points
0 comments1 min readLW link

I don’t think MIRI “gave up”

Raemon3 Feb 2023 0:26 UTC
106 points
64 comments4 min readLW link

What fact that you know is true but most peo­ple aren’t ready to ac­cept it?

lorepieri3 Feb 2023 0:06 UTC
47 points
211 comments1 min readLW link

[Question] Monotonous Work

Gideon Bauer2 Feb 2023 21:35 UTC
1 point
0 comments1 min readLW link

Is AI risk as­sess­ment too an­thro­pocen­tric?

Craig Mattson2 Feb 2023 21:34 UTC
3 points
6 comments1 min readLW link

Hal­i­fax Monthly Meetup: In­tro­duc­tion to Effec­tive Altruism

Ideopunk2 Feb 2023 21:10 UTC
10 points
0 comments1 min readLW link

Con­di­tion­ing Pre­dic­tive Models: Outer al­ign­ment via care­ful conditioning

2 Feb 2023 20:28 UTC
72 points
15 comments57 min readLW link