Why and how to write things on the Internet

benkuhn29 Dec 2022 22:40 UTC
20 points
2 comments15 min readLW link
(www.benkuhn.net)

Friendly and Un­friendly AGI are Indistinguishable

ErgoEcho29 Dec 2022 22:13 UTC
−4 points
4 comments4 min readLW link
(neologos.co)

200 COP in MI: Look­ing for Cir­cuits in the Wild

Neel Nanda29 Dec 2022 20:59 UTC
16 points
5 comments13 min readLW link

Thoughts on the im­pli­ca­tions of GPT-3, two years ago and NOW [here be drag­ons, we’re swim­ming, fly­ing and talk­ing with them]

Bill Benzon29 Dec 2022 20:05 UTC
0 points
0 comments5 min readLW link

Covid 12/​29/​22: Next Up is XBB.1.5

Zvi29 Dec 2022 18:20 UTC
33 points
4 comments10 min readLW link
(thezvi.wordpress.com)

En­trepreneur­ship ETG Might Be Bet­ter Than 80k Thought

Xodarap29 Dec 2022 17:51 UTC
33 points
0 comments1 min readLW link

In­ter­nal In­ter­faces Are a High-Pri­or­ity In­ter­pretabil­ity Target

Thane Ruthenis29 Dec 2022 17:49 UTC
26 points
6 comments7 min readLW link

CFP for Re­bel­lion and Di­sobe­di­ence in AI workshop

Ram Rachum29 Dec 2022 16:08 UTC
15 points
0 comments1 min readLW link

My scorched-earth policy on New Year’s resolutions

PatrickDFarley29 Dec 2022 14:45 UTC
29 points
2 comments4 min readLW link

Don’t feed the void. She is fat enough!

Johannes C. Mayer29 Dec 2022 14:18 UTC
11 points
0 comments1 min readLW link

[Question] Is there any unified re­source on Eliezer’s fa­tigue?

Johannes C. Mayer29 Dec 2022 14:04 UTC
8 points
2 comments1 min readLW link

Log­i­cal Prob­a­bil­ity of Gold­bach’s Con­jec­ture: Prov­able Rule or Coin­ci­dence?

avturchin29 Dec 2022 13:37 UTC
5 points
15 comments8 min readLW link

Where do you get your ca­pa­bil­ities from?

tailcalled29 Dec 2022 11:39 UTC
24 points
27 comments6 min readLW link

The com­mer­cial in­cen­tive to in­ten­tion­ally train AI to de­ceive us

Derek M. Jones29 Dec 2022 11:30 UTC
5 points
1 comment4 min readLW link
(shape-of-code.com)

In­finite neck­lace: the line as a circle

Alok Singh29 Dec 2022 10:41 UTC
5 points
2 comments1 min readLW link

Pri­vacy Tradeoffs

jefftk29 Dec 2022 3:40 UTC
13 points
1 comment2 min readLW link
(www.jefftk.com)

Against John Searle, Gary Mar­cus, the Chi­nese Room thought ex­per­i­ment and its world

philosophybear29 Dec 2022 3:26 UTC
21 points
43 comments8 min readLW link

Large Lan­guage Models Suggest a Path to Ems

anithite29 Dec 2022 2:20 UTC
17 points
2 comments5 min readLW link

[Question] Book recom­men­da­tions for the his­tory of ML?

Eleni Angelou28 Dec 2022 23:50 UTC
2 points
2 comments1 min readLW link

Rock-Paper-Scis­sors Can Be Weird

winwonce28 Dec 2022 23:12 UTC
14 points
3 comments1 min readLW link

200 COP in MI: The Case for Analysing Toy Lan­guage Models

Neel Nanda28 Dec 2022 21:07 UTC
39 points
3 comments7 min readLW link

200 Con­crete Open Prob­lems in Mechanis­tic In­ter­pretabil­ity: Introduction

Neel Nanda28 Dec 2022 21:06 UTC
103 points
0 comments10 min readLW link

Effec­tive ways to find love?

anonymoususer28 Dec 2022 20:46 UTC
8 points
6 comments1 min readLW link

Clas­si­cal logic based on propo­si­tions-as-sub­s­in­gle­ton-types

Thomas Kehrenberg28 Dec 2022 20:16 UTC
3 points
0 comments16 min readLW link

In Defense of Wrap­per-Minds

Thane Ruthenis28 Dec 2022 18:28 UTC
23 points
38 comments3 min readLW link

[Question] What is the best way to ap­proach Ex­pected Value calcu­la­tions when pay­offs are highly skewed?

jmh28 Dec 2022 14:42 UTC
8 points
16 comments1 min readLW link

Band­wagon effect: Bias in Eval­u­at­ing AGI X-Risks

28 Dec 2022 7:54 UTC
−1 points
0 comments1 min readLW link

Get­ting up to Speed on the Speed Prior in 2022

robertzk28 Dec 2022 7:49 UTC
36 points
5 comments65 min readLW link

[Question] World su­per­pow­ers, par­tic­u­larly the United States, still main­tain large con­ven­tional mil­i­taries de­spite nu­clear de­ter­rence. Why?

niederman28 Dec 2022 5:38 UTC
9 points
8 comments1 min readLW link
(maxniederman.com)

[Question] What does “prob­a­bil­ity” re­ally mean?

sisyphus28 Dec 2022 3:20 UTC
5 points
20 comments1 min readLW link

Zoom­ing the Chrome Au­dio Player

jefftk28 Dec 2022 2:30 UTC
9 points
0 comments1 min readLW link
(www.jefftk.com)

What AI Safety Ma­te­ri­als Do ML Re­searchers Find Com­pel­ling?

28 Dec 2022 2:03 UTC
175 points
34 comments2 min readLW link

South Bay ACX/​LW Meetup

IS28 Dec 2022 1:59 UTC
3 points
0 comments1 min readLW link

Re­gard­ing Blake Le­moine’s claim that LaMDA is ‘sen­tient’, he might be right (sorta), but per­haps not for the rea­sons he thinks

philosophybear28 Dec 2022 1:55 UTC
9 points
1 comment6 min readLW link

Fun­da­men­tal Uncer­tainty: Chap­ter 5 - How do we know what we know?

Gordon Seidoh Worley28 Dec 2022 1:28 UTC
10 points
2 comments12 min readLW link

Is check­ing that a state of the world is not dystopian eas­ier than con­struct­ing a non-dystopian state?

No77e27 Dec 2022 20:57 UTC
5 points
3 comments1 min readLW link

Crypto-cur­rency as pro-al­ign­ment mechanism

False Name27 Dec 2022 17:45 UTC
−10 points
2 comments2 min readLW link

My Reser­va­tions about Dis­cov­er­ing La­tent Knowl­edge (Burns, Ye, et al)

Robert_AIZI27 Dec 2022 17:27 UTC
50 points
0 comments4 min readLW link
(aizi.substack.com)

Things that can kill you quickly: What ev­ery­one should know about first aid

jasoncrawford27 Dec 2022 16:23 UTC
166 points
21 comments2 min readLW link1 review
(jasoncrawford.org)

[Question] Why The Fo­cus on Ex­pected Utility Max­imisers?

DragonGod27 Dec 2022 15:49 UTC
116 points
84 comments3 min readLW link

Pre­sump­tive Listen­ing: stick­ing to fa­mil­iar con­cepts and miss­ing the outer rea­son­ing paths

Remmelt27 Dec 2022 15:40 UTC
−14 points
8 comments2 min readLW link
(mflb.com)

Mere ex­po­sure effect: Bias in Eval­u­at­ing AGI X-Risks

27 Dec 2022 14:05 UTC
0 points
2 comments1 min readLW link

Hous­ing and Trans­porta­tion Roundup #2

Zvi27 Dec 2022 13:10 UTC
25 points
0 comments12 min readLW link
(thezvi.wordpress.com)

[Question] Are tul­pas moral pa­tients?

ChristianKl27 Dec 2022 11:30 UTC
16 points
28 comments1 min readLW link

Reflec­tions on my 5-month al­ign­ment up­skil­ling grant

Jay Bailey27 Dec 2022 10:51 UTC
82 points
4 comments8 min readLW link

In­sti­tu­tions Can­not Res­train Dark-Triad AI Exploitation

27 Dec 2022 10:34 UTC
5 points
0 comments5 min readLW link
(mflb.com)

In­tro­duc­tion: Bias in Eval­u­at­ing AGI X-Risks

27 Dec 2022 10:27 UTC
1 point
0 comments3 min readLW link

MDPs and the Bel­l­man Equa­tion, In­tu­itively Explained

Jack O'Brien27 Dec 2022 5:50 UTC
11 points
3 comments14 min readLW link

How ‘Hu­man-Hu­man’ dy­nam­ics give way to ‘Hu­man-AI’ and then ‘AI-AI’ dynamics

27 Dec 2022 3:16 UTC
−2 points
5 comments2 min readLW link
(mflb.com)

Nine Points of Col­lec­tive Insanity

27 Dec 2022 3:14 UTC
−2 points
3 comments1 min readLW link
(mflb.com)