Seek­ing Stu­dent Sub­mis­sions: Edit Your Source Code Contest

ArisAug 26, 2022, 2:08 AM
28 points
5 comments2 min readLW link

Pivotal acts us­ing an un­al­igned AGI?

Simon FischerAug 21, 2022, 5:13 PM
28 points
3 comments7 min readLW link

Thoughts on ‘List of Lethal­ities’

Alex Lawsen Aug 17, 2022, 6:33 PM
27 points
0 comments10 min readLW link

Ethan Perez on the In­verse Scal­ing Prize, Lan­guage Feed­back and Red Teaming

Michaël TrazziAug 24, 2022, 4:35 PM
26 points
0 comments3 min readLW link
(theinsideview.ai)

Dwarves & D.Sci: Data Fortress Eval­u­a­tion & Ruleset

aphyerAug 16, 2022, 12:15 AM
26 points
10 comments8 min readLW link

Where are the red lines for AI?

Karl von WendtAug 5, 2022, 9:34 AM
26 points
10 comments6 min readLW link

So­cratic Duck­ing, OODA Loops, Frame-by-Frame Debugging

CFAR!DuncanAug 4, 2022, 5:44 PM
26 points
1 comment5 min readLW link

Ar­gu­ment by In­tel­lec­tual Ordeal

lcAug 12, 2022, 1:03 PM
26 points
5 comments5 min readLW link

Matt Ygle­sias on AI Policy

Grant DemareeAug 17, 2022, 11:57 PM
25 points
1 comment1 min readLW link
(www.slowboring.com)

Ar­tifi­cial Mo­ral Ad­vi­sors: A New Per­spec­tive from Mo­ral Psychology

David GrossAug 28, 2022, 4:37 PM
25 points
1 comment1 min readLW link
(dl.acm.org)

Bridg­ing Ex­pected Utility Max­i­miza­tion and Optimization

Daniel HerrmannAug 5, 2022, 8:18 AM
25 points
5 comments14 min readLW link

Emer­gent Abil­ities of Large Lan­guage Models [Linkpost]

aogAug 10, 2022, 6:02 PM
25 points
2 comments1 min readLW link
(arxiv.org)

Ad­ver­sar­ial epistemology

jchanAug 24, 2022, 4:57 PM
25 points
15 comments3 min readLW link

Bench­mark­ing Pro­pos­als on Risk Scenarios

Paul BricmanAug 20, 2022, 10:01 AM
25 points
2 comments14 min readLW link

Google AI in­te­grates PaLM with robotics: SayCan up­date [Linkpost]

Evan R. MurphyAug 24, 2022, 8:54 PM
25 points
0 comments1 min readLW link
(sites.research.google)

An­nual AGI Bench­mark­ing Event

Lawrence PhillipsAug 27, 2022, 12:06 AM
24 points
3 comments2 min readLW link
(www.metaculus.com)

What Games Th­ese Days?

jefftkAug 18, 2022, 2:30 PM
24 points
6 comments3 min readLW link
(www.jefftk.com)

Pre­cur­sor check­ing for de­cep­tive alignment

evhubAug 3, 2022, 10:56 PM
24 points
0 comments14 min readLW link

Steelmin­ing via Analogy

Paul BricmanAug 13, 2022, 9:59 AM
24 points
0 comments2 min readLW link
(paulbricman.com)

What are the Red Flags for Neu­ral Net­work Suffer­ing? - Seeds of Science call for reviewers

rogersbaconAug 2, 2022, 10:37 PM
24 points
6 comments1 min readLW link

What’s the Most Im­pres­sive Thing That GPT-4 Could Plau­si­bly Do?

bayesedAug 26, 2022, 3:34 PM
24 points
22 comments1 min readLW link

No One-Size-Fit-All Epistemic Strategy

adamShimiAug 20, 2022, 12:56 PM
24 points
2 comments2 min readLW link

Three pillars for avoid­ing AGI catas­tro­phe: Tech­ni­cal al­ign­ment, de­ploy­ment de­ci­sions, and coordination

LintzAAug 3, 2022, 11:15 PM
24 points
0 comments11 min readLW link

An­nounc­ing the Distil­la­tion for Align­ment Practicum (DAP)

Aug 18, 2022, 7:50 PM
23 points
3 comments3 min readLW link

Bos­ton Rents Over Time II

jefftkAug 6, 2022, 9:20 PM
23 points
0 comments2 min readLW link
(www.jefftk.com)

Please (re)ex­plain your per­sonal jargon

Nathan Helm-BurgerAug 22, 2022, 2:30 PM
23 points
4 comments4 min readLW link

[Question] How do you get a job as a soft­ware de­vel­oper?

lsusrAug 15, 2022, 2:45 PM
22 points
24 comments1 min readLW link

Run­ning a Ba­sic Meetup

ScrewtapeAug 4, 2022, 9:49 PM
21 points
1 comment2 min readLW link

*New* Canada AI Safety & Gover­nance community

Wyatt Tessari L'AlliéAug 29, 2022, 6:45 PM
21 points
0 comments1 min readLW link

How evolu­tion suc­ceeds and fails at value alignment

OcracokeAug 21, 2022, 7:14 AM
21 points
2 comments4 min readLW link

Pro­ject pro­posal: Test­ing the IBP defi­ni­tion of agent

Aug 9, 2022, 1:09 AM
21 points
4 comments2 min readLW link

Em­brac­ing the Op­po­si­tion’s Point

YuliaAug 21, 2022, 9:51 AM
21 points
14 comments5 min readLW link
(yuliaverse.substack.com)

What Makes an Idea Un­der­stand­able? On Ar­chi­tec­turally and Cul­turally Nat­u­ral Ideas.

Aug 16, 2022, 2:09 AM
21 points
2 comments16 min readLW link

Pre­dic­tIt is clos­ing due to CFTC chang­ing its mind

eigenAug 6, 2022, 3:34 AM
20 points
4 comments1 min readLW link

A brief note on Sim­plic­ity Bias

carboniferous_umbraculum Aug 14, 2022, 2:05 AM
20 points
0 comments4 min readLW link

Cam­bist Booking

ScrewtapeAug 4, 2022, 10:40 PM
20 points
3 comments4 min readLW link

La­men­ta­tions, Gaza and Empathy

Yair HalberstadtAug 7, 2022, 7:55 AM
20 points
2 comments3 min readLW link

Let­ter from lead­ing Soviet Aca­demi­ci­ans to party and gov­ern­ment lead­ers of the Soviet Union re­gard­ing signs of de­cline and struc­tural prob­lems of the eco­nomic-poli­ti­cal sys­tem (1970)

M. Y. ZuoAug 1, 2022, 10:35 PM
20 points
10 comments16 min readLW link

Have you con­sid­ered get­ting rid of death?

WillaAug 29, 2022, 1:31 AM
20 points
19 comments1 min readLW link
(immortalityisgreat.substack.com)

And the Rev­enues Are So Small

ZviAug 15, 2022, 1:00 PM
19 points
5 comments11 min readLW link
(thezvi.wordpress.com)

Refine’s Se­cond Blog Post Day

adamShimiAug 20, 2022, 1:01 PM
19 points
0 comments1 min readLW link

A suffi­ciently para­noid pa­per­clip maximizer

RomanSAug 8, 2022, 11:17 AM
19 points
10 comments2 min readLW link

[Question] Is there any writ­ing about prompt en­g­ineer­ing for hu­mans?

Alex HollowAug 1, 2022, 12:52 PM
18 points
8 comments1 min readLW link

Vague con­cepts, fam­ily re­sem­blance and cluster prop­er­ties

Q HomeAug 20, 2022, 10:21 AM
18 points
8 comments7 min readLW link

Me­tac­u­lus and medians

rossryAug 6, 2022, 3:34 AM
18 points
4 comments4 min readLW link

Sur­prised by ELK re­port’s coun­terex­am­ple to De­bate, IDA

Evan R. MurphyAug 4, 2022, 2:12 AM
18 points
0 comments5 min readLW link

Please Do Fight the Hypothetical

Lone PineAug 29, 2022, 8:35 AM
18 points
6 comments3 min readLW link

AI al­ign­ment as “nav­i­gat­ing the space of in­tel­li­gent be­havi­our”

Nora_AmmannAug 23, 2022, 1:28 PM
18 points
0 comments6 min readLW link

An­nounc­ing: Mechanism De­sign for AI Safety—Read­ing Group

Rubi J. HudsonAug 9, 2022, 4:21 AM
18 points
3 comments4 min readLW link

My Plan to Build Aligned Superintelligence

apollonianbluesAug 21, 2022, 1:16 PM
18 points
7 comments8 min readLW link