Hu­mans, chim­panzees and other animals

gjmMay 30, 2023, 11:53 PM
21 points

15 votes

Overall karma indicates overall quality.

18 comments1 min readLW link

The case for re­mov­ing al­ign­ment and ML re­search from the train­ing dataset

berenMay 30, 2023, 8:54 PM
50 points

23 votes

Overall karma indicates overall quality.

8 comments5 min readLW link

Why Job Dis­place­ment Pre­dic­tions are Wrong: Ex­pla­na­tions of Cog­ni­tive Automation

Moritz WallawitschMay 30, 2023, 8:43 PM
−5 points

3 votes

Overall karma indicates overall quality.

0 comments8 min readLW link

PaLM-2 & GPT-4 in “Ex­trap­o­lat­ing GPT-N perfor­mance”

Lukas FinnvedenMay 30, 2023, 6:33 PM
57 points

27 votes

Overall karma indicates overall quality.

6 comments6 min readLW link

Why I don’t think that the prob­a­bil­ity that AGI kills ev­ery­one is roughly 1 (but rather around 0.995).

BastumannenMay 30, 2023, 5:54 PM
−6 points

5 votes

Overall karma indicates overall quality.

0 comments2 min readLW link

AI X-risk is a pos­si­ble solu­tion to the Fermi Paradox

magic9mushroomMay 30, 2023, 5:42 PM
5 points

15 votes

Overall karma indicates overall quality.

22 comments2 min readLW link2 reviews

LIMA: Less Is More for Alignment

Ulisse MiniMay 30, 2023, 5:10 PM
16 points

3 votes

Overall karma indicates overall quality.

6 comments1 min readLW link
(arxiv.org)

Boomerang—pro­to­col to dis­solve some com­mit­ment races

Filip SondejMay 30, 2023, 4:21 PM
37 points

17 votes

Overall karma indicates overall quality.

10 comments8 min readLW link

An­nounc­ing Apollo Research

May 30, 2023, 4:17 PM
217 points

94 votes

Overall karma indicates overall quality.

11 comments8 min readLW link

Ad­vice for new al­ign­ment peo­ple: Info Max

Jonas HallgrenMay 30, 2023, 3:42 PM
23 points

14 votes

Overall karma indicates overall quality.

4 comments5 min readLW link

[Question] Who is li­able for AI?

jmhMay 30, 2023, 1:54 PM
14 points

3 votes

Overall karma indicates overall quality.

4 comments1 min readLW link

AI Safety Newslet­ter #8: Rogue AIs, how to screen for AI risks, and grants for re­search on demo­cratic gov­er­nance of AI

May 30, 2023, 11:52 AM
20 points

8 votes

Overall karma indicates overall quality.

0 comments6 min readLW link
(newsletter.safe.ai)

The bul­ls­eye frame­work: My case against AI doom

titotalMay 30, 2023, 11:52 AM
89 points

65 votes

Overall karma indicates overall quality.

35 comments17 min readLW link

State­ment on AI Ex­tinc­tion—Signed by AGI Labs, Top Aca­demics, and Many Other Notable Figures

Dan HMay 30, 2023, 9:05 AM
382 points

165 votes

Overall karma indicates overall quality.

78 comments1 min readLW link1 review
(www.safe.ai)

The­o­ret­i­cal Limi­ta­tions of Au­tore­gres­sive Models

Gabriel WuMay 30, 2023, 2:37 AM
20 points

11 votes

Overall karma indicates overall quality.

1 comment10 min readLW link
(gabrieldwu.github.io)

A book re­view for “An­i­mal Weapons” and cross-ap­ply­ing the les­sons to x-risk

Habeeb AbdulfatahMay 30, 2023, 12:58 AM
−6 points

4 votes

Overall karma indicates overall quality.

1 comment1 min readLW link
(www.super-linear.org)

Without a tra­jec­tory change, the de­vel­op­ment of AGI is likely to go badly

Max HMay 29, 2023, 11:42 PM
21 points

7 votes

Overall karma indicates overall quality.

2 comments13 min readLW link

Win­ners-take-how-much?

YonatanKMay 29, 2023, 9:56 PM
3 points

5 votes

Overall karma indicates overall quality.

2 comments3 min readLW link

Re­ply to a fer­til­ity doc­tor con­cern­ing poly­genic em­bryo screening

GeneSmithMay 29, 2023, 9:50 PM
59 points

29 votes

Overall karma indicates overall quality.

6 comments8 min readLW link

Sen­tience matters

So8resMay 29, 2023, 9:25 PM
144 points

90 votes

Overall karma indicates overall quality.

96 comments2 min readLW link

Wikipe­dia as an in­tro­duc­tion to the al­ign­ment problem

SoerenMindMay 29, 2023, 6:43 PM
83 points

46 votes

Overall karma indicates overall quality.

10 comments1 min readLW link
(en.wikipedia.org)

[Question] What are some of the best in­tro­duc­tions/​break­downs of AI ex­is­ten­tial risk for those un­fa­mil­iar?

Isaac KingMay 29, 2023, 5:04 PM
17 points

5 votes

Overall karma indicates overall quality.

2 comments1 min readLW link

Creat­ing Flash­cards with LLMs

Diogo CruzMay 29, 2023, 4:55 PM
15 points

12 votes

Overall karma indicates overall quality.

3 comments9 min readLW link

On the Im­pos­si­bil­ity of In­tel­li­gent Paper­clip Maximizers

Michael SimkinMay 29, 2023, 4:55 PM
−21 points

13 votes

Overall karma indicates overall quality.

5 comments4 min readLW link

Min­i­mum Vi­able Exterminator

Richard HorvathMay 29, 2023, 4:32 PM
14 points

12 votes

Overall karma indicates overall quality.

5 comments5 min readLW link

An LLM-based “ex­em­plary ac­tor”

Roman LeventovMay 29, 2023, 11:12 AM
16 points

5 votes

Overall karma indicates overall quality.

0 comments12 min readLW link

Align­ing an H-JEPA agent via train­ing on the out­puts of an LLM-based “ex­em­plary ac­tor”

Roman LeventovMay 29, 2023, 11:08 AM
12 points

8 votes

Overall karma indicates overall quality.

10 comments30 min readLW link

Gem­ini will bring the next big timeline update

p.b.May 29, 2023, 6:05 AM
50 points

36 votes

Overall karma indicates overall quality.

6 comments1 min readLW link

Pro­posed Align­ment Tech­nique: OSNR (Out­put San­i­ti­za­tion via Nois­ing and Re­con­struc­tion) for Safer Usage of Po­ten­tially Misal­igned AGI

sudoMay 29, 2023, 1:35 AM
14 points

4 votes

Overall karma indicates overall quality.

9 comments6 min readLW link

Mo­ral­ity is Ac­ci­den­tal & Self-Congratulatory

ymeskhoutMay 29, 2023, 12:40 AM
26 points

32 votes

Overall karma indicates overall quality.

40 comments5 min readLW link

TinyS­to­ries: Small Lan­guage Models That Still Speak Co­her­ent English

Ulisse MiniMay 28, 2023, 10:23 PM
67 points

34 votes

Overall karma indicates overall quality.

8 comments2 min readLW link
(arxiv.org)

“Mem­branes” is bet­ter ter­minol­ogy than “bound­aries” alone

May 28, 2023, 10:16 PM
30 points

14 votes

Overall karma indicates overall quality.

12 comments3 min readLW link

The king token

p.b.May 28, 2023, 7:18 PM
17 points

8 votes

Overall karma indicates overall quality.

0 comments4 min readLW link

Lan­guage Agents Re­duce the Risk of Ex­is­ten­tial Catastrophe

May 28, 2023, 7:10 PM
39 points

35 votes

Overall karma indicates overall quality.

14 comments26 min readLW link

Devil’s Ad­vo­cate: Ad­verse Selec­tion Against Con­scien­tious­ness

lionhearted (Sebastian Marshall)May 28, 2023, 5:53 PM
10 points

6 votes

Overall karma indicates overall quality.

2 comments1 min readLW link

Re­acts now en­abled on 100% of posts, though still just ex­per­i­ment­ing

RubyMay 28, 2023, 5:36 AM
88 points

37 votes

Overall karma indicates overall quality.

73 comments2 min readLW link

Kelly bet­ting vs ex­pec­ta­tion max­i­miza­tion

MorgneticFieldMay 28, 2023, 1:54 AM
35 points

23 votes

Overall karma indicates overall quality.

33 comments5 min readLW link

Twin Cities ACX Meetup—June 2023

Timothy M.May 27, 2023, 8:11 PM
1 point

1 vote

Overall karma indicates overall quality.

1 comment1 min readLW link

Pro­ject Idea: Challenge Groups for Align­ment Researchers

Adam ZernerMay 27, 2023, 8:10 PM
13 points

9 votes

Overall karma indicates overall quality.

0 comments1 min readLW link

In­tro­spec­tive Bayes

False NameMay 27, 2023, 7:35 PM
−3 points

7 votes

Overall karma indicates overall quality.

2 comments16 min readLW link

Should Ra­tional An­i­ma­tions in­vite view­ers to read con­tent on LessWrong?

WriterMay 27, 2023, 7:26 PM
40 points

15 votes

Overall karma indicates overall quality.

9 comments3 min readLW link

Who are the Ex­perts on Cry­on­ics?

Mati_RoyMay 27, 2023, 7:24 PM
30 points

15 votes

Overall karma indicates overall quality.

9 comments1 min readLW link
(biostasis.substack.com)

AI and Planet Earth are in­com­pat­i­ble.

archeonMay 27, 2023, 6:59 PM
−4 points

7 votes

Overall karma indicates overall quality.

2 comments1 min readLW link

South Bay ACX/​LW Meetup

ISMay 27, 2023, 5:25 PM
2 points

1 vote

Overall karma indicates overall quality.

0 comments1 min readLW link

Hands-On Ex­pe­rience Is Not Magic

Thane RuthenisMay 27, 2023, 4:57 PM
22 points

32 votes

Overall karma indicates overall quality.

14 comments5 min readLW link

Is Deon­tolog­i­cal AI Safe? [Feed­back Draft]

May 27, 2023, 4:39 PM
19 points

16 votes

Overall karma indicates overall quality.

15 comments20 min readLW link

San Fran­cisco ACX Meetup “First Satur­day” June 3, 1 pm

guenaelMay 27, 2023, 1:58 PM
1 point

1 vote

Overall karma indicates overall quality.

0 comments1 min readLW link

Papers on pro­tein design

alexlyzhovMay 27, 2023, 1:18 AM
9 points

5 votes

Overall karma indicates overall quality.

0 comments3 min readLW link

D&D.Sci 5E: Re­turn of the League of Defenders

aphyerMay 26, 2023, 8:39 PM
42 points

14 votes

Overall karma indicates overall quality.

11 comments3 min readLW link

Seek­ing (Paid) Case Stud­ies on Standards

HoldenKarnofskyMay 26, 2023, 5:58 PM
69 points

21 votes

Overall karma indicates overall quality.

9 comments11 min readLW link