Eval­u­at­ing Ev­i­dence Re­con­struc­tions of Mock Crimes -Sub­mis­sion 2

Alan E DunneMay 24, 2023, 10:17 PM
−1 points

2 votes

Overall karma indicates overall quality.

1 comment3 min readLW link

[Linkpost] In­ter­pretabil­ity Dreams

DanielFilanMay 24, 2023, 9:08 PM
39 points

13 votes

Overall karma indicates overall quality.

2 comments2 min readLW link
(transformer-circuits.pub)

Rishi Su­nak men­tions “ex­is­ten­tial threats” in talk with OpenAI, Deep­Mind, An­thropic CEOs

May 24, 2023, 9:06 PM
34 points

18 votes

Overall karma indicates overall quality.

1 comment1 min readLW link
(www.gov.uk)

If you’re not a morn­ing per­son, con­sider quit­ting allergy pills

Brendan LongMay 24, 2023, 8:11 PM
8 points

5 votes

Overall karma indicates overall quality.

3 comments1 min readLW link
(www.brendanlong.com)

Adum­bra­tions on AGI from an outsider

nicholashaldenMay 24, 2023, 5:41 PM
57 points

34 votes

Overall karma indicates overall quality.

44 comments8 min readLW link
(nicholashalden.home.blog)

Open Thread With Ex­per­i­men­tal Fea­ture: Reactions

jimrandomhMay 24, 2023, 4:46 PM
101 points

48 votes

Overall karma indicates overall quality.

189 comments3 min readLW link

A re­jec­tion of the Orthog­o­nal­ity Thesis

ArisCMay 24, 2023, 4:37 PM
−2 points

16 votes

Overall karma indicates overall quality.

11 comments2 min readLW link
(medium.com)

Aligned AI via mon­i­tor­ing ob­jec­tives in Au­toGPT-like systems

Paul CologneseMay 24, 2023, 3:59 PM
27 points

14 votes

Overall karma indicates overall quality.

4 comments4 min readLW link

The Office of Science and Tech­nol­ogy Policy put out a re­quest for in­for­ma­tion on A.I.

HiroSakurabaMay 24, 2023, 1:33 PM
60 points

25 votes

Overall karma indicates overall quality.

4 comments1 min readLW link
(www.whitehouse.gov)

ChatGPT (May 2023) on De­sign­ing Friendly Superintelligence

Mitchell_PorterMay 24, 2023, 10:47 AM
5 points

7 votes

Overall karma indicates overall quality.

0 comments1 min readLW link
(singularitypolitics.wordpress.com)

No—AI is just as en­ergy-effi­cient as your brain.

Maxwell ClarkeMay 24, 2023, 2:30 AM
11 points

16 votes

Overall karma indicates overall quality.

7 comments1 min readLW link

[Question] What pro­jects and efforts are there to pro­mote AI safety re­search?

Christopher KingMay 24, 2023, 12:33 AM
4 points

2 votes

Overall karma indicates overall quality.

0 comments1 min readLW link

My May 2023 pri­ori­ties for AI x-safety: more em­pa­thy, more unifi­ca­tion of con­cerns, and less vil­ifi­ca­tion of OpenAI

Andrew_CritchMay 24, 2023, 12:02 AM
268 points

137 votes

Overall karma indicates overall quality.

39 comments8 min readLW link

AI Safety Newslet­ter #7: Dis­in­for­ma­tion, Gover­nance Recom­men­da­tions for AI labs, and Se­nate Hear­ings on AI

May 23, 2023, 9:47 PM
25 points

8 votes

Overall karma indicates overall quality.

0 comments6 min readLW link
(newsletter.safe.ai)

The Po­lar­ity Prob­lem [Draft]

May 23, 2023, 9:05 PM
24 points

12 votes

Overall karma indicates overall quality.

3 comments44 min readLW link

Progress links and tweets, 2023-05-23

jasoncrawfordMay 23, 2023, 8:15 PM
16 points

6 votes

Overall karma indicates overall quality.

0 comments1 min readLW link
(rootsofprogress.org)

Yoshua Ben­gio: How Rogue AIs may Arise

harfeMay 23, 2023, 6:28 PM
92 points

41 votes

Overall karma indicates overall quality.

12 comments18 min readLW link
(yoshuabengio.org)

‘Fun­da­men­tal’ vs ‘ap­plied’ mechanis­tic in­ter­pretabil­ity research

Lee SharkeyMay 23, 2023, 6:26 PM
65 points

31 votes

Overall karma indicates overall quality.

6 comments3 min readLW link

Co­er­cion is an adap­ta­tion to scarcity; trust is an adap­ta­tion to abundance

Richard_NgoMay 23, 2023, 6:14 PM
90 points

35 votes

Overall karma indicates overall quality.

11 comments4 min readLW link

[Question] Is “brit­tle al­ign­ment” good enough?

the8thbitMay 23, 2023, 5:35 PM
9 points

4 votes

Overall karma indicates overall quality.

5 comments3 min readLW link

Will Ar­tifi­cial Su­per­in­tel­li­gence Kill Us?

James_MillerMay 23, 2023, 4:27 PM
33 points

12 votes

Overall karma indicates overall quality.

2 comments22 min readLW link

Phone Num­ber Jingle

jefftkMay 23, 2023, 3:20 PM
11 points

3 votes

Overall karma indicates overall quality.

12 comments1 min readLW link
(www.jefftk.com)

GPT4 is ca­pa­ble of writ­ing de­cent long-form sci­ence fic­tion (with the right prompts)

RomanSMay 23, 2023, 1:41 PM
22 points

14 votes

Overall karma indicates overall quality.

28 comments65 min readLW link

[Question] Do hu­mans still provide value in cor­re­spon­dence chess?

Jonathan PaulsonMay 23, 2023, 12:15 PM
24 points

9 votes

Overall karma indicates overall quality.

16 comments1 min readLW link

[Linkpost] The AGI Show podcast

Soroush PourMay 23, 2023, 9:52 AM
4 points

3 votes

Overall karma indicates overall quality.

0 comments1 min readLW link

Data and “to­kens” a 30 year old hu­man “trains” on

Jose Miguel Cruz y CelisMay 23, 2023, 5:34 AM
16 points

11 votes

Overall karma indicates overall quality.

15 comments1 min readLW link

How I learned to stop wor­ry­ing and love skill trees

junk heap homotopyMay 23, 2023, 4:08 AM
83 points

51 votes

Overall karma indicates overall quality.

3 comments1 min readLW link

T-Shirt Size Distribution

jefftkMay 23, 2023, 2:40 AM
9 points

1 vote

Overall karma indicates overall quality.

0 comments1 min readLW link
(www.jefftk.com)

AI self-im­prove­ment is possible

bhauthMay 23, 2023, 2:32 AM
18 points

7 votes

Overall karma indicates overall quality.

3 comments8 min readLW link

Wor­ry­ing less about acausal extortion

RaemonMay 23, 2023, 2:08 AM
40 points

31 votes

Overall karma indicates overall quality.

12 comments13 min readLW link

Self-lead­er­ship and self-love dis­solve anger and trauma

Richard_NgoMay 22, 2023, 10:30 PM
74 points

37 votes

Overall karma indicates overall quality.

7 comments5 min readLW link

A Man­i­fold mar­ket no­tice: Binance

Scrooge McduckMay 22, 2023, 10:24 PM
15 points

11 votes

Overall karma indicates overall quality.

13 comments1 min readLW link

I don’t want to talk about AI

KirstenHMay 22, 2023, 9:23 PM
34 points

23 votes

Overall karma indicates overall quality.

11 comments2 min readLW link
(ealifestyles.substack.com)

Ac­ti­va­tion ad­di­tions in a small resi­d­ual network

Garrett BakerMay 22, 2023, 8:28 PM
22 points

9 votes

Overall karma indicates overall quality.

4 comments3 min readLW link

[Linkpost] “Gover­nance of su­per­in­tel­li­gence” by OpenAI

Daniel_EthMay 22, 2023, 8:15 PM
67 points

35 votes

Overall karma indicates overall quality.

20 comments2 min readLW link
(openai.com)

Two Pie­ces of Ad­vice About How to Re­mem­ber Things

Bentham's BulldogMay 22, 2023, 6:10 PM
13 points

10 votes

Overall karma indicates overall quality.

3 comments4 min readLW link

Why I Believe LLMs Do Not Have Hu­man-like Emotions

OneManyNoneMay 22, 2023, 3:46 PM
13 points

13 votes

Overall karma indicates overall quality.

6 comments7 min readLW link

AI Safety in China: Part 2

Lao MeinMay 22, 2023, 2:50 PM
103 points

63 votes

Overall karma indicates overall quality.

28 comments2 min readLW link

Con­jec­ture in­ter­nal sur­vey: AGI timelines and prob­a­bil­ity of hu­man ex­tinc­tion from ad­vanced AI

Maris SalaMay 22, 2023, 2:31 PM
155 points

75 votes

Overall karma indicates overall quality.

5 comments3 min readLW link
(www.conjecture.dev)

Papers, Please #1: Var­i­ous Papers on Em­ploy­ment, Wages and Productivity

ZviMay 22, 2023, 12:00 PM
42 points

15 votes

Overall karma indicates overall quality.

2 comments8 min readLW link
(thezvi.wordpress.com)

In Defense of «The Army of Jakoths»

MikkWMay 22, 2023, 11:59 AM
−14 points

11 votes

Overall karma indicates overall quality.

10 comments4 min readLW link

Speed of in­for­ma­tion in­put is a bot­tle­neck for rationality

MikkWMay 22, 2023, 10:24 AM
13 points

4 votes

Overall karma indicates overall quality.

0 comments4 min readLW link

Distil­la­tion of Neu­rotech and Align­ment Work­shop Jan­uary 2023

May 22, 2023, 7:17 AM
52 points

26 votes

Overall karma indicates overall quality.

9 comments14 min readLW link

The Treach­er­ous Turn is finished! (AI-takeover-themed table­top RPG)

Daniel KokotajloMay 22, 2023, 5:49 AM
55 points

27 votes

Overall karma indicates overall quality.

5 comments2 min readLW link
(thetreacherousturn.ai)

The Stan­ley Parable: Mak­ing philos­o­phy fun

Nathan1123May 22, 2023, 2:15 AM
6 points

4 votes

Overall karma indicates overall quality.

3 comments3 min readLW link

Sea Monsters

Adam ZernerMay 22, 2023, 12:58 AM
30 points

16 votes

Overall karma indicates overall quality.

11 comments5 min readLW link

The Army of Jakoths (a parable)

MikkWMay 21, 2023, 10:48 PM
−6 points

9 votes

Overall karma indicates overall quality.

0 comments1 min readLW link

A&I (Rihanna ‘S&M’ par­ody lyrics)

nahojMay 21, 2023, 10:34 PM
−2 points

6 votes

Overall karma indicates overall quality.

0 comments2 min readLW link

Four Bat­tle­grounds: Power in the Age of Ar­tifi­cial In­tel­li­gence (Book re­view)

PeterMcCluskeyMay 21, 2023, 9:19 PM
25 points

12 votes

Overall karma indicates overall quality.

0 comments4 min readLW link
(bayesianinvestor.com)

Gen­der Vec­tors in ROME’s La­tent Space

XodarapMay 21, 2023, 6:46 PM
14 points

8 votes

Overall karma indicates overall quality.

2 comments3 min readLW link