An A.I. Safety Pre­sen­ta­tion at RIT

Nicholas KrossMar 27, 2023, 11:49 PM
8 points

3 votes

Overall karma indicates overall quality.

0 comments1 min readLW link
(www.youtube.com)

Which AI out­puts should hu­mans check for shenani­gans, to avoid AI takeover? A sim­ple model

Tom DavidsonMar 27, 2023, 11:36 PM
16 points

9 votes

Overall karma indicates overall quality.

3 comments8 min readLW link

The Prospect of an AI Winter

Erich_GrunewaldMar 27, 2023, 8:55 PM
62 points

26 votes

Overall karma indicates overall quality.

24 comments15 min readLW link
(www.erichgrunewald.com)

[Question] Best ar­gu­ments against the out­side view that AGI won’t be a huge deal, thus we sur­vive.

Noosphere89Mar 27, 2023, 8:49 PM
4 points

7 votes

Overall karma indicates overall quality.

7 comments1 min readLW link

EA & LW Fo­rum Weekly Sum­mary (20th − 26th March 2023)

Zoe WilliamsMar 27, 2023, 8:46 PM
4 points

1 vote

Overall karma indicates overall quality.

0 comments6 min readLW link

Three of my be­liefs about up­com­ing AGI

Robert_AIZIMar 27, 2023, 8:27 PM
6 points

3 votes

Overall karma indicates overall quality.

0 comments3 min readLW link
(aizi.substack.com)

No­body knows how to re­li­ably test for AI safety

marcusarvanMar 27, 2023, 7:48 PM
1 point

1 vote

Overall karma indicates overall quality.

0 comments5 min readLW link

New blog: Planned Obsolescence

Ajeya CotraMar 27, 2023, 7:46 PM
96 points

45 votes

Overall karma indicates overall quality.

7 comments1 min readLW link
(www.planned-obsolescence.org)

South Bay ACX/​SSC Spring Mee­tups Everywhere

allisonaMar 27, 2023, 7:39 PM
2 points

2 votes

Overall karma indicates overall quality.

0 comments1 min readLW link

[Question] Re­sources to see how peo­ple think/​ap­proach math­e­mat­ics and prob­lem-solving

zefMar 27, 2023, 7:12 PM
7 points

3 votes

Overall karma indicates overall quality.

2 comments1 min readLW link

Stag­ger­ing Hunters

ScrewtapeMar 27, 2023, 7:11 PM
12 points

6 votes

Overall karma indicates overall quality.

2 comments5 min readLW link

Neu­rotech­nol­ogy is Crit­i­cal for AI Alignment

Milan CvitkovicMar 27, 2023, 6:27 PM
10 points

7 votes

Overall karma indicates overall quality.

3 comments1 min readLW link
(milan.cvitkovic.net)

[Question] Best re­sources to learn philos­o­phy of mind and AI?

Sky MooMar 27, 2023, 6:22 PM
1 point

1 vote

Overall karma indicates overall quality.

0 comments1 min readLW link

the ten­sor is a lonely place

jml6Mar 27, 2023, 6:22 PM
−11 points

4 votes

Overall karma indicates overall quality.

0 comments4 min readLW link
(ekjsgrjelrbno.substack.com)

[Question] Ber­mudez In­ter­face Problem

Motor VehicleMar 27, 2023, 6:11 PM
1 point

1 vote

Overall karma indicates overall quality.

2 comments1 min readLW link

Would you be a bet­ter RLHF la­beler than GPT-4?

kacheMar 27, 2023, 6:10 PM
1 point

3 votes

Overall karma indicates overall quality.

1 comment1 min readLW link

LLM Pow­ered LW Search

odraode17Mar 27, 2023, 6:09 PM
−1 points

2 votes

Overall karma indicates overall quality.

0 comments1 min readLW link

An­nounc­ing the Swiss Ex­is­ten­tial Risk Ini­ti­a­tive (CHERI) 2023 Re­search Fellowship

Tobias HMar 27, 2023, 4:36 PM
3 points

1 vote

Overall karma indicates overall quality.

0 comments2 min readLW link

In­dus­tri­al­iza­tion/​Com­put­er­i­za­tion Analogies

Gordon Seidoh WorleyMar 27, 2023, 4:34 PM
16 points

9 votes

Overall karma indicates overall quality.

2 comments2 min readLW link

Les­sons from Con­ver­gent Evolu­tion for AI Alignment

Mar 27, 2023, 4:25 PM
54 points

27 votes

Overall karma indicates overall quality.

9 comments8 min readLW link

GPT-4 is bad at strate­gic thinking

Christopher KingMar 27, 2023, 3:11 PM
22 points

15 votes

Overall karma indicates overall quality.

8 comments1 min readLW link

The salt in pasta wa­ter fallacy

Thomas SepulchreMar 27, 2023, 2:53 PM
235 points

164 votes

Overall karma indicates overall quality.

52 comments3 min readLW link2 reviews

CAIS-in­spired ap­proach to­wards safer and more in­ter­pretable AGIs

Peter HroššoMar 27, 2023, 2:36 PM
13 points

5 votes

Overall karma indicates overall quality.

7 comments1 min readLW link

An Overview of Sparks of Ar­tifi­cial Gen­eral In­tel­li­gence: Early ex­per­i­ments with GPT-4

AnnapurnaMar 27, 2023, 1:44 PM
10 points

7 votes

Overall karma indicates overall quality.

0 comments7 min readLW link
(jorgevelez.substack.com)

A Hive­mind of GPT-4 bots REALLY IS A HIVEMIND!

Erlja Jkdf.Mar 27, 2023, 12:44 PM
−10 points

5 votes

Overall karma indicates overall quality.

1 comment1 min readLW link

Du­ploish Mar­ble Runs

jefftkMar 27, 2023, 12:20 PM
26 points

12 votes

Overall karma indicates overall quality.

1 comment1 min readLW link
(www.jefftk.com)

GPT-4 Plugs In

ZviMar 27, 2023, 12:10 PM
198 points

108 votes

Overall karma indicates overall quality.

47 comments6 min readLW link
(thezvi.wordpress.com)

Please help me sense-check my as­sump­tions about the needs of the AI Safety com­mu­nity and re­lated ca­reer plans

peterslatteryMar 27, 2023, 8:23 AM
6 points

4 votes

Overall karma indicates overall quality.

4 comments2 min readLW link

Prac­ti­cal Pit­falls of Causal Scrubbing

Mar 27, 2023, 7:47 AM
87 points

36 votes

Overall karma indicates overall quality.

17 comments13 min readLW link

[Question] What If: An Earthquake in Taiwan?

SableMar 27, 2023, 7:31 AM
8 points

3 votes

Overall karma indicates overall quality.

2 comments1 min readLW link

What can we learn from Lex Frid­man’s in­ter­view with Sam Alt­man?

Karl von WendtMar 27, 2023, 6:27 AM
56 points

33 votes

Overall karma indicates overall quality.

22 comments9 min readLW link

[Question] Steel­man­ning OpenAI’s Short-Timelines Slow-Take­off Goal

FinalFormal2Mar 27, 2023, 2:55 AM
5 points

3 votes

Overall karma indicates overall quality.

2 comments1 min readLW link

The de­fault out­come for al­igned AGI still looks pretty bad

GeneSmithMar 27, 2023, 12:02 AM
14 points

19 votes

Overall karma indicates overall quality.

19 comments3 min readLW link

LLM Mo­du­lar­ity: The Separa­bil­ity of Ca­pa­bil­ities in Large Lan­guage Models

NickyPMar 26, 2023, 9:57 PM
99 points

55 votes

Overall karma indicates overall quality.

3 comments41 min readLW link

Test­ing ChatGPT for white lies

twkaiserMar 26, 2023, 9:32 PM
3 points

2 votes

Overall karma indicates overall quality.

2 comments6 min readLW link

Don’t take bad op­tions away from people

Dumbledore's ArmyMar 26, 2023, 8:12 PM
42 points

66 votes

Overall karma indicates overall quality.

100 comments5 min readLW link

What would a com­pute mon­i­tor­ing plan look like? [Linkpost]

Orpheus16Mar 26, 2023, 7:33 PM
158 points

76 votes

Overall karma indicates overall quality.

10 comments4 min readLW link
(arxiv.org)

[Question] GPT-4 Specs: 1 Trillion Pa­ram­e­ters?

infinibot27Mar 26, 2023, 6:56 PM
6 points

5 votes

Overall karma indicates overall quality.

8 comments1 min readLW link

Sen­tience in Machines—How Do We Test for This Ob­jec­tively?

Mayowa OsiboduMar 26, 2023, 6:56 PM
−2 points

5 votes

Overall karma indicates overall quality.

0 comments2 min readLW link
(www.researchgate.net)

If it quacks like a duck...

RationalMindsetMar 26, 2023, 6:54 PM
−4 points

5 votes

Overall karma indicates overall quality.

0 comments4 min readLW link

Chronos­ta­sis: The Time-Cap­sule Co­nun­drum of Lan­guage Models

RationalMindsetMar 26, 2023, 6:54 PM
−5 points

6 votes

Overall karma indicates overall quality.

0 comments1 min readLW link

[Question] What hap­pens with log­i­cal in­duc­tion when...

Donald HobsonMar 26, 2023, 6:31 PM
18 points

9 votes

Overall karma indicates overall quality.

2 comments1 min readLW link

Draft: In­tro­duc­tion to optimization

Alex_AltairMar 26, 2023, 5:25 PM
43 points

24 votes

Overall karma indicates overall quality.

8 comments16 min readLW link

Chat bot as CEO at NetDragon Websoft

ChristianKlMar 26, 2023, 4:01 PM
8 points

2 votes

Overall karma indicates overall quality.

2 comments1 min readLW link
(www.firstpost.com)

Dat­a­point: me­dian 10% AI x-risk men­tioned on Dutch pub­lic TV channel

Chris van MerwijkMar 26, 2023, 12:50 PM
17 points

7 votes

Overall karma indicates overall quality.

1 comment1 min readLW link

[Question] How Poli­tics in­ter­acts with AI ?

qbolecMar 26, 2023, 9:53 AM
−11 points

5 votes

Overall karma indicates overall quality.

4 comments1 min readLW link

De­scrip­tive vs. speci­fi­able values

TsviBTMar 26, 2023, 9:10 AM
17 points

9 votes

Overall karma indicates overall quality.

2 comments2 min readLW link

The al­ign­ment sta­bil­ity problem

Seth HerdMar 26, 2023, 2:10 AM
35 points

15 votes

Overall karma indicates overall quality.

15 comments4 min readLW link

Sur­vey on lifel­og­gers for a re­search project

Mati_RoyMar 26, 2023, 12:02 AM
20 points

7 votes

Overall karma indicates overall quality.

0 comments1 min readLW link

Man­i­fold: If okay AGI, why?

Eliezer YudkowskyMar 25, 2023, 10:43 PM
120 points

58 votes

Overall karma indicates overall quality.

37 comments1 min readLW link
(manifold.markets)