Test­ing for con­se­quence-blind­ness in LLMs us­ing the HI-ADS unit test.

David Scott Krueger (formerly: capybaralet)Nov 24, 2023, 11:35 PM
25 points
2 comments2 min readLW link

Epoch is hiring an ML Distributed Sys­tems Se­nior Researcher

Nov 24, 2023, 10:33 PM
2 points
0 comments4 min readLW link
(careers.rethinkpriorities.org)

Ar­ti­cle Dis­cus­sion And Free Pizza—St Paul

25HourNov 24, 2023, 9:02 PM
1 point
0 comments1 min readLW link

Why fo­cus on schemers in par­tic­u­lar (Sec­tions 1.3 and 1.4 of “Schem­ing AIs”)

Joe CarlsmithNov 24, 2023, 7:18 PM
8 points
0 comments22 min readLW link

Sur­viv­ing and Shap­ing Long-Term Com­pe­ti­tions: Les­sons from Net Assessment

Nov 24, 2023, 6:18 PM
5 points
0 comments13 min readLW link

Abil­ity to solve long-hori­zon tasks cor­re­lates with want­ing things in the be­hav­iorist sense

So8resNov 24, 2023, 5:37 PM
197 points
84 comments5 min readLW link1 review

The Limi­ta­tions of GPT-4

p.b.Nov 24, 2023, 3:30 PM
27 points
12 comments4 min readLW link

Progress links di­gest, 2023-11-24: Bot­tle­necks of ag­ing, Star­ship launches, and much more

jasoncrawfordNov 24, 2023, 3:25 PM
40 points
1 comment14 min readLW link
(rootsofprogress.org)

[Question] What’s the ev­i­dence that LLMs will scale up effi­ciently be­yond GPT4? i.e. couldn’t GPT5, etc., be very in­effi­cient?

M. Y. ZuoNov 24, 2023, 3:22 PM
9 points
6 comments1 min readLW link

Sapi­ence, un­der­stand­ing, and “AGI”

Seth HerdNov 24, 2023, 3:13 PM
15 points
3 comments6 min readLW link

In­su­late your ideas

Logan KiellerNov 24, 2023, 2:08 PM
18 points
5 comments2 min readLW link
(logankieller.substack.com)

Bordeaux, Gironde, France – ir­reg­u­lar ACX Meetup 2023-12-09

vi21maobk9vpNov 24, 2023, 11:17 AM
5 points
1 comment1 min readLW link

[Question] A Ques­tion For Peo­ple Who Believe In God

yanni kyriacosNov 24, 2023, 5:22 AM
3 points
38 comments1 min readLW link

[Question] First and Last Ques­tions for GPT-5*

Mitchell_PorterNov 24, 2023, 5:03 AM
15 points
5 comments1 min readLW link

4. A Mo­ral Case for Evolved-Sapi­ence-Chau­vinism

RogerDearnaleyNov 24, 2023, 4:56 AM
10 points
0 comments4 min readLW link

De­tect­ing What’s Been Seen

jefftkNov 24, 2023, 3:30 AM
23 points
0 comments2 min readLW link
(www.jefftk.com)

[Question] Help to find a blog I don’t re­mem­ber the name of

JavierCCNov 23, 2023, 10:49 PM
3 points
2 comments1 min readLW link

[Question] What did you change your mind about in the last year?

mike_hawkeNov 23, 2023, 8:53 PM
41 points
16 comments1 min readLW link

A few Su­per­hu­man ex­am­ples of Su­per­al­igned Su­per­in­tel­li­gence from Google Bard (Thanks­giv­ing 2023)

Nov 23, 2023, 7:06 PM
−9 points
1 comment17 min readLW link

Preps­giv­ing, A Con­ver­gently In­stru­men­tal Hu­man Practice

JenniferRMNov 23, 2023, 5:24 PM
39 points
0 comments8 min readLW link

AI #39: The Week of OpenAI

ZviNov 23, 2023, 3:10 PM
67 points
8 comments28 min readLW link
(thezvi.wordpress.com)

3. Uploading

RogerDearnaleyNov 23, 2023, 7:39 AM
21 points
5 comments8 min readLW link

2. AIs as Eco­nomic Agents

RogerDearnaleyNov 23, 2023, 7:07 AM
9 points
2 comments6 min readLW link

Thomas Kwa’s re­search journal

Nov 23, 2023, 5:11 AM
79 points
1 comment6 min readLW link

Never Drop A Ball

ScrewtapeNov 23, 2023, 4:15 AM
101 points
8 comments6 min readLW link1 review

Pos­si­ble OpenAI’s Q* break­through and Deep­Mind’s AlphaGo-type sys­tems plus LLMs

BurnyNov 23, 2023, 3:16 AM
37 points
25 comments2 min readLW link

Bos­ton Sec­u­lar Sols­tice: Call for Singers and Musicans

jefftkNov 23, 2023, 2:40 AM
16 points
2 comments1 min readLW link
(www.jefftk.com)

My Men­tal Model of Infohazards

MadHatterNov 23, 2023, 2:37 AM
8 points
34 comments2 min readLW link1 review

Sat­u­rat­ing the Difficulty Levels of Alignment

Johannes C. MayerNov 23, 2023, 12:39 AM
6 points
0 comments2 min readLW link

Sacra­mento LW/​ACX Meetup

mcintNov 22, 2023, 11:52 PM
1 point
0 comments1 min readLW link

Sam Alt­man’s ouster at OpenAI was pre­cip­i­tated by let­ter to board about AI break­through—Reuters

Jonathan YanNov 22, 2023, 11:17 PM
18 points
11 comments1 min readLW link
(www.reuters.com)

Fore­sight In­sti­tute: 2023 Progress & 2024 Plans for fund­ing benefi­cial tech­nol­ogy development

Allison DuettmannNov 22, 2023, 10:09 PM
24 points
1 comment6 min readLW link

AISC pro­ject: TinyEvals

Jett JaniakNov 22, 2023, 8:47 PM
22 points
0 comments4 min readLW link

The pro­posal to add a ``Last Judge″ to an AI, does not re­move the ur­gency, of mak­ing progress on the ``what al­ign­ment tar­get should be aimed at?″ ques­tion.

ThomasCederborgNov 22, 2023, 6:59 PM
1 point
0 comments18 min readLW link

Nei­ther Coper­ni­cus, Gal­ileo, nor Ke­pler had proof

Meow PNov 22, 2023, 6:41 PM
4 points
10 comments1 min readLW link
(www.cricetuscricetus.co.uk)

OpenAI: The Bat­tle of the Board

ZviNov 22, 2023, 5:30 PM
281 points
83 comments11 min readLW link
(thezvi.wordpress.com)

Alt­man re­turns as OpenAI CEO with new board

Seth HerdNov 22, 2023, 4:04 PM
6 points
3 comments1 min readLW link

A tax­on­omy of non-schemer mod­els (Sec­tion 1.2 of “Schem­ing AIs”)

Joe CarlsmithNov 22, 2023, 3:24 PM
13 points
0 comments13 min readLW link

AI de­bate: test your­self against chess ‘AIs’

Richard WillisNov 22, 2023, 2:58 PM
26 points
35 comments4 min readLW link

Public Call for In­ter­est in Math­e­mat­i­cal Alignment

DavidmanheimNov 22, 2023, 1:22 PM
90 points
9 comments1 min readLW link

How “Pinky Promise” diplo­macy once stopped a war in the Mid­dle East

positivesumNov 22, 2023, 12:03 PM
15 points
9 comments1 min readLW link
(tryingtruly.substack.com)

Align­ment, con­flict, powerseeking

Oliver SourbutNov 22, 2023, 9:47 AM
6 points
1 comment1 min readLW link

[Bias] Restrict­ing free­dom is more harm­ful than it seems

lsusrNov 22, 2023, 9:44 AM
17 points
15 comments1 min readLW link

Portable Charg­ers are Great

jefftkNov 22, 2023, 2:50 AM
21 points
2 comments1 min readLW link
(www.jefftk.com)

At­lantis: Berkeley event venue available for rent

Jonas VNov 22, 2023, 1:47 AM
45 points
0 comments2 min readLW link

[Question] How much should e-sig­na­tures have to cost a coun­try?

FlorianHNov 21, 2023, 10:45 PM
5 points
5 comments1 min readLW link

My first con­ver­sa­tion with An­nie Altman

RemmeltNov 21, 2023, 9:58 PM
8 points
3 comments1 min readLW link
(open.spotify.com)

User­script to always show LW com­ments in con­text vs at the top

Vlad SitaloNov 21, 2023, 5:53 PM
44 points
8 comments1 min readLW link

Dialogue on the Claim: “OpenAI’s Firing of Sam Alt­man (And Shortly-Sub­se­quent Events) On Net Re­duced Ex­is­ten­tial Risk From AGI”

Nov 21, 2023, 5:39 PM
73 points
84 comments11 min readLW link

AI Align­ment [progress] this Week (11/​19/​2023)

Logan ZoellnerNov 21, 2023, 4:09 PM
18 points
3 comments5 min readLW link
(midwitalignment.substack.com)