OpenAI ap­points Re­tired U.S. Army Gen­eral Paul M. Naka­sone to Board of Directors

Joel BurgetJun 13, 2024, 9:28 PM
35 points

20 votes

Overall karma indicates overall quality.

10 comments1 min readLW link
(openai.com)

AI #68: Re­mark­ably Rea­son­able Reactions

ZviJun 13, 2024, 4:30 PM
46 points

27 votes

Overall karma indicates overall quality.

11 comments50 min readLW link
(thezvi.wordpress.com)

Four Fu­tures For Cog­ni­tive Labor

Maxwell TabarrokJun 13, 2024, 12:56 PM
14 points

11 votes

Overall karma indicates overall quality.

11 comments4 min readLW link
(www.maximum-progress.com)

Un­der­rated Proverbs

Arjun PanicksseryJun 13, 2024, 12:30 PM
13 points

14 votes

Overall karma indicates overall quality.

9 comments1 min readLW link
(arjunpanickssery.substack.com)

[Paper] AI Sand­bag­ging: Lan­guage Models can Strate­gi­cally Un­der­perform on Evaluations

Jun 13, 2024, 10:04 AM
84 points

35 votes

Overall karma indicates overall quality.

10 comments2 min readLW link
(arxiv.org)

Prob­a­bly Not a Ghost Story

George IngebretsenJun 12, 2024, 10:55 PM
27 points

13 votes

Overall karma indicates overall quality.

4 comments3 min readLW link

AiPhone

ZviJun 12, 2024, 10:20 PM
63 points

23 votes

Overall karma indicates overall quality.

4 comments14 min readLW link
(thezvi.wordpress.com)

microwave drilling is impractical

bhauthJun 12, 2024, 10:16 PM
59 points

32 votes

Overall karma indicates overall quality.

19 comments4 min readLW link
(www.bhauth.com)

Phonose­man­tic Duplication

bitcoinssgJun 12, 2024, 8:19 PM
5 points

3 votes

Overall karma indicates overall quality.

0 comments1 min readLW link

My AI Model Delta Com­pared To Christiano

johnswentworthJun 12, 2024, 6:19 PM
191 points

92 votes

Overall karma indicates overall quality.

74 comments4 min readLW link

AI: 4 lev­els of im­pact [micro­p­ost]

Mati_RoyJun 12, 2024, 4:58 PM
8 points

5 votes

Overall karma indicates overall quality.

0 comments1 min readLW link

Ag­grega­tive prin­ci­ples ap­prox­i­mate util­i­tar­ian principles

Cleo NardoJun 12, 2024, 4:27 PM
28 points

10 votes

Overall karma indicates overall quality.

3 comments23 min readLW link

Sticker Short­cut Fal­lacy — The Real Worst Ar­gu­ment in the World

ymeskhoutJun 12, 2024, 2:52 PM
27 points

23 votes

Overall karma indicates overall quality.

15 comments4 min readLW link
(www.ymeskhout.com)

Long-Term Fu­ture Fund: May 2023 to March 2024 Pay­out recommendations

LinchJun 12, 2024, 1:46 PM
40 points

15 votes

Overall karma indicates overall quality.

0 comments13 min readLW link

An­thropic’s Cer­tifi­cate of Incorporation

Zach Stein-PerlmanJun 12, 2024, 1:00 PM
115 points

39 votes

Overall karma indicates overall quality.

7 comments4 min readLW link

Calcu­lance: A “Core” Ability

milanroskoJun 12, 2024, 7:21 AM
4 points

10 votes

Overall karma indicates overall quality.

0 comments1 min readLW link

AXRP Epi­sode 33 - RLHF Prob­lems with Scott Emmons

DanielFilanJun 12, 2024, 3:30 AM
34 points

9 votes

Overall karma indicates overall quality.

0 comments56 min readLW link

[New Fea­ture] Your Sub­scribed Feed

Jun 11, 2024, 10:45 PM
77 points

27 votes

Overall karma indicates overall quality.

13 comments4 min readLW link

Open Thread Sum­mer 2024

habrykaJun 11, 2024, 8:57 PM
22 points

7 votes

Overall karma indicates overall quality.

99 comments1 min readLW link

Can effi­ciency-ad­justable re­port­ing thresh­olds close a loop­hole in Bi­den’s ex­ec­u­tive or­der on AI?

Jemal YoungJun 11, 2024, 8:56 PM
4 points

2 votes

Overall karma indicates overall quality.

1 comment2 min readLW link

“Full Au­toma­tion” is a Slip­pery Metric

ozziegooenJun 11, 2024, 7:56 PM
30 points

15 votes

Overall karma indicates overall quality.

1 comment2 min readLW link

AI take­off and nu­clear war

owencbJun 11, 2024, 7:36 PM
80 points

23 votes

Overall karma indicates overall quality.

6 comments11 min readLW link
(strangecities.substack.com)

[Question] What do peo­ple think about the poly­mar­ket Eth Etf re­s­olu­tion?

edge_retainerJun 11, 2024, 6:34 PM
1 point

1 vote

Overall karma indicates overall quality.

0 comments1 min readLW link

Let’s De­sign A School, Part 3.1: Bring­ing it all to­gether with the Sieve Model

SableJun 11, 2024, 5:03 PM
13 points

3 votes

Overall karma indicates overall quality.

2 comments7 min readLW link
(affablyevil.substack.com)

How to elimi­nate cut?

jessicataJun 11, 2024, 3:54 PM
22 points

9 votes

Overall karma indicates overall quality.

0 comments14 min readLW link
(unstableontology.com)

my favourite Scott Sum­ner blog posts

DMMFJun 11, 2024, 2:40 PM
26 points

9 votes

Overall karma indicates overall quality.

0 comments3 min readLW link
(danfrank.ca)

[Question] Is any­one de­vel­op­ing op­ti­mi­sa­tion-ro­bust in­ter­pretabil­ity meth­ods?

JonoJun 11, 2024, 1:14 PM
6 points

3 votes

Overall karma indicates overall quality.

0 comments1 min readLW link

Keep the Grass Guessing

JackOfAllTradesJun 11, 2024, 7:29 AM
4 points

4 votes

Overall karma indicates overall quality.

0 comments2 min readLW link

“Me­tas­trate­gic Brain­storm­ing”, a core build­ing-block skill

RaemonJun 11, 2024, 4:27 AM
64 points

35 votes

Overall karma indicates overall quality.

5 comments6 min readLW link

AI De­bate Sta­bil­ity: Ad­dress­ing Self-Defeat­ing Responses

Annie SorkinJun 11, 2024, 3:03 AM
9 points

5 votes

Overall karma indicates overall quality.

0 comments3 min readLW link

Cor­rigi­bil­ity could make things worse

ThomasCederborgJun 11, 2024, 12:55 AM
9 points

10 votes

Overall karma indicates overall quality.

6 comments6 min readLW link

DPO/​PPO-RLHF on LLMs in­cen­tivizes syco­phancy, ex­ag­ger­a­tion and de­cep­tive hal­lu­ci­na­tion, but not mis­al­igned powerseeking

tailcalledJun 10, 2024, 9:20 PM
29 points

16 votes

Overall karma indicates overall quality.

13 comments2 min readLW link

Plop! Goes the Concept

Jonathan MoregårdJun 10, 2024, 7:23 PM
6 points

3 votes

Overall karma indicates overall quality.

0 comments8 min readLW link
(honestliving.substack.com)

What can we learn from or­cas?

JonasbJun 10, 2024, 6:01 PM
1 point

4 votes

Overall karma indicates overall quality.

0 comments8 min readLW link
(www.denominations.io)

How to build a data cen­ter, by Con­struc­tion Physics

TheManxLoinerJun 10, 2024, 5:38 PM
2 points

3 votes

Overall karma indicates overall quality.

0 comments1 min readLW link
(www.construction-physics.com)

Ob­ser­va­tions for do­ing de­bate with mod­els be­hind APIs

PoD123Jun 10, 2024, 4:22 PM
3 points

2 votes

Overall karma indicates overall quality.

0 comments3 min readLW link

My AI Model Delta Com­pared To Yudkowsky

johnswentworthJun 10, 2024, 4:12 PM
291 points

149 votes

Overall karma indicates overall quality.

103 comments4 min readLW link

[Question] Good ways to mon­e­tar­ily profit from the in­creas­ing de­mand for power?

Matt GoldenbergJun 10, 2024, 3:29 PM
12 points

8 votes

Overall karma indicates overall quality.

5 comments1 min readLW link

The Evolu­tion to­wards the Blank Slate

Arturo MaciasJun 10, 2024, 3:20 PM
−6 points

8 votes

Overall karma indicates overall quality.

0 comments3 min readLW link

10 Public “I was wrong” Ad­mis­sions by Scien­tists and Intellectuals

Hashem ElAssadJun 10, 2024, 2:19 PM
0 points

7 votes

Overall karma indicates overall quality.

3 comments1 min readLW link

[Valence se­ries] 4. Valence & Lik­ing /​ Admiring

Steven ByrnesJun 10, 2024, 2:19 PM
48 points

11 votes

Overall karma indicates overall quality.

12 comments15 min readLW link

5. Open Cor­rigi­bil­ity Questions

Max HarmsJun 10, 2024, 2:09 PM
30 points

11 votes

Overall karma indicates overall quality.

0 comments7 min readLW link

4. Ex­ist­ing Writ­ing on Corrigibility

Max HarmsJun 10, 2024, 2:08 PM
55 points

16 votes

Overall karma indicates overall quality.

17 comments106 min readLW link

On Dwarksh’s Pod­cast with Leopold Aschenbrenner

ZviJun 10, 2024, 12:40 PM
102 points

33 votes

Overall karma indicates overall quality.

7 comments59 min readLW link
(thezvi.wordpress.com)

Sum­mary of Si­tu­a­tional Aware­ness—The Decade Ahead

OscarJun 10, 2024, 8:44 AM
6 points

7 votes

Overall karma indicates overall quality.

2 comments1 min readLW link
(forum.effectivealtruism.org)

Why I don’t be­lieve in the placebo effect

transhumanist_atom_understanderJun 10, 2024, 2:37 AM
135 points

70 votes

Overall karma indicates overall quality.

22 comments9 min readLW link

Soviet com­edy film recommendations

Nina PanicksseryJun 9, 2024, 11:40 PM
42 points

20 votes

Overall karma indicates overall quality.

11 comments2 min readLW link
(open.substack.com)

The Data Wall is Important

JustisMillsJun 9, 2024, 10:54 PM
40 points

19 votes

Overall karma indicates overall quality.

20 comments2 min readLW link
(justismills.substack.com)

Two Fam­ily Dance Flyers

jefftkJun 9, 2024, 8:50 PM
13 points

5 votes

Overall karma indicates overall quality.

0 comments1 min readLW link
(www.jefftk.com)

[Question] What hap­pens to ex­ist­ing life sen­tences un­der LEV?

O OJun 9, 2024, 5:49 PM
5 points

2 votes

Overall karma indicates overall quality.

7 comments1 min readLW link