[Question] Help to find a blog I don’t re­mem­ber the name of

JavierCC23 Nov 2023 22:49 UTC
3 points
2 comments1 min readLW link

[Question] What did you change your mind about in the last year?

mike_hawke23 Nov 2023 20:53 UTC
41 points
16 comments1 min readLW link

An Idea on How LLMs Can Show Self-Serv­ing Bias

Bruce W. Lee23 Nov 2023 20:25 UTC
6 points
6 comments3 min readLW link

A few Su­per­hu­man ex­am­ples of Su­per­al­igned Su­per­in­tel­li­gence from Google Bard (Thanks­giv­ing 2023)

23 Nov 2023 19:06 UTC
−9 points
1 comment17 min readLW link

Preps­giv­ing, A Con­ver­gently In­stru­men­tal Hu­man Practice

JenniferRM23 Nov 2023 17:24 UTC
35 points
0 comments7 min readLW link

AI #39: The Week of OpenAI

Zvi23 Nov 2023 15:10 UTC
67 points
8 comments28 min readLW link
(thezvi.wordpress.com)

3. Uploading

RogerDearnaley23 Nov 2023 7:39 UTC
21 points
5 comments8 min readLW link

2. AIs as Eco­nomic Agents

RogerDearnaley23 Nov 2023 7:07 UTC
9 points
2 comments6 min readLW link

Thomas Kwa’s re­search journal

23 Nov 2023 5:11 UTC
79 points
1 comment6 min readLW link

Never Drop A Ball

Screwtape23 Nov 2023 4:15 UTC
62 points
1 comment6 min readLW link

Pos­si­ble OpenAI’s Q* break­through and Deep­Mind’s AlphaGo-type sys­tems plus LLMs

Burny23 Nov 2023 3:16 UTC
37 points
25 comments2 min readLW link

Bos­ton Sec­u­lar Sols­tice: Call for Singers and Musicans

jefftk23 Nov 2023 2:40 UTC
16 points
2 comments1 min readLW link
(www.jefftk.com)

My Men­tal Model of Infohazards

MadHatter23 Nov 2023 2:37 UTC
7 points
33 comments2 min readLW link

Sat­u­rat­ing the Difficulty Levels of Alignment

Johannes C. Mayer23 Nov 2023 0:39 UTC
6 points
0 comments2 min readLW link

Sacra­mento LW/​ACX Meetup

mcint22 Nov 2023 23:52 UTC
1 point
0 comments1 min readLW link

Sam Alt­man’s ouster at OpenAI was pre­cip­i­tated by let­ter to board about AI break­through—Reuters

Jonathan Yan22 Nov 2023 23:17 UTC
18 points
11 comments1 min readLW link
(www.reuters.com)

Fore­sight In­sti­tute: 2023 Progress & 2024 Plans for fund­ing benefi­cial tech­nol­ogy development

Allison Duettmann22 Nov 2023 22:09 UTC
24 points
1 comment6 min readLW link

AISC pro­ject: TinyEvals

Jett22 Nov 2023 20:47 UTC
17 points
0 comments4 min readLW link

The pro­posal to add a ``Last Judge″ to an AI, does not re­move the ur­gency, of mak­ing progress on the ``what al­ign­ment tar­get should be aimed at?″ ques­tion.

ThomasCederborg22 Nov 2023 18:59 UTC
1 point
0 comments18 min readLW link

Nei­ther Coper­ni­cus, Gal­ileo, nor Ke­pler had proof

Meow P22 Nov 2023 18:41 UTC
4 points
10 comments1 min readLW link
(www.cricetuscricetus.co.uk)

So you want to save the world? An ac­count in paladinhood

Tamsin Leake22 Nov 2023 17:40 UTC
65 points
19 comments15 min readLW link
(carado.moe)

OpenAI: The Bat­tle of the Board

Zvi22 Nov 2023 17:30 UTC
277 points
82 comments11 min readLW link
(thezvi.wordpress.com)

Alt­man re­turns as OpenAI CEO with new board

Seth Herd22 Nov 2023 16:04 UTC
5 points
3 comments1 min readLW link

A tax­on­omy of non-schemer mod­els (Sec­tion 1.2 of “Schem­ing AIs”)

Joe Carlsmith22 Nov 2023 15:24 UTC
13 points
0 comments13 min readLW link

AI de­bate: test your­self against chess ‘AIs’

Richard Willis22 Nov 2023 14:58 UTC
26 points
35 comments4 min readLW link

Public Call for In­ter­est in Math­e­mat­i­cal Alignment

Davidmanheim22 Nov 2023 13:22 UTC
89 points
9 comments1 min readLW link

How “Pinky Promise” diplo­macy once stopped a war in the Mid­dle East

positivesum22 Nov 2023 12:03 UTC
15 points
9 comments1 min readLW link
(tryingtruly.substack.com)

Align­ment, con­flict, powerseeking

Oliver Sourbut22 Nov 2023 9:47 UTC
6 points
1 comment1 min readLW link

[Bias] Restrict­ing free­dom is more harm­ful than it seems

lsusr22 Nov 2023 9:44 UTC
18 points
15 comments1 min readLW link

Portable Charg­ers are Great

jefftk22 Nov 2023 2:50 UTC
21 points
2 comments1 min readLW link
(www.jefftk.com)

At­lantis: Berkeley event venue available for rent

Jonas V22 Nov 2023 1:47 UTC
45 points
0 comments2 min readLW link

[Question] How much should e-sig­na­tures have to cost a coun­try?

FlorianH21 Nov 2023 22:45 UTC
5 points
5 comments1 min readLW link

My first con­ver­sa­tion with An­nie Altman

Remmelt21 Nov 2023 21:58 UTC
8 points
3 comments1 min readLW link
(open.spotify.com)

User­script to always show LW com­ments in con­text vs at the top

Vlad Sitalo21 Nov 2023 17:53 UTC
44 points
8 comments1 min readLW link

Dialogue on the Claim: “OpenAI’s Firing of Sam Alt­man (And Shortly-Sub­se­quent Events) On Net Re­duced Ex­is­ten­tial Risk From AGI”

21 Nov 2023 17:39 UTC
73 points
84 comments11 min readLW link

AI Align­ment [progress] this Week (11/​19/​2023)

Logan Zoellner21 Nov 2023 16:09 UTC
17 points
3 comments5 min readLW link
(midwitalignment.substack.com)

Va­ri­eties of fake al­ign­ment (Sec­tion 1.1 of “Schem­ing AIs”)

Joe Carlsmith21 Nov 2023 15:00 UTC
15 points
0 comments12 min readLW link

Align­ment can im­prove gen­er­al­i­sa­tion through more ro­bustly do­ing what a hu­man wants—CoinRun example

Stuart_Armstrong21 Nov 2023 11:41 UTC
68 points
9 comments3 min readLW link

AI Safety Re­search Or­ga­ni­za­tion In­cu­ba­tion Pro­gram—Ex­pres­sion of Interest

21 Nov 2023 10:23 UTC
65 points
6 comments1 min readLW link

Scott Alexan­der is wrong about slurs

[deactivated]21 Nov 2023 8:43 UTC
−27 points
29 comments2 min readLW link

Steel­man­ning The Devil

Screwtape21 Nov 2023 7:28 UTC
10 points
0 comments5 min readLW link

How to type Alek­sander Mądry’s last name in LaTeX

DanielFilan21 Nov 2023 0:50 UTC
9 points
1 comment1 min readLW link
(danielfilan.com)

Why not elec­tric trains and ex­ca­va­tors?

bhauth21 Nov 2023 0:07 UTC
67 points
39 comments5 min readLW link
(www.bhauth.com)

Vote on worth­while OpenAI top­ics to discuss

21 Nov 2023 0:03 UTC
61 points
55 comments1 min readLW link

The na­tional se­cu­rity di­men­sion of OpenAI’s lead­er­ship struggle

Mitchell_Porter20 Nov 2023 23:57 UTC
3 points
3 comments2 min readLW link

[Question] What will you think about the Cur­rent Thing in a year?

mike_hawke20 Nov 2023 22:39 UTC
21 points
0 comments2 min readLW link

Me­tac­u­lus In­tro­duces New Fore­cast Scores, New Leader­board & Medals

ChristianWilliams20 Nov 2023 20:33 UTC
15 points
2 comments1 min readLW link
(www.metaculus.com)

[Question] “Use­less Box” AGI

Cago20 Nov 2023 19:07 UTC
1 point
2 comments1 min readLW link

[Question] Ad­vice on choos­ing an al­co­hol re­hab cen­ter?

Slingshot927120 Nov 2023 18:46 UTC
2 points
1 comment1 min readLW link

Agent Boundaries Aren’t Markov Blan­kets. [Un­less they’re non-causal; see com­ments.]

abramdemski20 Nov 2023 18:23 UTC
81 points
8 comments2 min readLW link