Con­scious­ness is ir­rele­vant—in­stead solve al­ign­ment by ask­ing this question

Oliver Siegel4 Mar 2023 22:06 UTC
−10 points
6 comments1 min readLW link

More money with less risk: sell ser­vices in­stead of model access

lukehmiles4 Mar 2023 20:51 UTC
9 points
3 comments1 min readLW link

Con­tra “Strong Co­her­ence”

DragonGod4 Mar 2023 20:05 UTC
39 points
24 comments1 min readLW link

The Prac­ti­tioner’s Path 2.0: A new frame­work for struc­tured self-improvement

Evenflair4 Mar 2023 19:19 UTC
32 points
2 comments11 min readLW link
(guildoftherose.org)

The Benefits of Distil­la­tion in Research

Jonas Hallgren4 Mar 2023 17:45 UTC
15 points
2 comments5 min readLW link

Op­ti­mal Mu­sic Choice

mbazzani4 Mar 2023 17:26 UTC
5 points
0 comments1 min readLW link

Why don’t more peo­ple talk about ecolog­i­cal psy­chol­ogy?

Ppau4 Mar 2023 17:03 UTC
21 points
10 comments7 min readLW link

Switch­ing to Elec­tric Mandolin

jefftk4 Mar 2023 15:40 UTC
16 points
0 comments1 min readLW link
(www.jefftk.com)

Pre­dic­tive Perfor­mance on Me­tac­u­lus vs. Man­i­fold Markets

nikos4 Mar 2023 8:10 UTC
18 points
0 comments5 min readLW link

Con­tra Han­son on AI Risk

Liron4 Mar 2023 8:02 UTC
36 points
23 comments8 min readLW link

Bite Sized Tasks

Johannes C. Mayer4 Mar 2023 3:31 UTC
18 points
2 comments2 min readLW link

How pop­u­lar is ChatGPT? Part 2: slower growth than Poké­mon GO

Richard Korzekwa 3 Mar 2023 23:40 UTC
42 points
4 comments6 min readLW link
(aiimpacts.org)

Acausal normalcy

Andrew_Critch3 Mar 2023 23:34 UTC
170 points
30 comments8 min readLW link

Com­ments on OpenAI’s “Plan­ning for AGI and be­yond”

So8res3 Mar 2023 23:01 UTC
148 points
2 comments14 min readLW link

Why are coun­ter­fac­tu­als elu­sive?

Martín Soto3 Mar 2023 20:13 UTC
14 points
6 comments2 min readLW link

Si­tu­a­tional aware­ness in Large Lan­guage Models

Simon Möller3 Mar 2023 18:59 UTC
28 points
2 comments7 min readLW link

AI Gover­nance & Strat­egy: Pri­ori­ties, tal­ent gaps, & opportunities

Akash3 Mar 2023 18:09 UTC
56 points
2 comments4 min readLW link

Ve­ganism and Acausal Trade

elbow9213 Mar 2023 17:44 UTC
−3 points
1 comment2 min readLW link

Mea­sur­ing Ads Opt-Out Compliance

jefftk3 Mar 2023 16:00 UTC
18 points
2 comments2 min readLW link
(www.jefftk.com)

ChatGPT tells sto­ries, and a note about re­verse en­g­ineer­ing: A Work­ing Paper

Bill Benzon3 Mar 2023 15:12 UTC
3 points
0 comments3 min readLW link

Group Wiki Walk

Screwtape3 Mar 2023 15:10 UTC
9 points
0 comments3 min readLW link

Robin Han­son’s lat­est AI risk po­si­tion statement

Liron3 Mar 2023 14:25 UTC
55 points
17 comments1 min readLW link
(www.overcomingbias.com)

A re­ply to Byrnes on the Free En­ergy Principle

Roman Leventov3 Mar 2023 13:03 UTC
27 points
16 comments14 min readLW link

state of my al­ign­ment re­search, and what needs work

Tamsin Leake3 Mar 2023 10:28 UTC
51 points
0 comments2 min readLW link
(carado.moe)

Syd­ney can play chess and kind of keep track of the board state

Erik Jenner3 Mar 2023 9:39 UTC
62 points
19 comments6 min readLW link

[Fic­tion] The boy in the glass dome

Kaj_Sotala3 Mar 2023 7:50 UTC
28 points
0 comments2 min readLW link
(kajsotala.fi)

The Waluigi Effect (mega-post)

Cleo Nardo3 Mar 2023 3:22 UTC
618 points
188 comments16 min readLW link

Aspiring AI safety re­searchers should ~argmax over AGI timelines

Ryan Kidd3 Mar 2023 2:04 UTC
26 points
8 comments2 min readLW link

ACX/​SSC/​LW meetup

Épiphanie Gédéon2 Mar 2023 23:37 UTC
8 points
0 comments1 min readLW link

Re­sults Pre­dic­tion Thread About How Differ­ent Fac­tors Affect AI X-Risk

MrThink2 Mar 2023 22:13 UTC
9 points
0 comments2 min readLW link

Why I’m not into the Free En­ergy Principle

Steven Byrnes2 Mar 2023 19:27 UTC
138 points
48 comments9 min readLW link

[Question] Lost in the sauce

JungleTact1cs2 Mar 2023 16:58 UTC
−5 points
12 comments1 min readLW link

AI #2

Zvi2 Mar 2023 14:50 UTC
66 points
18 comments55 min readLW link
(thezvi.wordpress.com)

Payor’s Lemma in Nat­u­ral Language

Andrew_Critch2 Mar 2023 12:22 UTC
60 points
0 comments2 min readLW link

Joscha Bach on Syn­thetic In­tel­li­gence [an­no­tated]

Roman Leventov2 Mar 2023 11:02 UTC
9 points
1 comment9 min readLW link
(www.jimruttshow.com)

[Question] If I want to test how good I would be as an AI safety re­searcher alongside my full-time job (with the hope of it be­com­ing my full-time ca­reer at some point), is this a good plan?

Malleable_shape2 Mar 2023 9:44 UTC
16 points
0 comments4 min readLW link

[Question] What are some sources re­lated to big-pic­ture AI strat­egy?

Jacob Watts2 Mar 2023 5:00 UTC
2 points
0 comments1 min readLW link

Job list­ing (closed): Sen­tience In­sti­tute is ac­cept­ing ap­pli­ca­tions for a researcher

michael_dello2 Mar 2023 4:40 UTC
6 points
0 comments5 min readLW link
(www.sentienceinstitute.org)

Reflec­tion Mechanisms as an Align­ment Tar­get—At­ti­tudes on “near-term” AI

2 Mar 2023 4:29 UTC
20 points
0 comments8 min readLW link

Live Kingfisher Album?

jefftk2 Mar 2023 3:40 UTC
11 points
0 comments1 min readLW link
(www.jefftk.com)

Don’t Jump or I’ll...

Double2 Mar 2023 2:58 UTC
13 points
7 comments4 min readLW link

Clippy, the friendly paperclipper

Seth Herd2 Mar 2023 0:02 UTC
1 point
11 comments2 min readLW link

Hu­man level AI can plau­si­bly take over the world

anithite1 Mar 2023 23:27 UTC
26 points
12 comments2 min readLW link

Ex­treme GDP growth is a bad op­er­at­ing defi­ni­tion of “slow take­off”

lc1 Mar 2023 22:25 UTC
24 points
1 comment1 min readLW link

Learn the math­e­mat­i­cal struc­ture, not the con­cep­tual structure

Adam Shai1 Mar 2023 22:24 UTC
88 points
35 comments2 min readLW link

The Parable of the King and the Ran­dom Process

moridinamael1 Mar 2023 22:18 UTC
288 points
22 comments6 min readLW link

To MIRI-style folk, you can’t simu­late the uni­verse from the beginning

the gears to ascension1 Mar 2023 21:38 UTC
2 points
19 comments2 min readLW link

OpenAI in­tro­duce ChatGPT API at 1/​10th the pre­vi­ous $/​token

Arthur Conmy1 Mar 2023 20:48 UTC
28 points
4 comments1 min readLW link
(openai.com)

Progress links and tweets, 2023-03-01

jasoncrawford1 Mar 2023 20:33 UTC
12 points
2 comments1 min readLW link
(rootsofprogress.org)

Ta­boo “com­pute over­hang”

Zach Stein-Perlman1 Mar 2023 19:15 UTC
17 points
8 comments1 min readLW link