Con­scious­ness is ir­rele­vant—in­stead solve al­ign­ment by ask­ing this question

Oliver SiegelMar 4, 2023, 10:06 PM
−10 points
6 comments1 min readLW link

More money with less risk: sell ser­vices in­stead of model access

lemonhopeMar 4, 2023, 8:51 PM
9 points
3 comments1 min readLW link

Con­tra “Strong Co­her­ence”

DragonGodMar 4, 2023, 8:05 PM
39 points
24 comments1 min readLW link

The Prac­ti­tioner’s Path 2.0: A new frame­work for struc­tured self-improvement

EvenflairMar 4, 2023, 7:19 PM
32 points
2 comments11 min readLW link
(guildoftherose.org)

The Benefits of Distil­la­tion in Research

Jonas HallgrenMar 4, 2023, 5:45 PM
15 points
2 comments5 min readLW link

Op­ti­mal Mu­sic Choice

mbazzaniMar 4, 2023, 5:26 PM
5 points
0 comments1 min readLW link

Why don’t more peo­ple talk about ecolog­i­cal psy­chol­ogy?

PpauMar 4, 2023, 5:03 PM
21 points
10 comments7 min readLW link

Switch­ing to Elec­tric Mandolin

jefftkMar 4, 2023, 3:40 PM
16 points
1 comment1 min readLW link
(www.jefftk.com)

Pre­dic­tive Perfor­mance on Me­tac­u­lus vs. Man­i­fold Markets

nikosMar 4, 2023, 8:10 AM
18 points
0 comments5 min readLW link

Con­tra Han­son on AI Risk

LironMar 4, 2023, 8:02 AM
36 points
23 comments8 min readLW link

Bite Sized Tasks

Johannes C. MayerMar 4, 2023, 3:31 AM
18 points
2 comments2 min readLW link

How pop­u­lar is ChatGPT? Part 2: slower growth than Poké­mon GO

Richard Korzekwa Mar 3, 2023, 11:40 PM
42 points
4 comments6 min readLW link
(aiimpacts.org)

Acausal normalcy

Andrew_CritchMar 3, 2023, 11:34 PM
195 points
36 comments8 min readLW link1 review

Com­ments on OpenAI’s “Plan­ning for AGI and be­yond”

So8resMar 3, 2023, 11:01 PM
148 points
2 comments14 min readLW link

Why are coun­ter­fac­tu­als elu­sive?

Martín SotoMar 3, 2023, 8:13 PM
14 points
6 comments2 min readLW link

Si­tu­a­tional aware­ness in Large Lan­guage Models

Simon MöllerMar 3, 2023, 6:59 PM
31 points
2 comments7 min readLW link

AI Gover­nance & Strat­egy: Pri­ori­ties, tal­ent gaps, & opportunities

Orpheus16Mar 3, 2023, 6:09 PM
56 points
2 comments4 min readLW link

Mea­sur­ing Ads Opt-Out Compliance

jefftkMar 3, 2023, 4:00 PM
18 points
2 comments2 min readLW link
(www.jefftk.com)

ChatGPT tells sto­ries, and a note about re­verse en­g­ineer­ing: A Work­ing Paper

Bill BenzonMar 3, 2023, 3:12 PM
3 points
0 comments3 min readLW link

Group Wiki Walk

ScrewtapeMar 3, 2023, 3:10 PM
9 points
0 comments3 min readLW link

Robin Han­son’s lat­est AI risk po­si­tion statement

LironMar 3, 2023, 2:25 PM
55 points
18 comments1 min readLW link
(www.overcomingbias.com)

A re­ply to Byrnes on the Free En­ergy Principle

Roman LeventovMar 3, 2023, 1:03 PM
28 points
16 comments14 min readLW link

Syd­ney can play chess and kind of keep track of the board state

Erik JennerMar 3, 2023, 9:39 AM
64 points
19 comments6 min readLW link

[Fic­tion] The boy in the glass dome

Kaj_SotalaMar 3, 2023, 7:50 AM
28 points
0 comments2 min readLW link
(kajsotala.fi)

The Waluigi Effect (mega-post)

Cleo NardoMar 3, 2023, 3:22 AM
629 points
188 comments16 min readLW link

Aspiring AI safety re­searchers should ~argmax over AGI timelines

Ryan KiddMar 3, 2023, 2:04 AM
29 points
8 comments2 min readLW link

ACX/​SSC/​LW meetup

Épiphanie GédéonMar 2, 2023, 11:37 PM
8 points
0 comments1 min readLW link

Re­sults Pre­dic­tion Thread About How Differ­ent Fac­tors Affect AI X-Risk

MrThinkMar 2, 2023, 10:13 PM
9 points
0 comments2 min readLW link

Why I’m not into the Free En­ergy Principle

Steven ByrnesMar 2, 2023, 7:27 PM
150 points
50 comments9 min readLW link1 review

[Question] Lost in the sauce

JungleTact1csMar 2, 2023, 4:58 PM
−5 points
12 comments1 min readLW link

AI #2

ZviMar 2, 2023, 2:50 PM
66 points
18 comments55 min readLW link
(thezvi.wordpress.com)

Payor’s Lemma in Nat­u­ral Language

Andrew_CritchMar 2, 2023, 12:22 PM
62 points
0 comments2 min readLW link

Joscha Bach on Syn­thetic In­tel­li­gence [an­no­tated]

Roman LeventovMar 2, 2023, 11:02 AM
10 points
1 comment9 min readLW link
(www.jimruttshow.com)

[Question] If I want to test how good I would be as an AI safety re­searcher alongside my full-time job (with the hope of it be­com­ing my full-time ca­reer at some point), is this a good plan?

Malleable_shapeMar 2, 2023, 9:44 AM
16 points
0 comments4 min readLW link

Job list­ing (closed): Sen­tience In­sti­tute is ac­cept­ing ap­pli­ca­tions for a researcher

michael_delloMar 2, 2023, 4:40 AM
6 points
0 comments5 min readLW link
(www.sentienceinstitute.org)

Reflec­tion Mechanisms as an Align­ment Tar­get—At­ti­tudes on “near-term” AI

Mar 2, 2023, 4:29 AM
21 points
0 comments8 min readLW link

Live Kingfisher Album?

jefftkMar 2, 2023, 3:40 AM
11 points
0 comments1 min readLW link
(www.jefftk.com)

Don’t Jump or I’ll...

DoubleMar 2, 2023, 2:58 AM
13 points
7 comments4 min readLW link

Clippy, the friendly paperclipper

Seth HerdMar 2, 2023, 12:02 AM
3 points
11 comments2 min readLW link

Hu­man level AI can plau­si­bly take over the world

anithiteMar 1, 2023, 11:27 PM
27 points
12 comments2 min readLW link

Ex­treme GDP growth is a bad op­er­at­ing defi­ni­tion of “slow take­off”

lcMar 1, 2023, 10:25 PM
24 points
1 comment1 min readLW link

Learn the math­e­mat­i­cal struc­ture, not the con­cep­tual structure

Adam ShaiMar 1, 2023, 10:24 PM
98 points
35 comments2 min readLW link

The Parable of the King and the Ran­dom Process

moridinamaelMar 1, 2023, 10:18 PM
312 points
26 comments6 min readLW link3 reviews

To MIRI-style folk, you can’t simu­late the uni­verse from the beginning

the gears to ascensionMar 1, 2023, 9:38 PM
2 points
19 comments2 min readLW link

OpenAI in­tro­duce ChatGPT API at 1/​10th the pre­vi­ous $/​token

Arthur ConmyMar 1, 2023, 8:48 PM
28 points
4 comments1 min readLW link
(openai.com)

Progress links and tweets, 2023-03-01

jasoncrawfordMar 1, 2023, 8:33 PM
12 points
2 comments1 min readLW link
(rootsofprogress.org)

Ta­boo “com­pute over­hang”

Zach Stein-PerlmanMar 1, 2023, 7:15 PM
21 points
8 comments1 min readLW link

Call for Cruxes by Rhyme, a Longter­mist His­tory Consultancy

LaraMar 1, 2023, 6:39 PM
1 point
0 comments3 min readLW link
(forum.effectivealtruism.org)

Fight­ing with­out hope

Orpheus16Mar 1, 2023, 6:15 PM
46 points
14 comments4 min readLW link1 review

Sun­light is yel­low par­allel rays plus blue isotropic light

Thomas KehrenbergMar 1, 2023, 5:58 PM
77 points
5 comments2 min readLW link