Who Aligns the Align­ment Re­searchers?

Ben SmithMar 5, 2023, 11:22 PM
48 points
0 comments11 min readLW link

Star­tups are like firewood

Adam ZernerMar 5, 2023, 11:09 PM
26 points
2 comments3 min readLW link

A con­cern­ing ob­ser­va­tion from me­dia cov­er­age of AI in­dus­try dynamics

Justin OliveMar 5, 2023, 9:38 PM
8 points
3 comments3 min readLW link

Steven Pinker on ChatGPT and AGI (Feb 2023)

Evan R. MurphyMar 5, 2023, 9:34 PM
11 points
8 comments1 min readLW link
(news.harvard.edu)

Is it time to talk about AI dooms­day prep­ping yet?

bokovMar 5, 2023, 9:17 PM
0 points
8 comments1 min readLW link

Co­or­di­na­tion ex­plo­sion be­fore in­tel­li­gence ex­plo­sion...?

tailcalledMar 5, 2023, 8:48 PM
47 points
9 comments2 min readLW link

The Ogdoad

Tristan MianoMar 5, 2023, 8:01 PM
−15 points
1 comment37 min readLW link

[Question] What are some good ways to heighten my emo­tions?

oh54321Mar 5, 2023, 6:06 PM
5 points
5 comments1 min readLW link

Re­search pro­posal: Lev­er­ag­ing Jun­gian archetypes to cre­ate val­ues-based models

MiguelDevMar 5, 2023, 5:39 PM
5 points
2 comments2 min readLW link

Abus­ing Snap Cir­cuits IC

jefftkMar 5, 2023, 5:00 PM
19 points
3 comments3 min readLW link
(www.jefftk.com)

Do hu­mans de­rive val­ues from fic­ti­tious im­puted co­her­ence?

TsviBTMar 5, 2023, 3:23 PM
45 points
8 comments14 min readLW link

The In­ner-Com­pass Theorem

Tristan MianoMar 5, 2023, 3:21 PM
−18 points
12 comments16 min readLW link

Hal­i­fax Monthly Meetup: AI Safety Discussion

IdeopunkMar 5, 2023, 12:42 PM
10 points
0 comments1 min readLW link

Why kill ev­ery­one?

arisAlexisMar 5, 2023, 11:53 AM
7 points
5 comments2 min readLW link

Selec­tive, Cor­rec­tive, Struc­tural: Three Ways of Mak­ing So­cial Sys­tems Work

Said AchmizMar 5, 2023, 8:45 AM
100 points
13 comments2 min readLW link

Sub­sti­tute goods for leisure are abundant

Adam ZernerMar 5, 2023, 3:45 AM
20 points
7 comments5 min readLW link

[Question] Does polyamory at a work­place turn nepo­tism up to eleven?

ViliamMar 5, 2023, 12:57 AM
45 points
11 comments2 min readLW link

Why We MUST Build an (al­igned) Ar­tifi­cial Su­per­in­tel­li­gence That Takes Over Hu­man So­ciety—A Thought Experiment

twkaiserMar 5, 2023, 12:47 AM
−13 points
12 comments2 min readLW link

Fore­casts on Moore v Harper from Samotsvety

gregjusticeMar 5, 2023, 12:47 AM
7 points
0 comments1 min readLW link
(samotsvety.org)

Why Not Just… Build Weak AI Tools For AI Align­ment Re­search?

johnswentworthMar 5, 2023, 12:12 AM
184 points
18 comments6 min readLW link

Con­scious­ness is ir­rele­vant—in­stead solve al­ign­ment by ask­ing this question

Oliver SiegelMar 4, 2023, 10:06 PM
−10 points
6 comments1 min readLW link

More money with less risk: sell ser­vices in­stead of model access

lemonhopeMar 4, 2023, 8:51 PM
9 points
3 comments1 min readLW link

Con­tra “Strong Co­her­ence”

DragonGodMar 4, 2023, 8:05 PM
39 points
24 comments1 min readLW link

The Prac­ti­tioner’s Path 2.0: A new frame­work for struc­tured self-improvement

EvenflairMar 4, 2023, 7:19 PM
32 points
2 comments11 min readLW link
(guildoftherose.org)

The Benefits of Distil­la­tion in Research

Jonas HallgrenMar 4, 2023, 5:45 PM
15 points
2 comments5 min readLW link

Op­ti­mal Mu­sic Choice

mbazzaniMar 4, 2023, 5:26 PM
5 points
0 comments1 min readLW link

Why don’t more peo­ple talk about ecolog­i­cal psy­chol­ogy?

PpauMar 4, 2023, 5:03 PM
21 points
10 comments7 min readLW link

Switch­ing to Elec­tric Mandolin

jefftkMar 4, 2023, 3:40 PM
16 points
1 comment1 min readLW link
(www.jefftk.com)

Pre­dic­tive Perfor­mance on Me­tac­u­lus vs. Man­i­fold Markets

nikosMar 4, 2023, 8:10 AM
18 points
0 comments5 min readLW link

Con­tra Han­son on AI Risk

LironMar 4, 2023, 8:02 AM
36 points
23 comments8 min readLW link

Bite Sized Tasks

Johannes C. MayerMar 4, 2023, 3:31 AM
18 points
2 comments2 min readLW link

How pop­u­lar is ChatGPT? Part 2: slower growth than Poké­mon GO

Richard Korzekwa Mar 3, 2023, 11:40 PM
42 points
4 comments6 min readLW link
(aiimpacts.org)

Acausal normalcy

Andrew_CritchMar 3, 2023, 11:34 PM
195 points
36 comments8 min readLW link1 review

Com­ments on OpenAI’s “Plan­ning for AGI and be­yond”

So8resMar 3, 2023, 11:01 PM
148 points
2 comments14 min readLW link

Why are coun­ter­fac­tu­als elu­sive?

Martín SotoMar 3, 2023, 8:13 PM
14 points
6 comments2 min readLW link

Si­tu­a­tional aware­ness in Large Lan­guage Models

Simon MöllerMar 3, 2023, 6:59 PM
31 points
2 comments7 min readLW link

AI Gover­nance & Strat­egy: Pri­ori­ties, tal­ent gaps, & opportunities

Orpheus16Mar 3, 2023, 6:09 PM
56 points
2 comments4 min readLW link

Mea­sur­ing Ads Opt-Out Compliance

jefftkMar 3, 2023, 4:00 PM
18 points
2 comments2 min readLW link
(www.jefftk.com)

ChatGPT tells sto­ries, and a note about re­verse en­g­ineer­ing: A Work­ing Paper

Bill BenzonMar 3, 2023, 3:12 PM
3 points
0 comments3 min readLW link

Group Wiki Walk

ScrewtapeMar 3, 2023, 3:10 PM
9 points
0 comments3 min readLW link

Robin Han­son’s lat­est AI risk po­si­tion statement

LironMar 3, 2023, 2:25 PM
55 points
18 comments1 min readLW link
(www.overcomingbias.com)

A re­ply to Byrnes on the Free En­ergy Principle

Roman LeventovMar 3, 2023, 1:03 PM
28 points
16 comments14 min readLW link

Syd­ney can play chess and kind of keep track of the board state

Erik JennerMar 3, 2023, 9:39 AM
64 points
19 comments6 min readLW link

[Fic­tion] The boy in the glass dome

Kaj_SotalaMar 3, 2023, 7:50 AM
28 points
0 comments2 min readLW link
(kajsotala.fi)

The Waluigi Effect (mega-post)

Cleo NardoMar 3, 2023, 3:22 AM
629 points
188 comments16 min readLW link

Aspiring AI safety re­searchers should ~argmax over AGI timelines

Ryan KiddMar 3, 2023, 2:04 AM
29 points
8 comments2 min readLW link

ACX/​SSC/​LW meetup

Épiphanie GédéonMar 2, 2023, 11:37 PM
8 points
0 comments1 min readLW link

Re­sults Pre­dic­tion Thread About How Differ­ent Fac­tors Affect AI X-Risk

MrThinkMar 2, 2023, 10:13 PM
9 points
0 comments2 min readLW link

Why I’m not into the Free En­ergy Principle

Steven ByrnesMar 2, 2023, 7:27 PM
150 points
50 comments9 min readLW link1 review

[Question] Lost in the sauce

JungleTact1csMar 2, 2023, 4:58 PM
−5 points
12 comments1 min readLW link