Ap­ply to the Cavendish Labs Fel­low­ship (by 4/​15)

3 Apr 2023 23:09 UTC
11 points
0 comments1 min readLW link
(forum.effectivealtruism.org)

Twin Cities ACX Meetup—April 2023

Timothy M.3 Apr 2023 23:07 UTC
5 points
3 comments1 min readLW link

Com­mu­ni­cat­ing effec­tively un­der Knigh­tian norms

Richard_Ngo3 Apr 2023 22:39 UTC
88 points
54 comments6 min readLW link

If in­ter­pretabil­ity re­search goes well, it may get dangerous

So8res3 Apr 2023 21:48 UTC
197 points
10 comments2 min readLW link

Towards em­pa­thy in RL agents and be­yond: In­sights from cog­ni­tive sci­ence for AI Align­ment

Marc Carauleanu3 Apr 2023 19:59 UTC
15 points
6 comments1 min readLW link
(clipchamp.com)

Monthly Roundup #5: April 2023

Zvi3 Apr 2023 18:50 UTC
26 points
12 comments14 min readLW link
(thezvi.wordpress.com)

Ex­plor­ing non-an­thro­pocen­tric as­pects of AI ex­is­ten­tial safety

mishka3 Apr 2023 18:07 UTC
8 points
0 comments3 min readLW link

[Question] GJP on AGI

Suh_Prance_Alot3 Apr 2023 17:21 UTC
2 points
0 comments1 min readLW link

Do we have a plan for the “first crit­i­cal try” prob­lem?

Christopher King3 Apr 2023 16:27 UTC
−3 points
14 comments1 min readLW link

Ex­plo­ra­tory Anal­y­sis of RLHF Trans­form­ers with TransformerLens

Curt Tigges3 Apr 2023 16:09 UTC
21 points
2 comments11 min readLW link
(blog.eleuther.ai)

AWS Has Raised Prices Before

jefftk3 Apr 2023 16:00 UTC
7 points
3 comments1 min readLW link
(www.jefftk.com)

Mati’s in­tro­duc­tion to paus­ing gi­ant AI experiments

Mati_Roy3 Apr 2023 15:56 UTC
7 points
0 comments2 min readLW link

Su­per­in­tel­li­gence will out­smart us or it isn’t superintelligence

Neil 3 Apr 2023 15:01 UTC
−4 points
4 comments1 min readLW link

AI-kills-ev­ery­one sce­nar­ios re­quire robotic in­fras­truc­ture, but not nec­es­sar­ily nanotech

avturchin3 Apr 2023 12:45 UTC
52 points
47 comments4 min readLW link

Orthog­o­nal­ity is expensive

beren3 Apr 2023 10:20 UTC
34 points
8 comments3 min readLW link

Re­peated Play of Im­perfect New­comb’s Para­dox in In­fra-Bayesian Physicalism

Sven Nilsen3 Apr 2023 10:06 UTC
2 points
0 comments2 min readLW link

Effec­tive Altru­ism Vir­tual Pro­grams Apr-May 2023

Yve Nichols-Evans3 Apr 2023 6:40 UTC
1 point
0 comments1 min readLW link

Board Game Theory

Optimization Process3 Apr 2023 6:23 UTC
8 points
0 comments3 min readLW link

Planecrash Podcast

planecrashpodcast3 Apr 2023 4:34 UTC
10 points
5 comments1 min readLW link

[Question] I’m just start­ing to grasp Shard The­ory. Is that a nor­mal feel­ing?

twkaiser3 Apr 2023 3:08 UTC
−20 points
1 comment1 min readLW link

Rules for liv­ing in a 99.9+% lizard­man world

at_the_zoo3 Apr 2023 2:39 UTC
−1 points
12 comments1 min readLW link

The Friendly Drunk Fool Align­ment Strategy

JenniferRM3 Apr 2023 1:26 UTC
23 points
18 comments11 min readLW link

Slack Group: Ra­tion­al­ist Startup Founders

Adam Zerner3 Apr 2023 0:44 UTC
31 points
0 comments3 min readLW link

Orthog­o­nal­ity is Expensive

DragonGod3 Apr 2023 0:43 UTC
21 points
3 comments1 min readLW link
(www.beren.io)

GTP4 ca­pa­ble of limited re­cur­sive im­prov­ing?

Boris Kashirin2 Apr 2023 21:38 UTC
2 points
3 comments1 min readLW link

[Question] Scared about the fu­ture of AI

eitan weiss2 Apr 2023 20:37 UTC
−1 points
0 comments1 min readLW link

“a di­alogue with my­self con­cern­ing eliezer yud­kowsky” (not au­thor)

the gears to ascension2 Apr 2023 20:12 UTC
13 points
18 comments3 min readLW link

Fine-in­sured boun­ties as AI deterrent

Virtual Instinct2 Apr 2023 19:44 UTC
1 point
0 comments2 min readLW link

[Question] What could EA’s new name be?

trevor2 Apr 2023 19:25 UTC
17 points
20 comments2 min readLW link

Ex­po­sure to Lizard­man is Lethal

[DEACTIVATED] Duncan Sabien2 Apr 2023 18:57 UTC
70 points
96 comments3 min readLW link

Talk­box Bag­pipe Drones

jefftk2 Apr 2023 18:50 UTC
5 points
0 comments1 min readLW link
(www.jefftk.com)

[Question] Is there a LessWrong-ad­ja­cent place to hire free­lancers/​seek free­lance work?

nonzerosum2 Apr 2023 16:39 UTC
5 points
3 comments1 min readLW link

AISC 2023, Progress Re­port for March: Team In­ter­pretable Architectures

2 Apr 2023 16:19 UTC
14 points
0 comments14 min readLW link

Ul­ti­mate ends may be eas­ily hid­able be­hind con­ver­gent subgoals

TsviBT2 Apr 2023 14:51 UTC
57 points
4 comments22 min readLW link

Trans­parency for Gen­er­al­iz­ing Align­ment from Toy Models

Johannes C. Mayer2 Apr 2023 10:47 UTC
13 points
3 comments4 min readLW link

Ask First

intellectronica2 Apr 2023 10:45 UTC
3 points
1 comment1 min readLW link
(intellectronica.net)

[Question] When should a neu­ral net­work-based ap­proach for plant con­trol sys­tems be preferred over a tra­di­tional con­trol method?

Bob Guran2 Apr 2023 10:18 UTC
11 points
0 comments1 min readLW link

Pes­simism about AI Safety

2 Apr 2023 7:43 UTC
4 points
1 comment25 min readLW link

Some lesser-known megapro­ject ideas

Linch2 Apr 2023 1:14 UTC
19 points
4 comments1 min readLW link

Anal­y­sis of GPT-4 com­pe­tence in as­sess­ing com­plex le­gal lan­guage: Ex­am­ple of Bill C-11 of the Cana­dian Par­li­a­ment. - Part 1

M. Y. Zuo2 Apr 2023 0:01 UTC
12 points
2 comments14 min readLW link

Policy dis­cus­sions fol­low strong con­tex­tu­al­iz­ing norms

Richard_Ngo1 Apr 2023 23:51 UTC
230 points
61 comments3 min readLW link

A re­port about LessWrong karma volatility from a differ­ent universe

Ben Pace1 Apr 2023 21:48 UTC
176 points
7 comments1 min readLW link

A Con­fes­sion about the LessWrong Team

Ruby1 Apr 2023 21:47 UTC
87 points
5 comments2 min readLW link

Shut­ting down AI is not enough. We need to de­stroy all tech­nol­ogy.

Matthew Barnett1 Apr 2023 21:03 UTC
150 points
36 comments1 min readLW link

A policy guaran­teed to in­crease AI timelines

Richard Korzekwa 1 Apr 2023 20:50 UTC
46 points
1 comment2 min readLW link
(aiimpacts.org)

Why I Think the Cur­rent Tra­jec­tory of AI Re­search has Low P(doom) - LLMs

GaPa1 Apr 2023 20:35 UTC
2 points
1 comment10 min readLW link

Re­pairing the Effort Asymmetry

[DEACTIVATED] Duncan Sabien1 Apr 2023 20:23 UTC
41 points
11 comments2 min readLW link

Draft: In­fer­ring minimizers

Alex_Altair1 Apr 2023 20:20 UTC
9 points
0 comments1 min readLW link

AI Safety via Luck

Jozdien1 Apr 2023 20:13 UTC
76 points
7 comments11 min readLW link

Ho Chi Minh ACX Meetup

cygnus1 Apr 2023 19:41 UTC
1 point
0 comments1 min readLW link