Progress links and tweets, 2023-04-24

jasoncrawford24 Apr 2023 21:17 UTC
16 points
1 comment2 min readLW link
(rootsofprogress.org)

Ideas for AI labs: Read­ing list

Zach Stein-Perlman24 Apr 2023 19:00 UTC
11 points
0 comments4 min readLW link

Deep learn­ing mod­els might be se­cretly (al­most) linear

beren24 Apr 2023 18:43 UTC
110 points
28 comments4 min readLW link

Sub­jec­tive AI/​ML Digest: April II

Boris T24 Apr 2023 18:33 UTC
1 point
0 comments1 min readLW link
(borisagain.substack.com)

The Tox­o­plasma of AGI Doom and Ca­pa­bil­ities?

Robert_AIZI24 Apr 2023 18:11 UTC
68 points
12 comments1 min readLW link

[Question] Mea­sures of In­ter­net Viral­ity and News Popularity

Fer32dwt34r3dfsz24 Apr 2023 17:43 UTC
4 points
4 comments1 min readLW link

A con­cise sum-up of the ba­sic ar­gu­ment for AI doom

Mergimio H. Doefevmil24 Apr 2023 17:37 UTC
11 points
6 comments2 min readLW link

A re­sponse to Con­jec­ture’s CoEm proposal

Kristian Freed24 Apr 2023 17:23 UTC
7 points
0 comments4 min readLW link

Ca­ma­raderie at scale: in search of shared identity

eq24 Apr 2023 16:46 UTC
8 points
2 comments8 min readLW link

A Hy­po­thet­i­cal Takeover Sce­nario Twit­ter Poll

Zvi24 Apr 2023 14:00 UTC
54 points
9 comments17 min readLW link
(thezvi.wordpress.com)

Cape Town, South Africa—ACX Mee­tups Every­where “Spring” 2023

moyamo24 Apr 2023 13:37 UTC
2 points
0 comments1 min readLW link

Cred­ible, costly, pseudonymity

M. Y. Zuo24 Apr 2023 13:35 UTC
1 point
11 comments1 min readLW link

On Ar­tifice and Intelligence

Jonathan Yan24 Apr 2023 13:26 UTC
2 points
0 comments1 min readLW link
(medium.com)

AGI ruin mostly rests on strong claims about al­ign­ment and de­ploy­ment, not about society

Rob Bensinger24 Apr 2023 13:06 UTC
70 points
8 comments6 min readLW link

For al­ign­ment, we should si­mul­ta­neously use mul­ti­ple the­o­ries of cog­ni­tion and value

Roman Leventov24 Apr 2023 10:37 UTC
22 points
5 comments5 min readLW link

Power laws in Speedrun­ning and Ma­chine Learning

24 Apr 2023 10:06 UTC
71 points
1 comment1 min readLW link
(arxiv.org)

[Question] “User does not meet the re­quire­ments to vote”

Monkle24 Apr 2023 9:53 UTC
4 points
3 comments1 min readLW link

The Brain is Not Close to Ther­mo­dy­namic Limits on Computation

DaemonicSigil24 Apr 2023 8:21 UTC
167 points
58 comments5 min readLW link

Value Learn­ing – Towards Re­solv­ing Con­fu­sion

PashaKamyshev24 Apr 2023 6:43 UTC
4 points
0 comments18 min readLW link

Sum­maries of top fo­rum posts (17th − 23rd April 2023)

Zoe Williams24 Apr 2023 4:13 UTC
18 points
0 comments1 min readLW link

Do LLMs dream of emer­gent sheep?

shminux24 Apr 2023 3:26 UTC
15 points
2 comments1 min readLW link

Not us­ing a pri­ori in­for­ma­tion for Rus­sian propaganda

EniScien24 Apr 2023 1:14 UTC
−3 points
4 comments1 min readLW link

Con­tra Yud­kowsky on AI Doom

jacob_cannell24 Apr 2023 0:20 UTC
110 points
111 comments9 min readLW link

Con­se­quen­tial­ism is in the Stars not Ourselves

DragonGod24 Apr 2023 0:02 UTC
7 points
19 comments5 min readLW link

When did hu­mans be­come self-aware?

Derek M. Jones23 Apr 2023 22:36 UTC
6 points
2 comments1 min readLW link
(vectors.substack.com)

[Question] Are there AI poli­cies that are ro­bustly net-pos­i­tive even when con­sid­er­ing differ­ent AI sce­nar­ios?

Noosphere8923 Apr 2023 21:46 UTC
11 points
1 comment1 min readLW link

Get­ting Started With Naturalism

LoganStrohl23 Apr 2023 21:02 UTC
58 points
2 comments11 min readLW link

[Question] Why do we care about agency for al­ign­ment?

Chris_Leong23 Apr 2023 18:10 UTC
22 points
19 comments1 min readLW link

Tam­ing the Fire of Intelligence

Peter Kuhn23 Apr 2023 17:41 UTC
0 points
7 comments5 min readLW link

Prevent­ing AI Mi­suse: State of the Art Re­search and its Flaws

Madhav Malhotra23 Apr 2023 17:37 UTC
15 points
0 comments11 min readLW link
(forum.effectivealtruism.org)

[Question] Could trans­former net­work mod­els learn mo­tor plan­ning like they can learn lan­guage and image gen­er­a­tion?

mu_(negative)23 Apr 2023 17:24 UTC
2 points
4 comments1 min readLW link

Could a su­per­in­tel­li­gence de­duce gen­eral rel­a­tivity from a fal­ling ap­ple? An investigation

titotal23 Apr 2023 12:49 UTC
147 points
39 comments9 min readLW link

Endo-, Dia-, Para-, and Ecto-sys­temic novelty

TsviBT23 Apr 2023 12:25 UTC
16 points
3 comments5 min readLW link

An In­tro to An­thropic Rea­son­ing us­ing the ‘Boy or Girl Para­dox’ as a toy example

TobyC23 Apr 2023 10:20 UTC
28 points
28 comments19 min readLW link

[Question] Se­man­tics, Syn­tax and Prag­mat­ics of the Mind?

Ben Amitay23 Apr 2023 6:13 UTC
2 points
0 comments1 min readLW link

A great talk for AI noobs (ac­cord­ing to an AI noob)

dov23 Apr 2023 5:34 UTC
10 points
1 comment1 min readLW link
(forum.effectivealtruism.org)

Bits of NEFFA

jefftk23 Apr 2023 2:20 UTC
5 points
0 comments1 min readLW link
(www.jefftk.com)

“Rate limit­ing” as a mod tool

Raemon23 Apr 2023 0:42 UTC
48 points
36 comments4 min readLW link

What should we cen­sor from train­ing data?

wassname22 Apr 2023 23:33 UTC
6 points
3 comments1 min readLW link

Ar­chi­tec­ture-aware op­ti­mi­sa­tion: train ImageNet and more with­out hyperparameters

Chris Mingard22 Apr 2023 21:50 UTC
6 points
2 comments2 min readLW link

OpenAI’s GPT-4 Safety Goals

PeterMcCluskey22 Apr 2023 19:11 UTC
3 points
3 comments4 min readLW link
(bayesianinvestor.com)

In­tro­duc­ing the Nuts and Bolts Of Naturalism

LoganStrohl22 Apr 2023 18:31 UTC
75 points
1 comment3 min readLW link

We Need To Know About Con­tinual Learning

michael_mjd22 Apr 2023 17:08 UTC
29 points
14 comments4 min readLW link

The Se­cu­rity Mind­set, S-Risk and Pub­lish­ing Pro­saic Align­ment Research

lukemarks22 Apr 2023 14:36 UTC
39 points
7 comments6 min readLW link

[Question] How did LW up­date p(doom) af­ter LLMs blew up?

FinalFormal222 Apr 2023 14:21 UTC
24 points
29 comments1 min readLW link

The Cruel Trade-Off Between AI Mi­suse and AI X-risk Concerns

simeon_c22 Apr 2023 13:49 UTC
24 points
1 comment2 min readLW link

five ways to say “Al­most Always” and ac­tu­ally mean it

Yudhister Kumar22 Apr 2023 10:38 UTC
17 points
3 comments2 min readLW link
(www.ykumar.org)

P(doom|su­per­in­tel­li­gence) or coin tosses and dice throws of hu­man val­ues (and other re­lated Ps).

Muyyd22 Apr 2023 10:06 UTC
−7 points
0 comments4 min readLW link

[Question] Is it al­lowed to post job post­ings here? I am look­ing for a new PhD stu­dent to work on AI In­ter­pretabil­ity. Can I ad­ver­tise my po­si­tion?

Tiberius22 Apr 2023 1:22 UTC
5 points
4 comments1 min readLW link

LessWrong mod­er­a­tion mes­sag­ing container

Raemon22 Apr 2023 1:19 UTC
21 points
13 comments1 min readLW link