All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025 2026

All Jan Feb MarAprMay Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 242526 27 28 29 30

Progress links and tweets, 2023-04-24

jasoncrawford24 Apr 2023 21:17 UTC

16 points

1 comment2 min readLW link

(rootsofprogress.org)

Ideas for AI labs: Reading list

Zach Stein-Perlman24 Apr 2023 19:00 UTC

11 points

0 comments4 min readLW link

Deep learning models might be secretly (almost) linear

beren24 Apr 2023 18:43 UTC

117 points

29 comments4 min readLW link

Subjective AI/ML Digest: April II

Boris T24 Apr 2023 18:33 UTC

1 point

0 comments1 min readLW link

(borisagain.substack.com)

The Toxoplasma of AGI Doom and Capabilities?

Robert_AIZI24 Apr 2023 18:11 UTC

72 points

13 comments1 min readLW link

[Question] Measures of Internet Virality and News Popularity

T43124 Apr 2023 17:43 UTC

4 points

4 comments1 min readLW link

A concise sum-up of the basic argument for AI doom

Mergimio H. Doefevmil24 Apr 2023 17:37 UTC

11 points

6 comments2 min readLW link

A response to Conjecture’s CoEm proposal

Kristian Freed24 Apr 2023 17:23 UTC

7 points

0 comments4 min readLW link

Camaraderie at scale: in search of shared identity

eq24 Apr 2023 16:46 UTC

8 points

2 comments8 min readLW link

A Hypothetical Takeover Scenario Twitter Poll

Zvi24 Apr 2023 14:00 UTC

54 points

9 comments17 min readLW link

(thezvi.wordpress.com)

Cape Town, South Africa—ACX Meetups Everywhere “Spring” 2023

moyamo24 Apr 2023 13:37 UTC

2 points

0 comments1 min readLW link

Credible, costly, pseudonymity

M. Y. Zuo24 Apr 2023 13:35 UTC

1 point

11 comments1 min readLW link

On Artifice and Intelligence

Jonathan Yan24 Apr 2023 13:26 UTC

2 points

0 comments1 min readLW link

(medium.com)

AGI ruin mostly rests on strong claims about alignment and deployment, not about society

Rob Bensinger24 Apr 2023 13:06 UTC

70 points

8 comments6 min readLW link

For alignment, we should simultaneously use multiple theories of cognition and value

Roman Leventov24 Apr 2023 10:37 UTC

23 points

5 comments5 min readLW link

Power laws in Speedrunning and Machine Learning

Jsevillamol and Ege Erdil

24 Apr 2023 10:06 UTC

71 points

1 comment1 min readLW link

(arxiv.org)

[Question] “User does not meet the requirements to vote”

Monkle24 Apr 2023 9:53 UTC

4 points

3 comments1 min readLW link

The Brain is Not Close to Thermodynamic Limits on Computation

DaemonicSigil24 Apr 2023 8:21 UTC

167 points

58 comments5 min readLW link

Value Learning – Towards Resolving Confusion

PashaKamyshev24 Apr 2023 6:43 UTC

4 points

0 comments18 min readLW link

Summaries of top forum posts (17th − 23rd April 2023)

Zoe Williams24 Apr 2023 4:13 UTC

18 points

0 comments8 min readLW link

Do LLMs dream of emergent sheep?

Shmi24 Apr 2023 3:26 UTC

16 points

2 comments1 min readLW link

Not using a priori information for Russian propaganda

EniScien24 Apr 2023 1:14 UTC

−5 points

4 comments1 min readLW link

Contra Yudkowsky on AI Doom

jacob_cannell24 Apr 2023 0:20 UTC

91 points

111 comments9 min readLW link

Consequentialism is in the Stars not Ourselves

DragonGod24 Apr 2023 0:02 UTC

7 points

19 comments5 min readLW link

When did humans become self-aware?

Derek M. Jones23 Apr 2023 22:36 UTC

6 points

2 comments1 min readLW link

(vectors.substack.com)

[Question] Are there AI policies that are robustly net-positive even when considering different AI scenarios?

Noosphere8923 Apr 2023 21:46 UTC

11 points

1 comment1 min readLW link

Getting Started With Naturalism

LoganStrohl23 Apr 2023 21:02 UTC

69 points

4 comments11 min readLW link 1 review

[Question] Why do we care about agency for alignment?

Chris_Leong23 Apr 2023 18:10 UTC

22 points

19 comments1 min readLW link

Taming the Fire of Intelligence

Peter Kuhn23 Apr 2023 17:41 UTC

0 points

7 comments5 min readLW link

Preventing AI Misuse: State of the Art Research and its Flaws

Madhav Malhotra23 Apr 2023 17:37 UTC

15 points

0 comments11 min readLW link

(forum.effectivealtruism.org)

[Question] Could transformer network models learn motor planning like they can learn language and image generation?

mu_(negative)23 Apr 2023 17:24 UTC

2 points

4 comments1 min readLW link

Could a superintelligence deduce general relativity from a falling apple? An investigation

titotal23 Apr 2023 12:49 UTC

150 points

39 comments9 min readLW link

Endo-, Dia-, Para-, and Ecto-systemic novelty

TsviBT23 Apr 2023 12:25 UTC

17 points

3 comments5 min readLW link

An Intro to Anthropic Reasoning using the ‘Boy or Girl Paradox’ as a toy example

TobyC23 Apr 2023 10:20 UTC

31 points

28 comments19 min readLW link

[Question] Semantics, Syntax and Pragmatics of the Mind?

Ben Amitay23 Apr 2023 6:13 UTC

2 points

0 comments1 min readLW link

A great talk for AI noobs (according to an AI noob)

dov23 Apr 2023 5:34 UTC

10 points

1 comment1 min readLW link

(forum.effectivealtruism.org)

Bits of NEFFA

jefftk23 Apr 2023 2:20 UTC

5 points

0 comments1 min readLW link

(www.jefftk.com)

“Rate limiting” as a mod tool

Raemon23 Apr 2023 0:42 UTC

48 points

36 comments4 min readLW link

What should we censor from training data?

wassname22 Apr 2023 23:33 UTC

16 points

4 comments1 min readLW link

Architecture-aware optimisation: train ImageNet and more without hyperparameters

Chris Mingard22 Apr 2023 21:50 UTC

6 points

2 comments2 min readLW link

OpenAI’s GPT-4 Safety Goals

PeterMcCluskey22 Apr 2023 19:11 UTC

3 points

3 comments4 min readLW link

(bayesianinvestor.com)

Introducing the Nuts and Bolts Of Naturalism

LoganStrohl22 Apr 2023 18:31 UTC

78 points

2 comments3 min readLW link

We Need To Know About Continual Learning

michael_mjd22 Apr 2023 17:08 UTC

30 points

14 comments4 min readLW link

[Question] How did LW update p(doom) after LLMs blew up?

FinalFormal222 Apr 2023 14:21 UTC

24 points

29 comments1 min readLW link

The Cruel Trade-Off Between AI Misuse and AI X-risk Concerns

simeon_c22 Apr 2023 13:49 UTC

24 points

1 comment2 min readLW link

five ways to say “Almost Always” and actually mean it

Yudhister Kumar [Deprecated]22 Apr 2023 10:38 UTC

17 points

3 comments2 min readLW link

(www.ykumar.org)

P(doom|superintelligence) or coin tosses and dice throws of human values (and other related Ps).

Muyyd22 Apr 2023 10:06 UTC

−7 points

0 comments4 min readLW link

[Question] Is it allowed to post job postings here? I am looking for a new PhD student to work on AI Interpretability. Can I advertise my position?

Tiberius22 Apr 2023 1:22 UTC

5 points

4 comments1 min readLW link

LessWrong moderation messaging container

Raemon22 Apr 2023 1:19 UTC

21 points

13 comments1 min readLW link

Neural network polytopes (Colab notebook)

Zach Furman21 Apr 2023 22:42 UTC

11 points

0 comments1 min readLW link

(colab.research.google.com)