All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025

All JanFebMar Apr May Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 456 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28

A Brief Overview of AI Safety/Alignment Orgs, Fields, Researchers, and Resources for ML Researchers

Austin WitteFeb 2, 2023, 1:02 AM

18 points

1 comment2 min readLW link

Interviews with 97 AI Researchers: Quantitative Analysis

Maheen Shermohammed and Vael Gates

Feb 2, 2023, 1:01 AM

23 points

0 comments7 min readLW link

“AI Risk Discussions” website: Exploring interviews from 97 AI Researchers

Vael Gates, Lukas Trötzmüller, Maheen Shermohammed, michaelkeenan and zchuang

Feb 2, 2023, 1:00 AM

43 points

1 comment LW link

Predicting researcher interest in AI alignment

Vael GatesFeb 2, 2023, 12:58 AM

25 points

0 comments LW link

Focus on the places where you feel shocked everyone’s dropping the ball

So8resFeb 2, 2023, 12:27 AM

466 points

64 comments4 min readLW link 3 reviews

Exercise is Good, Actually

Gordon Seidoh WorleyFeb 2, 2023, 12:09 AM

91 points

27 comments3 min readLW link

Product safety is a poor model for AI governance

Richard Korzekwa Feb 1, 2023, 10:40 PM

36 points

0 comments5 min readLW link

(aiimpacts.org)

Hinton: “mortal” efficient analog hardware may be learned-in-place, uncopyable

the gears to ascensionFeb 1, 2023, 10:19 PM

12 points

3 comments1 min readLW link

Can we “cure” cancer?

jasoncrawfordFeb 1, 2023, 10:03 PM

41 points

31 comments2 min readLW link

(rootsofprogress.org)

Eli Lifland on Navigating the AI Alignment Landscape

ozziegooenFeb 1, 2023, 9:17 PM

9 points

1 comment31 min readLW link

(quri.substack.com)

Schizophrenia as a deficiency in long-range cortex-to-cortex communication

Steven ByrnesFeb 1, 2023, 7:32 PM

35 points

38 comments11 min readLW link

AI Safety Arguments: An Interactive Guide

Lukas TrötzmüllerFeb 1, 2023, 7:26 PM

20 points

0 comments3 min readLW link

More findings on Memorization and double descent

Marius HobbhahnFeb 1, 2023, 6:26 PM

53 points

2 comments19 min readLW link

Language Models can be Utility-Maximising Agents

Raymond DouglasFeb 1, 2023, 6:13 PM

22 points

1 comment2 min readLW link

Trends in the dollar training cost of machine learning systems

Ben CottierFeb 1, 2023, 2:48 PM

23 points

0 comments2 min readLW link

(epochai.org)

Polis: Why and How to Use it

brookFeb 1, 2023, 2:03 PM

5 points

0 comments LW link

Subitisation of Self

vitaliyaFeb 1, 2023, 9:18 AM

14 points

4 comments2 min readLW link

Directed Babbling

Yudhister KumarFeb 1, 2023, 9:10 AM

20 points

1 comment3 min readLW link

(www.ykumar.org)

Voting Results for the 2021 Review

RaemonFeb 1, 2023, 8:02 AM

66 points

10 comments38 min readLW link

Abstraction As Symmetry and Other Thoughts

NumendilFeb 1, 2023, 6:25 AM

28 points

9 comments2 min readLW link

The effect of horizon length on scaling laws

Jacob_HiltonFeb 1, 2023, 3:59 AM

23 points

2 comments1 min readLW link

(arxiv.org)

Contra Dance Lengths

jefftkFeb 1, 2023, 3:30 AM

9 points

0 comments1 min readLW link

(www.jefftk.com)

Aiming for Convergence Is Like Discouraging Betting

Zack_M_DavisFeb 1, 2023, 12:03 AM

62 points

18 comments11 min readLW link 1 review

On value in humans, other animals, and AI

Michele CampoloJan 31, 2023, 11:33 PM

3 points

17 comments5 min readLW link

Criticism of the main framework in AI alignment

Michele CampoloJan 31, 2023, 11:01 PM

19 points

2 comments6 min readLW link

Nice Clothes are Good, Actually

Gordon Seidoh WorleyJan 31, 2023, 7:22 PM

72 points

28 comments4 min readLW link

[Linkpost] Human-narrated audio version of “Is Power-Seeking AI an Existential Risk?”

Joe CarlsmithJan 31, 2023, 7:21 PM

12 points

1 comment1 min readLW link

No Really, Attention is ALL You Need—Attention can do feedforward networks

Robert_AIZIJan 31, 2023, 6:48 PM

29 points

7 comments6 min readLW link

(aizi.substack.com)

Talk to me about your summer/career plans

Orpheus16Jan 31, 2023, 6:29 PM

31 points

3 comments2 min readLW link

Mechanistic Interpretability Quickstart Guide

Neel NandaJan 31, 2023, 4:35 PM

42 points

3 comments6 min readLW link

(www.neelnanda.io)

New Hackathon: Robustness to distribution changes and ambiguity

Charbel-RaphaëlJan 31, 2023, 12:50 PM

12 points

3 comments1 min readLW link

Squiggle: Why and how to use it

brookJan 31, 2023, 12:37 PM

3 points

0 comments LW link

Beware of Fake Alternatives

silentbobJan 31, 2023, 10:21 AM

57 points

11 comments4 min readLW link 1 review

Inner Misalignment in “Simulator” LLMs

Adam ScherlisJan 31, 2023, 8:33 AM

84 points

12 comments4 min readLW link

Why AI experts’ jobs are always decades from being automated

Allen HoskinsJan 31, 2023, 3:01 AM

0 points

1 comment5 min readLW link

(open.substack.com)

Apply to HAIST/MAIA’s AI Governance Workshop in DC (Feb 17-20)

Phosphorous, Xander Davies, CMD, Paramedic and tlevin

Jan 31, 2023, 2:06 AM

28 points

0 comments2 min readLW link

EA & LW Forum Weekly Summary (23rd − 29th Jan ’23)

Zoe WilliamsJan 31, 2023, 12:36 AM

12 points

0 comments LW link

Saying things because they sound good

Adam ZernerJan 31, 2023, 12:17 AM

23 points

6 comments2 min readLW link

South Bay Meetup

DavidFriedmanJan 30, 2023, 11:35 PM

2 points

0 comments1 min readLW link

Peter Thiel’s speech at Oxford Debating Union on technological stagnation, Nuclear weapons, COVID, Environment, Alignment, ‘anti-anti anti-anti-classical liberalism’, Bostrom, LW, etc.

M. Y. ZuoJan 30, 2023, 11:31 PM

8 points

33 comments1 min readLW link

Medical Image Registration: The obscure field where Deep Mesaoptimizers are already at the top of the benchmarks. (post + colab notebook)

HastingsJan 30, 2023, 10:46 PM

35 points

1 comment3 min readLW link

Humans Can Be Manually Strategic

ScrewtapeJan 30, 2023, 10:35 PM

13 points

0 comments3 min readLW link

Why I hate the “accident vs. misuse” AI x-risk dichotomy (quick thoughts on “structural risk”)

David Scott Krueger (formerly: capybaralet)30 Jan 2023 18:50 UTC

34 points

41 comments2 min readLW link

2022 Unofficial LessWrong General Census

Screwtape30 Jan 2023 18:36 UTC

97 points

33 comments2 min readLW link

Call for submissions: “(In)human Values and Artificial Agency”, ALIFE 2023

the gears to ascension30 Jan 2023 17:37 UTC

29 points

4 comments1 min readLW link

(humanvaluesandartificialagency.com)

What I mean by “alignment is in large part about making cognition aimable at all”

So8res30 Jan 2023 15:22 UTC

171 points

25 comments2 min readLW link

The Energy Requirements and Feasibility of Off-World Mining

clans30 Jan 2023 15:07 UTC

31 points

1 comment8 min readLW link

(locationtbd.home.blog)

Whatever their arguments, Covid vaccine sceptics will probably never convince me

contrarianbrit30 Jan 2023 13:42 UTC

8 points

10 comments3 min readLW link

(thomasprosser.substack.com)

Simulacra Levels Summary

Zvi30 Jan 2023 13:40 UTC

77 points

14 comments7 min readLW link

(thezvi.wordpress.com)

A Few Principles of Successful AI Design

Vestozia30 Jan 2023 10:42 UTC

1 point

0 comments8 min readLW link