All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024

All JanFebMar Apr May Jun Jul Aug Sep Oct Nov Dec

All 123 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28

Product safety is a poor model for AI governance

Richard Korzekwa 1 Feb 2023 22:40 UTC

36 points

0 comments5 min readLW link

(aiimpacts.org)

Hinton: “mortal” efficient analog hardware may be learned-in-place, uncopyable

the gears to ascension1 Feb 2023 22:19 UTC

10 points

3 comments1 min readLW link

Can we “cure” cancer?

jasoncrawford1 Feb 2023 22:03 UTC

41 points

31 comments2 min readLW link

(rootsofprogress.org)

Eli Lifland on Navigating the AI Alignment Landscape

ozziegooen1 Feb 2023 21:17 UTC

9 points

1 comment31 min readLW link

(quri.substack.com)

Schizophrenia as a deficiency in long-range cortex-to-cortex communication

Steven Byrnes1 Feb 2023 19:32 UTC

34 points

29 comments11 min readLW link

AI Safety Arguments: An Interactive Guide

Lukas Trötzmüller1 Feb 2023 19:26 UTC

20 points

0 comments3 min readLW link

More findings on Memorization and double descent

Marius Hobbhahn1 Feb 2023 18:26 UTC

53 points

2 comments19 min readLW link

Language Models can be Utility-Maximising Agents

Raymond D1 Feb 2023 18:13 UTC

22 points

1 comment2 min readLW link

Trends in the dollar training cost of machine learning systems

Ben Cottier1 Feb 2023 14:48 UTC

23 points

0 comments2 min readLW link

(epochai.org)

Polis: Why and How to Use it

brook1 Feb 2023 14:03 UTC

3 points

0 comments1 min readLW link

Subitisation of Self

vitaliya1 Feb 2023 9:18 UTC

14 points

4 comments2 min readLW link

Directed Babbling

Yudhister Kumar1 Feb 2023 9:10 UTC

20 points

1 comment3 min readLW link

(www.ykumar.org)

Voting Results for the 2021 Review

Raemon1 Feb 2023 8:02 UTC

66 points

10 comments38 min readLW link

Abstraction As Symmetry and Other Thoughts

Numendil1 Feb 2023 6:25 UTC

28 points

9 comments2 min readLW link

The effect of horizon length on scaling laws

Jacob_Hilton1 Feb 2023 3:59 UTC

23 points

2 comments1 min readLW link

(arxiv.org)

Contra Dance Lengths

jefftk1 Feb 2023 3:30 UTC

9 points

0 comments1 min readLW link

(www.jefftk.com)

Aiming for Convergence Is Like Discouraging Betting

Zack_M_Davis1 Feb 2023 0:03 UTC

60 points

17 comments11 min readLW link

On value in humans, other animals, and AI

Michele Campolo31 Jan 2023 23:33 UTC

3 points

17 comments5 min readLW link

Criticism of the main framework in AI alignment

Michele Campolo31 Jan 2023 23:01 UTC

19 points

2 comments6 min readLW link

Nice Clothes are Good, Actually

Gordon Seidoh Worley31 Jan 2023 19:22 UTC

62 points

28 comments4 min readLW link

[Linkpost] Human-narrated audio version of “Is Power-Seeking AI an Existential Risk?”

Joe Carlsmith31 Jan 2023 19:21 UTC

12 points

1 comment1 min readLW link

No Really, Attention is ALL You Need—Attention can do feedforward networks

Robert_AIZI31 Jan 2023 18:48 UTC

29 points

7 comments6 min readLW link

(aizi.substack.com)

Talk to me about your summer/career plans

Akash31 Jan 2023 18:29 UTC

31 points

3 comments2 min readLW link

Mechanistic Interpretability Quickstart Guide

Neel Nanda31 Jan 2023 16:35 UTC

42 points

3 comments6 min readLW link

(www.neelnanda.io)

New Hackathon: Robustness to distribution changes and ambiguity

Charbel-Raphaël31 Jan 2023 12:50 UTC

11 points

3 comments1 min readLW link

Squiggle: Why and how to use it

brook31 Jan 2023 12:37 UTC

3 points

0 comments1 min readLW link

Beware of Fake Alternatives

silentbob31 Jan 2023 10:21 UTC

50 points

10 comments4 min readLW link

Inner Misalignment in “Simulator” LLMs

Adam Scherlis31 Jan 2023 8:33 UTC

84 points

11 comments4 min readLW link

Why AI experts’ jobs are always decades from being automated

Allen Hoskins31 Jan 2023 3:01 UTC

0 points

1 comment5 min readLW link

(open.substack.com)

Apply to HAIST/MAIA’s AI Governance Workshop in DC (Feb 17-20)

Phosphorous, Xander Davies, CMD, Paramedic and tlevin

31 Jan 2023 2:06 UTC

28 points

0 comments2 min readLW link

EA & LW Forum Weekly Summary (23rd − 29th Jan ’23)

Zoe Williams31 Jan 2023 0:36 UTC

12 points

0 comments1 min readLW link

Saying things because they sound good

Adam Zerner31 Jan 2023 0:17 UTC

23 points

6 comments2 min readLW link

South Bay Meetup

DavidFriedman30 Jan 2023 23:35 UTC

2 points

0 comments1 min readLW link

Peter Thiel’s speech at Oxford Debating Union on technological stagnation, Nuclear weapons, COVID, Environment, Alignment, ‘anti-anti anti-anti-classical liberalism’, Bostrom, LW, etc.

M. Y. Zuo30 Jan 2023 23:31 UTC

8 points

33 comments1 min readLW link

Medical Image Registration: The obscure field where Deep Mesaoptimizers are already at the top of the benchmarks. (post + colab notebook)

Hastings30 Jan 2023 22:46 UTC

23 points

0 comments3 min readLW link

Humans Can Be Manually Strategic

Screwtape30 Jan 2023 22:35 UTC

13 points

0 comments3 min readLW link

Why I hate the “accident vs. misuse” AI x-risk dichotomy (quick thoughts on “structural risk”)

David Scott Krueger (formerly: capybaralet)30 Jan 2023 18:50 UTC

32 points

41 comments2 min readLW link

2022 Unofficial LessWrong General Census

Screwtape30 Jan 2023 18:36 UTC

97 points

33 comments2 min readLW link

Call for submissions: “(In)human Values and Artificial Agency”, ALIFE 2023

the gears to ascension30 Jan 2023 17:37 UTC

29 points

4 comments1 min readLW link

(humanvaluesandartificialagency.com)

What I mean by “alignment is in large part about making cognition aimable at all”

So8res30 Jan 2023 15:22 UTC

163 points

24 comments2 min readLW link

The Energy Requirements and Feasibility of Off-World Mining

clans30 Jan 2023 15:07 UTC

31 points

1 comment8 min readLW link

(locationtbd.home.blog)

Whatever their arguments, Covid vaccine sceptics will probably never convince me

contrarianbrit30 Jan 2023 13:42 UTC

8 points

10 comments3 min readLW link

(thomasprosser.substack.com)

Simulacra Levels Summary

Zvi30 Jan 2023 13:40 UTC

71 points

12 comments7 min readLW link

(thezvi.wordpress.com)

A Few Principles of Successful AI Design

Vestozia30 Jan 2023 10:42 UTC

1 point

0 comments8 min readLW link

Against Boltzmann mesaoptimizers

porby30 Jan 2023 2:55 UTC

76 points

6 comments4 min readLW link

How Likely is Losing a Google Account?

jefftk30 Jan 2023 0:20 UTC

52 points

11 comments3 min readLW link

(www.jefftk.com)

Model-driven feedback could amplify alignment failures

aogara30 Jan 2023 0:00 UTC

21 points

1 comment2 min readLW link

Takeaways from calibration training

Olli Järviniemi29 Jan 2023 19:09 UTC

38 points

1 comment3 min readLW link

Structure, creativity, and novelty

TsviBT29 Jan 2023 14:30 UTC

18 points

4 comments7 min readLW link

What is the ground reality of countries taking steps to recalibrate AI development towards Alignment first?

Nebuch29 Jan 2023 13:26 UTC

8 points

6 comments3 min readLW link