All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025

All JanFebMar Apr May Jun Jul Aug Sep Oct Nov Dec

All 1 234 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28

[Question] Monotonous Work

Gideon BauerFeb 2, 2023, 9:35 PM

1 point

0 comments1 min readLW link

Is AI risk assessment too anthropocentric?

Craig MattsonFeb 2, 2023, 9:34 PM

3 points

6 comments1 min readLW link

Halifax Monthly Meetup: Introduction to Effective Altruism

IdeopunkFeb 2, 2023, 9:10 PM

10 points

0 comments1 min readLW link

Conditioning Predictive Models: Outer alignment via careful conditioning

evhub, Adam Jermyn, Johannes Treutlein, Rubi J. Hudson and kcwoolverton

Feb 2, 2023, 8:28 PM

72 points

15 comments57 min readLW link

Conditioning Predictive Models: Large language models as predictors

evhub, Adam Jermyn, Johannes Treutlein, Rubi J. Hudson and kcwoolverton

Feb 2, 2023, 8:28 PM

88 points

4 comments13 min readLW link

Normative vs Descriptive Models of Agency

mattmacdermottFeb 2, 2023, 8:28 PM

26 points

5 comments4 min readLW link

Andrew Huberman on How to Optimize Sleep

Leon LangFeb 2, 2023, 8:17 PM

37 points

6 comments6 min readLW link

[Question] How can I help inflammation-based nerve damage be temporary?

Optimization ProcessFeb 2, 2023, 7:20 PM

17 points

4 comments1 min readLW link

More findings on maximal data dimension

Marius HobbhahnFeb 2, 2023, 6:33 PM

27 points

1 comment11 min readLW link

Heritability, Behaviorism, and Within-Lifetime RL

Steven ByrnesFeb 2, 2023, 4:34 PM

39 points

3 comments4 min readLW link

Covid 2/2/23: The Emergency Ends on 5/11

ZviFeb 2, 2023, 2:00 PM

22 points

6 comments7 min readLW link

(thezvi.wordpress.com)

You are probably not a good alignment researcher, and other blatant lies

junk heap homotopyFeb 2, 2023, 1:55 PM

83 points

16 comments2 min readLW link

Don’t Judge a Tool by its Average Output

silentbobFeb 2, 2023, 1:42 PM

12 points

2 comments4 min readLW link

Epoch Impact Report 2022

JsevillamolFeb 2, 2023, 1:09 PM

16 points

0 comments LW link

You Don’t Exist, Duncan

Duncan Sabien (Inactive)Feb 2, 2023, 8:37 AM

252 points

107 comments9 min readLW link

Temporally Layered Architecture for Adaptive, Distributed and Continuous Control

Roman LeventovFeb 2, 2023, 6:29 AM

6 points

4 comments1 min readLW link

(arxiv.org)

Research agenda: Formalizing abstractions of computations

Erik JennerFeb 2, 2023, 4:29 AM

93 points

10 comments31 min readLW link

Progress links and tweets, 2023-02-01

jasoncrawfordFeb 2, 2023, 2:25 AM

10 points

0 comments1 min readLW link

(rootsofprogress.org)

Retrospective on the AI Safety Field Building Hub

Vael GatesFeb 2, 2023, 2:06 AM

30 points

0 comments LW link

How to export Android Chrome tabs to an HTML file in Linux (as of February 2023)

Adam ScherlisFeb 2, 2023, 2:03 AM

7 points

3 comments2 min readLW link

(adam.scherlis.com)

Hacked Account Spam

jefftkFeb 2, 2023, 1:50 AM

13 points

5 comments1 min readLW link

(www.jefftk.com)

A simple technique to reduce negative rumination

cranberry_bearFeb 2, 2023, 1:33 AM

9 points

0 comments1 min readLW link

A Brief Overview of AI Safety/Alignment Orgs, Fields, Researchers, and Resources for ML Researchers

Austin WitteFeb 2, 2023, 1:02 AM

18 points

1 comment2 min readLW link

Interviews with 97 AI Researchers: Quantitative Analysis

Maheen Shermohammed and Vael Gates

Feb 2, 2023, 1:01 AM

23 points

0 comments7 min readLW link

“AI Risk Discussions” website: Exploring interviews from 97 AI Researchers

Vael Gates, Lukas Trötzmüller, Maheen Shermohammed, michaelkeenan and zchuang

Feb 2, 2023, 1:00 AM

43 points

1 comment LW link

Predicting researcher interest in AI alignment

Vael GatesFeb 2, 2023, 12:58 AM

25 points

0 comments LW link

Focus on the places where you feel shocked everyone’s dropping the ball

So8resFeb 2, 2023, 12:27 AM

463 points

64 comments4 min readLW link 3 reviews

Exercise is Good, Actually

Gordon Seidoh WorleyFeb 2, 2023, 12:09 AM

91 points

27 comments3 min readLW link

Product safety is a poor model for AI governance

Richard Korzekwa Feb 1, 2023, 10:40 PM

36 points

0 comments5 min readLW link

(aiimpacts.org)

Hinton: “mortal” efficient analog hardware may be learned-in-place, uncopyable

the gears to ascensionFeb 1, 2023, 10:19 PM

12 points

3 comments1 min readLW link

Can we “cure” cancer?

jasoncrawfordFeb 1, 2023, 10:03 PM

41 points

31 comments2 min readLW link

(rootsofprogress.org)

Eli Lifland on Navigating the AI Alignment Landscape

ozziegooenFeb 1, 2023, 9:17 PM

9 points

1 comment31 min readLW link

(quri.substack.com)

Schizophrenia as a deficiency in long-range cortex-to-cortex communication

Steven ByrnesFeb 1, 2023, 7:32 PM

35 points

38 comments11 min readLW link

AI Safety Arguments: An Interactive Guide

Lukas TrötzmüllerFeb 1, 2023, 7:26 PM

20 points

0 comments3 min readLW link

More findings on Memorization and double descent

Marius HobbhahnFeb 1, 2023, 6:26 PM

53 points

2 comments19 min readLW link

Language Models can be Utility-Maximising Agents

Raymond DFeb 1, 2023, 6:13 PM

22 points

1 comment2 min readLW link

Trends in the dollar training cost of machine learning systems

Ben CottierFeb 1, 2023, 2:48 PM

23 points

0 comments2 min readLW link

(epochai.org)

Polis: Why and How to Use it

brookFeb 1, 2023, 2:03 PM

5 points

0 comments LW link

Subitisation of Self

vitaliyaFeb 1, 2023, 9:18 AM

14 points

4 comments2 min readLW link

Directed Babbling

Yudhister KumarFeb 1, 2023, 9:10 AM

20 points

1 comment3 min readLW link

(www.ykumar.org)

Voting Results for the 2021 Review

RaemonFeb 1, 2023, 8:02 AM

66 points

10 comments38 min readLW link

Abstraction As Symmetry and Other Thoughts

NumendilFeb 1, 2023, 6:25 AM

28 points

9 comments2 min readLW link

The effect of horizon length on scaling laws

Jacob_Hilton1 Feb 2023 3:59 UTC

23 points

2 comments1 min readLW link

(arxiv.org)

Contra Dance Lengths

jefftk1 Feb 2023 3:30 UTC

9 points

0 comments1 min readLW link

(www.jefftk.com)

Aiming for Convergence Is Like Discouraging Betting

Zack_M_Davis1 Feb 2023 0:03 UTC

62 points

18 comments11 min readLW link 1 review

On value in humans, other animals, and AI

Michele Campolo31 Jan 2023 23:33 UTC

3 points

17 comments5 min readLW link

Criticism of the main framework in AI alignment

Michele Campolo31 Jan 2023 23:01 UTC

19 points

2 comments6 min readLW link

Nice Clothes are Good, Actually

Gordon Seidoh Worley31 Jan 2023 19:22 UTC

72 points

28 comments4 min readLW link

[Linkpost] Human-narrated audio version of “Is Power-Seeking AI an Existential Risk?”

Joe Carlsmith31 Jan 2023 19:21 UTC

12 points

1 comment1 min readLW link

No Really, Attention is ALL You Need—Attention can do feedforward networks

Robert_AIZI31 Jan 2023 18:48 UTC

29 points

7 comments6 min readLW link

(aizi.substack.com)