All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025 2026

All Jan Feb MarAprMay Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 141516 17 18 19 20 21 22 23 24 25 26 27 28 29 30

List of requests for an AI slowdown/halt.

Cleo Nardo14 Apr 2023 23:55 UTC

46 points

6 comments1 min readLW link

[linkpost] “What Are Reasonable AI Fears?” by Robin Hanson, 2023-04-23

Arjun Panickssery14 Apr 2023 23:26 UTC

26 points

16 comments4 min readLW link

(quillette.com)

“Do X because decision theory” ~= “Do X because bayes theorem”

lc14 Apr 2023 20:57 UTC

40 points

1 comment2 min readLW link

LLMs and hallucination, like white on rice?

Bill Benzon14 Apr 2023 19:53 UTC

5 points

0 comments3 min readLW link

GPT-4 is easily controlled/exploited with tricky decision theoretic dilemmas.

scasper14 Apr 2023 19:39 UTC

6 points

4 comments2 min readLW link

On Caring about our AI Progeny

PeterMcCluskey14 Apr 2023 19:32 UTC

22 points

5 comments1 min readLW link

(bayesianinvestor.com)

Moderation notes re: recent Said/Duncan threads

Raemon14 Apr 2023 18:06 UTC

52 points

560 comments2 min readLW link

What we’ve learned so far from our technological temptations project

Richard Korzekwa 14 Apr 2023 17:46 UTC

15 points

4 comments11 min readLW link

(aiimpacts.org)

[Question] How does consciousness interact with architecture?

FinalFormal214 Apr 2023 15:56 UTC

5 points

3 comments1 min readLW link

Iqisa: A Library For Handling Forecasting Datasets

niplav14 Apr 2023 15:16 UTC

27 points

0 comments2 min readLW link

What’s this probability you’re reporting?

EOC and SCP

14 Apr 2023 15:07 UTC

19 points

10 comments3 min readLW link

Navigating AI Risks (NAIR) #1: Slowing Down AI

simeon_c14 Apr 2023 14:35 UTC

11 points

3 comments1 min readLW link

(navigatingairisks.substack.com)

[Question] What would the FLI moratorium actually do?

ChristianKl14 Apr 2023 13:14 UTC

17 points

7 comments1 min readLW link

Research Report: Incorrectness Cascades

Robert_AIZI14 Apr 2023 12:49 UTC

19 points

0 comments10 min readLW link

(aizi.substack.com)

The self-unalignment problem

Jan_Kulveit and rosehadshar

14 Apr 2023 12:10 UTC

155 points

24 comments10 min readLW link

AI Safety Europe Retreat 2023 Retrospective

Magdalena Wache14 Apr 2023 9:05 UTC

43 points

0 comments2 min readLW link

[Question] What’s the difference between Wisdom and Rationality?

Yoav Ravid14 Apr 2023 6:22 UTC

8 points

4 comments1 min readLW link

Shapley Value Attribution in Chain of Thought

leogao14 Apr 2023 5:56 UTC

106 points

7 comments4 min readLW link

A freshman year during the AI midgame: my approach to the next year

Buck14 Apr 2023 0:38 UTC

154 points

15 comments7 min readLW link 1 review

Against AI Understanding and Sentience: Large Language Models, Meaning, and the Patterns of Human Language Use

Jonathan Yan13 Apr 2023 23:29 UTC

−1 points

0 comments1 min readLW link

(philsci-archive.pitt.edu)

R0 Is Not Counterfactual

jefftk13 Apr 2023 19:50 UTC

33 points

9 comments2 min readLW link

(www.jefftk.com)

Subscripts for Probabilities

niplav13 Apr 2023 18:32 UTC

67 points

9 comments5 min readLW link

The Virus—Short Story

Michael Soareverix13 Apr 2023 18:18 UTC

4 points

0 comments4 min readLW link

First ACX Brno Meetup

adekcz13 Apr 2023 17:42 UTC

2 points

0 comments1 min readLW link

Polluting the agentic commons

hamandcheese13 Apr 2023 17:42 UTC

7 points

4 comments2 min readLW link

(www.secondbest.ca)

Cambridge LW Meetup: When Science Isn’t Enough

Tony Wang and Darmani

13 Apr 2023 17:36 UTC

2 points

0 comments1 min readLW link

Even if human & AI alignment are just as easy, we are screwed

Matthew_Opitz13 Apr 2023 17:32 UTC

35 points

5 comments5 min readLW link

Was Homer a stochastic parrot? Meaning in literary texts and LLMs

Bill Benzon13 Apr 2023 16:44 UTC

7 points

4 comments3 min readLW link

AI #7: Free Agency

Zvi13 Apr 2023 16:20 UTC

33 points

12 comments47 min readLW link

(thezvi.wordpress.com)

Navigating the Open-Source AI Landscape: Data, Funding, and Safety

André Ferretti and mic

13 Apr 2023 15:29 UTC

32 points

7 comments11 min readLW link

(forum.effectivealtruism.org)

On AutoGPT

Zvi13 Apr 2023 12:30 UTC

248 points

47 comments20 min readLW link

(thezvi.wordpress.com)

Identifying semantic neurons, mechanistic circuits & interpretability web apps

Esben Kran and Neel Nanda

13 Apr 2023 11:59 UTC

18 points

0 comments8 min readLW link

Trying AgentGPT, an AutoGPT variant

Gunnar_Zarncke13 Apr 2023 10:13 UTC

10 points

9 comments1 min readLW link

Announcing Epoch’s dashboard of key trends and figures in Machine Learning

Jsevillamol13 Apr 2023 7:33 UTC

35 points

7 comments1 min readLW link

(epochai.org)

[Question] What is the best source to explain short AI timelines to a skeptical person?

trevor13 Apr 2023 4:29 UTC

12 points

12 comments1 min readLW link

“Aligned” foundation models don’t imply aligned systems

Max H13 Apr 2023 4:13 UTC

39 points

11 comments5 min readLW link

[Question] Using ChatGPT for memory reconsolidation?

warrenjordan13 Apr 2023 1:27 UTC

3 points

2 comments1 min readLW link

Independence Dividends

jefftk13 Apr 2023 1:20 UTC

35 points

11 comments1 min readLW link

(www.jefftk.com)

AI x-risk, approximately ordered by embarrassment

Alex Lawsen 12 Apr 2023 23:01 UTC

151 points

7 comments19 min readLW link

AXRP Episode 20 - ‘Reform’ AI Alignment with Scott Aaronson

DanielFilan12 Apr 2023 21:30 UTC

22 points

2 comments68 min readLW link

Apply to >30 AI safety funders in one application with the Nonlinear Network

KatWoods, Emerson Spartz and Drew Spartz

12 Apr 2023 21:23 UTC

65 points

12 comments2 min readLW link

AGI goal space is big, but narrowing might not be as hard as it seems.

Jacy Reese Anthis12 Apr 2023 19:03 UTC

15 points

0 comments3 min readLW link

Natural language alignment

Jacy Reese Anthis12 Apr 2023 19:02 UTC

31 points

2 comments2 min readLW link

Repugnant levels of violins

Solenoid_Entity12 Apr 2023 17:11 UTC

74 points

10 comments12 min readLW link

Progress links and tweets, 2023-04-12

jasoncrawford12 Apr 2023 16:52 UTC

8 points

2 comments1 min readLW link

(rootsofprogress.org)

A basic mathematical structure of intelligence

Golol12 Apr 2023 16:49 UTC

4 points

6 comments4 min readLW link

[Question] Should AutoGPT update us towards researching IDA?

Michaël Trazzi12 Apr 2023 16:41 UTC

15 points

5 comments1 min readLW link

Boxing lessons

yakimoff12 Apr 2023 16:19 UTC

1 point

0 comments1 min readLW link

Dazed and confused: Good olde’ walk around the Marin Headlands

yakimoff12 Apr 2023 16:09 UTC

1 point

0 comments1 min readLW link

Towards a solution to the alignment problem via objective detection and evaluation

Paul Colognese12 Apr 2023 15:39 UTC

9 points

7 comments12 min readLW link