All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025 2026

All Jan Feb MarAprMay Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 151617 18 19 20 21 22 23 24 25 26 27 28 29 30

SmartyHeaderCode: anomalous tokens for GPT3.5 and GPT-4

AdamYedidia15 Apr 2023 22:35 UTC

72 points

18 comments6 min readLW link

Open-source LLMs may prove Bostrom’s vulnerable world hypothesis

Roope Ahvenharju15 Apr 2023 19:16 UTC

1 point

1 comment1 min readLW link

[linkpost] Elon Musk plans AI start-up to rival OpenAI

Hatfield15 Apr 2023 19:06 UTC

11 points

11 comments1 min readLW link

(www.ft.com)

FLI report: Policymaking in the Pause

Zach Stein-Perlman15 Apr 2023 17:01 UTC

15 points

3 comments1 min readLW link

(futureoflife.org)

Reflective journal entries using GPT-4 and Obsidian that demand less willpower.

Solenoid_Entity15 Apr 2023 12:45 UTC

57 points

24 comments7 min readLW link

An example elevator pitch for AI doom

laserfiche15 Apr 2023 12:29 UTC

2 points

5 comments1 min readLW link

AI as Contact with our Collective Unconscious

Scott Broock15 Apr 2023 2:11 UTC

−4 points

6 comments4 min readLW link

The Truth About False

Thoth Hermes15 Apr 2023 1:01 UTC

−21 points

4 comments17 min readLW link

(thothhermes.substack.com)

The ‘ petertodd’ phenomenon

mwatkins15 Apr 2023 0:59 UTC

193 points

52 comments38 min readLW link 1 review

[Question] Concave Utility Question

Scott Garrabrant15 Apr 2023 0:14 UTC

55 points

36 comments2 min readLW link

List of requests for an AI slowdown/halt.

Cleo Nardo14 Apr 2023 23:55 UTC

46 points

6 comments1 min readLW link

[linkpost] “What Are Reasonable AI Fears?” by Robin Hanson, 2023-04-23

Arjun Panickssery14 Apr 2023 23:26 UTC

26 points

16 comments4 min readLW link

(quillette.com)

“Do X because decision theory” ~= “Do X because bayes theorem”

lc14 Apr 2023 20:57 UTC

40 points

1 comment2 min readLW link

LLMs and hallucination, like white on rice?

Bill Benzon14 Apr 2023 19:53 UTC

5 points

0 comments3 min readLW link

GPT-4 is easily controlled/exploited with tricky decision theoretic dilemmas.

scasper14 Apr 2023 19:39 UTC

6 points

4 comments2 min readLW link

On Caring about our AI Progeny

PeterMcCluskey14 Apr 2023 19:32 UTC

22 points

5 comments1 min readLW link

(bayesianinvestor.com)

Moderation notes re: recent Said/Duncan threads

Raemon14 Apr 2023 18:06 UTC

52 points

560 comments2 min readLW link

What we’ve learned so far from our technological temptations project

Richard Korzekwa 14 Apr 2023 17:46 UTC

15 points

4 comments11 min readLW link

(aiimpacts.org)

[Question] How does consciousness interact with architecture?

FinalFormal214 Apr 2023 15:56 UTC

5 points

3 comments1 min readLW link

Iqisa: A Library For Handling Forecasting Datasets

niplav14 Apr 2023 15:16 UTC

27 points

0 comments2 min readLW link

What’s this probability you’re reporting?

EOC and SCP

14 Apr 2023 15:07 UTC

19 points

10 comments3 min readLW link

Navigating AI Risks (NAIR) #1: Slowing Down AI

simeon_c14 Apr 2023 14:35 UTC

11 points

3 comments1 min readLW link

(navigatingairisks.substack.com)

[Question] What would the FLI moratorium actually do?

ChristianKl14 Apr 2023 13:14 UTC

17 points

7 comments1 min readLW link

Research Report: Incorrectness Cascades

Robert_AIZI14 Apr 2023 12:49 UTC

19 points

0 comments10 min readLW link

(aizi.substack.com)

The self-unalignment problem

Jan_Kulveit and rosehadshar

14 Apr 2023 12:10 UTC

160 points

24 comments10 min readLW link

AI Safety Europe Retreat 2023 Retrospective

Magdalena Wache14 Apr 2023 9:05 UTC

43 points

0 comments2 min readLW link

[Question] What’s the difference between Wisdom and Rationality?

Yoav Ravid14 Apr 2023 6:22 UTC

8 points

4 comments1 min readLW link

Shapley Value Attribution in Chain of Thought

leogao14 Apr 2023 5:56 UTC

106 points

7 comments4 min readLW link

A freshman year during the AI midgame: my approach to the next year

Buck14 Apr 2023 0:38 UTC

154 points

15 comments7 min readLW link 1 review

Against AI Understanding and Sentience: Large Language Models, Meaning, and the Patterns of Human Language Use

Jonathan Yan13 Apr 2023 23:29 UTC

−1 points

0 comments1 min readLW link

(philsci-archive.pitt.edu)

R0 Is Not Counterfactual

jefftk13 Apr 2023 19:50 UTC

33 points

9 comments2 min readLW link

(www.jefftk.com)

Subscripts for Probabilities

niplav13 Apr 2023 18:32 UTC

67 points

9 comments5 min readLW link

The Virus—Short Story

Michael Soareverix13 Apr 2023 18:18 UTC

4 points

0 comments4 min readLW link

First ACX Brno Meetup

adekcz13 Apr 2023 17:42 UTC

2 points

0 comments1 min readLW link

Polluting the agentic commons

hamandcheese13 Apr 2023 17:42 UTC

7 points

4 comments2 min readLW link

(www.secondbest.ca)

Cambridge LW Meetup: When Science Isn’t Enough

Tony Wang and Darmani

13 Apr 2023 17:36 UTC

2 points

0 comments1 min readLW link

Even if human & AI alignment are just as easy, we are screwed

Matthew_Opitz13 Apr 2023 17:32 UTC

35 points

5 comments5 min readLW link

Was Homer a stochastic parrot? Meaning in literary texts and LLMs

Bill Benzon13 Apr 2023 16:44 UTC

7 points

4 comments3 min readLW link

AI #7: Free Agency

Zvi13 Apr 2023 16:20 UTC

33 points

12 comments47 min readLW link

(thezvi.wordpress.com)

Navigating the Open-Source AI Landscape: Data, Funding, and Safety

André Ferretti13 Apr 2023 15:29 UTC

32 points

7 comments11 min readLW link

(forum.effectivealtruism.org)

On AutoGPT

Zvi13 Apr 2023 12:30 UTC

248 points

47 comments20 min readLW link

(thezvi.wordpress.com)

Identifying semantic neurons, mechanistic circuits & interpretability web apps

Esben Kran and Neel Nanda

13 Apr 2023 11:59 UTC

18 points

0 comments8 min readLW link

Trying AgentGPT, an AutoGPT variant

Gunnar_Zarncke13 Apr 2023 10:13 UTC

10 points

9 comments1 min readLW link

Announcing Epoch’s dashboard of key trends and figures in Machine Learning

Jsevillamol13 Apr 2023 7:33 UTC

35 points

7 comments1 min readLW link

(epochai.org)

[Question] What is the best source to explain short AI timelines to a skeptical person?

trevor13 Apr 2023 4:29 UTC

12 points

12 comments1 min readLW link

“Aligned” foundation models don’t imply aligned systems

Max H13 Apr 2023 4:13 UTC

39 points

11 comments5 min readLW link

[Question] Using ChatGPT for memory reconsolidation?

warrenjordan13 Apr 2023 1:27 UTC

3 points

2 comments1 min readLW link

Independence Dividends

jefftk13 Apr 2023 1:20 UTC

35 points

11 comments1 min readLW link

(www.jefftk.com)

AI x-risk, approximately ordered by embarrassment

Alex Lawsen 12 Apr 2023 23:01 UTC

151 points

7 comments19 min readLW link

AXRP Episode 20 - ‘Reform’ AI Alignment with Scott Aaronson

DanielFilan12 Apr 2023 21:30 UTC

22 points

2 comments68 min readLW link