All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025 2026

All Jan Feb Mar Apr May Jun Jul AugSepOct Nov Dec

All 1 2 3 4 5 678 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

Video essay: How Will We Know When AI is Conscious?

JanPro6 Sep 2023 18:10 UTC

11 points

7 comments1 min readLW link

(www.youtube.com)

My First Post

Jaivardhan Nawani6 Sep 2023 17:42 UTC

35 points

9 comments1 min readLW link

ActAdd: Steering Language Models without Optimization

technicalities, TurnTrout, lisathiergart, David Udell, Ulisse Mini and Monte M

6 Sep 2023 17:21 UTC

105 points

3 comments2 min readLW link

(arxiv.org)

Monthly Roundup #10: September 2023

Zvi6 Sep 2023 13:20 UTC

35 points

4 comments56 min readLW link

(thezvi.wordpress.com)

Find Hot French Food Near Me: A Follow-up

aphyer6 Sep 2023 12:32 UTC

77 points

19 comments2 min readLW link

Manifest 2023

Saul Munn and Austin Chen

6 Sep 2023 11:24 UTC

3 points

0 comments1 min readLW link

Last Chance: Get tickets to Manifest 2023! (Sep 22-24 in Berkeley)

Saul Munn and Austin Chen

6 Sep 2023 10:35 UTC

5 points

0 comments1 min readLW link

What I’ve been reading, September 2023

jasoncrawford6 Sep 2023 9:32 UTC

17 points

0 comments5 min readLW link

(rootsofprogress.org)

Decision Theory: A (Normative) Introduction

Pareto Optimal6 Sep 2023 8:22 UTC

−1 points

1 comment3 min readLW link

(paretooptimal.substack.com)

[Question] What’s the easiest way to make a luminator?

kuira6 Sep 2023 0:07 UTC

7 points

13 comments1 min readLW link

Ordinary claims require ordinary evidence

blake80865 Sep 2023 22:09 UTC

1 point

3 comments2 min readLW link

Conversation about paradigms, intellectual progress, social consensus, and AI

Ruby and RobertM

5 Sep 2023 21:30 UTC

14 points

6 comments1 min readLW link

What I would do if I wasn’t at ARC Evals

LawrenceC5 Sep 2023 19:19 UTC

220 points

10 comments13 min readLW link 1 review

The Evolutionary Pathway from Biological to Digital Intelligence: A Cosmic Perspective

George3605 Sep 2023 17:47 UTC

−17 points

0 comments4 min readLW link

The Illusion of Universal Morality: A Dynamic Perspective on Genetic Fitness and Ethical Complexity

George3605 Sep 2023 17:47 UTC

−9 points

7 comments2 min readLW link

Benchmarks for Detecting Measurement Tampering [Redwood Research]

ryan_greenblatt and Fabien Roger

5 Sep 2023 16:44 UTC

94 points

22 comments20 min readLW link 1 review

(arxiv.org)

[Question] Strongest real-world examples supporting AI risk claims?

rosehadshar5 Sep 2023 15:12 UTC

41 points

7 comments1 min readLW link

AISN #21: Google DeepMind’s GPT-4 Competitor, Military Investments in Autonomous Drones, The UK AI Safety Summit, and Case Studies in AI Policy

Dan H5 Sep 2023 15:03 UTC

15 points

0 comments5 min readLW link

(newsletter.safe.ai)

Who Has the Best Food?

Zvi5 Sep 2023 13:40 UTC

50 points

62 comments10 min readLW link

(thezvi.wordpress.com)

World, mind, and learnability: A note on the metaphysical structure of the cosmos [& LLMs]

Bill Benzon5 Sep 2023 12:19 UTC

4 points

1 comment5 min readLW link

Deleted

goktu5 Sep 2023 8:10 UTC

−12 points

1 comment1 min readLW link

Text Posts from the Kids Group: 2023 I

jefftk5 Sep 2023 2:00 UTC

75 points

3 comments7 min readLW link

(www.jefftk.com)

Action theory is not policy theory is not agent theory

Cole Wyeth5 Sep 2023 1:38 UTC

20 points

4 comments6 min readLW link

(colewyeth.com)

The purpose of the (Mosaic) law

mruwnik4 Sep 2023 23:38 UTC

7 points

5 comments6 min readLW link

Against the Open Source / Closed Source Dichotomy: Regulated Source as a Model for Responsible AI Development

alex.herwix4 Sep 2023 20:25 UTC

4 points

12 comments6 min readLW link

(forum.effectivealtruism.org)

Notes on nukes, IR, and AI from “Arsenals of Folly” (and other books)

tlevin4 Sep 2023 19:02 UTC

11 points

0 comments6 min readLW link

Hertford, Sourbut (rationality lessons from University Challenge)

Oliver Sourbut4 Sep 2023 18:44 UTC

30 points

7 comments14 min readLW link

(www.oliversourbut.net)

a rant on politician-engineer coalitional conflict

bhauth4 Sep 2023 17:15 UTC

64 points

12 comments4 min readLW link

How ForumMagnum builds communities of inquiry

Jim Fisher4 Sep 2023 16:52 UTC

35 points

21 comments5 min readLW link

Interpreting a matrix-valued word embedding with a mathematically proven characterization of all optima

Joseph Van Name4 Sep 2023 16:19 UTC

3 points

4 comments12 min readLW link

Hard Questions Are Language Bugs

George3d64 Sep 2023 14:44 UTC

30 points

13 comments7 min readLW link

(ontologi.cc)

Defunding My Mistake

ymeskhout4 Sep 2023 14:43 UTC

185 points

41 comments6 min readLW link

The omnizoid—Heighn FDT Debate #1: Why FDT Isn’t Crazy

Heighn4 Sep 2023 12:57 UTC

24 points

4 comments6 min readLW link

Paper: On measuring situational awareness in LLMs

Owain_Evans, Daniel Kokotajlo, Mikita Balesni, Tomek Korbak, Asa Cooper Stickland, Meg and Maximilian Kaufmann

4 Sep 2023 12:54 UTC

111 points

17 comments5 min readLW link

(arxiv.org)

Impending AGI doesn’t make everything else unimportant

Igor Ivanov4 Sep 2023 12:34 UTC

29 points

12 comments5 min readLW link

Open Thread – Autumn 2023

Raemon3 Sep 2023 22:54 UTC

26 points

113 comments1 min readLW link

What must be the case that ChatGPT would have memorized “To be or not to be”? – Three kinds of conceptual objects for LLMs

Bill Benzon3 Sep 2023 18:39 UTC

19 points

0 comments12 min readLW link

Fundamental question: What determines a mind’s effects?

TsviBT3 Sep 2023 17:15 UTC

16 points

4 comments13 min readLW link

An embedding decoder model, trained with a different objective on a different dataset, can decode another model’s embeddings surprisingly accurately

Logan Zoellner3 Sep 2023 11:34 UTC

20 points

1 comment1 min readLW link

Series of absurd upgrades in nature’s great search

lemonhope3 Sep 2023 9:35 UTC

15 points

8 comments1 min readLW link

Conservation of Expected Evidence and Random Sampling in Anthropics

Ape in the coat3 Sep 2023 6:55 UTC

9 points

9 comments7 min readLW link

The goal of physics

Jim Pivarski2 Sep 2023 23:08 UTC

47 points

4 comments5 min readLW link

Will value of paid sex drop right before the end of the world?

azamatvaliev2 Sep 2023 19:03 UTC

−9 points

0 comments4 min readLW link

PIBBSS Summer Symposium 2023

Nora_Ammann and DusanDNesic

2 Sep 2023 17:22 UTC

25 points

2 comments3 min readLW link

The smallest possible button (or: moth traps!)

Neil 2 Sep 2023 15:24 UTC

126 points

18 comments3 min readLW link

(neilwarren.substack.com)

Steven Harnad: Symbol grounding and the structure of dictionaries

Bill Benzon2 Sep 2023 12:28 UTC

5 points

3 comments2 min readLW link

Is Metaethics Unnecessary Given Intent-Aligned AI?

Caleb Biddulph2 Sep 2023 9:48 UTC

12 points

0 comments7 min readLW link

Rational Agents Cooperate in the Prisoner’s Dilemma

Isaac King2 Sep 2023 6:15 UTC

17 points

68 comments12 min readLW link

[Linkpost] Large language models converge toward human-like concept organization

Bogdan Ionut Cirstea2 Sep 2023 6:00 UTC

22 points

1 comment1 min readLW link

Plum Cooking Temperature

jefftk2 Sep 2023 1:30 UTC

11 points

0 comments1 min readLW link

(www.jefftk.com)