All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025 2026

All Jan Feb Mar Apr May Jun Jul AugSepOct Nov Dec

All 1 2 3 4 5 6 7 8 91011 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

Betting and forecasting

CarlJ9 Sep 2023 20:03 UTC

2 points

0 comments1 min readLW link

AI presidents discuss AI alignment agendas

TurnTrout and Garrett Baker

9 Sep 2023 18:55 UTC

222 points

23 comments1 min readLW link

(www.youtube.com)

Probabilistic argument relationships and an invitation to the argument mapping community

Alexander_Heckett9 Sep 2023 18:45 UTC

13 points

4 comments10 min readLW link

How teams went about their research at AI Safety Camp edition 8

Remmelt, Linda Linsefors and Kristi Uustalu

9 Sep 2023 16:34 UTC

28 points

0 comments13 min readLW link

Panel discussion on AI consciousness with Rob Long and Jeff Sebo

Aaron Bergman9 Sep 2023 3:38 UTC

10 points

0 comments42 min readLW link

(www.youtube.com)

Possible Divergence in AGI Risk Tolerance between Selfish and Altruistic agents

Brad West 9 Sep 2023 0:23 UTC

1 point

1 comment2 min readLW link

Capture the Flag Mechanistic Interpretability Challenges

Alejandro Acelas and Alexandre Variengien

8 Sep 2023 23:00 UTC

24 points

0 comments7 min readLW link

[Question] What is to be done? (About the profit motive)

Connor Barber8 Sep 2023 19:27 UTC

2 points

21 comments1 min readLW link

What is the optimal frontier for due diligence?

RobertM and Ruby

8 Sep 2023 18:20 UTC

41 points

1 comment1 min readLW link

Progress links digest, 2023-09-08: The Conservative Futurist, cargo airships, and more

jasoncrawford8 Sep 2023 17:48 UTC

14 points

7 comments5 min readLW link

(rootsofprogress.org)

The AI apocalypse myth.

Spiritus Dei8 Sep 2023 17:43 UTC

−22 points

12 comments2 min readLW link

Sum-threshold attacks

TsviBT8 Sep 2023 17:13 UTC

259 points

57 comments10 min readLW link

(tsvibt.blogspot.com)

Debate series: should we push for a pause on the development of AI?

Xodarap8 Sep 2023 16:29 UTC

39 points

1 comment1 min readLW link

AI Probability Trees—Joe Carlsmith (2022)

Nathan Young8 Sep 2023 15:40 UTC

12 points

1 comment8 min readLW link

Invading Australia (Endless Formerlies Most Beautiful, or What I Learned On My Holiday)

Oliver Sourbut8 Sep 2023 15:33 UTC

14 points

1 comment8 min readLW link

(www.oliversourbut.net)

Explaining grokking through circuit efficiency

Vikrant Varma and Rohin Shah

8 Sep 2023 14:39 UTC

102 points

11 comments3 min readLW link

(arxiv.org)

Have Attention Spans Been Declining?

niplav8 Sep 2023 14:11 UTC

75 points

23 comments17 min readLW link 1 review

Explained Simply: Quantilizers

brook8 Sep 2023 12:54 UTC

15 points

8 comments1 min readLW link

(aisafetyexplained.substack.com)

Crossing the Rubicon.

Spiritus Dei8 Sep 2023 4:19 UTC

−4 points

5 comments13 min readLW link

[Question] What EY and LessWrong meant when (fill in the blank) found them.

Bill Benzon8 Sep 2023 1:42 UTC

1 point

0 comments1 min readLW link

Bring back the Colosseums

lc8 Sep 2023 0:09 UTC

18 points

30 comments1 min readLW link

Science to Be Done Internationally Using Blockchain

Victor Porton7 Sep 2023 23:29 UTC

−18 points

0 comments2 min readLW link

(science-dao.org)

A quick update from Nonlinear

KatWoods7 Sep 2023 21:28 UTC

73 points

23 comments2 min readLW link

[Linkpost] Frontier AI Taskforce: first progress report

Paul Colognese7 Sep 2023 19:06 UTC

21 points

0 comments4 min readLW link

(www.gov.uk)

[Question] How did you make your way back from meta?

matto7 Sep 2023 17:23 UTC

25 points

28 comments1 min readLW link

AI#28: Watching and Waiting

Zvi7 Sep 2023 17:20 UTC

52 points

14 comments45 min readLW link

(thezvi.wordpress.com)

[Question] Measure of complexity allowed by the laws of the universe and relative theory?

dr_s7 Sep 2023 12:21 UTC

8 points

22 comments1 min readLW link

Recreating the caring drive

Catnee7 Sep 2023 10:41 UTC

43 points

15 comments10 min readLW link 1 review

Sharing Information About Nonlinear

Ben Pace, the Vacationing Vagabond7 Sep 2023 6:51 UTC

324 points

324 comments34 min readLW link

Weekly Incidence vs Cumulative Infections

jefftk7 Sep 2023 2:30 UTC

13 points

6 comments1 min readLW link

(www.jefftk.com)

Improving Mathematical Accuracy in LLMs—History − 1

Abhay Chowdhry7 Sep 2023 1:58 UTC

5 points

1 comment9 min readLW link

Breaking RLHF “Safety” (And how to fix it?)

MPotter7 Sep 2023 1:58 UTC

3 points

0 comments4 min readLW link

Feedback-loops, Deliberate Practice, and Transfer Learning

Bird Concept and Raemon

7 Sep 2023 1:57 UTC

46 points

5 comments1 min readLW link

Video essay: How Will We Know When AI is Conscious?

JanPro6 Sep 2023 18:10 UTC

11 points

7 comments1 min readLW link

(www.youtube.com)

My First Post

Jaivardhan Nawani6 Sep 2023 17:42 UTC

35 points

9 comments1 min readLW link

ActAdd: Steering Language Models without Optimization

technicalities, TurnTrout, lisathiergart, David Udell, Ulisse Mini and Monte M

6 Sep 2023 17:21 UTC

105 points

3 comments2 min readLW link

(arxiv.org)

Monthly Roundup #10: September 2023

Zvi6 Sep 2023 13:20 UTC

35 points

4 comments56 min readLW link

(thezvi.wordpress.com)

Find Hot French Food Near Me: A Follow-up

aphyer6 Sep 2023 12:32 UTC

77 points

19 comments2 min readLW link

Manifest 2023

Saul Munn and Austin Chen

6 Sep 2023 11:24 UTC

3 points

0 comments1 min readLW link

Last Chance: Get tickets to Manifest 2023! (Sep 22-24 in Berkeley)

Saul Munn and Austin Chen

6 Sep 2023 10:35 UTC

5 points

0 comments1 min readLW link

What I’ve been reading, September 2023

jasoncrawford6 Sep 2023 9:32 UTC

17 points

0 comments5 min readLW link

(rootsofprogress.org)

Decision Theory: A (Normative) Introduction

Pareto Optimal6 Sep 2023 8:22 UTC

−1 points

1 comment3 min readLW link

(paretooptimal.substack.com)

[Question] What’s the easiest way to make a luminator?

kuira6 Sep 2023 0:07 UTC

7 points

13 comments1 min readLW link

Ordinary claims require ordinary evidence

blake80865 Sep 2023 22:09 UTC

1 point

3 comments2 min readLW link

Conversation about paradigms, intellectual progress, social consensus, and AI

Ruby and RobertM

5 Sep 2023 21:30 UTC

14 points

6 comments1 min readLW link

What I would do if I wasn’t at ARC Evals

LawrenceC5 Sep 2023 19:19 UTC

220 points

10 comments13 min readLW link 1 review

The Evolutionary Pathway from Biological to Digital Intelligence: A Cosmic Perspective

George3605 Sep 2023 17:47 UTC

−17 points

0 comments4 min readLW link

The Illusion of Universal Morality: A Dynamic Perspective on Genetic Fitness and Ethical Complexity

George3605 Sep 2023 17:47 UTC

−9 points

7 comments2 min readLW link

Benchmarks for Detecting Measurement Tampering [Redwood Research]

ryan_greenblatt and Fabien Roger

5 Sep 2023 16:44 UTC

94 points

22 comments20 min readLW link 1 review

(arxiv.org)

[Question] Strongest real-world examples supporting AI risk claims?

rosehadshar5 Sep 2023 15:12 UTC

41 points

7 comments1 min readLW link