All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025 2026

All Jan Feb Mar Apr May Jun Jul Aug Sep OctNovDec

All 1 2 3 4 567 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

Graceful Degradation

Screwtape5 Nov 2024 23:57 UTC

84 points

8 comments4 min readLW link

An alternative approach to superbabies

Towards_Keeperhood5 Nov 2024 22:56 UTC

48 points

19 comments3 min readLW link

Apply to be a mentor in SPAR!

agucova5 Nov 2024 21:32 UTC

5 points

0 comments1 min readLW link

Going Beyond “immaturity”

moisentinel5 Nov 2024 20:51 UTC

−3 points

2 comments2 min readLW link

Intent alignment as a stepping-stone to value alignment

Seth Herd5 Nov 2024 20:43 UTC

37 points

8 comments3 min readLW link

Why Recursion Pharmaceuticals abandoned cell painting for brightfield imaging

Abhishaike Mahajan5 Nov 2024 14:51 UTC

29 points

1 comment18 min readLW link

(www.owlposting.com)

Winning isn’t enough

JesseClifton and Anthony DiGiovanni

5 Nov 2024 11:37 UTC

44 points

35 comments9 min readLW link

Anthropic—The case for targeted regulation

anaguma5 Nov 2024 7:07 UTC

11 points

0 comments2 min readLW link

(www.anthropic.com)

The Shallow Bench

Karl Faulks5 Nov 2024 5:07 UTC

56 points

5 comments3 min readLW link

Using Narrative Prompting to Extract Policy Forecasts from LLMs

Max Ghenis5 Nov 2024 4:37 UTC

5 points

0 comments1 min readLW link

ML4Good (AI Safety Bootcamp) - Experience report

JanEbbing5 Nov 2024 1:18 UTC

16 points

0 comments3 min readLW link

Catastrophic Cyber Capabilities Benchmark (3CB): Robustly Evaluating LLM Agent Cyber Offense Capabilities

Jonathan N, Andrey Anurin, Connor Axiotes and Esben Kran

5 Nov 2024 1:01 UTC

9 points

0 comments6 min readLW link

(www.apartresearch.com)

[Question] Could orcas be (trained to be) smarter than humans? 

Towards_Keeperhood4 Nov 2024 23:29 UTC

59 points

23 comments1 min readLW link

Metastatic Cancer Treatment Since 2010: The Success Stories

sarahconstantin4 Nov 2024 22:50 UTC

51 points

2 comments6 min readLW link

(sarahconstantin.substack.com)

Bay Winter Solstice 2024: Speech Auditions

ozymandias4 Nov 2024 22:31 UTC

32 points

1 comment1 min readLW link

Empathy/Systemizing Quotient is a poor/biased model for the autism/sex link

tailcalled4 Nov 2024 21:11 UTC

48 points

1 comment7 min readLW link 1 review

Distributed espionage

margetmagenta4 Nov 2024 19:43 UTC

3 points

0 comments1 min readLW link

GPT-8 may not be ASI

rvzlxax4094 Nov 2024 19:31 UTC

−2 points

1 comment3 min readLW link

AI timelines don’t account for base rate of tech progress

rvzlxax4094 Nov 2024 19:31 UTC

−10 points

2 comments1 min readLW link

Update on the Mysterious Trump Buyers on Polymarket

Annapurna4 Nov 2024 19:22 UTC

11 points

9 comments1 min readLW link

(jorgevelez.substack.com)

[Intuitive self-models] 8. Rooting Out Free Will Intuitions

Steven Byrnes4 Nov 2024 18:16 UTC

79 points

20 comments24 min readLW link

Option control

Joe Carlsmith4 Nov 2024 17:54 UTC

28 points

0 comments55 min readLW link

[Question] Noticing the World

EvolutionByDesign4 Nov 2024 16:41 UTC

4 points

1 comment1 min readLW link

The current state of RSPs

Zach Stein-Perlman4 Nov 2024 16:00 UTC

23 points

2 comments9 min readLW link

[Question] Does the “ancient wisdom” argument have any validity? If a particular teaching or tradition is old, to what extent does this make it more trustworthy?

SpectrumDT4 Nov 2024 15:20 UTC

19 points

49 comments1 min readLW link

A brief history of the automated corporation

owencb4 Nov 2024 14:35 UTC

26 points

1 comment5 min readLW link

(strangecities.substack.com)

Abstractions are not Natural

Alfred Harwood4 Nov 2024 11:10 UTC

25 points

21 comments11 min readLW link

[Linkpost] Building Altruistic and Moral AI Agent with Brain-inspired Affective Empathy Mechanisms

Gunnar_Zarncke4 Nov 2024 10:15 UTC

13 points

0 comments1 min readLW link

(arxiv.org)

Context-dependent consequentialism

Jeremy Gillen and mattmacdermott

4 Nov 2024 9:29 UTC

31 points

6 comments27 min readLW link

Survival without dignity

L Rudolf L4 Nov 2024 2:29 UTC

410 points

30 comments15 min readLW link 1 review

(nosetgauge.substack.com)

Drug development costs can range over two orders of magnitude

rossry3 Nov 2024 23:13 UTC

38 points

0 comments11 min readLW link

Redefining Tolerance: Beyond Popper’s Paradox

mindprison3 Nov 2024 22:23 UTC

−1 points

0 comments3 min readLW link

Goal: Understand Intelligence

Johannes C. Mayer3 Nov 2024 21:20 UTC

14 points

19 comments1 min readLW link

Current safety training techniques do not fully transfer to the agent setting

Simon Lermen and fidgetsinner

3 Nov 2024 19:24 UTC

162 points

9 comments5 min readLW link

Why our politicians aren’t Median

Yair Halberstadt3 Nov 2024 14:03 UTC

73 points

15 comments3 min readLW link

Human Biodiversity (Part 4: Astral Codex Ten)

Evan_Gaensbauer3 Nov 2024 4:20 UTC

−14 points

5 comments1 min readLW link

(reflectivealtruism.com)

Understanding incomparability versus incommensurability in relation to RLHF

artemiocobb2 Nov 2024 22:57 UTC

1 point

1 comment2 min readLW link

electric turbofans

bhauth2 Nov 2024 22:50 UTC

63 points

2 comments5 min readLW link

(bhauth.com)

Reality as Category-Theoretic State Machines: A Mathematical Framework

Wenitte Apiou2 Nov 2024 21:04 UTC

−8 points

0 comments2 min readLW link

The Median Researcher Problem

johnswentworth2 Nov 2024 20:16 UTC

167 points

74 comments1 min readLW link 2 reviews

Testing “True” Language Understanding in LLMs: A Simple Proposal

MtryaSam2 Nov 2024 19:12 UTC

9 points

2 comments2 min readLW link

Testing “True” Language Understanding in LLMs: A Simple Proposal

MtryaSam2 Nov 2024 19:12 UTC

−3 points

0 comments2 min readLW link

Fragile, Robust, and Antifragile Preference Satisfaction

adamShimi2 Nov 2024 17:25 UTC

19 points

0 comments5 min readLW link

(formethods.substack.com)

Higher Order Signs, Hallucination and Schizophrenia

Nicolas Villarreal2 Nov 2024 16:33 UTC

4 points

0 comments13 min readLW link

(nicolasdvillarreal.substack.com)

[Question] Is OpenAI net negative for AI Safety?

Lysandre Terrisse2 Nov 2024 16:18 UTC

4 points

0 comments1 min readLW link

Two arguments against longtermist thought experiments

momom22 Nov 2024 10:22 UTC

15 points

6 comments3 min readLW link

Both-Sidesism—When Fair & Balanced Goes Wrong

James Stephen Brown2 Nov 2024 3:04 UTC

3 points

15 comments6 min readLW link

(nonzerosum.games)

What can we learn from insecure domains?

Logan Zoellner1 Nov 2024 23:53 UTC

14 points

21 comments1 min readLW link

Science advances one funeral at a time

Cameron Berg, Kvee, Diogo de Lucena and Trent Hodgeson

1 Nov 2024 23:06 UTC

104 points

9 comments2 min readLW link

The Cartesian Crisis

mindprison1 Nov 2024 23:02 UTC

−5 points

2 comments2 min readLW link