All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025

All Jan Feb Mar Apr May Jun Jul Aug Sep OctNovDec

All 1 2 3 4 5 6 7 8 9 10 111213 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

[Question] What are the primary drivers that caused selection pressure for intelligence in humans?

Towards_KeeperhoodNov 7, 2024, 9:40 AM

8 points

15 comments1 min readLW link

The Logistics of Distribution of Meaning: Against Epistemic Bureaucratization

SahilNov 7, 2024, 5:27 AM

27 points

7 comments12 min readLW link

SAEs are highly dataset dependent: a case study on the refusal direction

Connor Kissane, robertzk, Neel Nanda and Arthur Conmy

Nov 7, 2024, 5:22 AM

66 points

4 comments14 min readLW link

Should CA, TX, OK, and LA merge into a giant swing state, just for elections?

Thomas KwaNov 6, 2024, 11:01 PM

115 points

35 comments1 min readLW link

New Funding Category Open in Foresight’s AI Safety Grants

Allison DuettmannNov 6, 2024, 10:59 PM

15 points

0 comments1 min readLW link

Scattered thoughts on what it means for an LLM to believe

TheManxLoinerNov 6, 2024, 10:10 PM

5 points

4 comments5 min readLW link

The Bayesian Conspiracy Live Recording

EneaszNov 6, 2024, 4:25 PM

9 points

0 comments1 min readLW link

Anthropic: Three Sketches of ASL-4 Safety Case Components

Zach Stein-PerlmanNov 6, 2024, 4:00 PM

95 points

33 comments1 min readLW link

(alignment.anthropic.com)

Meme Talking Points

ymeskhoutNov 6, 2024, 3:27 PM

34 points

0 comments3 min readLW link

Advisors for Smaller Major Donors?

jefftkNov 6, 2024, 2:30 PM

18 points

2 comments3 min readLW link

(www.jefftk.com)

Scissors Statements for President?

AnnaSalamonNov 6, 2024, 10:38 AM

118 points

32 comments1 min readLW link

[Question] How to cite LessWrong as an academic source?

PhilosophicalSoulNov 6, 2024, 8:28 AM

6 points

6 comments1 min readLW link

How to put California and Texas on the campaign trail!

Yair HalberstadtNov 6, 2024, 6:08 AM

25 points

4 comments1 min readLW link

LDT (and everything else) can be irrational

Christopher KingNov 6, 2024, 4:05 AM

10 points

15 comments2 min readLW link

Join my new subscriber chat

sarahconstantinNov 6, 2024, 2:30 AM

7 points

0 comments1 min readLW link

(sarahconstantin.substack.com)

Graceful Degradation

ScrewtapeNov 5, 2024, 11:57 PM

83 points

8 comments4 min readLW link

An alternative approach to superbabies

Towards_KeeperhoodNov 5, 2024, 10:56 PM

48 points

19 comments3 min readLW link

Apply to be a mentor in SPAR!

agucovaNov 5, 2024, 9:32 PM

5 points

0 comments LW link

Going Beyond “immaturity”

moisentinelNov 5, 2024, 8:51 PM

−3 points

2 comments2 min readLW link

Intent alignment as a stepping-stone to value alignment

Seth HerdNov 5, 2024, 8:43 PM

37 points

8 comments3 min readLW link

Why Recursion Pharmaceuticals abandoned cell painting for brightfield imaging

Abhishaike MahajanNov 5, 2024, 2:51 PM

29 points

1 comment18 min readLW link

(www.owlposting.com)

Winning isn’t enough

JesseClifton and Anthony DiGiovanni

Nov 5, 2024, 11:37 AM

40 points

18 comments9 min readLW link

Anthropic—The case for targeted regulation

anagumaNov 5, 2024, 7:07 AM

11 points

0 comments2 min readLW link

(www.anthropic.com)

The Shallow Bench

Karl FaulksNov 5, 2024, 5:07 AM

48 points

5 comments3 min readLW link

Using Narrative Prompting to Extract Policy Forecasts from LLMs

Max GhenisNov 5, 2024, 4:37 AM

5 points

0 comments1 min readLW link

ML4Good (AI Safety Bootcamp) - Experience report

JanEbbingNov 5, 2024, 1:18 AM

13 points

0 comments3 min readLW link

Catastrophic Cyber Capabilities Benchmark (3CB): Robustly Evaluating LLM Agent Cyber Offense Capabilities

Jonathan N, abra, Connor Axiotes and Esben Kran

Nov 5, 2024, 1:01 AM

8 points

0 comments6 min readLW link

(www.apartresearch.com)

[Question] Could orcas be (trained to be) smarter than humans? 

Towards_KeeperhoodNov 4, 2024, 11:29 PM

56 points

23 comments1 min readLW link

Metastatic Cancer Treatment Since 2010: The Success Stories

sarahconstantinNov 4, 2024, 10:50 PM

51 points

2 comments6 min readLW link

(sarahconstantin.substack.com)

Bay Winter Solstice 2024: Speech Auditions

ozymandiasNov 4, 2024, 10:31 PM

32 points

1 comment1 min readLW link

Empathy/Systemizing Quotient is a poor/biased model for the autism/sex link

tailcalledNov 4, 2024, 9:11 PM

43 points

0 comments7 min readLW link

Distributed espionage

margetmagentaNov 4, 2024, 7:43 PM

3 points

0 comments1 min readLW link

GPT-8 may not be ASI

rvzlxax409Nov 4, 2024, 7:31 PM

−2 points

1 comment3 min readLW link

AI timelines don’t account for base rate of tech progress

rvzlxax409Nov 4, 2024, 7:31 PM

−10 points

2 comments1 min readLW link

Update on the Mysterious Trump Buyers on Polymarket

AnnapurnaNov 4, 2024, 7:22 PM

19 points

9 comments1 min readLW link

(jorgevelez.substack.com)

[Intuitive self-models] 8. Rooting Out Free Will Intuitions

Steven ByrnesNov 4, 2024, 6:16 PM

70 points

19 comments24 min readLW link

Option control

Joe CarlsmithNov 4, 2024, 5:54 PM

28 points

0 comments54 min readLW link

[Question] Noticing the World

EvolutionByDesignNov 4, 2024, 4:41 PM

4 points

1 comment1 min readLW link

The current state of RSPs

Zach Stein-PerlmanNov 4, 2024, 4:00 PM

23 points

2 comments9 min readLW link

[Question] Does the “ancient wisdom” argument have any validity? If a particular teaching or tradition is old, to what extent does this make it more trustworthy?

SpectrumDTNov 4, 2024, 3:20 PM

18 points

49 comments1 min readLW link

A brief history of the automated corporation

owencbNov 4, 2024, 2:35 PM

26 points

1 comment5 min readLW link

(strangecities.substack.com)

Abstractions are not Natural

Alfred HarwoodNov 4, 2024, 11:10 AM

25 points

21 comments11 min readLW link

[Linkpost] Building Altruistic and Moral AI Agent with Brain-inspired Affective Empathy Mechanisms

Gunnar_ZarnckeNov 4, 2024, 10:15 AM

13 points

0 comments1 min readLW link

(arxiv.org)

Context-dependent consequentialism

Jeremy Gillen and mattmacdermott

Nov 4, 2024, 9:29 AM

31 points

6 comments27 min readLW link

Survival without dignity

L Rudolf LNov 4, 2024, 2:29 AM

369 points

29 comments15 min readLW link

(nosetgauge.substack.com)

Drug development costs can range over two orders of magnitude

rossryNov 3, 2024, 11:13 PM

38 points

0 comments11 min readLW link

Redefining Tolerance: Beyond Popper’s Paradox

mindprisonNov 3, 2024, 10:23 PM

−1 points

0 comments3 min readLW link

Goal: Understand Intelligence

Johannes C. MayerNov 3, 2024, 9:20 PM

14 points

19 comments1 min readLW link

Current safety training techniques do not fully transfer to the agent setting

Simon Lermen and Govind Pimpale

Nov 3, 2024, 7:24 PM

158 points

9 comments5 min readLW link

Why our politicians aren’t Median

Yair HalberstadtNov 3, 2024, 2:03 PM

62 points

15 comments3 min readLW link