All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025 2026

All JanFebMar Apr May Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 131415 16 17 18 19 20 21 22 23 24 25 26 27 28 29

An EA used deceptive messaging to advance her project; we need mechanisms to avoid deontologically dubious plans

Mikhail Samin13 Feb 2024 23:15 UTC

25 points

1 comment5 min readLW link

Useful starting code for interpretability

eggsyntax13 Feb 2024 23:13 UTC

26 points

2 comments1 min readLW link

Masterpiece

Richard_Ngo13 Feb 2024 23:10 UTC

178 points

22 comments4 min readLW link 1 review

(www.narrativeark.xyz)

A Bridge Between Utilitarianism & Stoicism

Jonathan Moregård13 Feb 2024 22:46 UTC

5 points

0 comments5 min readLW link

(honestliving.substack.com)

The “context window” analogy for human minds

Ruby13 Feb 2024 19:29 UTC

38 points

0 comments2 min readLW link

More on the Apple Vision Pro

Zvi13 Feb 2024 17:40 UTC

33 points

5 comments8 min readLW link

(thezvi.wordpress.com)

Linear White

Teja Prabhu13 Feb 2024 16:31 UTC

−3 points

3 comments3 min readLW link

(krez.expert)

Causality is Everywhere

silentbob13 Feb 2024 13:44 UTC

26 points

12 comments8 min readLW link

Technologies and Terminology: AI isn’t Software, it’s… Deepware?

Davidmanheim and abramdemski

13 Feb 2024 13:37 UTC

40 points

10 comments8 min readLW link

[Question] LessWrong Is Very Wrong: Ultimately All Social Media Platforms Are The Same

Amritesh Kumar13 Feb 2024 6:53 UTC

−16 points

2 comments1 min readLW link

Lsusr’s Rationality Dojo

lsusr13 Feb 2024 5:52 UTC

108 points

19 comments2 min readLW link

[Question] Where is the Town Square?

Gretta Duleba13 Feb 2024 3:53 UTC

46 points

8 comments1 min readLW link

My cover story in Jacobin on AI capitalism and the x-risk debates

garrison12 Feb 2024 23:34 UTC

98 points

5 comments6 min readLW link

(jacobin.com)

What is Ontology?

martinkunev12 Feb 2024 23:01 UTC

4 points

0 comments4 min readLW link

Thank you for triggering me

Cissy12 Feb 2024 20:09 UTC

5 points

2 comments6 min readLW link 1 review

(www.moremyself.xyz)

Interpreting Quantum Mechanics in Infra-Bayesian Physicalism

Yegreg12 Feb 2024 18:56 UTC

35 points

10 comments43 min readLW link 1 review

I played the AI box game as the Gatekeeper — and lost

datawitch12 Feb 2024 18:39 UTC

36 points

55 comments4 min readLW link

The Last Laugh: Exploring the Role of Humor as a Benchmark for Large Language Models

Greg Robison12 Feb 2024 18:34 UTC

4 points

6 comments11 min readLW link

Natural abstractions are observer-dependent: a conversation with John Wentworth

Martín Soto12 Feb 2024 17:28 UTC

40 points

13 comments7 min readLW link

Tort Law Can Play an Important Role in Mitigating AI Risk

Gabriel Weil12 Feb 2024 17:17 UTC

39 points

9 comments5 min readLW link

On the Proposed California SB 1047

Zvi12 Feb 2024 16:40 UTC

46 points

18 comments12 min readLW link

(thezvi.wordpress.com)

Thoughts on “The Offense-Defense Balance Rarely Changes”

Cullen12 Feb 2024 3:26 UTC

46 points

5 comments5 min readLW link

Skepticism About DeepMind’s “Grandmaster-Level” Chess Without Search

Arjun Panickssery12 Feb 2024 0:56 UTC

57 points

13 comments3 min readLW link

[Question] What are the known difficulties with this alignment approach?

tailcalled11 Feb 2024 22:52 UTC

18 points

24 comments1 min readLW link

[Question] What are the deciding factors of human cognitive endurance?

koratkar11 Feb 2024 21:56 UTC

23 points

3 comments1 min readLW link

Carl Shulman On Dwarkesh Podcast June 2023

Moonicker11 Feb 2024 21:02 UTC

18 points

0 comments159 min readLW link

How do you actually obtain and report a likelihood function for scientific research?

Peter Berggren11 Feb 2024 17:42 UTC

55 points

4 comments1 min readLW link

The entropy maxim for binary questions

dkl911 Feb 2024 17:17 UTC

2 points

1 comment1 min readLW link

(dkl9.net)

GPT2XL_RLLMv3 vs. BetterDAN, AI Machiavelli & Oppo Jailbreaks

MiguelDev11 Feb 2024 11:03 UTC

16 points

4 comments14 min readLW link

[Question] What’s the theory of impact for activation vectors?

Chris_Leong11 Feb 2024 7:34 UTC

61 points

12 comments1 min readLW link

Experimenting With Footboard Piezos

jefftk11 Feb 2024 3:00 UTC

11 points

2 comments2 min readLW link

(www.jefftk.com)

The Core Values of Life—A proposal for a universal theory of ethics

Thomas Gjøstøl10 Feb 2024 21:48 UTC

2 points

4 comments18 min readLW link

And All the Shoggoths Merely Players

Zack_M_Davis10 Feb 2024 19:56 UTC

178 points

59 comments12 min readLW link 2 reviews

Sam Altman’s Chip Ambitions Undercut OpenAI’s Safety Strategy

garrison10 Feb 2024 19:52 UTC

198 points

53 comments3 min readLW link 1 review

(garrisonlovely.substack.com)

The lattice of partial updatelessness

Martín Soto10 Feb 2024 17:34 UTC

23 points

5 comments5 min readLW link

A Strange ACH Corner Case

jefftk10 Feb 2024 3:00 UTC

27 points

2 comments2 min readLW link

(www.jefftk.com)

Dreams of AI alignment: The danger of suggestive names

TurnTrout10 Feb 2024 1:22 UTC

109 points

59 comments4 min readLW link

Scenario planning for AI x-risk

Corin Katzke10 Feb 2024 0:14 UTC

24 points

12 comments14 min readLW link

(forum.effectivealtruism.org)

Close the Gates to an Inhuman Future: How and why we should choose to not develop superhuman general-purpose artificial intelligence

aaguirre9 Feb 2024 20:25 UTC

13 points

0 comments1 min readLW link

(arxiv.org)

[Crosspost] Deep Dive: The Coming Technological Singularity—How to survive in a Post-human Era

simulacra.exe9 Feb 2024 18:49 UTC

2 points

2 comments9 min readLW link

The Ideal Speech Situation as a Tool for AI Ethical Reflection: A Framework for Alignment

kenneth myers9 Feb 2024 18:40 UTC

6 points

12 comments3 min readLW link

What’s ChatGPT’s Favorite Ice Cream Flavor? An Investigation Into Synthetic Respondents

Greg Robison9 Feb 2024 18:38 UTC

19 points

4 comments15 min readLW link

OpenAI wants to raise 5-7 trillion

O O9 Feb 2024 16:15 UTC

13 points

29 comments1 min readLW link

(decrypt.co)

[Question] Constituency-sized AI congress?

Nathan Helm-Burger9 Feb 2024 16:01 UTC

11 points

5 comments1 min readLW link

One True Love

Zvi9 Feb 2024 15:10 UTC

34 points

7 comments10 min readLW link

(thezvi.wordpress.com)

[Question] Executive function advice from people who are good at it?

TeaTieAndHat9 Feb 2024 10:11 UTC

7 points

1 comment1 min readLW link

[Question] Do you want to make an AI Alignment song?

Kabir Kumar9 Feb 2024 8:22 UTC

4 points

0 comments1 min readLW link

Skills I’d like my collaborators to have

Raemon9 Feb 2024 8:20 UTC

108 points

9 comments8 min readLW link

Transfer learning and generalization-qua-capability in Babbage and Davinci (or, why division is better than Spanish)

RP and agg

9 Feb 2024 7:00 UTC

50 points

6 comments3 min readLW link

Biden-Harris Administration Announces First-Ever Consortium Dedicated to AI Safety

Ben Smith9 Feb 2024 6:40 UTC

22 points

0 comments1 min readLW link

(www.nist.gov)