All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025

All Jan Feb Mar Apr May Jun JulAugSep Oct Nov Dec

All1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

FarmKind’s Illusory Offer

jefftkAug 9, 2024, 11:30 AM

71 points

5 comments3 min readLW link

(www.jefftk.com)

Please do not use AI to write for you

Richard_KennawayAug 21, 2024, 9:53 AM

69 points

34 comments4 min readLW link

What is it to solve the alignment problem? (Notes)

Joe CarlsmithAug 24, 2024, 9:19 PM

69 points

18 comments53 min readLW link

The Hessian rank bounds the learning coefficient

Lucius BushnaqAug 8, 2024, 8:55 PM

68 points

10 comments4 min readLW link

Showing SAE Latents Are Not Atomic Using Meta-SAEs

Bart Bussmann, Michael Pearce, Patrick Leask, Joseph Bloom, Lee Sharkey and Neel Nanda

Aug 24, 2024, 12:56 AM

68 points

10 comments20 min readLW link

GPT-4o System Card

Zach Stein-PerlmanAug 8, 2024, 8:30 PM

68 points

11 comments2 min readLW link

(openai.com)

AI #79: Ready for Some Football

ZviAug 29, 2024, 1:30 PM

68 points

16 comments32 min readLW link

(thezvi.wordpress.com)

Why Large Bureaucratic Organizations?

johnswentworthAug 27, 2024, 6:30 PM

68 points

52 comments12 min readLW link

The economics of space tethers

harsimonyAug 22, 2024, 4:15 PM

67 points

22 comments7 min readLW link

(splittinginfinity.substack.com)

Fear of centralized power vs. fear of misaligned AGI: Vitalik Buterin on 80,000 Hours

Seth HerdAug 5, 2024, 3:38 PM

66 points

22 comments5 min readLW link

A primer on why computational predictive toxicology is hard

Abhishaike MahajanAug 19, 2024, 5:16 PM

63 points

2 comments12 min readLW link

(www.owlposting.com)

Interdictor Ship

lsusrAug 19, 2024, 4:59 AM

63 points

9 comments7 min readLW link

Outrage Bonding

Jonathan MoregårdAug 9, 2024, 1:46 PM

63 points

12 comments2 min readLW link

(honestliving.substack.com)

Rationalists are missing a core piece for agent-like structure (energy vs information overload)

tailcalledAug 17, 2024, 9:57 AM

62 points

9 comments4 min readLW link

AI #78: Some Welcome Calm

ZviAug 22, 2024, 2:20 PM

61 points

15 comments33 min readLW link

(thezvi.wordpress.com)

Self-explaining SAE features

Dmitrii Kharlapenko, neverix, Neel Nanda and Arthur Conmy

Aug 5, 2024, 10:20 PM

60 points

13 comments10 min readLW link

… Wait, our models of semantics should inform fluid mechanics?!?

johnswentworth and David Lorell

Aug 26, 2024, 4:38 PM

59 points

18 comments4 min readLW link

Announcing the $200k EA Community Choice

Austin ChenAug 14, 2024, 12:39 AM

58 points

8 comments LW link

(manifund.substack.com)

Congressional Insider Trading

Maxwell TabarrokAug 30, 2024, 1:32 PM

57 points

6 comments7 min readLW link

(www.maximum-progress.com)

You’re a Space Wizard, Luke

lsusrAug 18, 2024, 5:35 AM

57 points

6 comments2 min readLW link

Referendum Mechanics in a Marketplace of Ideas

Martin SustrikAug 25, 2024, 8:30 AM

57 points

2 comments5 min readLW link

(250bpm.substack.com)

The Bitter Lesson for AI Safety Research

adamk, Richard Ren, Dan H and Gabe M

Aug 2, 2024, 6:39 PM

57 points

5 comments3 min readLW link

Some Unorthodox Ways To Achieve High GDP Growth

johnswentworth and David Lorell

Aug 8, 2024, 6:58 PM

57 points

6 comments6 min readLW link

John Schulman leaves OpenAI for Anthropic [and then left Anthropic again for Thinking Machines]

SodiumAug 6, 2024, 1:23 AM

57 points

0 comments1 min readLW link

Measuring Structure Development in Algorithmic Transformers

Micurie and Einar Urdshals

Aug 22, 2024, 8:38 AM

56 points

4 comments11 min readLW link

Thiel on AI & Racing with China

Ben PaceAug 20, 2024, 3:19 AM

55 points

10 comments12 min readLW link

Demis Hassabis — Google DeepMind: The Podcast

Zach Stein-PerlmanAug 16, 2024, 12:00 AM

55 points

8 comments3 min readLW link

(www.youtube.com)

Owain Evans on Situational Awareness and Out-of-Context Reasoning in LLMs

Michaël TrazziAug 24, 2024, 4:30 AM

55 points

0 comments5 min readLW link

[LDSL#0] Some epistemological conundrums

tailcalledAug 7, 2024, 7:52 PM

54 points

11 comments10 min readLW link

Provably Safe AI: Worldview and Projects

Ben Goldhaber and Steve_Omohundro

Aug 9, 2024, 11:21 PM

54 points

44 comments7 min readLW link

Calendar feature geometry in GPT-2 layer 8 residual stream SAEs

Patrick Leask, Bart Bussmann and Neel Nanda

Aug 17, 2024, 1:16 AM

53 points

0 comments5 min readLW link

Extended Interview with Zhukeepa on Religion

Ben Pace and zhukeepa

Aug 18, 2024, 3:19 AM

53 points

61 comments119 min readLW link

AI Rights for Human Safety

Simon GoldsteinAug 1, 2024, 11:01 PM

53 points

6 comments1 min readLW link

(papers.ssrn.com)

AI #76: Six Shorts Stories About OpenAI

ZviAug 8, 2024, 1:50 PM

53 points

10 comments48 min readLW link

(thezvi.wordpress.com)

Rewilding the Gut VS the Autoimmune Epidemic

GGDAug 16, 2024, 6:00 PM

51 points

0 comments3 min readLW link

Decision Theory in Space

lsusrAug 18, 2024, 7:02 AM

50 points

18 comments2 min readLW link

Interoperable High Level Structures: Early Thoughts on Adjectives

johnswentworth and David Lorell

Aug 22, 2024, 9:12 PM

49 points

1 comment7 min readLW link

SRE’s review of Democracy

Martin SustrikAug 3, 2024, 7:20 AM

48 points

2 comments3 min readLW link

(250bpm.substack.com)

What’s important in “AI for epistemics”?

Lukas FinnvedenAug 24, 2024, 1:27 AM

48 points

0 comments28 min readLW link

(www.forethought.org)

Trustworthy and untrustworthy models

Olli JärviniemiAug 19, 2024, 4:27 PM

47 points

3 comments8 min readLW link

All The Latest Human tFUS Studies

sarahconstantinAug 9, 2024, 10:20 PM

46 points

2 comments8 min readLW link

(sarahconstantin.substack.com)

Humanity isn’t remotely longtermist, so arguments for AGI x-risk should focus on the near term

Seth Herd12 Aug 2024 18:10 UTC

46 points

10 comments1 min readLW link

We’re not as 3-Dimensional as We Think

silentbob4 Aug 2024 14:39 UTC

46 points

17 comments5 min readLW link

How to hire somebody better than yourself

lemonhope28 Aug 2024 8:12 UTC

46 points

5 comments5 min readLW link

AI #75: Math is Easier

Zvi1 Aug 2024 13:40 UTC

46 points

25 comments72 min readLW link

(thezvi.wordpress.com)

Principled Satisficing To Avoid Goodhart

JenniferRM16 Aug 2024 19:05 UTC

45 points

2 comments8 min readLW link

Startup Roundup #2

Zvi6 Aug 2024 13:30 UTC

45 points

0 comments32 min readLW link

(thezvi.wordpress.com)

Case Study: Interpreting, Manipulating, and Controlling CLIP With Sparse Autoencoders

Gytis Daujotas1 Aug 2024 21:08 UTC

45 points

7 comments7 min readLW link

[Question] “Deception Genre” What Books are like Project Lawful?

Double28 Aug 2024 17:19 UTC

45 points

20 comments1 min readLW link

In defense of technological unemployment as the main AI concern

tailcalled27 Aug 2024 17:58 UTC

44 points

36 comments1 min readLW link