All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025

All Jan Feb Mar Apr May Jun JulAugSep Oct Nov Dec

All1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

$20K In Bounties for AI Safety Public Materials

Dan H, TW123 and ozhang

Aug 5, 2022, 2:52 AM

71 points

9 comments6 min readLW link

Paper reading as a Cargo Cult

jem-mosigAug 7, 2022, 7:50 AM

70 points

10 comments5 min readLW link

Building a Bugs List prompts

CFAR!DuncanAug 13, 2022, 8:00 AM

69 points

9 comments2 min readLW link

Jack Clark on the realities of AI policy

Kaj_SotalaAug 7, 2022, 8:44 AM

68 points

3 comments3 min readLW link

(threadreaderapp.com)

The Expanding Moral Cinematic Universe

RaemonAug 28, 2022, 6:42 PM

67 points

9 comments14 min readLW link

In Defense Of Making Money

George3d6Aug 18, 2022, 2:10 PM

65 points

13 comments7 min readLW link

(www.epistem.ink)

AI art isn’t “about to shake things up”. It’s already here.

Davis_KingsleyAug 22, 2022, 11:17 AM

65 points

19 comments3 min readLW link

Vingean Agency

abramdemskiAug 24, 2022, 8:08 PM

63 points

14 comments3 min readLW link

ACX Meetups Everywhere List

Scott AlexanderAug 26, 2022, 6:12 PM

63 points

1 comment41 min readLW link

Encultured AI Pre-planning, Part 1: Enabling New Benchmarks

Andrew_Critch and Nick Hay

Aug 8, 2022, 10:44 PM

63 points

2 comments6 min readLW link

Steganography in Chain of Thought Reasoning

A RayAug 8, 2022, 3:47 AM

62 points

13 comments6 min readLW link

Oops It’s Time To Overthrow the Organizer Day!

ScrewtapeAug 18, 2022, 4:40 PM

62 points

5 comments4 min readLW link

Seeking PCK (Pedagogical Content Knowledge)

CFAR!DuncanAug 12, 2022, 4:15 AM

62 points

11 comments5 min readLW link

Seeking Interns/RAs for Mechanistic Interpretability Projects

Neel NandaAug 15, 2022, 7:11 AM

61 points

0 comments2 min readLW link

Autonomy as taking responsibility for reference maintenance

Ramana KumarAug 17, 2022, 12:50 PM

61 points

3 comments5 min readLW link

An Introduction to Current Theories of Consciousness

hohenheimAug 28, 2022, 5:55 PM

60 points

43 comments49 min readLW link

OpenAI’s Alignment Plans

dkirmaniAug 24, 2022, 7:39 PM

60 points

17 comments5 min readLW link

(openai.com)

Anti-squatted AI x-risk domains index

plexAug 12, 2022, 12:01 PM

59 points

6 comments1 min readLW link

Finding Goals in the World Model

Jeremy Gillen, JamesH and Thomas Larsen

Aug 22, 2022, 6:06 PM

59 points

8 comments13 min readLW link

The Pragmascope Idea

johnswentworthAug 4, 2022, 9:52 PM

59 points

20 comments3 min readLW link

My thoughts on direct work (and joining LessWrong)

RobertMAug 16, 2022, 6:53 PM

58 points

4 comments6 min readLW link

How to plan for a radically uncertain future?

KerryAug 30, 2022, 2:14 AM

57 points

35 comments1 min readLW link

EA & LW Forums Weekly Summary (21 Aug − 27 Aug 22′)

Zoe WilliamsAug 30, 2022, 1:42 AM

57 points

4 comments12 min readLW link

How and why to turn everything into audio

KatWoods and AmberDawn

Aug 11, 2022, 8:55 AM

57 points

20 comments5 min readLW link

Refine’s First Blog Post Day

adamShimiAug 13, 2022, 10:23 AM

55 points

3 comments1 min readLW link

[Question] How to bet against civilizational adequacy?

Wei Dai12 Aug 2022 23:33 UTC

54 points

20 comments1 min readLW link

All the posts I will never write

Alexander Gietelink Oldenziel14 Aug 2022 18:29 UTC

54 points

8 comments8 min readLW link

Brain-like AGI project “aintelope”

Gunnar_Zarncke14 Aug 2022 16:33 UTC

54 points

2 comments1 min readLW link

Transformer language models are doing something more general

Numendil3 Aug 2022 21:13 UTC

53 points

6 comments2 min readLW link

I missed the crux of the alignment problem the whole time

zeshen13 Aug 2022 10:11 UTC

53 points

7 comments3 min readLW link

Using GPT-3 to augment human intelligence

Henrik Karlsson10 Aug 2022 15:54 UTC

52 points

8 comments18 min readLW link

(escapingflatland.substack.com)

Variational Bayesian methods

Ege Erdil25 Aug 2022 20:49 UTC

52 points

2 comments9 min readLW link

A Data limited future

Donald Hobson6 Aug 2022 14:56 UTC

52 points

25 comments2 min readLW link

Turbocharging

CFAR!Duncan2 Aug 2022 0:01 UTC

52 points

5 comments9 min readLW link

Announcing Squiggle: Early Access

ozziegooen3 Aug 2022 19:48 UTC

51 points

7 comments7 min readLW link

(forum.effectivealtruism.org)

General alignment properties

TurnTrout8 Aug 2022 23:40 UTC

51 points

2 comments1 min readLW link

Againstness

CFAR!Duncan2 Aug 2022 19:29 UTC

50 points

8 comments9 min readLW link

Polaris, Five-Second Versions, and Thought Lengths

CFAR!Duncan1 Aug 2022 7:14 UTC

50 points

12 comments8 min readLW link

On Car Seats as Contraception

Zvi22 Aug 2022 14:10 UTC

49 points

15 comments35 min readLW link

(thezvi.wordpress.com)

Six weeks doesn’t make a habit

lynettebye6 Aug 2022 8:54 UTC

48 points

1 comment3 min readLW link

AGI Timelines Are Mostly Not Strategically Relevant To Alignment

johnswentworth23 Aug 2022 20:15 UTC

48 points

34 comments1 min readLW link

The Shard Theory Alignment Scheme

David Udell25 Aug 2022 4:52 UTC

47 points

32 comments2 min readLW link

Gradient descent doesn’t select for inner search

Ivan Vendrov13 Aug 2022 4:15 UTC

47 points

23 comments4 min readLW link

Covid 8/18/22: CDC Admits Mistakes

Zvi18 Aug 2022 14:30 UTC

46 points

9 comments17 min readLW link

(thezvi.wordpress.com)

Proposal: Consider not using distance-direction-dimension words in abstract discussions

moridinamael9 Aug 2022 20:44 UTC

46 points

18 comments5 min readLW link

The Falling Drill

Screwtape5 Aug 2022 0:08 UTC

46 points

3 comments2 min readLW link

Review: Amusing Ourselves to Death

L Rudolf L20 Aug 2022 21:13 UTC

44 points

7 comments16 min readLW link 1 review

(www.strataoftheworld.com)

Volunteer to host a meetup!

mingyuan21 Aug 2022 22:43 UTC

44 points

1 comment1 min readLW link

The Dumbest Possible Gets There First

Artaxerxes13 Aug 2022 10:20 UTC

44 points

7 comments2 min readLW link

The Solomonoff prior is malign. It’s not a big deal.

Charlie Steiner25 Aug 2022 8:25 UTC

43 points

9 comments7 min readLW link