All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025 2026

All Jan Feb Mar Apr May Jun Jul Aug Sep Oct NovDec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 171819 20 21 22 23 24 25 26 27 28 29 30 31

Talk: “AI Would Be A Lot Less Alarming If We Understood Agents”

johnswentworth17 Dec 2023 23:46 UTC

58 points

3 comments1 min readLW link

(www.youtube.com)

∀: a story

Richard_Ngo17 Dec 2023 22:42 UTC

42 points

1 comment8 min readLW link

(www.narrativeark.xyz)

Reviving a 2015 MacBook

jefftk17 Dec 2023 21:00 UTC

13 points

0 comments1 min readLW link

(www.jefftk.com)

A Common-Sense Case For Mutually-Misaligned AGIs Allying Against Humans

Thane Ruthenis17 Dec 2023 20:28 UTC

29 points

7 comments11 min readLW link

The Limits of Artificial Consciousness: A Biology-Based Critique of Chalmers’ Fading Qualia Argument

Štěpán Los17 Dec 2023 19:11 UTC

−6 points

9 comments17 min readLW link

What makes teaching math special

Viliam17 Dec 2023 14:15 UTC

45 points

27 comments11 min readLW link

The predictive power of dissipative adaptation

dr_s17 Dec 2023 14:01 UTC

59 points

16 comments19 min readLW link

Linkpost: Francesca v Harvard

Linch17 Dec 2023 6:18 UTC

5 points

5 comments2 min readLW link

(www.francesca-v-harvard.org)

The Serendipity of Density

jefftk17 Dec 2023 3:50 UTC

40 points

4 comments1 min readLW link

(www.jefftk.com)

Bounty: Diverse hard tasks for LLM agents

Beth Barnes and Megan Kinniment

17 Dec 2023 1:04 UTC

49 points

31 comments16 min readLW link

2022 (and All Time) Posts by Pingback Count

Raemon16 Dec 2023 21:17 UTC

53 points

14 comments6 min readLW link

“Humanity vs. AGI” Will Never Look Like “Humanity vs. AGI” to Humanity

Thane Ruthenis16 Dec 2023 20:08 UTC

192 points

34 comments5 min readLW link

A visual analogy for text generation by LLMs?

Bill Benzon16 Dec 2023 17:58 UTC

3 points

0 comments1 min readLW link

cold aluminum for medicine

bhauth16 Dec 2023 14:38 UTC

42 points

4 comments4 min readLW link

(www.bhauth.com)

Scalable Oversight and Weak-to-Strong Generalization: Compatible approaches to the same problem

Ansh Radhakrishnan, Buck, ryan_greenblatt and Fabien Roger

16 Dec 2023 5:49 UTC

79 points

4 comments6 min readLW link 1 review

Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision

leogao16 Dec 2023 5:39 UTC

56 points

5 comments1 min readLW link

Pope Francis shares thoughts on responsible AI development

corruptedCatapillar16 Dec 2023 3:49 UTC

15 points

4 comments1 min readLW link

(www.vatican.va)

Current AIs Provide Nearly No Data Relevant to AGI Alignment

Thane Ruthenis15 Dec 2023 20:16 UTC

132 points

157 comments8 min readLW link 1 review

Agglomeration of ‘Ought’

DavidAndresBloom15 Dec 2023 19:07 UTC

1 point

1 comment11 min readLW link

Predicting the future with the power of the Internet (and pissing off Rob Miles)

Writer15 Dec 2023 17:37 UTC

23 points

9 comments4 min readLW link

(youtu.be)

Progress links digest, 2023-12-15: Vitalik on d/acc, $100M+ in prizes, and more

jasoncrawford15 Dec 2023 15:52 UTC

20 points

0 comments12 min readLW link

(rootsofprogress.org)

“AI Alignment” is a Dangerously Overloaded Term

Roko15 Dec 2023 14:34 UTC

108 points

100 comments3 min readLW link

[Valence series] 4. Valence & Social Status (deprecated)

Steven Byrnes15 Dec 2023 14:24 UTC

35 points

19 comments11 min readLW link

Contra Scott on Abolishing the FDA

Maxwell Tabarrok15 Dec 2023 14:00 UTC

46 points

3 comments6 min readLW link

(maximumprogress.substack.com)

[Paper] Trajectories through semantic spaces in schizophrenia and the relationship to ripple bursts

bvbvbvbvbvbvbvbvbvbvbv15 Dec 2023 13:37 UTC

3 points

0 comments1 min readLW link

(www.pnas.org)

Takeaways from a Mechanistic Interpretability project on “Forbidden Facts”

Tony Wang, Miles Wang and kaivu

15 Dec 2023 11:05 UTC

34 points

8 comments10 min readLW link

Refinement of Active Inference agency ontology

Roman Leventov15 Dec 2023 9:31 UTC

17 points

0 comments5 min readLW link

(arxiv.org)

EU policymakers reach an agreement on the AI Act

tlevin15 Dec 2023 6:02 UTC

78 points

7 comments7 min readLW link

Where Does Adversarial Pressure Come From?

quetzal_rainbow14 Dec 2023 22:31 UTC

17 points

1 comment2 min readLW link

Epoch wise critical periods, and singular learning theory

Garrett Baker14 Dec 2023 20:55 UTC

16 points

1 comment5 min readLW link

OpenAI Superalignment: Weak-to-strong generalization

Dalmert14 Dec 2023 19:47 UTC

25 points

3 comments1 min readLW link

(openai.com)

Applications for EA Global are still open!

Eli_Nathan14 Dec 2023 19:10 UTC

1 point

0 comments1 min readLW link

Personal Development System: Winning Repeatedly and Growing Effectively With The BIG4

Paul Rohde14 Dec 2023 18:49 UTC

13 points

0 comments33 min readLW link

(blog.paul-rohde.com)

Introducing The ‘From Big Ideas To Real-World Results’: A Series for Effective Personal Development

Paul Rohde14 Dec 2023 18:49 UTC

13 points

1 comment8 min readLW link

(blog.paul-rohde.com)

Talking With People Who Speak to Congressional Staffers about AI risk

Eneasz14 Dec 2023 17:55 UTC

32 points

0 comments1 min readLW link

(www.thebayesianconspiracy.com)

Bayesian Injustice

Kevin Dorst14 Dec 2023 15:44 UTC

125 points

10 comments6 min readLW link

(kevindorst.substack.com)

AI #42: The Wrong Answer

Zvi14 Dec 2023 14:50 UTC

69 points

6 comments54 min readLW link

(thezvi.wordpress.com)

Some for-profit AI alignment org ideas

Eric Ho14 Dec 2023 14:23 UTC

93 points

19 comments9 min readLW link

Mapping the semantic void: Strange goings-on in GPT embedding spaces

mwatkins14 Dec 2023 13:10 UTC

115 points

31 comments14 min readLW link

Categorical Organization in Memory: ChatGPT Organizes the 665 Topic Tags from My New Savanna Blog

Bill Benzon14 Dec 2023 13:02 UTC

0 points

6 comments2 min readLW link

Moral Mountains

Biff Wiff14 Dec 2023 10:40 UTC

8 points

10 comments2 min readLW link

Update on Chinese IQ-related gene panels

Lao Mein14 Dec 2023 10:12 UTC

70 points

7 comments1 min readLW link

Red Line Ashmont Train is Now Approaching

jefftk14 Dec 2023 2:50 UTC

23 points

2 comments1 min readLW link

(www.jefftk.com)

Various AI doom pathways (and how likely they are)

Logan Zoellner14 Dec 2023 0:54 UTC

1 point

1 comment4 min readLW link

(midwitalignment.substack.com)

Are There Examples of Overhang for Other Technologies?

Jeffrey Heninger13 Dec 2023 21:48 UTC

59 points

50 comments11 min readLW link

(blog.aiimpacts.org)

Is being sexy for your homies?

Valentine13 Dec 2023 20:37 UTC

210 points

100 comments14 min readLW link 2 reviews

How bad is chlorinated water?

bhauth13 Dec 2023 18:00 UTC

41 points

18 comments3 min readLW link

(www.bhauth.com)

[Question] Suggestions for net positive LLM research

Cole Wyeth13 Dec 2023 17:29 UTC

13 points

6 comments1 min readLW link

AI Control: Improving Safety Despite Intentional Subversion

Buck, Fabien Roger, ryan_greenblatt and Kshitij Sachan

13 Dec 2023 15:51 UTC

241 points

26 comments10 min readLW link 4 reviews

The Busy Bee Brain

Bill Benzon13 Dec 2023 13:10 UTC

11 points

0 comments6 min readLW link