All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 202320242025 2026

All Jan Feb Mar Apr May Jun Jul AugSepOct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 242526 27 28 29 30

An open response to Wittkotter and Yampolskiy

Donald Hobson24 Sep 2024 22:27 UTC

8 points

0 comments4 min readLW link

A Path out of Insufficient Views

Unreal24 Sep 2024 20:00 UTC

45 points

68 comments9 min readLW link 1 review

How to give effectively to US Dems

Hauke Hillebrandt24 Sep 2024 14:38 UTC

2 points

0 comments1 min readLW link

(www.slowboring.com)

[Question] How do you follow AI (safety) news?

peter_hartree24 Sep 2024 13:58 UTC

4 points

2 comments1 min readLW link

Instruction Following without Instruction Tuning

Bogdan Ionut Cirstea24 Sep 2024 13:49 UTC

17 points

0 comments1 min readLW link

(arxiv.org)

Book Review: On the Edge: The Gamblers

Zvi24 Sep 2024 11:50 UTC

35 points

1 comment89 min readLW link

(thezvi.wordpress.com)

Editing at the Take Level

jefftk24 Sep 2024 11:30 UTC

12 points

1 comment1 min readLW link

(www.jefftk.com)

Using LLM’s for AI Foundation research and the Simple Solution assumption

Donald Hobson24 Sep 2024 11:00 UTC

5 points

0 comments2 min readLW link

When to join a respectability cascade

B Jacobs24 Sep 2024 7:54 UTC

10 points

1 comment2 min readLW link

(bobjacobs.substack.com)

Sampling Effects on Strategic Behavior in Supervised Learning Models

Phil Blandfort24 Sep 2024 7:44 UTC

1 point

0 comments6 min readLW link

In Praise of the Beatitudes

Rob Ennals24 Sep 2024 5:08 UTC

9 points

8 comments3 min readLW link

(messyprogress.substack.com)

[Question] What are the best arguments for/against AIs being “slightly ‘nice’”?

Raemon24 Sep 2024 2:00 UTC

103 points

62 comments31 min readLW link

Struggling like a Shadowmoth

Raemon24 Sep 2024 0:47 UTC

197 points

46 comments7 min readLW link 1 review

Bounty for Evidence on Some of Palisade Research’s Beliefs

benwr and Jeffrey Ladish

23 Sep 2024 20:01 UTC

46 points

4 comments2 min readLW link

Predicting Influenza Abundance in Wastewater Metagenomic Sequencing Data

jefftk23 Sep 2024 17:25 UTC

27 points

0 comments4 min readLW link

(naobservatory.org)

A primer on ML in antibody engineering

Abhishaike Mahajan23 Sep 2024 17:03 UTC

11 points

0 comments25 min readLW link

(www.owlposting.com)

[Question] On the subject of in-house large language models versus implementing frontier models

Annapurna23 Sep 2024 15:00 UTC

7 points

1 comment1 min readLW link

A basic systems architecture for AI agents that do autonomous research

Buck23 Sep 2024 13:58 UTC

190 points

17 comments8 min readLW link 1 review

Book Review: On the Edge: The Fundamentals

Zvi23 Sep 2024 13:40 UTC

64 points

3 comments31 min readLW link

(thezvi.wordpress.com)

Switching to a 4GB SD

jefftk23 Sep 2024 11:20 UTC

11 points

1 comment1 min readLW link

(www.jefftk.com)

Model evals for dangerous capabilities

Zach Stein-Perlman23 Sep 2024 11:00 UTC

52 points

11 comments3 min readLW link

Foundations—Why Britain has stagnated [crosspost]

Nathan Young23 Sep 2024 10:43 UTC

23 points

1 comment57 min readLW link

(ukfoundations.co)

Boons and banes

dkl923 Sep 2024 6:18 UTC

7 points

0 comments2 min readLW link

(dkl9.net)

The Sun is big, but superintelligences will not spare Earth a little sunlight

Eliezer Yudkowsky23 Sep 2024 3:39 UTC

218 points

143 comments13 min readLW link

My 10-year retrospective on trying SSRIs

Kaj_Sotala22 Sep 2024 20:30 UTC

81 points

9 comments2 min readLW link

(kajsotala.fi)

Making Eggs Without Ovaries

Niko_McCarty and Metacelsus

22 Sep 2024 17:44 UTC

58 points

5 comments16 min readLW link

(www.asimov.press)

Becket First

jefftk22 Sep 2024 17:10 UTC

9 points

0 comments2 min readLW link

(www.jefftk.com)

On the Role of Proto-Languages

adamShimi22 Sep 2024 16:50 UTC

54 points

1 comment4 min readLW link

(epistemologicalfascinations.substack.com)

I’m creating a deep dive podcast episode about the original Leverage Research—would you like to take part?

spencerg22 Sep 2024 14:03 UTC

38 points

2 comments1 min readLW link

Who Feels More Alone?

marvinscheffold22 Sep 2024 11:54 UTC

−8 points

2 comments39 min readLW link

Another argument against utility-centric alignment paradigms

Fiora Starlight22 Sep 2024 7:28 UTC

69 points

39 comments8 min readLW link

My hopes for YouCongress.com

Nathan Helm-Burger22 Sep 2024 3:20 UTC

14 points

3 comments4 min readLW link

How Often Does Taking Away Options Help?

niplav21 Sep 2024 21:52 UTC

21 points

8 comments2 min readLW link

A Rational Company—Seeking Advisors

AlignmentOptimizer21 Sep 2024 19:51 UTC

0 points

1 comment1 min readLW link

Seeking mentorship

Kevin Afachao21 Sep 2024 16:54 UTC

5 points

0 comments1 min readLW link

Applications of Chaos: Saying No (with Hastings Greer)

Elizabeth21 Sep 2024 16:30 UTC

50 points

16 comments2 min readLW link

(acesounderglass.com)

Investigating an insurance-for-AI startup

L Rudolf L and Florence Hinder

21 Sep 2024 15:29 UTC

72 points

0 comments16 min readLW link

(www.strataoftheworld.com)

An Unmeasured Song of Measurement

jan Sijan21 Sep 2024 15:08 UTC

−3 points

0 comments4 min readLW link

Should Sports Betting Be Banned?

Maxwell Tabarrok21 Sep 2024 14:13 UTC

18 points

2 comments4 min readLW link

(www.maximum-progress.com)

Work with me on agent foundations: independent fellowship

Alex_Altair21 Sep 2024 13:59 UTC

59 points

8 comments4 min readLW link

Electric Mandola

jefftk21 Sep 2024 13:40 UTC

9 points

0 comments1 min readLW link

(www.jefftk.com)

Glitch Token Catalog - (Almost) a Full Clear

Lao Mein21 Sep 2024 12:22 UTC

38 points

3 comments37 min readLW link

The Other Existential Crisis

James Stephen Brown21 Sep 2024 1:16 UTC

9 points

24 comments2 min readLW link

Apply to MATS 7.0!

Ryan Kidd and K Richards

21 Sep 2024 0:23 UTC

32 points

0 comments5 min readLW link

Moscow – ACX Meetups Everywhere Fall 2024

red-hara20 Sep 2024 23:03 UTC

−1 points

0 comments1 min readLW link

Validating / finding alignment-relevant concepts using neural data

Bogdan Ionut Cirstea20 Sep 2024 21:12 UTC

7 points

0 comments1 min readLW link

(docs.google.com)

Augmenting Statistical Models with Natural Language Parameters

jsteinhardt20 Sep 2024 18:30 UTC

34 points

0 comments8 min readLW link

(bounded-regret.ghost.io)

Fun With The Tabula Muris (Senis)

sarahconstantin20 Sep 2024 18:20 UTC

25 points

0 comments8 min readLW link

(sarahconstantin.substack.com)

My Critique of Effective Altruism

Dylan Price20 Sep 2024 17:41 UTC

−10 points

8 comments4 min readLW link

[Question] Why be moral if we can’t measure how moral we are? Is it even possible to measure morality?

Oliver Kuperman20 Sep 2024 17:40 UTC

−2 points

0 comments3 min readLW link