All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025

All Jan Feb Mar Apr May JunJulAug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 141516 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

OpenAI Launches Superalignment Taskforce

ZviJul 11, 2023, 1:00 PM

150 points

40 comments49 min readLW link

(thezvi.wordpress.com)

Critiquing Risks From Learned Optimization, and Avoiding Cached Theories

ProofBySonnetJul 11, 2023, 11:38 AM

1 point

0 comments6 min readLW link

[UPDATE: deadline extended to July 24!] New wind in rationality’s sails: Applications for Epistea Residency 2023 are now open

Jana Meixnerová and Irena Kotíková

Jul 11, 2023, 11:02 AM

80 points

7 comments3 min readLW link

Two Hot Takes about Quine

Charlie SteinerJul 11, 2023, 6:42 AM

17 points

0 comments2 min readLW link

Disincentivizing deception in mesa optimizers with Model Tampering

martinkunevJul 11, 2023, 12:44 AM

3 points

0 comments2 min readLW link

Drawn Out: a story

Richard_NgoJul 11, 2023, 12:08 AM

82 points

2 comments8 min readLW link

Definitions are about efficiency and consistency with common language.

Nacruno96Jul 10, 2023, 11:46 PM

1 point

0 comments4 min readLW link

Reframing Evolution—An information wavefront traveling through time

Joshua ClancyJul 10, 2023, 10:36 PM

1 point

0 comments5 min readLW link

(midflip.org)

GPT-7: The Tale of the Big Computer (An Experimental Story)

Justin BullockJul 10, 2023, 8:22 PM

4 points

4 comments5 min readLW link

Cost-effectiveness of professional field-building programs for AI safety research

Dan HJul 10, 2023, 6:28 PM

8 points

5 comments18 min readLW link

Cost-effectiveness of student programs for AI safety research

Dan HJul 10, 2023, 6:28 PM

15 points

2 comments15 min readLW link

Modeling the impact of AI safety field-building programs

Dan HJul 10, 2023, 6:27 PM

21 points

0 comments7 min readLW link

I think Michael Bailey’s dismissal of my autogynephilia questions for Scott Alexander and Aella makes very little sense

tailcalledJul 10, 2023, 5:39 PM

46 points

45 comments2 min readLW link

Incentives from a causal perspective

tom4everitt, James Fox, RyanCarey, mattmacdermott, sbenthall and Jonathan Richens

Jul 10, 2023, 5:16 PM

27 points

0 comments6 min readLW link

Is the Endowment Effect Due to Incomparability?

Kevin DorstJul 10, 2023, 4:26 PM

21 points

10 comments7 min readLW link

(kevindorst.substack.com)

Frontier AI Regulation

Zach Stein-PerlmanJul 10, 2023, 2:30 PM

21 points

4 comments8 min readLW link

(arxiv.org)

Why is it so hard to change people’s minds? Well, imagine if it wasn’t...

CelarixJul 10, 2023, 1:55 PM

6 points

9 comments6 min readLW link

Consider Joining the UK Foundation Model Taskforce

ZviJul 10, 2023, 1:50 PM

105 points

12 comments1 min readLW link

(thezvi.wordpress.com)

“Reframing Superintelligence” + LLMs + 4 years

Eric DrexlerJul 10, 2023, 1:42 PM

118 points

9 comments12 min readLW link

Open-minded updatelessness

Nicolas Macé, JesseClifton and SMK

Jul 10, 2023, 11:08 AM

66 points

21 comments12 min readLW link

Consciousness as a conflationary alliance term for intrinsically valued internal experiences

Andrew_CritchJul 10, 2023, 8:09 AM

212 points

54 comments11 min readLW link 2 reviews

The world where LLMs are possible

Ape in the coatJul 10, 2023, 8:00 AM

20 points

10 comments3 min readLW link

The virtue of determination

Richard_NgoJul 10, 2023, 5:11 AM

73 points

6 comments4 min readLW link

Some reasons to not say “Doomer”

RubyJul 9, 2023, 9:05 PM

46 points

18 comments4 min readLW link

The Seeker’s Game – Vignettes from the Bay

YuliaJul 9, 2023, 7:32 PM

141 points

19 comments16 min readLW link

[Question] Why have exposure notification apps been (mostly) discontinued?

VipulNaikJul 9, 2023, 7:07 PM

10 points

5 comments2 min readLW link

[Question] The Necessity of Privacy: A Condition for Social Change and Experimentation?

BlakeJul 9, 2023, 6:42 PM

−8 points

1 comment1 min readLW link

Attempting to Deconstruct “Real”

herschelJul 9, 2023, 4:40 PM

21 points

23 comments2 min readLW link

Quick proposal: Decision market regrantor using manifund (please improve)

Nathan YoungJul 9, 2023, 12:49 PM

10 points

5 comments5 min readLW link

[Question] Where are the people building AGI in the non-dumb way?

Johannes C. MayerJul 9, 2023, 11:39 AM

10 points

19 comments2 min readLW link

[Question] What to read on the “informal multi-world model”?

mishkaJul 9, 2023, 4:48 AM

13 points

23 comments1 min readLW link

Whether LLMs “understand” anything is mostly a terminological dispute

RobertMJul 9, 2023, 3:31 AM

10 points

1 comment1 min readLW link

“View”

herschelJul 8, 2023, 11:19 PM

6 points

0 comments2 min readLW link

[Question] H5N1. Just how bad is the situation?

Q HomeJul 8, 2023, 10:09 PM

16 points

8 comments1 min readLW link

A Two-Part System for Practical Self-Care

Jonathan MoregårdJul 8, 2023, 9:23 PM

11 points

0 comments3 min readLW link

(honestliving.substack.com)

Really Strong Features Found in Residual Stream

Logan RiggsJul 8, 2023, 7:40 PM

69 points

6 comments2 min readLW link

Eight Strategies for Tackling the Hard Part of the Alignment Problem

scasperJul 8, 2023, 6:55 PM

42 points

11 comments7 min readLW link

“Concepts of Agency in Biology” (Okasha, 2023) - Brief Paper Summary

Nora_AmmannJul 8, 2023, 6:22 PM

40 points

3 comments7 min readLW link

Blanchard’s Dangerous Idea and the Plight of the Lucid Crossdreamer

Zack_M_DavisJul 8, 2023, 6:03 PM

38 points

135 comments72 min readLW link

(unremediatedgender.space)

Continuous Adversarial Quality Assurance: Extending RLHF and Constitutional AI

Benaya KorenJul 8, 2023, 5:32 PM

6 points

0 comments9 min readLW link

Commentless downvoting is not a good way to fight infohazards

DirectedEvolutionJul 8, 2023, 5:29 PM

6 points

9 comments3 min readLW link

[Question] Why does anxiety (?) make me dumb?

TeaTieAndHatJul 8, 2023, 4:13 PM

18 points

14 comments3 min readLW link

Economic Time Bomb: An Overlooked Employment Bubble Threatening the US Economy

Glenn ClaytonJul 8, 2023, 3:19 PM

4 points

10 comments6 min readLW link

What is everyone doing in AI governance

Igor IvanovJul 8, 2023, 3:16 PM

12 points

0 comments5 min readLW link

LLM misalignment can probably be found without manual prompt engineering

ProgramCrafterJul 8, 2023, 2:35 PM

1 point

0 comments1 min readLW link

You must not fool yourself, and you are the easiest person to fool

Richard_NgoJul 8, 2023, 2:05 PM

35 points

5 comments4 min readLW link

Fixed Point: a love story

Richard_NgoJul 8, 2023, 1:56 PM

100 points

2 comments7 min readLW link

Announcing AI Alignment workshop at the ALIFE 2023 conference

rorygreigJul 8, 2023, 1:52 PM

16 points

0 comments1 min readLW link

(humanvaluesandartificialagency.com)

3D Printed Talkbox Cap

jefftkJul 8, 2023, 1:00 PM

9 points

0 comments1 min readLW link

(www.jefftk.com)

Writing this post as rationality case study

Ben AmitayJul 8, 2023, 12:24 PM

10 points

8 comments2 min readLW link