All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025

All Jan Feb Mar Apr May JunJulAug Sep Oct Nov Dec

All 1 2 3 4 5 6 789 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Notes from the Qatar Center for Global Banking and Finance 3rd Annual Conference

PixelatedPenguinJul 7, 2023, 11:48 PM

2 points

0 comments1 min readLW link

Introducing bayescalc.io

Adele LopezJul 7, 2023, 4:11 PM

115 points

29 comments1 min readLW link

(bayescalc.io)

Meetup Tip: Ask Attendees To Explain It

ScrewtapeJul 7, 2023, 4:08 PM

10 points

0 comments4 min readLW link

Interpreting Modular Addition in MLPs

Bart BussmannJul 7, 2023, 9:22 AM

20 points

0 comments6 min readLW link

Internal independent review for language model agent alignment

Seth HerdJul 7, 2023, 6:54 AM

55 points

30 comments11 min readLW link

[Question] Can LessWrong provide me with something I find obviously highly useful to my own practical life?

agrippaJul 7, 2023, 3:08 AM

32 points

4 comments1 min readLW link

ask me about technology

bhauthJul 7, 2023, 2:03 AM

23 points

42 comments1 min readLW link

Apparently, of the 195 Million the DoD allocated in University Research Funding Awards in 2022, more than half of them concerned AI or compute hardware research

mako yassJul 7, 2023, 1:20 AM

41 points

5 comments2 min readLW link

(www.defense.gov)

What are the best non-LW places to read on alignment progress?

RaemonJul 7, 2023, 12:57 AM

50 points

14 comments1 min readLW link

Two paths to win the AGI transition

Nathan Helm-BurgerJul 6, 2023, 9:59 PM

11 points

8 comments4 min readLW link

Empirical Evidence Against “The Longest Training Run”

NickGabsJul 6, 2023, 6:32 PM

31 points

0 comments14 min readLW link

Progress Studies Fellowship looking for members

jay ramJul 6, 2023, 5:41 PM

3 points

0 comments1 min readLW link

BOUNTY AVAILABLE: AI ethicists, what are your object-level arguments against AI notkilleveryoneism?

Peter BerggrenJul 6, 2023, 5:32 PM

18 points

6 comments2 min readLW link

Layering and Technical Debt in the Global Wayfinding Model

herschelJul 6, 2023, 5:30 PM

14 points

0 comments3 min readLW link

Localizing goal misgeneralization in a maze-solving policy network

Jan BetleyJul 6, 2023, 4:21 PM

37 points

2 comments7 min readLW link

Jesse Hoogland on Developmental Interpretability and Singular Learning Theory

Michaël TrazziJul 6, 2023, 3:46 PM

42 points

2 comments4 min readLW link

(theinsideview.ai)

Progress links and tweets, 2023-07-06: Terraformer Mark One, Israeli water management, & more

jasoncrawfordJul 6, 2023, 3:35 PM

18 points

4 comments2 min readLW link

(rootsofprogress.org)

Towards Non-Panopticon AI Alignment

Logan ZoellnerJul 6, 2023, 3:29 PM

7 points

0 comments3 min readLW link

A Defense of Work on Mathematical AI Safety

DavidmanheimJul 6, 2023, 2:15 PM

28 points

13 comments3 min readLW link

(forum.effectivealtruism.org)

Understanding the two most common mental health problems in the world

spencergJul 6, 2023, 2:06 PM

19 points

0 comments LW link

Announcing the EA Archive

Aaron BergmanJul 6, 2023, 1:49 PM

13 points

2 comments LW link

Agency begets agency

Richard_NgoJul 6, 2023, 1:08 PM

60 points

1 comment4 min readLW link

AI #19: Hofstadter, Sutskever, Leike

ZviJul 6, 2023, 12:50 PM

60 points

16 comments40 min readLW link

(thezvi.wordpress.com)

Do you feel that AGI Alignment could be achieved in a Type 0 civilization?

Super AGIJul 6, 2023, 4:52 AM

−2 points

1 comment1 min readLW link

Open Thread—July 2023

RubyJul 6, 2023, 4:50 AM

11 points

35 comments1 min readLW link

AI Intermediation

jefftkJul 6, 2023, 1:50 AM

12 points

0 comments1 min readLW link

(www.jefftk.com)

Announcing Manifund Regrants

Austin ChenJul 5, 2023, 7:42 PM

74 points

8 comments LW link

Infra-Bayesian Logic

harfe and Yegreg

Jul 5, 2023, 7:16 PM

15 points

2 comments1 min readLW link

[Linkpost] Introducing Superalignment

berenJul 5, 2023, 6:23 PM

175 points

69 comments1 min readLW link

(openai.com)

If you wish to make an apple pie, you must first become dictator of the universe

jasoncrawfordJul 5, 2023, 6:14 PM

27 points

9 comments13 min readLW link

(rootsofprogress.org)

An AGI kill switch with defined security properties

PeterpiperJul 5, 2023, 5:40 PM

−5 points

6 comments1 min readLW link

The risk-reward tradeoff of interpretability research

JustinShovelain and Elliot Mckernon

Jul 5, 2023, 5:05 PM

15 points

1 comment6 min readLW link

(tentatively) Found 600+ Monosemantic Features in a Small LM Using Sparse Autoencoders

Logan RiggsJul 5, 2023, 4:49 PM

60 points

1 comment7 min readLW link

[Question] What did AI Safety’s specific funding of AGI R&D labs lead to?

RemmeltJul 5, 2023, 3:51 PM

3 points

0 comments LW link

AISN #13: An interdisciplinary perspective on AI proxy failures, new competitors to ChatGPT, and prompting language models to misbehave

Dan HJul 5, 2023, 3:33 PM

13 points

0 comments LW link

Exploring Functional Decision Theory (FDT) and a modified version (ModFDT)

MiguelDevJul 5, 2023, 2:06 PM

11 points

11 comments15 min readLW link

Optimized for Something other than Winning or: How Cricket Resists Moloch and Goodhart’s Law

A.H.Jul 5, 2023, 12:33 PM

53 points

26 comments4 min readLW link

Puffer-pope reality check

Neil Jul 5, 2023, 9:27 AM

20 points

2 comments1 min readLW link

Final Lightspeed Grants coworking/office hours before the application deadline

habrykaJul 5, 2023, 6:03 AM

13 points

2 comments1 min readLW link

MXR Talkbox Cap?

jefftkJul 5, 2023, 1:50 AM

9 points

0 comments1 min readLW link

(www.jefftk.com)

“Reification”

herschelJul 5, 2023, 12:53 AM

11 points

4 comments2 min readLW link

Dominant Assurance Contract Experiment #2: Berkeley House Dinners

Arjun PanicksseryJul 5, 2023, 12:13 AM

51 points

8 comments1 min readLW link

(arjunpanickssery.substack.com)

Three camps in AI x-risk discussions: My personal very oversimplified overview

Aryeh EnglanderJul 4, 2023, 8:42 PM

21 points

0 comments LW link

Six (and a half) intuitions for SVD

CallumMcDougall4 Jul 2023 19:23 UTC

71 points

1 comment1 min readLW link

Animal Weapons: Lessons for Humans in the Age of X-Risk

Damin Curtis4 Jul 2023 18:14 UTC

4 points

0 comments10 min readLW link

Apocalypse Prepping—Concise SHTF guide to prepare for AGI doomsday

prepper4 Jul 2023 17:41 UTC

−7 points

9 comments1 min readLW link

(prepper.i2phides.me)

Ways I Expect AI Regulation To Increase Extinction Risk

1a3orn4 Jul 2023 17:32 UTC

226 points

32 comments7 min readLW link

AI labs’ statements on governance

Zach Stein-Perlman4 Jul 2023 16:30 UTC

30 points

0 comments36 min readLW link

AIs teams will probably be more superintelligent than individual AIs

Robert_AIZI4 Jul 2023 14:06 UTC

3 points

1 comment2 min readLW link

(aizi.substack.com)

What I Think About When I Think About History

Jacob G-W4 Jul 2023 14:02 UTC

3 points

4 comments3 min readLW link

(g-w1.github.io)