All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025

All Jan Feb Mar Apr May JunJulAug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8910 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Taboo Truth

Tomás B.Jul 8, 2023, 11:23 PM

36 points

16 comments2 min readLW link

“View”

herschelJul 8, 2023, 11:19 PM

6 points

0 comments2 min readLW link

[Question] H5N1. Just how bad is the situation?

Q HomeJul 8, 2023, 10:09 PM

16 points

8 comments1 min readLW link

A Two-Part System for Practical Self-Care

Jonathan MoregårdJul 8, 2023, 9:23 PM

11 points

0 comments3 min readLW link

(honestliving.substack.com)

Really Strong Features Found in Residual Stream

Logan RiggsJul 8, 2023, 7:40 PM

69 points

6 comments2 min readLW link

Eight Strategies for Tackling the Hard Part of the Alignment Problem

scasperJul 8, 2023, 6:55 PM

42 points

11 comments7 min readLW link

“Concepts of Agency in Biology” (Okasha, 2023) - Brief Paper Summary

Nora_AmmannJul 8, 2023, 6:22 PM

40 points

3 comments7 min readLW link

Blanchard’s Dangerous Idea and the Plight of the Lucid Crossdreamer

Zack_M_DavisJul 8, 2023, 6:03 PM

38 points

135 comments72 min readLW link

(unremediatedgender.space)

Continuous Adversarial Quality Assurance: Extending RLHF and Constitutional AI

Benaya KorenJul 8, 2023, 5:32 PM

6 points

0 comments9 min readLW link

Commentless downvoting is not a good way to fight infohazards

DirectedEvolutionJul 8, 2023, 5:29 PM

6 points

9 comments3 min readLW link

[Question] Why does anxiety (?) make me dumb?

TeaTieAndHatJul 8, 2023, 4:13 PM

18 points

14 comments3 min readLW link

Economic Time Bomb: An Overlooked Employment Bubble Threatening the US Economy

Glenn ClaytonJul 8, 2023, 3:19 PM

4 points

10 comments6 min readLW link

What is everyone doing in AI governance

Igor IvanovJul 8, 2023, 3:16 PM

12 points

0 comments5 min readLW link

LLM misalignment can probably be found without manual prompt engineering

ProgramCrafterJul 8, 2023, 2:35 PM

1 point

0 comments1 min readLW link

You must not fool yourself, and you are the easiest person to fool

Richard_NgoJul 8, 2023, 2:05 PM

35 points

5 comments4 min readLW link

Fixed Point: a love story

Richard_NgoJul 8, 2023, 1:56 PM

99 points

2 comments7 min readLW link

Announcing AI Alignment workshop at the ALIFE 2023 conference

rorygreigJul 8, 2023, 1:52 PM

16 points

0 comments1 min readLW link

(humanvaluesandartificialagency.com)

3D Printed Talkbox Cap

jefftkJul 8, 2023, 1:00 PM

9 points

0 comments1 min readLW link

(www.jefftk.com)

Writing this post as rationality case study

Ben AmitayJul 8, 2023, 12:24 PM

10 points

8 comments2 min readLW link

[Question] What Does LessWrong/EA Think of Human Intelligence Augmentation as of mid-2023?

lukemarksJul 8, 2023, 11:42 AM

84 points

28 comments2 min readLW link

[Question] Request for feedback—infohazards in testing LLMs for causal reasoning?

DirectedEvolutionJul 8, 2023, 9:01 AM

16 points

0 comments2 min readLW link

Views on when AGI comes and on strategy to reduce existential risk

TsviBTJul 8, 2023, 9:00 AM

133 points

61 comments14 min readLW link 1 review

Weekday Evening Beach Picnics

jefftkJul 8, 2023, 2:20 AM

2 points

4 comments1 min readLW link

(www.jefftk.com)

ACI#4: Seed AI is the new Perpetual Motion Machine

Akira PyinyaJul 8, 2023, 1:17 AM

−1 points

0 comments6 min readLW link

[Question] Links to discussions on social equilibrium and human value after (aligned) super-AI?

Michael TontchevJul 8, 2023, 1:01 AM

7 points

3 comments1 min readLW link

Notes from the Qatar Center for Global Banking and Finance 3rd Annual Conference

PixelatedPenguinJul 7, 2023, 11:48 PM

2 points

0 comments1 min readLW link

Introducing bayescalc.io

Adele LopezJul 7, 2023, 4:11 PM

115 points

29 comments1 min readLW link

(bayescalc.io)

Meetup Tip: Ask Attendees To Explain It

ScrewtapeJul 7, 2023, 4:08 PM

10 points

0 comments4 min readLW link

Interpreting Modular Addition in MLPs

Bart BussmannJul 7, 2023, 9:22 AM

20 points

0 comments6 min readLW link

Internal independent review for language model agent alignment

Seth HerdJul 7, 2023, 6:54 AM

55 points

30 comments11 min readLW link

[Question] Can LessWrong provide me with something I find obviously highly useful to my own practical life?

agrippaJul 7, 2023, 3:08 AM

32 points

4 comments1 min readLW link

ask me about technology

bhauthJul 7, 2023, 2:03 AM

23 points

42 comments1 min readLW link

Apparently, of the 195 Million the DoD allocated in University Research Funding Awards in 2022, more than half of them concerned AI or compute hardware research

mako yassJul 7, 2023, 1:20 AM

41 points

5 comments2 min readLW link

(www.defense.gov)

What are the best non-LW places to read on alignment progress?

RaemonJul 7, 2023, 12:57 AM

50 points

14 comments1 min readLW link

Two paths to win the AGI transition

Nathan Helm-BurgerJul 6, 2023, 9:59 PM

11 points

8 comments4 min readLW link

Empirical Evidence Against “The Longest Training Run”

NickGabsJul 6, 2023, 6:32 PM

31 points

0 comments14 min readLW link

Progress Studies Fellowship looking for members

jay ramJul 6, 2023, 5:41 PM

3 points

0 comments1 min readLW link

BOUNTY AVAILABLE: AI ethicists, what are your object-level arguments against AI notkilleveryoneism?

Peter BerggrenJul 6, 2023, 5:32 PM

18 points

6 comments2 min readLW link

Layering and Technical Debt in the Global Wayfinding Model

herschelJul 6, 2023, 5:30 PM

14 points

0 comments3 min readLW link

Localizing goal misgeneralization in a maze-solving policy network

Jan BetleyJul 6, 2023, 4:21 PM

37 points

2 comments7 min readLW link

Jesse Hoogland on Developmental Interpretability and Singular Learning Theory

Michaël TrazziJul 6, 2023, 3:46 PM

42 points

2 comments4 min readLW link

(theinsideview.ai)

Progress links and tweets, 2023-07-06: Terraformer Mark One, Israeli water management, & more

jasoncrawfordJul 6, 2023, 3:35 PM

18 points

4 comments2 min readLW link

(rootsofprogress.org)

Towards Non-Panopticon AI Alignment

Logan ZoellnerJul 6, 2023, 3:29 PM

7 points

0 comments3 min readLW link

A Defense of Work on Mathematical AI Safety

Davidmanheim6 Jul 2023 14:15 UTC

28 points

13 comments3 min readLW link

(forum.effectivealtruism.org)

Understanding the two most common mental health problems in the world

spencerg6 Jul 2023 14:06 UTC

19 points

0 comments LW link

Announcing the EA Archive

Aaron Bergman6 Jul 2023 13:49 UTC

13 points

2 comments LW link

Agency begets agency

Richard_Ngo6 Jul 2023 13:08 UTC

60 points

1 comment4 min readLW link

AI #19: Hofstadter, Sutskever, Leike

Zvi6 Jul 2023 12:50 UTC

60 points

16 comments40 min readLW link

(thezvi.wordpress.com)

Do you feel that AGI Alignment could be achieved in a Type 0 civilization?

Super AGI6 Jul 2023 4:52 UTC

−2 points

1 comment1 min readLW link

Open Thread—July 2023

Ruby6 Jul 2023 4:50 UTC

11 points

35 comments1 min readLW link