All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025

AllJanFeb Mar Apr May Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 456 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

The ultimate limits of alignment will determine the shape of the long term future

berenJan 2, 2023, 12:47 PM

34 points

2 comments6 min readLW link

A kernel of Lie theory

Alok SinghJan 2, 2023, 9:20 AM

−1 points

8 comments1 min readLW link

(alok.github.io)

Belief Bias: Bias in Evaluating AGI X-Risks

Remmelt and flandry19

Jan 2, 2023, 8:59 AM

−10 points

1 comment1 min readLW link

Pacing: inexplicably good

KatjaGraceJan 2, 2023, 8:30 AM

39 points

7 comments1 min readLW link

(worldspiritsockpuppet.com)

Alignment, Anger, and Love: Preparing for the Emergence of Superintelligent AI

tavurthJan 2, 2023, 6:16 AM

2 points

3 comments1 min readLW link

[Question] How can total world index fund growth outpace money supply growth over the long term?

pandoJan 2, 2023, 5:33 AM

4 points

7 comments1 min readLW link

My first year in AI alignment

Alex_AltairJan 2, 2023, 1:28 AM

61 points

10 comments7 min readLW link

Sail Over Mountains of ICE...

AnthonyRepettoJan 2, 2023, 12:27 AM

26 points

51 comments7 min readLW link

Fun math facts about 2023

Adam ScherlisJan 1, 2023, 11:38 PM

9 points

6 comments1 min readLW link

The Thingness of Things

TsviBTJan 1, 2023, 10:19 PM

51 points

35 comments10 min readLW link

Thoughts On Expanding the AI Safety Community: Benefits and Challenges of Outreach to Non-Technical Professionals

Yashvardhan SharmaJan 1, 2023, 7:21 PM

4 points

4 comments7 min readLW link

[Question] Would it be good or bad for the US military to get involved in AI risk?

Grant DemareeJan 1, 2023, 7:02 PM

50 points

12 comments1 min readLW link

Better New Year’s Goals through Aligning the Elephant and the Rider

moridinamaelJan 1, 2023, 5:54 PM

20 points

0 comments2 min readLW link

(guildoftherose.org)

A Löbian argument pattern for implicit reasoning in natural language: Löbian party invitations

Andrew_CritchJan 1, 2023, 5:39 PM

23 points

8 comments7 min readLW link

woke offline, anti-woke online

Yair HalberstadtJan 1, 2023, 8:24 AM

13 points

12 comments1 min readLW link

Summary of 80k’s AI problem profile

JakubKJan 1, 2023, 7:30 AM

7 points

0 comments5 min readLW link

(forum.effectivealtruism.org)

What percent of people work in moral mazes?

RaemonJan 1, 2023, 4:33 AM

21 points

9 comments4 min readLW link

Recursive Middle Manager Hell

RaemonJan 1, 2023, 4:33 AM

224 points

46 comments11 min readLW link 1 review

Challenge to the notion that anything is (maybe) possible with AGI

Remmelt and flandry19

Jan 1, 2023, 3:57 AM

−27 points

4 comments1 min readLW link

(mflb.com)

The Roots of Progress’s 2022 in review

jasoncrawfordJan 1, 2023, 2:54 AM

14 points

2 comments15 min readLW link

(rootsofprogress.org)

Investing for a World Transformed by AI

PeterMcCluskeyJan 1, 2023, 2:47 AM

70 points

24 comments6 min readLW link 1 review

(bayesianinvestor.com)

Why Free Will is NOT an illusion

Akira PyinyaJan 1, 2023, 2:29 AM

0 points

16 comments1 min readLW link

Localhost Security Messaging

jefftkJan 1, 2023, 2:20 AM

7 points

3 comments1 min readLW link

(www.jefftk.com)

0 and 1 aren’t probabilities

Alok SinghJan 1, 2023, 12:09 AM

2 points

4 comments2 min readLW link

(en.wikipedia.org)

‘simulator’ framing and confusions about LLMs

Beth BarnesDec 31, 2022, 11:38 PM

104 points

11 comments4 min readLW link

Monitoring devices I have loved

ElizabethDec 31, 2022, 10:51 PM

62 points

13 comments3 min readLW link 1 review

Slack matters more than any outcome

ValentineDec 31, 2022, 8:11 PM

164 points

56 comments19 min readLW link 1 review

To Be Particular About Morality

AGODec 31, 2022, 7:58 PM

6 points

2 comments7 min readLW link

200 COP in MI: Interpreting Algorithmic Problems

Neel NandaDec 31, 2022, 7:55 PM

33 points

2 comments10 min readLW link

The Feeling of Idea Scarcity

johnswentworthDec 31, 2022, 5:34 PM

251 points

23 comments5 min readLW link 1 review

Curse of knowledge and Naive realism: Bias in Evaluating AGI X-Risks

Remmelt and flandry19

Dec 31, 2022, 1:33 PM

−7 points

1 comment1 min readLW link

(www.lesswrong.com)

[Question] What career advice do you give to software engineers?

AntbDec 31, 2022, 12:01 PM

15 points

4 comments1 min readLW link

[Question] Are Mixture-of-Experts Transformers More Interpretable Than Dense Transformers?

simeon_cDec 31, 2022, 11:34 AM

8 points

5 comments1 min readLW link

[Question] In which cases can ChatGPT be used as an aid for thesis or scientific paper writing?

Bob GuranDec 31, 2022, 10:50 AM

1 point

1 comment1 min readLW link

Two Issues with Playing Chicken with the Universe

Chris_LeongDec 31, 2022, 6:47 AM

4 points

4 comments2 min readLW link

Extreme risk neutrality isn’t always wrong

Grant DemareeDec 31, 2022, 4:05 AM

28 points

19 comments4 min readLW link

Verbal parity: What is it and how to measure it? + an edited version of “Against John Searle, Gary Marcus, the Chinese Room thought experiment and its world”

philosophybearDec 31, 2022, 3:46 AM

2 points

0 comments11 min readLW link

Should AI systems have to identify themselves?

Darren McKeeDec 31, 2022, 2:57 AM

2 points

2 comments1 min readLW link

[Question] What do you imagine, when you imagine “taking over the world”?

johnswentworthDec 31, 2022, 1:04 AM

22 points

16 comments1 min readLW link

A few thoughts on my self-study for alignment research

Thomas KehrenbergDec 30, 2022, 10:05 PM

6 points

0 comments2 min readLW link

Christmas Microscopy

jefftkDec 30, 2022, 9:10 PM

27 points

0 comments1 min readLW link

(www.jefftk.com)

What “upside” of AI?

False NameDec 30, 2022, 8:58 PM

0 points

5 comments4 min readLW link

Evidence on recursive self-improvement from current ML

berenDec 30, 2022, 8:53 PM

31 points

12 comments6 min readLW link

[Question] Is ChatGPT TAI?

Amal 30 Dec 2022 19:44 UTC

14 points

5 comments1 min readLW link

My thoughts on OpenAI’s alignment plan

Orpheus1630 Dec 2022 19:33 UTC

55 points

3 comments20 min readLW link

Beyond Rewards and Values: A Non-dualistic Approach to Universal Intelligence

Akira Pyinya30 Dec 2022 19:05 UTC

10 points

4 comments14 min readLW link

10 Years of LessWrong

SebastianG 30 Dec 2022 17:15 UTC

73 points

2 comments4 min readLW link

Chatbots as a Publication Format

derek shiller30 Dec 2022 14:11 UTC

6 points

6 comments4 min readLW link

Human sexuality as an interesting case study of alignment

beren30 Dec 2022 13:37 UTC

39 points

26 comments3 min readLW link

The Twitter Files: Covid Edition

Zvi30 Dec 2022 13:30 UTC

32 points

2 comments10 min readLW link

(thezvi.wordpress.com)