All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025

AllJanFeb Mar Apr May Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 678 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

An ML interpretation of Shard Theory

berenJan 3, 2023, 8:30 PM

39 points

5 comments4 min readLW link

Talking to God

abramdemskiJan 3, 2023, 8:14 PM

30 points

7 comments2 min readLW link

My Advice for Incoming SERI MATS Scholars

Johannes C. MayerJan 3, 2023, 7:25 PM

58 points

6 comments4 min readLW link

Touch reality as soon as possible (when doing machine learning research)

LawrenceCJan 3, 2023, 7:11 PM

117 points

9 comments8 min readLW link 1 review

Kolb’s: an approach to consciously get better at anything

jacquesthibsJan 3, 2023, 6:16 PM

12 points

1 comment6 min readLW link

[Question] {M|Im|Am}oral Mazes—any large-scale counterexamples?

DagonJan 3, 2023, 4:43 PM

24 points

4 comments1 min readLW link

Effectively self-studying over the Internet

libaiJan 3, 2023, 4:23 PM

11 points

1 comment4 min readLW link

Set-like mathematics in type theory

Thomas KehrenbergJan 3, 2023, 2:33 PM

5 points

1 comment13 min readLW link

Monthly Roundup #2

ZviJan 3, 2023, 12:50 PM

23 points

3 comments23 min readLW link

(thezvi.wordpress.com)

Whisper’s Wild Implications

Ollie JJan 3, 2023, 12:17 PM

19 points

6 comments5 min readLW link

How to eat potato chips while typing

KatjaGraceJan 3, 2023, 11:50 AM

45 points

12 comments1 min readLW link

(worldspiritsockpuppet.com)

[Question] I have thousands of copies of HPMOR in Russian. How to use them with the most impact?

Mikhail SaminJan 3, 2023, 10:21 AM

26 points

3 comments1 min readLW link

Is recursive self-alignment possible?

No77eJan 3, 2023, 9:15 AM

5 points

5 comments1 min readLW link

On the naturalistic study of the linguistic behavior of artificial intelligence

Bill BenzonJan 3, 2023, 9:06 AM

1 point

0 comments4 min readLW link

SF Severe Weather Warning

stavrosJan 3, 2023, 6:04 AM

3 points

3 comments1 min readLW link

(news.ycombinator.com)

Status quo bias; System justification: Bias in Evaluating AGI X-Risks

Remmelt and flandry19

Jan 3, 2023, 2:50 AM

−11 points

0 comments1 min readLW link

200 COP in MI: Exploring Polysemanticity and Superposition

Neel NandaJan 3, 2023, 1:52 AM

34 points

6 comments16 min readLW link

The need for speed in web frameworks?

Adam ZernerJan 3, 2023, 12:06 AM

19 points

2 comments8 min readLW link

[Simulators seminar sequence] #1 Background & shared assumptions

Jan, Charlie Steiner, Logan Riggs, janus, jacquesthibs, metasemi, Michael Oesterle, Lucas Teixeira, peligrietzer and remember

Jan 2, 2023, 11:48 PM

50 points

4 comments3 min readLW link

Linear Algebra Done Right, Axler

David UdellJan 2, 2023, 10:54 PM

57 points

6 comments9 min readLW link

MacArthur BART (Filk)

Gordon Seidoh WorleyJan 2, 2023, 10:50 PM

10 points

1 comment1 min readLW link

Knottiness

abramdemskiJan 2, 2023, 10:13 PM

43 points

4 comments2 min readLW link

[Question] Default Sort for Shortforms is Very Bad; How Do I Change It?

DragonGodJan 2, 2023, 9:50 PM

15 points

0 comments1 min readLW link

MAKE IT BETTER (a poetic demonstration of the banality of GPT-3)

rogersbaconJan 2, 2023, 8:47 PM

7 points

2 comments5 min readLW link

Review of “Make People Better”

MetacelsusJan 2, 2023, 8:30 PM

10 points

0 comments3 min readLW link

(denovo.substack.com)

Preparing for Less Privacy

jefftkJan 2, 2023, 8:30 PM

23 points

1 comment2 min readLW link

(www.jefftk.com)

Large language models can provide “normative assumptions” for learning human preferences

Stuart_ArmstrongJan 2, 2023, 7:39 PM

29 points

12 comments3 min readLW link

On the Importance of Open Sourcing Reward Models

elandgreJan 2, 2023, 7:01 PM

18 points

5 comments6 min readLW link

Prediction Markets for Science

VaniverJan 2, 2023, 5:55 PM

27 points

7 comments5 min readLW link

Why don’t Rationalists use bidets?

LakinJan 2, 2023, 5:42 PM

31 points

33 comments2 min readLW link

Soft optimization makes the value target bigger

Jeremy GillenJan 2, 2023, 4:06 PM

119 points

20 comments12 min readLW link

Results from the AI testing hackathon

Esben KranJan 2, 2023, 3:46 PM

13 points

0 comments LW link

Induction heads—illustrated

CallumMcDougallJan 2, 2023, 3:35 PM

130 points

12 comments3 min readLW link

Opportunity Cost Blackmail

adamShimiJan 2, 2023, 1:48 PM

70 points

11 comments2 min readLW link

(epistemologicalvigilance.substack.com)

The ultimate limits of alignment will determine the shape of the long term future

berenJan 2, 2023, 12:47 PM

34 points

2 comments6 min readLW link

A kernel of Lie theory

Alok SinghJan 2, 2023, 9:20 AM

−1 points

8 comments1 min readLW link

(alok.github.io)

Belief Bias: Bias in Evaluating AGI X-Risks

Remmelt and flandry19

Jan 2, 2023, 8:59 AM

−10 points

1 comment1 min readLW link

Pacing: inexplicably good

KatjaGraceJan 2, 2023, 8:30 AM

39 points

7 comments1 min readLW link

(worldspiritsockpuppet.com)

Alignment, Anger, and Love: Preparing for the Emergence of Superintelligent AI

tavurthJan 2, 2023, 6:16 AM

2 points

3 comments1 min readLW link

[Question] How can total world index fund growth outpace money supply growth over the long term?

pandoJan 2, 2023, 5:33 AM

4 points

7 comments1 min readLW link

My first year in AI alignment

Alex_AltairJan 2, 2023, 1:28 AM

61 points

10 comments7 min readLW link

Sail Over Mountains of ICE...

AnthonyRepettoJan 2, 2023, 12:27 AM

26 points

51 comments7 min readLW link

Fun math facts about 2023

Adam ScherlisJan 1, 2023, 11:38 PM

9 points

6 comments1 min readLW link

The Thingness of Things

TsviBT1 Jan 2023 22:19 UTC

51 points

35 comments10 min readLW link

Thoughts On Expanding the AI Safety Community: Benefits and Challenges of Outreach to Non-Technical Professionals

Yashvardhan Sharma1 Jan 2023 19:21 UTC

4 points

4 comments7 min readLW link

[Question] Would it be good or bad for the US military to get involved in AI risk?

Grant Demaree1 Jan 2023 19:02 UTC

50 points

12 comments1 min readLW link

Better New Year’s Goals through Aligning the Elephant and the Rider

moridinamael1 Jan 2023 17:54 UTC

20 points

0 comments2 min readLW link

(guildoftherose.org)

A Löbian argument pattern for implicit reasoning in natural language: Löbian party invitations

Andrew_Critch1 Jan 2023 17:39 UTC

23 points

8 comments7 min readLW link

woke offline, anti-woke online

Yair Halberstadt1 Jan 2023 8:24 UTC

13 points

12 comments1 min readLW link

Summary of 80k’s AI problem profile

JakubK1 Jan 2023 7:30 UTC

7 points

0 comments5 min readLW link

(forum.effectivealtruism.org)