All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025

AllJanFeb Mar Apr May Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 293031

Visualisation of Probability Mass

brookJan 25, 2023, 3:09 PM

7 points

0 comments LW link

When Did EA Start?

jefftkJan 25, 2023, 2:30 PM

37 points

2 comments2 min readLW link

(www.jefftk.com)

Some Thoughts on AI Art

abramdemskiJan 25, 2023, 2:18 PM

74 points

20 comments7 min readLW link

Quick thoughts on “scalable oversight” / “super-human feedback” research

David Scott Krueger (formerly: capybaralet)Jan 25, 2023, 12:55 PM

27 points

9 comments2 min readLW link

Sapir-Whorf for Rationalists

Duncan Sabien (Inactive)Jan 25, 2023, 7:58 AM

155 points

49 comments19 min readLW link

ChatGPT vs the 2-4-6 Task

cwilluJan 25, 2023, 6:59 AM

20 points

4 comments3 min readLW link

Pessimistic Shard Theory

Garrett BakerJan 25, 2023, 12:59 AM

72 points

13 comments3 min readLW link

Thatcher’s Axiom

Edward P. KöningsJan 24, 2023, 10:35 PM

10 points

22 comments4 min readLW link

[Question] Some questions about free will compatibilism

Asking QuestionsJan 24, 2023, 9:54 PM

3 points

21 comments6 min readLW link

Alexander and Yudkowsky on AGI goals

Scott Alexander and Eliezer Yudkowsky

Jan 24, 2023, 9:09 PM

179 points

53 comments26 min readLW link 1 review

[Question] Is _The Age of AI: And Our Human Future_ worth reading

jmhJan 24, 2023, 9:05 PM

4 points

0 comments1 min readLW link

Inverse Scaling Prize: Second Round Winners

Ian McKenzie, Sam Bowman and Ethan Perez

Jan 24, 2023, 8:12 PM

58 points

17 comments15 min readLW link

ChatGPT intimates a tantalizing future; its core LLM is organized on multiple levels; and it has broken the idea of thinking.

Bill BenzonJan 24, 2023, 7:05 PM

5 points

0 comments5 min readLW link

How-to Transformer Mechanistic Interpretability—in 50 lines of code or less!

StefanHexJan 24, 2023, 6:45 PM

47 points

5 comments13 min readLW link

The Cabinet of Wikipedian Curiosities

Sam EnrightJan 24, 2023, 6:22 PM

36 points

5 comments6 min readLW link

(samenright.com)

Explanatory Parsimony, Explanatory Superfluousness and Uselessness of Newton’s First Law

Jimdrix_HendriJan 24, 2023, 5:21 PM

−2 points

7 comments2 min readLW link

Guesstimate: Why and how to use it

brook and chanamessinger

Jan 24, 2023, 4:24 PM

8 points

0 comments3 min readLW link

(forum.effectivealtruism.org)

GWWC Pledge History

jefftkJan 24, 2023, 3:50 PM

15 points

0 comments3 min readLW link

(www.jefftk.com)

Gradient hacking is extremely difficult

berenJan 24, 2023, 3:45 PM

170 points

22 comments5 min readLW link

[Question] What sci-fi books are most relevant to a future with transformative AI?

sidJan 24, 2023, 3:30 PM

2 points

9 comments1 min readLW link

Grant-making in EA should consider peer-reviewing grant applications along the public-sector model

Ben SmithJan 24, 2023, 3:01 PM

0 points

3 comments LW link

“Endgame safety” for AGI

Steven ByrnesJan 24, 2023, 2:15 PM

85 points

10 comments6 min readLW link

Thoughts on hardware / compute requirements for AGI

Steven ByrnesJan 24, 2023, 2:03 PM

63 points

32 comments24 min readLW link

Parameter Scaling Comes for RL, Maybe

1a3ornJan 24, 2023, 1:55 PM

100 points

3 comments14 min readLW link

How to find cool things in a new place

Sam F. BrownJan 24, 2023, 11:20 AM

12 points

0 comments1 min readLW link

[Crosspost] ACX 2022 Prediction Contest Results

Scott Alexander, Eric Neyman and Sam Marks

Jan 24, 2023, 6:56 AM

48 points

6 comments8 min readLW link

The Human-AI Reflective Equilibrium

Allison DuettmannJan 24, 2023, 1:32 AM

22 points

1 comment24 min readLW link

“Status” can be corrosive; here’s how I handle it

Orpheus16Jan 24, 2023, 1:25 AM

71 points

8 comments6 min readLW link

[Question] What area of the digital domain seems safe from AI in the next 5-10 years?

Adrien ChauvetJan 24, 2023, 1:16 AM

11 points

14 comments1 min readLW link

Some of my disagreements with List of Lethalities

TurnTroutJan 24, 2023, 12:25 AM

70 points

7 comments10 min readLW link

Rounding Someone Off

David UdellJan 24, 2023, 12:03 AM

25 points

0 comments5 min readLW link

Life Has a Cruel Symmetry

philhJan 23, 2023, 11:40 PM

21 points

5 comments11 min readLW link

(reasonableapproximation.net)

Highlights and Prizes from the 2021 Review Phase

RaemonJan 23, 2023, 9:41 PM

38 points

14 comments21 min readLW link

[Question] AI safety milestones?

Zach Stein-PerlmanJan 23, 2023, 9:00 PM

7 points

5 comments1 min readLW link

[Question] A post-quantum theory of classical gravity?

Logan ZoellnerJan 23, 2023, 8:39 PM

13 points

5 comments1 min readLW link

Meals For Unclear Dietary Restrictions

jefftkJan 23, 2023, 8:00 PM

17 points

3 comments2 min readLW link

(www.jefftk.com)

It’s ok

stratospherJan 23, 2023, 6:11 PM

1 point

0 comments2 min readLW link

Experimenting with beta.character.ai

svemirskiJan 23, 2023, 5:31 PM

−3 points

5 comments1 min readLW link

This week in fashion

JanJan 23, 2023, 5:23 PM

29 points

7 comments7 min readLW link

(universalprior.substack.com)

Movie Review: Megan

ZviJan 23, 2023, 12:50 PM

60 points

19 comments24 min readLW link

(thezvi.wordpress.com)

[Question] Has private AGI research made independent safety research ineffective already? What should we do about this?

Roman LeventovJan 23, 2023, 7:36 AM

43 points

5 comments5 min readLW link

Deconfusing “Capabilities vs. Alignment”

RobertMJan 23, 2023, 4:46 AM

27 points

7 comments2 min readLW link

What a compute-centric framework says about AI takeoff speeds

Tom DavidsonJan 23, 2023, 4:02 AM

188 points

30 comments16 min readLW link 1 review

Philly Rat Fest

LoganChipkinJan 23, 2023, 4:01 AM

9 points

0 comments1 min readLW link

EA & LW Forum Weekly Summary (16th − 22nd Jan ’23)

Zoe WilliamsJan 23, 2023, 3:46 AM

13 points

0 comments LW link

Consider Trying Dictation

jefftkJan 22, 2023, 10:50 PM

23 points

10 comments2 min readLW link

(www.jefftk.com)

Emotional attachment to AIs opens doors to problems

Igor IvanovJan 22, 2023, 8:28 PM

20 points

10 comments4 min readLW link

What fills a vacuum?

Logan KiellerJan 22, 2023, 7:25 PM

11 points

6 comments2 min readLW link

Gemini modeling

TsviBTJan 22, 2023, 2:28 PM

12 points

8 comments11 min readLW link

Large language models learn to represent the world

gjmJan 22, 2023, 1:10 PM

101 points

20 comments3 min readLW link 1 review