All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 202120222023 2024 2025

All Jan Feb Mar Apr May Jun Jul Aug Sep Oct NovDec

All 1 2 3 4 5 6 7 8910 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

[Question] Where’s the economic incentive for wokism coming from?

Valentine8 Dec 2022 23:28 UTC

15 points

105 comments1 min readLW link

I Believe we are in a Hardware Overhang

nem8 Dec 2022 23:18 UTC

8 points

0 comments1 min readLW link

Of pumpkins, the Falcon Heavy, and Groucho Marx: High-Level discourse structure in ChatGPT

Bill Benzon8 Dec 2022 22:25 UTC

2 points

0 comments8 min readLW link

How Many Lives Does X-Risk Work Save From Nonexistence On Average?

Jordan Arel8 Dec 2022 21:57 UTC

4 points

5 comments14 min readLW link

AI Safety Seems Hard to Measure

HoldenKarnofsky8 Dec 2022 19:50 UTC

71 points

6 comments14 min readLW link

(www.cold-takes.com)

Playing shell games with definitions

weverka8 Dec 2022 19:35 UTC

9 points

3 comments1 min readLW link

Notes on OpenAI’s alignment plan

Alex Flint8 Dec 2022 19:13 UTC

40 points

5 comments7 min readLW link

Relevant to natural abstractions: Euclidean Symmetry Equivariant Machine Learning—Overview, Applications, and Open Questions

the gears to ascension8 Dec 2022 18:01 UTC

8 points

0 comments1 min readLW link

(youtu.be)

I’ve started publishing the novel I wrote to promote EA

Timothy Underwood8 Dec 2022 17:30 UTC

10 points

3 comments1 min readLW link

Neural networks biased towards geometrically simple functions?

DavidHolmes8 Dec 2022 16:16 UTC

16 points

2 comments3 min readLW link

If Wentworth is right about natural abstractions, it would be bad for alignment

Wuschel Schulz8 Dec 2022 15:19 UTC

29 points

5 comments4 min readLW link

Covid 12/8/22: Another Winter Wave

Zvi8 Dec 2022 14:40 UTC

23 points

8 comments11 min readLW link

(thezvi.wordpress.com)

Why I’m Sceptical of Foom

DragonGod8 Dec 2022 10:01 UTC

20 points

36 comments3 min readLW link

Take 7: You should talk about “the human’s utility function” less.

Charlie Steiner8 Dec 2022 8:14 UTC

50 points

22 comments2 min readLW link

Machine Learning Consent

jefftk8 Dec 2022 3:50 UTC

38 points

14 comments3 min readLW link

(www.jefftk.com)

Riffing on the agent type

Quinn8 Dec 2022 0:19 UTC

21 points

3 comments4 min readLW link

[Question] Looking for ideas of public assets (stocks, funds, ETFs) that I can invest in to have a chance at profiting from the mass adoption and commercialization of AI technology

Annapurna7 Dec 2022 22:35 UTC

15 points

9 comments1 min readLW link

A Fallibilist Wordview

Toni MUENDEL7 Dec 2022 20:59 UTC

−12 points

2 comments13 min readLW link

Thoughts on AGI organizations and capabilities work

Rob Bensinger and So8res

7 Dec 2022 19:46 UTC

102 points

17 comments5 min readLW link

How to Think About Climate Models and How to Improve Them

clans7 Dec 2022 19:37 UTC

7 points

0 comments2 min readLW link

(locationtbd.home.blog)

The novelty quotient

River Lewis7 Dec 2022 17:16 UTC

4 points

7 comments2 min readLW link

(heytraveler.substack.com)

ChatGPT: “An error occurred. If this issue persists...”

Bill Benzon7 Dec 2022 15:41 UTC

5 points

11 comments3 min readLW link

Take 6: CAIS is actually Orwellian.

Charlie Steiner7 Dec 2022 13:50 UTC

12 points

8 comments2 min readLW link

Peter Thiel on Technological Stagnation and Out of Touch Rationalists

Matt Goldenberg7 Dec 2022 13:15 UTC

9 points

26 comments1 min readLW link

(youtu.be)

[Link] Wavefunctions: from Linear Algebra to Spinors

sen7 Dec 2022 12:44 UTC

11 points

12 comments1 min readLW link

(paperclip.substack.com)

Why I like Zulip instead of Slack or Discord

Alok Singh7 Dec 2022 9:28 UTC

31 points

10 comments1 min readLW link

Bioweapons, and ChatGPT (another vulnerability story)

Beeblebrox7 Dec 2022 7:27 UTC

−5 points

0 comments2 min readLW link

Where to be an AI Safety Professor

scasper7 Dec 2022 7:09 UTC

31 points

12 comments2 min readLW link

[Question] Are there any tools to convert LW sequences to PDF or any other file format?

quetzal_rainbow7 Dec 2022 5:28 UTC

2 points

2 comments1 min readLW link

Manifold Markets community meetup

Sinclair Chen7 Dec 2022 3:25 UTC

4 points

0 comments1 min readLW link

“Attention Passengers”: not for Signs

jefftk7 Dec 2022 2:00 UTC

27 points

10 comments1 min readLW link

(www.jefftk.com)

[ASoT] Probability Infects Concepts it Touches

Ulisse Mini7 Dec 2022 1:48 UTC

10 points

4 comments1 min readLW link

Simple Way to Prevent Power-Seeking AI

research_prime_space7 Dec 2022 0:26 UTC

12 points

1 comment1 min readLW link

In defense of probably wrong mechanistic models

evhub6 Dec 2022 23:24 UTC

55 points

10 comments2 min readLW link

AI Safety in a Vulnerable World: Requesting Feedback on Preliminary Thoughts

Jordan Arel6 Dec 2022 22:35 UTC

4 points

2 comments3 min readLW link

ChatGPT and the Human Race

Ben Reilly6 Dec 2022 21:38 UTC

6 points

1 comment3 min readLW link

[Question] How do finite factored sets compare with phase space?

Alex_Altair6 Dec 2022 20:05 UTC

15 points

1 comment1 min readLW link

Mesa-Optimizers via Grokking

orthonormal6 Dec 2022 20:05 UTC

36 points

4 comments6 min readLW link

Using GPT-Eliezer against ChatGPT Jailbreaking

Stuart_Armstrong and rgorman

6 Dec 2022 19:54 UTC

170 points

85 comments9 min readLW link

The Parable of the Crimp

Phosphorous6 Dec 2022 18:41 UTC

11 points

3 comments3 min readLW link

The Categorical Imperative Obscures

Gordon Seidoh Worley6 Dec 2022 17:48 UTC

17 points

17 comments2 min readLW link

MIRI’s “Death with Dignity” in 60 seconds.

Cleo Nardo6 Dec 2022 17:18 UTC

60 points

4 comments1 min readLW link

Things roll downhill

awenonian6 Dec 2022 15:27 UTC

19 points

0 comments1 min readLW link

EA & LW Forums Weekly Summary (28th Nov − 4th Dec 22′)

Zoe Williams6 Dec 2022 9:38 UTC

10 points

1 comment17 min readLW link

Take 5: Another problem for natural abstractions is laziness.

Charlie Steiner6 Dec 2022 7:00 UTC

31 points

4 comments3 min readLW link

Verification Is Not Easier Than Generation In General

johnswentworth6 Dec 2022 5:20 UTC

74 points

27 comments1 min readLW link

Shh, don’t tell the AI it’s likely to be evil

naterush6 Dec 2022 3:35 UTC

19 points

9 comments1 min readLW link

[Question] What are the major underlying divisions in AI safety?

Chris_Leong6 Dec 2022 3:28 UTC

5 points

2 comments1 min readLW link

[Link] Why I’m optimistic about OpenAI’s alignment approach

janleike5 Dec 2022 22:51 UTC

98 points

15 comments1 min readLW link

(aligned.substack.com)

The No Free Lunch theorem for dummies

Steven Byrnes5 Dec 2022 21:46 UTC

37 points

16 comments3 min readLW link