All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025 2026

All Jan FebMarApr May Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 161718 19 20 21 22 23 24 25 26 27 28 29 30 31

Donation offsets for ChatGPT Plus subscriptions

Jeffrey Ladish16 Mar 2023 23:29 UTC

53 points

3 comments3 min readLW link

The algorithm isn’t doing X, it’s just doing Y.

Cleo Nardo16 Mar 2023 23:28 UTC

53 points

43 comments5 min readLW link

Announcing the ERA Cambridge Summer Research Fellowship

Nandini Shiralkar16 Mar 2023 22:57 UTC

11 points

0 comments3 min readLW link

Gradual takeoff, fast failure

Max H16 Mar 2023 22:02 UTC

15 points

4 comments5 min readLW link

Conceding a short timelines bet early

Matthew Barnett16 Mar 2023 21:49 UTC

136 points

17 comments1 min readLW link

Attribution Patching: Activation Patching At Industrial Scale

Neel Nanda16 Mar 2023 21:44 UTC

45 points

10 comments58 min readLW link

(www.neelnanda.io)

[Question] Will 2023 be the last year you can write short stories and receive most of the intellectual credit for writing them?

lc16 Mar 2023 21:36 UTC

20 points

12 comments1 min readLW link

Is it a bad idea to pay for GPT-4?

nem16 Mar 2023 20:49 UTC

24 points

8 comments1 min readLW link

Are AI developers playing with fire?

marcusarvan16 Mar 2023 19:12 UTC

6 points

0 comments10 min readLW link

[Question] When will computer programming become an unskilled job (if ever)?

lc16 Mar 2023 17:46 UTC

36 points

55 comments1 min readLW link

[Appendix] Natural Abstractions: Key Claims, Theorems, and Critiques

LawrenceC, Erik Jenner and Leon Lang

16 Mar 2023 16:38 UTC

48 points

0 comments13 min readLW link

Natural Abstractions: Key Claims, Theorems, and Critiques

LawrenceC, Leon Lang and Erik Jenner

16 Mar 2023 16:37 UTC

251 points

26 comments45 min readLW link 3 reviews

On the Crisis at Silicon Valley Bank

Zvi16 Mar 2023 15:50 UTC

59 points

9 comments41 min readLW link

(thezvi.wordpress.com)

[Question] What literature on the neuroscience of decision making can you recommend?

quetzal_rainbow16 Mar 2023 15:32 UTC

3 points

0 comments1 min readLW link

[Question] What organizations other than Conjecture have (esp. public) info-hazard policies?

David Scott Krueger16 Mar 2023 14:49 UTC

20 points

1 comment1 min readLW link

[Question] Is there an analysis of the common consideration that splitting an AI lab into two (e.g. the founding of Anthropic) speeds up the development of TAI and therefore increases AI x-risk?

tchauvin16 Mar 2023 14:16 UTC

4 points

0 comments1 min readLW link

A chess game against GPT-4

Rafael Harth16 Mar 2023 14:05 UTC

24 points

23 comments1 min readLW link

ChatGPT getting out of the box

qbolec16 Mar 2023 13:47 UTC

6 points

3 comments1 min readLW link

[Question] Are funds (such as the Long-Term Future Fund) willing to give extra money to AI safety researchers to balance for the opportunity cost of taking an “industry” job?

Malleable_shape16 Mar 2023 11:54 UTC

5 points

1 comment1 min readLW link

Three levels of exploration and intelligence

Q Home16 Mar 2023 10:55 UTC

9 points

3 comments21 min readLW link

Here, have a calmness video

Kaj_Sotala16 Mar 2023 10:00 UTC

112 points

15 comments2 min readLW link

(www.youtube.com)

Wittgenstein’s Language Games and the Critique of the Natural Abstraction Hypothesis

Chris_Leong16 Mar 2023 7:56 UTC

17 points

20 comments2 min readLW link

Red-teaming AI-safety concepts that rely on science metaphors

catubc16 Mar 2023 6:52 UTC

5 points

4 comments5 min readLW link

[ASoT] Some thoughts on human abstractions

leogao16 Mar 2023 5:42 UTC

43 points

4 comments5 min readLW link

How I Run Solstice, Step by Step

maia16 Mar 2023 3:23 UTC

46 points

0 comments16 min readLW link

(particularvirtue.blogspot.com)

GPT-4 Multiplication Competition

dandelion416 Mar 2023 3:09 UTC

11 points

7 comments1 min readLW link

Want to predict/explain/control the output of GPT-4? Then learn about the world, not about transformers.

Cleo Nardo16 Mar 2023 3:08 UTC

107 points

26 comments5 min readLW link

[Question] Is it worth avoiding detailed discussions of expectations about agency levels of powerful AIs?

David Johnston16 Mar 2023 3:06 UTC

11 points

6 comments2 min readLW link

Why self-improvement?

Adam Zerner16 Mar 2023 2:49 UTC

12 points

4 comments2 min readLW link

[Question] What is a good comprehensive examination of risks near the Ohio train derailment?

1a3orn16 Mar 2023 0:21 UTC

17 points

0 comments1 min readLW link

Write a Book?

jefftk16 Mar 2023 0:10 UTC

45 points

7 comments3 min readLW link

(www.jefftk.com)

AI Safety − 7 months of discussion in 17 minutes

Zoe Williams15 Mar 2023 23:41 UTC

25 points

0 comments17 min readLW link

How well did Manifold predict GPT-4?

David Chee15 Mar 2023 23:19 UTC

49 points

5 comments2 min readLW link

80k podcast episode on sentience in AI systems

Robbo15 Mar 2023 20:19 UTC

15 points

0 comments13 min readLW link

(80000hours.org)

GPT-4: What we (I) know about it

Robert_AIZI15 Mar 2023 20:12 UTC

40 points

29 comments12 min readLW link

(aizi.substack.com)

Grading on Word Count

Max Niederman15 Mar 2023 19:17 UTC

50 points

11 comments1 min readLW link

(maxniederman.com)

How to Escape From the Simulation (Seeds of Science)

rogersbacon15 Mar 2023 18:46 UTC

1 point

1 comment1 min readLW link

Towards understanding-based safety evaluations

evhub15 Mar 2023 18:18 UTC

164 points

16 comments5 min readLW link

Newcomb’s paradox complete solution.

Augs SMSHacks15 Mar 2023 17:56 UTC

−12 points

13 comments3 min readLW link

The Ethics of Eating Seafood: A Rational Discussion

Jonathan Grant15 Mar 2023 17:55 UTC

1 point

2 comments2 min readLW link

ChatGPT (and now GPT4) is very easily distracted from its rules

dmcs15 Mar 2023 17:55 UTC

180 points

42 comments1 min readLW link

[Question] What happened to the OpenPhil OpenAI board seat?

ChristianKl15 Mar 2023 16:59 UTC

65 points

2 comments1 min readLW link

Nokens: A potential method of investigating glitch tokens

Hoagy15 Mar 2023 16:23 UTC

21 points

0 comments4 min readLW link

The epistemic virtue of scope matching

jasoncrawford15 Mar 2023 13:31 UTC

85 points

15 comments5 min readLW link

(rootsofprogress.org)

POC || GTFO culture as partial antidote to alignment wordcelism

lc15 Mar 2023 10:21 UTC

162 points

17 comments7 min readLW link 2 reviews

Just Pivot to AI: The secret is out

sapphire15 Mar 2023 6:26 UTC

16 points

1 comment2 min readLW link

Bushels Are Commodity-Specific

jefftk15 Mar 2023 2:00 UTC

29 points

0 comments2 min readLW link

(www.jefftk.com)

ARC tests to see if GPT-4 can escape human control; GPT-4 failed to do so

Christopher King15 Mar 2023 0:29 UTC

116 points

22 comments2 min readLW link

Shutting Down the Lightcone Offices

habryka and Ben Pace

14 Mar 2023 22:47 UTC

339 points

103 comments17 min readLW link 2 reviews

[Question] What are some ideas that LessWrong has reinvented?

RomanHauksson14 Mar 2023 22:27 UTC

4 points

13 comments1 min readLW link