All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025 2026

All Jan FebMarApr May Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 171819 20 21 22 23 24 25 26 27 28 29 30 31

[Question] Why Carl Jung is not popular in AI Alignment Research?

MiguelDev17 Mar 2023 23:56 UTC

−3 points

13 comments1 min readLW link

[Event] Join Metaculus for Forecast Friday on March 24th!

ChristianWilliams17 Mar 2023 22:47 UTC

3 points

0 comments1 min readLW link

(www.eventbrite.com)

Meetup Tip: The Next Meetup Will Be. . .

Screwtape17 Mar 2023 22:04 UTC

45 points

0 comments3 min readLW link

The Power of High Speed Stupidity

robotelvis17 Mar 2023 21:41 UTC

33 points

6 comments9 min readLW link 1 review

(messyprogress.substack.com)

Retrospective on ‘GPT-4 Predictions’ After the Release of GPT-4

Stephen McAleese17 Mar 2023 18:34 UTC

26 points

6 comments6 min readLW link

“Carefully Bootstrapped Alignment” is organizationally hard

Raemon17 Mar 2023 18:00 UTC

268 points

23 comments11 min readLW link 1 review

[Question] Are nested jailbreaks inevitable?

judson17 Mar 2023 17:43 UTC

1 point

0 comments1 min readLW link

Ethical AI investments?

Justin wilson17 Mar 2023 17:43 UTC

25 points

15 comments1 min readLW link

New economic system for AI era

ksme sho17 Mar 2023 17:42 UTC

−1 points

1 comment5 min readLW link

On some first principles of intelligence

Macheng_Shen17 Mar 2023 17:42 UTC

−14 points

0 comments4 min readLW link

Essential Behaviorism Terms

Rivka17 Mar 2023 17:41 UTC

16 points

1 comment10 min readLW link

Vector semantics and “Kubla Khan,” Part 2

Bill Benzon17 Mar 2023 16:32 UTC

2 points

0 comments3 min readLW link

Super-Luigi = Luigi + (Luigi—Waluigi)

Alexei17 Mar 2023 15:27 UTC

16 points

9 comments1 min readLW link

Survey on intermediate goals in AI governance

MichaelA and MaxRa

17 Mar 2023 13:12 UTC

25 points

3 comments1 min readLW link

GPT-4 solves Gary Marcus-induced flubs

JakubK17 Mar 2023 6:40 UTC

57 points

29 comments2 min readLW link

(docs.google.com)

[Question] Are the LLM “intelligence” tests publicly available for humans to take?

nim17 Mar 2023 0:09 UTC

7 points

13 comments1 min readLW link

Donation offsets for ChatGPT Plus subscriptions

Jeffrey Ladish16 Mar 2023 23:29 UTC

53 points

3 comments3 min readLW link

The algorithm isn’t doing X, it’s just doing Y.

Cleo Nardo16 Mar 2023 23:28 UTC

53 points

43 comments5 min readLW link

Announcing the ERA Cambridge Summer Research Fellowship

Nandini Shiralkar16 Mar 2023 22:57 UTC

11 points

0 comments3 min readLW link

Gradual takeoff, fast failure

Max H16 Mar 2023 22:02 UTC

15 points

4 comments5 min readLW link

Conceding a short timelines bet early

Matthew Barnett16 Mar 2023 21:49 UTC

134 points

17 comments1 min readLW link

Attribution Patching: Activation Patching At Industrial Scale

Neel Nanda16 Mar 2023 21:44 UTC

45 points

10 comments58 min readLW link

(www.neelnanda.io)

[Question] Will 2023 be the last year you can write short stories and receive most of the intellectual credit for writing them?

lc16 Mar 2023 21:36 UTC

20 points

12 comments1 min readLW link

Is it a bad idea to pay for GPT-4?

nem16 Mar 2023 20:49 UTC

24 points

8 comments1 min readLW link

Are AI developers playing with fire?

marcusarvan16 Mar 2023 19:12 UTC

6 points

0 comments10 min readLW link

[Question] When will computer programming become an unskilled job (if ever)?

lc16 Mar 2023 17:46 UTC

36 points

55 comments1 min readLW link

[Appendix] Natural Abstractions: Key Claims, Theorems, and Critiques

LawrenceC, Erik Jenner and Leon Lang

16 Mar 2023 16:38 UTC

48 points

0 comments13 min readLW link

Natural Abstractions: Key Claims, Theorems, and Critiques

LawrenceC, Leon Lang and Erik Jenner

16 Mar 2023 16:37 UTC

249 points

26 comments45 min readLW link 3 reviews

On the Crisis at Silicon Valley Bank

Zvi16 Mar 2023 15:50 UTC

59 points

9 comments41 min readLW link

(thezvi.wordpress.com)

[Question] What literature on the neuroscience of decision making can you recommend?

quetzal_rainbow16 Mar 2023 15:32 UTC

3 points

0 comments1 min readLW link

[Question] What organizations other than Conjecture have (esp. public) info-hazard policies?

David Scott Krueger (formerly: capybaralet)16 Mar 2023 14:49 UTC

20 points

1 comment1 min readLW link

[Question] Is there an analysis of the common consideration that splitting an AI lab into two (e.g. the founding of Anthropic) speeds up the development of TAI and therefore increases AI x-risk?

tchauvin16 Mar 2023 14:16 UTC

4 points

0 comments1 min readLW link

A chess game against GPT-4

Rafael Harth16 Mar 2023 14:05 UTC

24 points

23 comments1 min readLW link

ChatGPT getting out of the box

qbolec16 Mar 2023 13:47 UTC

6 points

3 comments1 min readLW link

[Question] Are funds (such as the Long-Term Future Fund) willing to give extra money to AI safety researchers to balance for the opportunity cost of taking an “industry” job?

Malleable_shape16 Mar 2023 11:54 UTC

5 points

1 comment1 min readLW link

Three levels of exploration and intelligence

Q Home16 Mar 2023 10:55 UTC

9 points

3 comments21 min readLW link

Here, have a calmness video

Kaj_Sotala16 Mar 2023 10:00 UTC

113 points

15 comments2 min readLW link

(www.youtube.com)

Wittgenstein’s Language Games and the Critique of the Natural Abstraction Hypothesis

Chris_Leong16 Mar 2023 7:56 UTC

17 points

20 comments2 min readLW link

Red-teaming AI-safety concepts that rely on science metaphors

catubc16 Mar 2023 6:52 UTC

5 points

4 comments5 min readLW link

[ASoT] Some thoughts on human abstractions

leogao16 Mar 2023 5:42 UTC

42 points

4 comments5 min readLW link

How I Run Solstice, Step by Step

maia16 Mar 2023 3:23 UTC

46 points

0 comments16 min readLW link

(particularvirtue.blogspot.com)

GPT-4 Multiplication Competition

dandelion416 Mar 2023 3:09 UTC

11 points

7 comments1 min readLW link

Want to predict/explain/control the output of GPT-4? Then learn about the world, not about transformers.

Cleo Nardo16 Mar 2023 3:08 UTC

107 points

26 comments5 min readLW link

[Question] Is it worth avoiding detailed discussions of expectations about agency levels of powerful AIs?

David Johnston16 Mar 2023 3:06 UTC

11 points

6 comments2 min readLW link

Why self-improvement?

Adam Zerner16 Mar 2023 2:49 UTC

12 points

4 comments2 min readLW link

[Question] What is a good comprehensive examination of risks near the Ohio train derailment?

1a3orn16 Mar 2023 0:21 UTC

17 points

0 comments1 min readLW link

Write a Book?

jefftk16 Mar 2023 0:10 UTC

45 points

7 comments3 min readLW link

(www.jefftk.com)

AI Safety − 7 months of discussion in 17 minutes

Zoe Williams15 Mar 2023 23:41 UTC

25 points

0 comments17 min readLW link

How well did Manifold predict GPT-4?

David Chee15 Mar 2023 23:19 UTC

49 points

5 comments2 min readLW link

80k podcast episode on sentience in AI systems

Robbo15 Mar 2023 20:19 UTC

15 points

0 comments13 min readLW link

(80000hours.org)