5 Nov 2023 23:26 UTC

27 points

25 comments16 min readLW link

On Overhangs and Technological Change

Roko5 Nov 2023 22:58 UTC

50 points

19 comments2 min readLW link

xAI announces Grok, beats GPT-3.5

Nikola Jurkovic5 Nov 2023 22:11 UTC

10 points

6 comments1 min readLW link

(x.ai)

Disentangling four motivations for acting in accordance with UDT

Julian Stastny5 Nov 2023 21:26 UTC

35 points

4 comments7 min readLW link

AI as Super-Demagogue

RationalDino5 Nov 2023 21:21 UTC

11 points

12 comments9 min readLW link

EA orgs’ legal structure inhibits risk taking and information sharing on the margin

Elizabeth5 Nov 2023 19:13 UTC

136 points

17 comments4 min readLW link

Eric Schmidt on recursive self-improvement

Nikola Jurkovic5 Nov 2023 19:05 UTC

24 points

3 comments1 min readLW link

(www.youtube.com)

Pivotal Acts might Not be what You Think they are

Johannes C. Mayer5 Nov 2023 17:23 UTC

41 points

13 comments3 min readLW link

The Assumed Intent Bias

silentbob5 Nov 2023 16:28 UTC

51 points

13 comments6 min readLW link

Go flash blinking lights at printed text right now

lemonhope5 Nov 2023 7:29 UTC

15 points

9 comments1 min readLW link

Life of GPT

Odd anon5 Nov 2023 4:55 UTC

6 points

2 comments5 min readLW link

Lightning Talks

Screwtape5 Nov 2023 3:27 UTC

6 points

3 comments4 min readLW link

Utility is not the selection target

tailcalled4 Nov 2023 22:48 UTC

24 points

1 comment1 min readLW link

Stuxnet, not Skynet: Humanity’s disempowerment by AI

Roko4 Nov 2023 22:23 UTC

107 points

24 comments6 min readLW link

The 6D effect: When companies take risks, one email can be very powerful.

scasper4 Nov 2023 20:08 UTC

287 points

42 comments3 min readLW link

Genetic fitness is a measure of selection strength, not the selection target

Kaj_Sotala4 Nov 2023 19:02 UTC

50 points

44 comments18 min readLW link

The Soul Key

Richard_Ngo4 Nov 2023 17:51 UTC

114 points

10 comments8 min readLW link 1 review

(www.narrativeark.xyz)

[Linkpost] Concept Alignment as a Prerequisite for Value Alignment

Bogdan Ionut Cirstea4 Nov 2023 17:34 UTC

27 points

0 comments1 min readLW link

(arxiv.org)

Being good at the basics

dominicq4 Nov 2023 14:18 UTC

33 points

1 comment3 min readLW link

If a little is good, is more better?

DanielFilan4 Nov 2023 7:10 UTC

25 points

16 comments2 min readLW link

(danielfilan.com)

Untrusted smart models and trusted dumb models

Buck4 Nov 2023 3:06 UTC

87 points

17 comments6 min readLW link 1 review

As Many Ideas

Screwtape3 Nov 2023 22:47 UTC

12 points

0 comments4 min readLW link

Paul Christiano on Dwarkesh Podcast

ESRogs3 Nov 2023 22:13 UTC

19 points

0 comments1 min readLW link

(www.dwarkeshpatel.com)

Deception Chess: Game #1

Zane, aphyer, Alex A and AdamYedidia

3 Nov 2023 21:13 UTC

117 points

22 comments8 min readLW link 1 review

8 examples informing my pessimism on uploading without reverse engineering

Steven Byrnes3 Nov 2023 20:03 UTC

120 points

12 comments12 min readLW link

Integrity in AI Governance and Advocacy

habryka and Olive Branch

3 Nov 2023 19:52 UTC

135 points

57 comments23 min readLW link

Averaging samples from a population with log-normal distribution

CrimsonChin3 Nov 2023 19:42 UTC

8 points

2 comments1 min readLW link

Securing Civilization Against Catastrophic Pandemics

jefftk3 Nov 2023 19:33 UTC

13 points

0 comments1 min readLW link

(dam.gcsp.ch)

The Unavoidable Experience of Free Will in a Deterministic World

gmax3 Nov 2023 17:55 UTC

−12 points

0 comments3 min readLW link

Thoughts on open source AI

Sam Marks3 Nov 2023 15:35 UTC

62 points

17 comments10 min readLW link

Knowledge Base 6: Consensus theory of truth

iwis3 Nov 2023 13:56 UTC

−8 points

0 comments1 min readLW link

The other side of the tidal wave

KatjaGrace3 Nov 2023 5:40 UTC

203 points

88 comments1 min readLW link

(worldspiritsockpuppet.com)

Does davidad’s uploading moonshot work?

Bird Concept, lisathiergart, Anders_Sandberg, davidad and Arenamontanus

3 Nov 2023 2:21 UTC

146 points

35 comments25 min readLW link

Twin Cities ACX Meetup—November 2023

Timothy M.3 Nov 2023 0:47 UTC

1 point

1 comment1 min readLW link

San Francisco ACX Meetup “First Saturday”

guenael3 Nov 2023 0:10 UTC

4 points

0 comments1 min readLW link

[Question] What are your favorite posts, podcast episodes, and recorded talks, on AI timelines, or factors that would influence AI timelines?

nonzerosum2 Nov 2023 22:42 UTC

2 points

0 comments1 min readLW link

One Day Sooner

Screwtape2 Nov 2023 19:00 UTC

137 points

8 comments8 min readLW link 1 review

Propaganda or Science: A Look at Open Source AI and Bioterrorism Risk

1a3orn2 Nov 2023 18:20 UTC

194 points

79 comments23 min readLW link

AI #36: In the Background

Zvi2 Nov 2023 18:00 UTC

45 points

5 comments37 min readLW link

(thezvi.wordpress.com)

Doubt Certainty

RationalDino2 Nov 2023 17:43 UTC

4 points

13 comments3 min readLW link

Saying the quiet part out loud: trading off x-risk for personal immortality

disturbance2 Nov 2023 17:43 UTC

84 points

89 comments5 min readLW link

Mech Interp Challenge: November—Deciphering the Cumulative Sum Model

CallumMcDougall2 Nov 2023 17:10 UTC

18 points

2 comments2 min readLW link

Estimating effective dimensionality of MNIST models

Arjun Panickssery2 Nov 2023 14:13 UTC

41 points

3 comments1 min readLW link

Averages and sample sizes

mruwnik2 Nov 2023 9:52 UTC

15 points

2 comments8 min readLW link

ACX/LW/EA crossover meetup

RasmusHB2 Nov 2023 5:57 UTC

2 points

0 comments1 min readLW link

Upcoming Feedback Opportunity on Dual-Use Foundation Models

Chris_Leong2 Nov 2023 4:28 UTC

3 points

0 comments1 min readLW link

Public Weights?

jefftk2 Nov 2023 2:50 UTC

49 points

19 comments3 min readLW link

(www.jefftk.com)

[Question] Should people build productizations of open source AI models?

lc2 Nov 2023 1:26 UTC

23 points

0 comments1 min readLW link

Singular learning theory and bridging from ML to brain emulations

kave and Garrett Baker

1 Nov 2023 21:31 UTC

26 points

16 comments29 min readLW link

My thoughts on the social response to AI risk

Matthew Barnett1 Nov 2023 21:17 UTC

146 points

37 comments10 min readLW link