All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025

All Jan Feb MarAprMay Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8910 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

[Question] Is there a fundamental distinction between simulating a mind and simulating being a mind? Is this a useful and important distinction?

Thoth Hermes8 Apr 2023 23:44 UTC

−17 points

8 comments2 min readLW link

“warning about ai doom” is also “announcing capabilities progress to noobs”

the gears to ascension8 Apr 2023 23:42 UTC

23 points

5 comments3 min readLW link

Feature Request: Right Click to Copy LaTeX

DragonGod8 Apr 2023 23:27 UTC

18 points

4 comments1 min readLW link

ELCK might require nontrivial scalable alignment progress, and seems tractable enough to try

Alex Lawsen 8 Apr 2023 21:49 UTC

17 points

0 comments2 min readLW link

GPTs are Predictors, not Imitators

Eliezer Yudkowsky8 Apr 2023 19:59 UTC

418 points

100 comments3 min readLW link 3 reviews

4 generations of alignment

qbolec8 Apr 2023 19:59 UTC

1 point

0 comments3 min readLW link

The surprising parameter efficiency of vision models

beren8 Apr 2023 19:44 UTC

81 points

28 comments4 min readLW link

Random Observation on AI goals

FTPickle8 Apr 2023 19:28 UTC

−11 points

2 comments1 min readLW link

[Question] Can we evaluate the “tool versus agent” AGI prediction?

Xodarap8 Apr 2023 18:40 UTC

16 points

7 comments1 min readLW link

Relative Abstracted Agency

Audere8 Apr 2023 16:57 UTC

14 points

6 comments5 min readLW link

The benevolence of the butcher

dr_s8 Apr 2023 16:29 UTC

86 points

33 comments6 min readLW link 1 review

SERI MATS—Summer 2023 Cohort

Aris, Ryan Kidd and Christian Smith

8 Apr 2023 15:32 UTC

71 points

25 comments4 min readLW link

AI Proposals at ‘Two Sessions’: AGI as ‘Two Bombs, One Satellite’?

Derek M. Jones8 Apr 2023 11:31 UTC

5 points

0 comments1 min readLW link

(www.chinatalk.media)

All images from the WaitButWhy sequence on AI

trevor8 Apr 2023 7:36 UTC

73 points

5 comments2 min readLW link

Guidelines for productive discussions

ambigram8 Apr 2023 6:00 UTC

38 points

0 comments5 min readLW link

All AGI Safety questions welcome (especially basic ones) [April 2023]

steven04618 Apr 2023 4:21 UTC

57 points

89 comments2 min readLW link

Bringing Agency Into AGI Extinction Is Superfluous

George3d68 Apr 2023 4:02 UTC

28 points

18 comments5 min readLW link

Lagos, Nigeria—ACX Meetups Everywhere 2023

damola8 Apr 2023 3:55 UTC

1 point

0 comments1 min readLW link

Upcoming Changes in Large Language Models

Andrew Keenan Richardson8 Apr 2023 3:41 UTC

43 points

8 comments4 min readLW link

(mechanisticmind.com)

Consider The Hand Axe

ymeskhout8 Apr 2023 1:31 UTC

143 points

16 comments6 min readLW link

AGI as a new data point

Will Rodgers8 Apr 2023 1:01 UTC

−1 points

0 comments1 min readLW link

Parametrize Priority Evaluations

SilverFlame8 Apr 2023 0:39 UTC

2 points

2 comments6 min readLW link

Pausing AI Developments Isn’t Enough. We Need to Shut it All Down

Eliezer Yudkowsky8 Apr 2023 0:36 UTC

275 points

44 comments12 min readLW link 1 review

Humanitarian Phase Transition needed before Technological Singularity

Dr_What7 Apr 2023 23:17 UTC

−9 points

5 comments2 min readLW link

[Question] Thoughts about Hugging Face?

kwiat.dev7 Apr 2023 23:17 UTC

7 points

0 comments1 min readLW link

[Question] Is it correct to frame alignment as “programming a good philosophy of meaning”?

Util7 Apr 2023 23:16 UTC

2 points

3 comments1 min readLW link

n=3 AI Risk Quick Math and Reasoning

lionhearted (Sebastian Marshall)7 Apr 2023 20:27 UTC

6 points

3 comments4 min readLW link

[Question] What are good alternatives to Predictionbook for personal prediction tracking? Edited: I originally thought it was down but it was just 500 until I though of clearing cookies.

sortega7 Apr 2023 19:18 UTC

4 points

4 comments1 min readLW link

Environments for Measuring Deception, Resource Acquisition, and Ethical Violations

Dan H7 Apr 2023 18:40 UTC

51 points

2 comments2 min readLW link

(arxiv.org)

Superintelligence Is Not Omniscience

Jeffrey Heninger7 Apr 2023 16:30 UTC

16 points

21 comments7 min readLW link

(aiimpacts.org)

An ‘AGI Emergency Eject Criteria’ consensus could be really useful.

tcelferact7 Apr 2023 16:21 UTC

5 points

0 comments1 min readLW link

Reliability, Security, and AI risk: Notes from infosec textbook chapter 1

Orpheus167 Apr 2023 15:47 UTC

34 points

1 comment4 min readLW link

Pre-registering a study

Robert_AIZI7 Apr 2023 15:46 UTC

10 points

0 comments6 min readLW link

(aizi.substack.com)

Live discussion at Eastercon

Douglas_Reay7 Apr 2023 15:25 UTC

5 points

0 comments1 min readLW link

[Question] ChatGTP “Writing ” News Stories for The Guardian?

jmh7 Apr 2023 12:16 UTC

1 point

4 comments1 min readLW link

Storyteller’s convention, 2223 A.D.

plex7 Apr 2023 11:54 UTC

8 points

0 comments2 min readLW link

Stampy’s AI Safety Info—New Distillations #1 [March 2023]

markov7 Apr 2023 11:06 UTC

42 points

0 comments2 min readLW link

(aisafety.info)

Beren’s “Deconfusing Direct vs Amortised Optimisation”

DragonGod7 Apr 2023 8:57 UTC

52 points

10 comments3 min readLW link

Goal alignment without alignment on epistemology, ethics, and science is futile

Roman Leventov7 Apr 2023 8:22 UTC

20 points

2 comments2 min readLW link

Polio Lab Leak Caught with Wastewater Sampling

Cullen7 Apr 2023 1:06 UTC

82 points

3 comments1 min readLW link

(www.eurosurveillance.org)

Catching the Eye of Sauron

Casey_7 Apr 2023 0:40 UTC

221 points

68 comments4 min readLW link

If Alignment is Hard, then so is Self-Improvement

PavleMiha7 Apr 2023 0:08 UTC

21 points

20 comments1 min readLW link

Anthropic is further accelerating the Arms Race?

sapphire6 Apr 2023 23:29 UTC

82 points

22 comments1 min readLW link

(techcrunch.com)

Suggestion for safe AI structure (Curated Transparent Decisions)

Kane Gregory6 Apr 2023 22:00 UTC

5 points

6 comments3 min readLW link

10 reasons why lists of 10 reasons might be a winning strategy

trevor6 Apr 2023 21:24 UTC

110 points

7 comments1 min readLW link

A Defense of Utilitarianism

Pareto Optimal6 Apr 2023 21:09 UTC

−3 points

2 comments5 min readLW link

(paretooptimal.substack.com)

One Does Not Simply Replace the Humans

JerkyTreats6 Apr 2023 20:56 UTC

9 points

3 comments4 min readLW link

(www.lesswrong.com)

[Question] Where to begin in ML/AI?

Jake the Student6 Apr 2023 20:45 UTC

9 points

4 comments1 min readLW link

Misgeneralization as a misnomer

So8res6 Apr 2023 20:43 UTC

128 points

22 comments4 min readLW link

You can use GPT-4 to create prompt injections against GPT-4

WitchBOT6 Apr 2023 20:39 UTC

87 points

9 comments2 min readLW link