All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025 2026

All Jan Feb MarAprMay Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 181920 21 22 23 24 25 26 27 28 29 30

Consciousness as recurrence, potential for enforcing alignment?

Foyle18 Apr 2023 23:05 UTC

−2 points

6 comments1 min readLW link

Encouraging New Users To Bet On Their Beliefs

YafahEdelman18 Apr 2023 22:10 UTC

49 points

8 comments2 min readLW link

AI Safety Newsletter #2: ChaosGPT, Natural Selection, and AI Safety in the Media

ozhang, Dan H and Orpheus16

18 Apr 2023 18:44 UTC

30 points

0 comments4 min readLW link

(newsletter.safe.ai)

Scientism vs. people

Roman Leventov18 Apr 2023 17:28 UTC

4 points

4 comments11 min readLW link

Capabilities and alignment of LLM cognitive architectures

Seth Herd18 Apr 2023 16:29 UTC

88 points

18 comments20 min readLW link

World and Mind in Artificial Intelligence: arguments against the AI pause

Arturo Macias18 Apr 2023 14:40 UTC

1 point

0 comments1 min readLW link

(forum.effectivealtruism.org)

Slowing AI: Interventions

Zach Stein-Perlman18 Apr 2023 14:30 UTC

19 points

0 comments5 min readLW link

Cryptographic and auxiliary approaches relevant for AI safety

Allison Duettmann18 Apr 2023 14:18 UTC

7 points

0 comments6 min readLW link

The Overemployed Via ChatGPT

Zvi18 Apr 2023 13:40 UTC

58 points

7 comments6 min readLW link

(thezvi.wordpress.com)

[Linkpost] AI Alignment, Explained in 5 Points (updated)

Daniel_Eth18 Apr 2023 8:09 UTC

10 points

0 comments1 min readLW link

(medium.com)

Argentines LW/SSC/EA/MIRIx—Call to All

daviddelauba18 Apr 2023 6:37 UTC

1 point

0 comments1 min readLW link

No, really, it predicts next tokens.

simon18 Apr 2023 3:47 UTC

58 points

55 comments3 min readLW link

The basic reasons I expect AGI ruin

Rob Bensinger18 Apr 2023 3:37 UTC

187 points

74 comments14 min readLW link

High schoolers can apply to the Atlas Fellowship: $10k scholarship + 11-day program

Ronny Fernandez and Jonas V

18 Apr 2023 2:53 UTC

26 points

0 comments3 min readLW link

Green goo is plausible

anithite18 Apr 2023 0:04 UTC

67 points

31 comments4 min readLW link 1 review

AI Impacts Quarterly Newsletter, Jan-Mar 2023

Harlan17 Apr 2023 22:10 UTC

5 points

0 comments3 min readLW link

(blog.aiimpacts.org)

[Question] How do you align your emotions through updates and existential uncertainty?

VojtaKovarik17 Apr 2023 20:46 UTC

4 points

10 comments1 min readLW link

AI Alignment Research Engineer Accelerator (ARENA): call for applicants

CallumMcDougall17 Apr 2023 20:30 UTC

100 points

9 comments7 min readLW link

AI policy ideas: Reading list

Zach Stein-Perlman17 Apr 2023 19:00 UTC

24 points

7 comments4 min readLW link

NYT: The Surprising Thing A.I. Engineers Will Tell You if You Let Them

Sodium17 Apr 2023 18:59 UTC

11 points

2 comments1 min readLW link

(www.nytimes.com)

But why would the AI kill us?

So8res17 Apr 2023 18:42 UTC

142 points

96 comments2 min readLW link

Sama Says the Age of Giant AI Models is Already Over

Algon17 Apr 2023 18:36 UTC

49 points

12 comments1 min readLW link

(www.wired.com)

Meetup Tip: Conversation Starters

Screwtape17 Apr 2023 18:25 UTC

20 points

1 comment3 min readLW link

Critiques of prominent AI safety labs: Redwood Research

Omega.17 Apr 2023 18:20 UTC

4 points

0 comments22 min readLW link

(forum.effectivealtruism.org)

How Large Language Models Nuke our Naive Notions of Truth and Reality

Sean Lee17 Apr 2023 18:08 UTC

0 points

23 comments11 min readLW link

An alternative of PPO towards alignment

ml hkust17 Apr 2023 17:58 UTC

2 points

2 comments4 min readLW link

What I learned at the AI Safety Europe Retreat

skaisg17 Apr 2023 17:40 UTC

28 points

0 comments10 min readLW link

(skaisg.eu)

What is your timelines for ADI (artificial disempowering intelligence)?

Christopher King17 Apr 2023 17:01 UTC

3 points

3 comments2 min readLW link

[Question] Can we get around Godel’s Incompleteness theorems and Turing undecidable problems via infinite computers?

Noosphere8917 Apr 2023 15:14 UTC

−11 points

12 comments1 min readLW link

La Crosse, WI Rationality Meetup

Daniel Uebele17 Apr 2023 15:13 UTC

1 point

0 comments1 min readLW link

Slowing AI: Foundations

Zach Stein-Perlman17 Apr 2023 14:30 UTC

45 points

11 comments17 min readLW link

Slowing AI: Reading list

Zach Stein-Perlman17 Apr 2023 14:30 UTC

47 points

3 comments4 min readLW link

Goodhart’s Law inside the human mind

Kaj_Sotala17 Apr 2023 13:48 UTC

129 points

13 comments16 min readLW link

Prediction: any uncontrollable AI will turn earth into a giant computer

Karl von Wendt17 Apr 2023 12:30 UTC

11 points

8 comments3 min readLW link

AutoBound on neural network can achieve OOMs lower training loss

Maybe_a17 Apr 2023 5:20 UTC

10 points

9 comments1 min readLW link

(ai.googleblog.com)

Making Booking.Com less out to get you

Elizabeth17 Apr 2023 4:04 UTC

21 points

0 comments1 min readLW link

(www.alexcharlton.co)

grey goo is unlikely

bhauth17 Apr 2023 1:59 UTC

158 points

123 comments9 min readLW link 2 reviews

(bhauth.com)

AGI Clinics: A Safe Haven for Humanity’s First Encounters with Superintelligence

portr.17 Apr 2023 1:52 UTC

−5 points

1 comment1 min readLW link

Summaries of top forum posts (27th March to 16th April)

Zoe Williams17 Apr 2023 0:28 UTC

14 points

1 comment12 min readLW link

AI Takeover Scenario with Scaled LLMs

simeon_c16 Apr 2023 23:28 UTC

42 points

15 comments8 min readLW link

My experience getting funding for my biological research

Metacelsus16 Apr 2023 22:53 UTC

78 points

10 comments5 min readLW link

(denovo.substack.com)

Top lesson from GPT: we will probably destroy humanity “for the lulz” as soon as we are able.

Shmi16 Apr 2023 20:27 UTC

63 points

28 comments1 min readLW link

On urgency, priority and collective reaction to AI-Risks: Part I

Denreik16 Apr 2023 19:14 UTC

−10 points

15 comments5 min readLW link

Efficient Learning: Memorization

Alvin Ånestrand16 Apr 2023 17:58 UTC

4 points

2 comments5 min readLW link

(forum.effectivealtruism.org)

Mechanistically interpreting time in GPT-2 small

rgould, Elizabeth Ho and Arthur Conmy

16 Apr 2023 17:57 UTC

68 points

6 comments21 min readLW link

La Crosse, WI Rationality Meetup

Daniel Uebele16 Apr 2023 17:33 UTC

1 point

0 comments1 min readLW link

The Soul of the Writer (on LLMs, the psychology of writers, and the nature of intelligence)

rogersbacon16 Apr 2023 16:02 UTC

11 points

1 comment3 min readLW link

(www.secretorum.life)

Possibilizing vs. actualizing

TsviBT16 Apr 2023 15:55 UTC

31 points

2 comments5 min readLW link

Human Extinction by AI through economic power

ChristianKl16 Apr 2023 12:15 UTC

8 points

1 comment8 min readLW link

Bit Flip

Charlie Sanders16 Apr 2023 7:30 UTC

−2 points

11 comments11 min readLW link