All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025 2026

All Jan FebMarApr May Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 282930 31

Job Board (28 March 2033)

dr_s28 Mar 2023 22:44 UTC

21 points

1 comment3 min readLW link

Four lenses on AI risks

jasoncrawford28 Mar 2023 21:52 UTC

23 points

5 comments3 min readLW link

(rootsofprogress.org)

Some common confusion about induction heads

Alexandre Variengien28 Mar 2023 21:51 UTC

65 points

4 comments5 min readLW link

Draft: The optimization toolbox

Alex_Altair28 Mar 2023 20:40 UTC

20 points

1 comment7 min readLW link

Inching “Kubla Khan” and GPT into the same intellectual framework @ 3 Quarks Daily

Bill Benzon28 Mar 2023 19:50 UTC

5 points

0 comments3 min readLW link

A rough and incomplete review of some of John Wentworth’s research

So8res28 Mar 2023 18:52 UTC

177 points

18 comments18 min readLW link

[Question] How do you manage your inputs?

Mateusz Bagiński28 Mar 2023 18:26 UTC

15 points

2 comments1 min readLW link

Chatbot convinces Belgian to commit suicide

Jeroen De Ryck28 Mar 2023 18:14 UTC

60 points

18 comments3 min readLW link

(www.standaard.be)

A Primer On Chaos

johnswentworth28 Mar 2023 18:01 UTC

54 points

9 comments9 min readLW link

[Question] How likely are scenarios where AGI ends up overtly or de facto torturing us? How likely are scenarios where AGI prevents us from committing suicide or dying?

JohnGreer28 Mar 2023 18:00 UTC

11 points

4 comments1 min readLW link

How do we align humans and what does it mean for the new Conjecture’s strategy

Igor Ivanov28 Mar 2023 17:54 UTC

7 points

4 comments7 min readLW link

Governing High-Impact AI Systems: Understanding Canada’s Proposed AI Bill. April 15, Carleton University, Ottawa

Liav Koren28 Mar 2023 17:48 UTC

11 points

1 comment1 min readLW link

(forum.effectivealtruism.org)

I had a chat with GPT-4 on the future of AI and AI safety

Kristian Freed28 Mar 2023 17:47 UTC

1 point

0 comments8 min readLW link

LessWrong Hangout

Raymond Koopmanschap28 Mar 2023 17:47 UTC

0 points

0 comments1 min readLW link

Half-baked alignment idea

ozb28 Mar 2023 17:47 UTC

6 points

27 comments1 min readLW link

Some of My Current Impressions Entering AI Safety

worse28 Mar 2023 17:46 UTC

2 points

0 comments2 min readLW link

[Question] Why do the Sequences say that “Löb’s Theorem shows that a mathematical system cannot assert its own soundness without becoming inconsistent.”?

Thoth Hermes28 Mar 2023 17:19 UTC

12 points

30 comments1 min readLW link

Corrigibility, Self-Deletion, and Identical Strawberries

Robert_AIZI28 Mar 2023 16:54 UTC

9 points

2 comments6 min readLW link

(aizi.substack.com)

[Question] Why no major LLMs with memory?

Kaj_Sotala28 Mar 2023 16:34 UTC

42 points

15 comments1 min readLW link

Response to Tyler Cowen’s Existential risk, AI, and the inevitable turn in human history

Zvi28 Mar 2023 16:00 UTC

73 points

27 comments20 min readLW link

(thezvi.wordpress.com)

Adapting to Change: Overcoming Chronostasis in AI Language Models

RationalMindset28 Mar 2023 14:32 UTC

−1 points

0 comments6 min readLW link

Feeling Progress as Motivation

Sable28 Mar 2023 9:11 UTC

4 points

0 comments3 min readLW link

(affablyevil.substack.com)

Creating a family with GPT-4

Kaj_Sotala28 Mar 2023 6:40 UTC

23 points

3 comments10 min readLW link

(kajsotala.fi)

Some 2-4-6 problems

abstractapplic28 Mar 2023 6:32 UTC

28 points

9 comments1 min readLW link

(h-b-p.github.io)

[Question] Deep folding docs site?

mcint28 Mar 2023 6:01 UTC

−1 points

2 comments1 min readLW link

[Question] Why does advanced AI want not to be shut down?

RedFishBlueFish28 Mar 2023 4:26 UTC

2 points

19 comments1 min readLW link

100 Dinners And A Workshop: Information Preservation And Goals

Stephen Fowler28 Mar 2023 3:13 UTC

8 points

0 comments7 min readLW link

Demons from the 5&10verse!

Slimepriestess28 Mar 2023 2:41 UTC

4 points

15 comments4 min readLW link

(voidgoddess.org)

[Question] Can GPT-4 play 20 questions against another instance of itself?

Nathan Helm-Burger28 Mar 2023 1:11 UTC

15 points

1 comment1 min readLW link

(evanthebouncy.medium.com)

Geoffrey Hinton—Full “not inconceivable” quote

WilliamKiely28 Mar 2023 0:22 UTC

21 points

2 comments2 min readLW link

An A.I. Safety Presentation at RIT

Nicholas Kross27 Mar 2023 23:49 UTC

8 points

0 comments1 min readLW link

(www.youtube.com)

Which AI outputs should humans check for shenanigans, to avoid AI takeover? A simple model

Tom Davidson27 Mar 2023 23:36 UTC

16 points

3 comments8 min readLW link

The Prospect of an AI Winter

Erich_Grunewald27 Mar 2023 20:55 UTC

62 points

24 comments15 min readLW link

(www.erichgrunewald.com)

[Question] Best arguments against the outside view that AGI won’t be a huge deal, thus we survive.

Noosphere8927 Mar 2023 20:49 UTC

4 points

7 comments1 min readLW link

EA & LW Forum Weekly Summary (20th − 26th March 2023)

Zoe Williams27 Mar 2023 20:46 UTC

4 points

0 comments6 min readLW link

Three of my beliefs about upcoming AGI

Robert_AIZI27 Mar 2023 20:27 UTC

6 points

0 comments3 min readLW link

(aizi.substack.com)

Nobody knows how to reliably test for AI safety

marcusarvan27 Mar 2023 19:48 UTC

1 point

0 comments5 min readLW link

New blog: Planned Obsolescence

Ajeya Cotra27 Mar 2023 19:46 UTC

96 points

7 comments1 min readLW link

(www.planned-obsolescence.org)

South Bay ACX/SSC Spring Meetups Everywhere

allisona27 Mar 2023 19:39 UTC

2 points

0 comments1 min readLW link

[Question] Resources to see how people think/approach mathematics and problem-solving

zef27 Mar 2023 19:12 UTC

7 points

2 comments1 min readLW link

Staggering Hunters

Screwtape27 Mar 2023 19:11 UTC

12 points

2 comments5 min readLW link

Neurotechnology is Critical for AI Alignment

Milan Cvitkovic27 Mar 2023 18:27 UTC

10 points

3 comments1 min readLW link

(milan.cvitkovic.net)

[Question] Best resources to learn philosophy of mind and AI?

Sky Moo27 Mar 2023 18:22 UTC

1 point

0 comments1 min readLW link

the tensor is a lonely place

jml627 Mar 2023 18:22 UTC

−11 points

0 comments4 min readLW link

(ekjsgrjelrbno.substack.com)

[Question] Bermudez Interface Problem

Motor Vehicle27 Mar 2023 18:11 UTC

1 point

2 comments1 min readLW link

Would you be a better RLHF labeler than GPT-4?

kache27 Mar 2023 18:10 UTC

1 point

1 comment1 min readLW link

LLM Powered LW Search

odraode1727 Mar 2023 18:09 UTC

−1 points

0 comments1 min readLW link

Announcing the Swiss Existential Risk Initiative (CHERI) 2023 Research Fellowship

Tobias H27 Mar 2023 16:36 UTC

3 points

0 comments2 min readLW link

Industrialization/Computerization Analogies

Gordon Seidoh Worley27 Mar 2023 16:34 UTC

16 points

2 comments2 min readLW link

Lessons from Convergent Evolution for AI Alignment

Jan_Kulveit and rosehadshar

27 Mar 2023 16:25 UTC

54 points

9 comments8 min readLW link