All 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 202220232024 2025 2026

All Jan FebMarApr May Jun Jul Aug Sep Oct Nov Dec

All 1 2 3 4 5 6 7 8 9 10 111213 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

The issue of meaning in large language models (LLMs)

Bill Benzon11 Mar 2023 23:00 UTC

1 point

34 comments8 min readLW link

[Linkpost] Scott Alexander reacts to OpenAI’s latest post

Orpheus1611 Mar 2023 22:24 UTC

27 points

0 comments5 min readLW link

(astralcodexten.substack.com)

Compositional language for hypotheses about computations

Vanessa Kosoy11 Mar 2023 19:43 UTC

38 points

6 comments12 min readLW link

Understanding and controlling a maze-solving policy network

TurnTrout, peligrietzer, Ulisse Mini, Monte M and David Udell

11 Mar 2023 18:59 UTC

335 points

28 comments23 min readLW link

[Question] How can we promote AI alignment in Japan?

Shoka Kadoi11 Mar 2023 18:52 UTC

24 points

11 comments1 min readLW link

How to Support Someone Who is Struggling

David Zeller11 Mar 2023 18:52 UTC

76 points

13 comments5 min readLW link

[Question] Given one AI, why not more?

Frank Adk11 Mar 2023 18:52 UTC

7 points

12 comments1 min readLW link

Agents synchronization

Ben Amitay11 Mar 2023 18:41 UTC

12 points

1 comment5 min readLW link

Against Complete Blackout Curtains For Sleep

jp11 Mar 2023 18:29 UTC

19 points

11 comments1 min readLW link

[Question] Counterarguments to Core AI X-Risk Stories?

DavidW11 Mar 2023 17:55 UTC

10 points

2 comments1 min readLW link

The Power of Intelligence—The Animation

Writer11 Mar 2023 16:15 UTC

45 points

3 comments1 min readLW link

(youtu.be)

[Question] Hoarding Gmail-accounts in a post-CAPTCHA world?

Alexander Gietelink Oldenziel11 Mar 2023 16:08 UTC

7 points

3 comments1 min readLW link

[Question] Will the Bitcoin fee market actually work?

TropicalFruit11 Mar 2023 0:02 UTC

10 points

7 comments1 min readLW link

Rationalism and social rationalism

philosophybear10 Mar 2023 23:20 UTC

17 points

5 comments10 min readLW link

(philosophybear.substack.com)

Meetup Tip: Nametags

Screwtape10 Mar 2023 21:00 UTC

18 points

2 comments3 min readLW link

[Question] Is ChatGPT (or other LLMs) more ‘sentient’/’conscious/etc. then a baby without a brain?

M. Y. Zuo10 Mar 2023 19:00 UTC

−5 points

2 comments1 min readLW link

The humanity’s biggest mistake

RomanS10 Mar 2023 16:30 UTC

0 points

1 comment2 min readLW link

Operationalizing timelines

Zach Stein-Perlman10 Mar 2023 16:30 UTC

7 points

1 comment3 min readLW link

[Question] What do you think is wrong with rationalist culture?

tailcalled10 Mar 2023 13:17 UTC

16 points

78 comments1 min readLW link

Dice Decision Making

Bart Bussmann10 Mar 2023 13:01 UTC

21 points

14 comments3 min readLW link

Stop calling it “jailbreaking” ChatGPT

Templarrr10 Mar 2023 11:41 UTC

7 points

9 comments2 min readLW link

Long-term memory for LLM via self-replicating prompt

avturchin10 Mar 2023 10:28 UTC

20 points

3 comments2 min readLW link

Thoughts on the OpenAI alignment plan: will AI research assistants be net-positive for AI existential risk?

Jeffrey Ladish10 Mar 2023 8:21 UTC

58 points

3 comments9 min readLW link

Reflections On The Feasibility Of Scalable-Oversight

Felix Hofstätter10 Mar 2023 7:54 UTC

11 points

0 comments12 min readLW link

Japan AI Alignment Conference

Chris Scammell and Katrina Joslin

10 Mar 2023 6:56 UTC

64 points

7 comments1 min readLW link

(www.conjecture.dev)

Everything’s normal until it’s not

Eleni Angelou10 Mar 2023 2:02 UTC

7 points

0 comments3 min readLW link

Acolytes, reformers, and atheists

lc10 Mar 2023 0:48 UTC

9 points

0 comments4 min readLW link

The hot mess theory of AI misalignment: More intelligent agents behave less coherently

Jonathan Yan10 Mar 2023 0:20 UTC

50 points

22 comments1 min readLW link

(sohl-dickstein.github.io)

Why Not Just Outsource Alignment Research To An AI?

johnswentworth9 Mar 2023 21:49 UTC

161 points

50 comments9 min readLW link 1 review

What’s Not Our Problem

Jacob Falkovich9 Mar 2023 20:07 UTC

22 points

6 comments9 min readLW link

Questions about Conjecure’s CoEm proposal

Orpheus16 and Niki Dupuis

9 Mar 2023 19:32 UTC

51 points

4 comments2 min readLW link

What Jason has been reading, March 2023

jasoncrawford9 Mar 2023 18:46 UTC

12 points

0 comments6 min readLW link

(rootsofprogress.org)

[Question] “Provide C++ code for a function that outputs a Fibonacci sequence of n terms, where n is provided as a parameter to the function

Thembeka999 Mar 2023 18:37 UTC

−21 points

2 comments1 min readLW link

Anthropic: Core Views on AI Safety: When, Why, What, and How

jonmenaster9 Mar 2023 17:34 UTC

17 points

1 comment22 min readLW link

(www.anthropic.com)

Why do we assume there is a “real” shoggoth behind the LLM? Why not masks all the way down?

Robert_AIZI9 Mar 2023 17:28 UTC

64 points

48 comments2 min readLW link

Anthropic’s Core Views on AI Safety

Zac Hatfield-Dodds9 Mar 2023 16:55 UTC

173 points

40 comments2 min readLW link

(www.anthropic.com)

Some ML-Related Math I Now Understand Better

Fabien Roger9 Mar 2023 16:35 UTC

50 points

6 comments4 min readLW link

The Translucent Thoughts Hypotheses and Their Implications

Fabien Roger9 Mar 2023 16:30 UTC

142 points

7 comments19 min readLW link

IRL in General Environments

michaelcohen9 Mar 2023 13:32 UTC

8 points

20 comments1 min readLW link

Utility uncertainty vs. expected information gain

michaelcohen9 Mar 2023 13:32 UTC

13 points

9 comments1 min readLW link

Value Learning is only Asymptotically Safe

michaelcohen9 Mar 2023 13:32 UTC

5 points

19 comments1 min readLW link

Impact Measure Testing with Honey Pots and Myopia

michaelcohen9 Mar 2023 13:32 UTC

17 points

9 comments1 min readLW link

Just Imitate Humans?

michaelcohen9 Mar 2023 13:31 UTC

11 points

72 comments1 min readLW link

Build a Causal Decision Theorist

michaelcohen9 Mar 2023 13:31 UTC

−2 points

14 comments4 min readLW link

ChatGPT explores the semantic differential

Bill Benzon9 Mar 2023 13:09 UTC

7 points

2 comments7 min readLW link

AI #3

Zvi9 Mar 2023 12:20 UTC

55 points

12 comments62 min readLW link

(thezvi.wordpress.com)

The Scientific Approach To Anything and Everything

Rami Rustom9 Mar 2023 11:27 UTC

6 points

5 comments16 min readLW link

Paper Summary: The Effectiveness of AI Existential Risk Communication to the American and Dutch Public

otto.barten9 Mar 2023 10:47 UTC

14 points

6 comments4 min readLW link

Speed running everyone through the bad alignment bingo. $5k bounty for a LW conversational agent

ArthurB9 Mar 2023 9:26 UTC

140 points

32 comments2 min readLW link

Chomsky on ChatGPT (link)

mukashi9 Mar 2023 7:00 UTC

2 points

6 comments1 min readLW link