Roman Leventov

Karma: 1,338

An independent researcher/blogger/philosopher about intelligence and agency (esp. Active Inference), alignment, ethics, interaction of the AI transition with the sociotechnical risks (epistemics, economics, human psychology), collective mind architecture, research strategy and methodology.

Twitter: https://twitter.com/leventov. E-mail: leventov.ru@gmail.com (the preferred mode of communication). I’m open to collaborations and work.

Presentations at meetups, workshops and conferences, some recorded videos.

I’m a founding member of the Gaia Consoritum, on a mission to create a global, decentralised system for collective sense-making and decision-making, i.e., civilisational intelligence. Drop me a line if you want to learn more about it and/or join the consoritum.

You can help to boost my sense of accountability and give me a feeling that my work is valued by becoming a paid subscriber of my Substack (though I don’t post anything paywalled; in fact, on this blog, I just syndicate my LessWrong writing).

For Russian speakers: русскоязычная сеть по безопасности ИИ, Telegram group.

Active Inference as a formalisation of instrumental convergence

Roman Leventov26 Jul 2022 17:55 UTC

12 points

2 comments3 min readLW link

(direct.mit.edu)

AGI-level reasoner will appear sooner than an agent; what the humanity will do with this reasoner is critical

Roman Leventov30 Jul 2022 20:56 UTC

24 points

10 comments1 min readLW link

[Question] Are language models close to the superhuman level in philosophy?

Roman Leventov19 Aug 2022 4:43 UTC

6 points

2 comments2 min readLW link

The problem with the media presentation of “believing in AI”

Roman Leventov14 Sep 2022 21:05 UTC

3 points

0 comments1 min readLW link

The circular problem of epistemic irresponsibility

Roman Leventov31 Oct 2022 17:23 UTC

5 points

2 comments8 min readLW link

[Question] What is our current best infohazard policy for AGI (safety) research?

Roman Leventov15 Nov 2022 22:33 UTC

12 points

2 comments1 min readLW link

The two conceptions of Active Inference: an intelligence architecture and a theory of agency

Roman Leventov16 Nov 2022 9:30 UTC

15 points

0 comments4 min readLW link

Properties of current AIs and some predictions of the evolution of AI from the perspective of scale-free theories of agency and regulative development

Roman Leventov20 Dec 2022 17:13 UTC

33 points

3 comments36 min readLW link

How evolutionary lineages of LLMs can plan their own future and act on these plans

Roman Leventov25 Dec 2022 18:11 UTC

39 points

16 comments8 min readLW link

AI psychology should ground the theories of AI consciousness and inform human-AI ethical interaction design

Roman Leventov8 Jan 2023 6:37 UTC

19 points

8 comments2 min readLW link

Reward is not Necessary: How to Create a Compositional Self-Preserving Agent for Life-Long Learning

Roman Leventov12 Jan 2023 16:43 UTC

17 points

2 comments2 min readLW link

(arxiv.org)

Critique of some recent philosophy of LLMs’ minds

Roman Leventov20 Jan 2023 12:53 UTC

51 points

8 comments20 min readLW link

[Question] Has private AGI research made independent safety research ineffective already? What should we do about this?

Roman Leventov23 Jan 2023 7:36 UTC

43 points

5 comments5 min readLW link

Temporally Layered Architecture for Adaptive, Distributed and Continuous Control

Roman Leventov2 Feb 2023 6:29 UTC

6 points

4 comments1 min readLW link

(arxiv.org)

A multi-disciplinary view on AI safety research

Roman Leventov8 Feb 2023 16:50 UTC

43 points

4 comments26 min readLW link

Morphological intelligence, superhuman empathy, and ethical arbitration

Roman Leventov13 Feb 2023 10:25 UTC

1 point

0 comments2 min readLW link

The Linguistic Blind Spot of Value-Aligned Agency, Natural and Artificial

Roman Leventov14 Feb 2023 6:57 UTC

6 points

0 comments2 min readLW link

(arxiv.org)

Powerful mesa-optimisation is already here

Roman Leventov17 Feb 2023 4:59 UTC

35 points

1 comment2 min readLW link

(arxiv.org)

Joscha Bach on Synthetic Intelligence [annotated]

Roman Leventov2 Mar 2023 11:02 UTC

9 points

1 comment9 min readLW link

(www.jimruttshow.com)

A reply to Byrnes on the Free Energy Principle

Roman Leventov3 Mar 2023 13:03 UTC

27 points

16 comments14 min readLW link