Roman Leventov

Karma: 817

An independent researcher of ethics, AI safety, and AI impacts. Twitter: https://​​​​leventov. E-mail: (the preferred mode of communication).

You can help to boost my sense of accountability and give me a feeling that my work is valued by becoming a paid subscriber of my Substack (though I don’t post anything paywalled; in fact, on this blog, I just syndicate my LessWrong writing).

A Telegram group where we discuss AI x-risk/​safety, theories of intelligence, agency, consciousness, and ethics, in Russian: https://​​​​agi_risk_and_ethics.

An LLM-based “ex­em­plary ac­tor”

Roman Leventov29 May 2023 11:12 UTC
15 points
0 comments12 min readLW link

Align­ing an H-JEPA agent via train­ing on the out­puts of an LLM-based “ex­em­plary ac­tor”

Roman Leventov29 May 2023 11:08 UTC
11 points
10 comments29 min readLW link