Roko comments on The Problem

Roko 13 Aug 2025 23:57 UTC
3 points
1

suppose there’s some information about the world that the AIs are able to glean through vast amounts of experience and reflection, and that they can’t justify except through reference to that experience and reflection. Suppose there are two AIs that make conflicting claims about that information, while agreeing on everything that humans can check.

Well the AIs will develop track records and reputations.

This is already happening with LLM-based AIs.

And the vast majority of claims will actually be somewhat checkable, at some cost, after some time.

I don’t think this is a particularly bad problem.