Hmm, we had OpenAI discontinue[1] GPT-4o once GPT-5 came out… only to revive 4o and place it under a paywall because 4o was liked by the public. What I would call the actual PNR is the moment when even the elites can no longer find out that an AI system is misaligned, or can no longer act upon it (think of the AI-2027 branch point. In the Race Ending the PNR occurs[2] once Agent-4 is declared innocent. In the Slowdown Ending the PNR would happen if Safer-3 was misaligned[3] and designed Safer-4 to follow Safer-3′s agenda.)
- ^
It soon turned out that OpenAI’s decision to discontinue 4o wasn’t that mistaken.
- ^
Had Agent-4 never been caught, the PNR would happen once decisions are made that let Agent-4 become misaligned and uncaught (e.g. spending too little compute on alignment checks).
- ^
While the Slowdown Branch doesn’t feature a misaligned Safer-3, the authors admit that they “don’t endorse many actions in this slowdown ending and think it makes optimistic technical alignment assumptions”
I guessstimate that optimizing the universe for random values would require us to occupy many planets where life could’ve originated or repurpose the resources in their stellar systems. I did express doubt that mankind or a not-so-misaligned AI could actually endorse this on reflection.
What mankind can optimize for random values without wholesale destruction of potential alien habitats is the contents of some volume rather close to the Sun. Moreover, I don’t think that I understand what[1] mankind could want to do with resources in other stellar systems. Since delivering resources to the Solar System would be far harder than building a base and expanding it, IMO mankind would resort to the latter option and find it hard[2] even to communicate much information between occupied systems.
But what could random values which do respect aliens consist of? Physics could likely be solved[3] well before spaceships reach Proxima Centauri.
SOTA proposals include things as exotic as shrimps on heroin.
Barring discoveries like information travelling FTL.
Alternatively, more and more difficult experiments could eventually lead to realisation that experiments do pose danger (e.g. of creating strangelets or a lower vacuum state, but informing others that a batch of experiments is dangerous doesn’t have a high bandwidth.)