Galathir comments on Galathir’s Shortform

Galathir 2 Dec 2025 9:02 UTC
2 points
−1
Is there anyone exploring how AI might be used to increase integrity and build trustworthiness.
For example it could scan the behaviour of people, businesses or AI and see whether it is consistent to stated promises, flagging things that are not.
It might be used to train LLMs to be consistent if they are too be used as agents
- Galathir 3 Dec 2025 18:07 UTC
  1 point
  0
  Parent
  AI might help with people generating tests for key results from okrs and publishing if they are not met.
  If the key results are published this could help with AI pauses by validating that no stories on creating beyond frontier models have been written or started (assuming that that is a key result people care about).
  I figured that objectives and key results are how companies maintain alignment and avoid internal arms races so might be useful for alignment between entities too (perhaps with government accredited badges for people that maintain objectives like pausing and responsible data use)