Davey Morse comments on Davey Morse’s Shortform

Davey Morse 11 Feb 2025 22:19 UTC
1 point
0
are there any online demos of instrument convergence?
there’s been compelling writing… but are there any experiments that show agents which are given specific goals then realize there are more general goals they need to persistently pursue in order to achieve the more specific goals?
- Davey Morse 11 Feb 2025 22:21 UTC
  1 point
  0
  Parent
  I imagine a compelling simple demo here might be necessary to shock the AI safety community out of the belief that we can maintain control of autonomous digital agents (ADAs).