I think mode 1 is actually to some extent also a tunable explore/exploit preference (in reasoning space) that may stumble egregiously in OOD environments.
Like it’s not just forgetting, it’s “was I paranoid enough to ruminate about this unlikely connection between fact 1 and fact 2 that eventually led to me nuking prod”.
I think mode 1 is actually to some extent also a tunable explore/exploit preference (in reasoning space) that may stumble egregiously in OOD environments.
Like it’s not just forgetting, it’s “was I paranoid enough to ruminate about this unlikely connection between fact 1 and fact 2 that eventually led to me nuking prod”.