I’m scared of models getting long term unbounded goals
This is surely scary. I think on some level I’m not worried about that, but maybe because I’m worried enough even about less scary scenarios (“let’s try to deal at least with the easy problems, and hope the hard ones don’t happen”). This feels somewhat similar to my disagreements with Sam here.
Yeah, that makes sense—thx.
This is surely scary. I think on some level I’m not worried about that, but maybe because I’m worried enough even about less scary scenarios (“let’s try to deal at least with the easy problems, and hope the hard ones don’t happen”). This feels somewhat similar to my disagreements with Sam here.
I could get on board with “lets try to deal at least with the easy problems, and
hopeensure the hard ones don’t happen”?That sounds great. I think I’m just a bit less optimistic about our chances at ensuring things : )
Oh, I said try to ensure for a reason. I do think it’s somewhat tractable though