Is it a false positive? We are trying to eval their eval-awareness, here...
AprilSR
To be clear, on the general topic I totally agree that “can this thing be defended from bad actors” is often rather underemphasized!
I’m not entirely sure I’m convinced of the idea that the broad rationalist-EA-AI safety community isn’t a confusing patchwork of metaphorical city states? I suppose the money and power is probably concentrated more than the vague culture is?
Trigger: I encounter a new insight
Action: I make a TAP
I liked the Good Heart tokens because they paid me money and it’s funny to see CFAR when I scroll down in my PayPal.
My cursed proposal is to have a second language model extract the code from the original response.
If I had to guess I think it’s relevant to like, anthropic reasoning, or something.
Okay, this makes sense! It’s not obvious to me exactly how ambitious 2 is, but I get why you might be skeptical.
I’m having a little trouble understanding the whole argument. It’s not obvious to me why exactly this line of reasoning doesn’t prove too much by ruling out human speech? Plenty of human phonemes are like 10ms long?
I have no idea why the duration of an individual click is supposed to be relevant. There’s like, at least 30 milliseconds between clicks (according to Claude), and usually more than that, which seems like the relevant number to me?
Oh, I meant in the category of (topological) vector spaces, which requires the quotient maps to be linear.
I think maybe part of the confusion is that, when you’re working with vector spaces in particular, subspaces and quotient spaces are the same thing.
My intuition is like… you get a topological circle by gluing the two ends of an interval together, but no subspace of the interval is homeomorphic to a circle. I’m not entirely sure that this sort of issue meaningfully impacts neural networks, but I don’t immediately see any reason why it wouldn’t?
Claim 2 sounds very likely false to me.
Surely 28K should be at least 5 points!
I learned that antipsychotic medications have unpleasant side effects that can make people unwilling to get on or stay on them. Once a brain malfunctions that badly, without treatment, it never gets fixed.
I’d consider cobenfy as an option here, maybe? Or sometimes nicotine, allegedly, though that obviously isn’t ideal.
I very much know how continuous functions work and precisely what differentiability is, and I have filed taxes, but I would probably need a refresher on tax brackets to be sure I had everything right...
Yeah I’m pretty sure it’s an idiosyncratic mental technique / human psychology observation, there isn’t technical agent foundations progress here.
Also on this topic: