Some form of ID verification, or (less precisely) IP geo-blocking. This is already done on many websites for other purposes (copyright, local laws, sanctions etc).
cubefox
I think an antonym is a polar opposite, not a negation. (hot --> cold rather than hot --> not hot)
This makes me wonder how the tribes of math will change as a result, as well as the relative status and prestige of subfields.
David Bessis has a long essay on the future of mathematics: The fall of the theorem economy. He argues convincingly that obtaining theorems was never a fundamental goal of mathematics. The real goal was always increasing human understanding of mathematical concepts. He argues that theorems are only valued so highly because they have been the most important (but imperfect) proxy for measuring contributions to mathematical understanding. You had to have deep understanding to come up with a proof. Bessis calls this monomaniacal focus on theorems “Hardy’s curse”, and the dynamic of theorems being the hard social currency of mathematics (citations, academic positions), the “theorem economy”. With the automation of proving theorems, they will lose most of their social prestige due to goodharting, and the theorem economy will break down, while leaving mathematics itself mostly untouched. The essay also touches on a lot of other interesting points.
Because of how it sounds? There is already “überall”/everywhere.
The Apollo program was probably the clearest case of US American “reward hacking”. (Or the moon race between the US and USSR more generally.)
In the short term yes, in the medium term frontier AI companies, forced by the US government, can restrict access to frontier models to US Americans. Which would accelerate the US economy over the rest of the world, while the justification will be “national security”.
I think that it’s good that a government body just demonstrated it’s willing to pull a frontier model offline
The government didn’t pull a frontier model offline, it cut off access for non-Americans.
Some form of “prior uncertainty” can be described with the Beta distribution. If we have a coin (or any other binary event) there is a difference between having observed 0 outcomes (0 heads and 0 tails), having observed one heads and one tails, and having observed 50 heads and 50 tails. All of them can be quantified as evidence for “50 probability for heads”, but the degree of reliability of this estimate varies widely between these cases. The Beta distribution is a subjective probability distribution over possible but unknown objective probability values of an event. If you are very sure the objective probability of your event is around 50%, the Beta distribution has a narrower peak than if you are more uncertain about your probability.
Of course, this doesn’t cover the more general types of prior uncertainty (e.g. rational vs irrational) you are discussing here, but at least it is something.
I’m a different person but I have a similar experience, I don’t use the “Your Feed > For You” section because it shows me subjectively random posts, though perhaps I could tune it to work better. Previously there was a “Top Comments” section on the front page which I personally found more useful.
Another one: Nathan Young, Dear AGI,
Mine is that it’s indeed simply saying the government doesn’t want to work their products, even indirectly.
The first is very different from the second. “Indirectly” is an extremely wide concept, and very few companies are designated supply chain risks. Usually when deciding to buy or not buy some product, you don’t care at all which tools the company uses, or the company from which the first company buys tools, etc.
I assume there is a reason you don’t want New Foundations?
I thought usually ∈ is assumed to be acyclic, then one could identify a set with its ∈ DAG. But if you have a universal set it probably must be an element of itself, so we don’t even have a DAG. (I recently learned that the classical definition of “acyclic” only rules out finite cycles, so even a DAG allows for infinite cycles? Strange.)
So identify a set with its directed ∈ graph? Not sure whether this makes sense.
More to the point, this SEP article might be interesting.
This is cool research. But I want to emphasize that the usage of AUROC for the evaluation of any binary classifier is generally questionable because a high AUROC value only indicates 1) a large true positive rate and 2) a large true negative rate (= low false positive rate).
But a high value of an appropriate statistic for a binary predictor should maximize all four of these values:
P(actually true | predicted true) (= true positive rate, recall, sensitivity)
P(actually false | predicted false) (= true negative rate, specificity, converse of false positive rate)
P(predicted true | actually true) (= positive predictive value, precision)
P(predicted false | actually false) (= negative predictive value)
A high value of the AUROC only means 1 and 2 are large, while 3 and 4 could be very small. In which case we would clearly have a bad classifier despite a high AUROC value. For details see this paper.
There is another statistic which can be used instead of the AUROC while avoiding this problem: the phi coefficient / “MCC”. This is simply the binary version of the standard Pearson correlation.
A high value of the MCC (close to +1) indicates that all four of the above probabilities are large, and a high negative value (close to −1) that all four are small. If the predictor and the measured variable are independent (the classifier guesses randomly) the value of the MCC is 0.
The linked paper above goes so far as to say
In this short study, we explain why the Matthews correlation coefficient should replace the ROC AUC as standard statistic in all the scientific studies involving a binary classification, in all scientific fields.
I consider it coherent to have qualia and be unaware of the fact, insofar as we’re taking qualia to be a good concept.
This would only make sense if they misunderstand the word “qualia”. Which seems quite unlikely for Carl Zimmerman who said he studied philosophy at MIT.
There is a difference between solving intent alignment for instruction following, and full value alignment. The latter would plausiblybbe guaranteed to be “safe”, or even more optimal than merely safe. (Utopia etc.)
Related: The Zombie Preacher of Somerset
So AI papers are currently good enough that they can’t be trivially distinguished from human papers, making Pangram necessary, but not yet good enough to produce AI research that is at least on a human level. From the outside this looks like a sign that RSI fairly close now.
Tangentially, it’s somewhat interesting that Pangram is a twist on Turing’s original test: In the original, it was a human who had to distinguish between a human and an AI based on text, now it is an AI that distinguishes between both, since AIs are apparently better now than humans in distinguishing between humans and AIs. So Pangram is a CAPTCHA, but conventional captchas weren’t better than humans at distinguishing between AIs and humans.
This reminds me of this passage from Richard Jeffrey’s The Logic of Decision:
Which does suggest that both believing and wanting is related to reducing prediction error, because wanting something is wanting it to be true, and wanting something to be true is wanting to increase your degree of belief that it is true. So they are indeed related but they don’t seem hard to distinguish.