DavidHolmes

Karma: 97

http://pub.math.leidenuniv.nl/~holmesdst/

Neural networks biased towards geometrically simple functions?

DavidHolmes8 Dec 2022 16:16 UTC

16 points

2 comments3 min readLW link

DavidHolmes 9 Aug 2019 3:48 UTC
12 points
on: Partial preferences and models
Hi Stuart,
I’m working my way through your `Research Agenda v0.9’ post, and am therefore going through various older posts to understand things. I wonder if I could ask some questions about the definition you propose here?
First, that $X$ be contained in $R^{N}$ for some $N$ seems not so relevant; can I just assume X, Y and Z are some manifolds ( $C^{k}$ for some $0 \leq k \leq \infty$ )? And we are given some partial order $≺$ on X, so that we can refer to `being a better world’?
Then, as I understand it, your definition says the following:
Fix X, $≺$ and Z. Let Y be a manifold and $y_{+}$ , $y_{-} \in Y$ . Given a local homomorphism $+ : Y \times Z \to X$ , we say that $y_{+}$ is partially preferred to $y_{-}$ if for all $z \in Z$ , we have $y_{-} + z ≺ y_{+} + z$ .
I’m not sure which inequalities should be strict, but this seems non-essential for now. On the other hand, the dependence of this definition on the choice of Y seems somewhat subtle and interesting. I will try to illustrate this in what follows.
First, let us make a new definition. Fix X, $≺$ , and Z as before. Let $Y^{'} = {y_{+}, y_{-}}$ , a two-element set equipped with the discrete topology, and let $+^{'} : Y \times Z \to X$ be an immersion of $C^{k}$ -manifolds. We say that $y_{+}$ is weakly partially preferred to $y_{-}$ if for all $z \in Z$ , we have $y_{-} +^{'} z ≺ y_{+} +^{'} z$ .
First, it is clear that partial preference implies weak partial preference. More formally:
Claim 1: Fix X, $≺$ and Z. Suppose we have a manifold Y, points $y_{+}$ , $y_{-} \in Y$ , and a local homomorphism $+ : Y \times Z \to X$ such that $y_{+}$ is partially preferred to $y_{-}$ . Setting $Y^{'} = {y_{+}, y_{-}}$ with the subspace topology from $Y$ (i.e. discrete), and taking $+^{'}$ to be the restriction of $+$ from $Y \times Z$ to $Y^{'} \times Z$ , we have that $y_{+}$ is weakly partially preferred to $y_{-}$ .
Proof: obvious. $\qed$
However, the converse can fail if Z is not contractible. First, let’s prove that the concepts are equivalent for Z contractible:
Claim 2: Fix X, $≺$ and Z, and assume that Z is contractible. Suppose we have a two-element set $Y^{'} = {y_{+}, y_{-}}$ and a map $+^{'} : Y^{'} \times Z \to X$ making $y_{+}$ weakly partially preferred to $y_{-}$ . Then there exist a manifold Y, an injection $Y^{'} \to Y$ , and a local homeomorphism $+ : Y \times Z \to X$ whose restriction to $Y^{'} \times Z$ is $+^{'}$ , making $y_{+}$ partially preferred to $y_{-}$ .
Proof: Let’s assume for simplicity of notation that X is equidimensional, say of dimension $d_{X}$ , and write $d_{Z}$ for the dimension of Z. Let Y be the disjoint union of two open balls of dimension $d_{X} - d_{Z}$ , with $Y^{'} \to Y$ the inclusion of the centres of the balls. Then take an $ϵ$ -neighbourhood of Z in X; it is diffeomorphic to $Y \times Z$ since the normal bundle to Z in X is trivialisable (c.f. https://math.stackexchange.com/questions/857784/product-neighborhood-theorem-with-boundary). $\qed$
If we want examples where weak partial preference and partial preference don’t coincide, we should look for an example where Z is not contractible, and its normal bundle in X is not contractible.
Example 3: Let X be the disjoint union of two moebius bands, and let Z be a circle. Note that including Z along the centre of either band gives a submanifold whose tubular neighbourhood is not a product. Assume that $≺$ is such that one component of X is preferred to the other (and $≺$ is indifferent within each connected component). Then take $Y^{'} = {y_{+}, y_{-}}$ , and $+^{'} : Y^{'} \times Z \to X$ to be the inclusion of the two circles along the centres of the two moebius bands, such that ${y_{+}} \times Z$ ends up in the preferred band. This yields a situation where $y_{+}$ is weakly partially preferred to $y_{-}$ , but the conclusion of Claim 2 fails, i.e. this cannot be extended to a partial preference for $y_{+}$ over $y_{-}$ .
What conclusion should we draw from this? To me, it suggests that the notion of partial preference is not yet quite as one would want. In the setting of Example 3, where X consists of two moebius strips, one of which is preferred to the other, then landing in the preferred strip should be preferred to landing in the un-preferred strip?! And yet the `local homeomorphism from a product’ condition gets in the way. This example is obviously quite artificial, and maybe analogous things cannot occur in reality. But I’m not so happy with this as an answer, since our approaches to AI safety should be (so far as possible) robust against the flaws in our understanding of physics.
Apologies for the overly-long comment, and for the imperfect LaTeX (I’ve not used this type of form much before).

Categorial preferences and utility functions

DavidHolmes9 Aug 2019 21:36 UTC

10 points

6 comments4 min readLW link

DavidHolmes 13 May 2021 6:51 UTC
10 points
on: There’s no such thing as a tree (phylogenetically)
I really liked the content, but I found some of the style (`Sit down!′ etc) really off-putting, which I why I only actually read the post on my 3rd attempt. Obviously you’re welcome to write in whatever style you want, and probably lots of other people really like it, I just thought it might be useful to mention that a non-empty set of people find it off-putting.

DavidHolmes 17 Jun 2019 2:14 UTC
9 points
on: The Univariate Fallacy
Hi Zack,
Can you clarify something? In the picture you draw, there is a codimension-1 linear subspace separating the parameter space into two halves, with all red points to one side, and all blue points to the other. Projecting onto any 1-dimensional subspace orthogonal to this (there is a unique one through the origin) will thus yield a `variable’ which cleanly separates the two points into the red and blue categories. So in the illustrated example, it looks just like a problem of bad coordinate choice.
On the other hand, one can easily have much more pathological situations; for examples, the red points could all lie inside a certain sphere, and the blue points outside it. Then no choice of linear coordinates will illustrate this, and one has to use more advanced analysis techniques to pick up on it (e.g. persistent homology).
So, to my vague question: do you have only the first situation in mind, or are you also considering the general case, but made the illustrated example extra-simple?
Perhaps this is clarified by your numerical example, I’m afraid I’ve not checked.

DavidHolmes 15 Feb 2021 7:56 UTC
8 points
in reply to: weft’s comment on: Your Cheerful Price

I expect most people on LW to be okay being asked their Cheerful Price to have sex with someone.

I find this a surprising assertion. It does not apply to me, probably it does apply to you. Ordinarily I would ask if you had any other data points, but I don’t want to take the conversation in this direction...

DavidHolmes 27 Aug 2022 5:34 UTC
6 points
5
on: Your posts should be on arXiv
Some advice for getting papers accepted on arxiv

As some other comments have pointed out, there is a certain amount of moderation on arXiv. This is a little opaque, so below is an attempt to summarise some things that are likely to make it easier to get your paper accepted. I’m sure the list is very incomplete!

In writing this I don’t want to give the impression that posting things to arXiv is hard; I have currently 28 papers there, have never had a single problem or delay with moderation, and the submission process generally takes me <15 mins these days.
1. Endorsement. When you first attempt to submit a paper you may need to be endorsed. JanBrauner kindly offered below to help people with endorsements; I might also be able to do the same, but I’ve never posted in the CS part of arXiv, so not sure how effective this will be. However, even better to avoid need for moderation. To this end, use an academic email address if you have one; this is quite likely to already be enough. Also, see below on subject classes (endorsement requirements depend on which subject class(es) you want to post in).
2. Choosing subject classes. Each paper gets one or more subject classes, like CS.AI; see [https://arxiv.org/category_taxonomy] for a list. Some subject classes attract more junk than others, and the ones that attract more junk are more heavily moderated. In mathematics, it is math.GM (General Mathematics) that attracts most junk, hence is most heavily moderated. I guess most people here are looking at CS.AI, I don’t know what this is like. But one easy thing is to minimise cross-listing (adding additional subject classes for your paper), as then you are moderated by all of them.
3. Write in (la)tex, submit the tex file. You don’t have to do this, but it is standard and preferred by the arXiv, and I suspect makes it less likely your paper gets flagged for moderation. It is also an easy way to make sure your paper looks like a serious academic paper.
4. It is possible to submit papers on behalf of third parties. I’ve never done this, and I suspect such papers will be more heavily moderated.
5. If you have multiple authors, it doesn’t really matter who submits. After the submission is posted you are sent a ‘paper password’ allowing coauthors to ‘claim’ the paper; it is then associated to their arXiv account, orcid etc (orcid is optional, but a really good idea, and free).
Finally, a request: please be nice to the moderators! They are generally unpaid volunteers doing a valuable service to the community (e.g. making sure I don’t have to read nonsense proofs of the Riemann hypothesis every morning). Of course it doesn’t feel good if your paper gets held up, but please try not to take it personally.
What links here?
- Your posts should be on arXiv by JanB (25 Aug 2022 10:35 UTC; 149 points)

DavidHolmes 19 Aug 2022 19:29 UTC
5 points
0
on: AI Safety bounty for practical homomorphic encryption
1. I think that
provable guarantees on the safety of an FHE scheme that do not rely on open questions in complexity theory such as the difficulty of lattice problems.

is far out of reach at present (in particular to the extent that there does not exist a bounty which would affect people’s likeliness to work on it). It is hard to do much in crypto without assuming some kind of problem to be computationally difficult. And there are very few results proving that a given problem is computationally difficult in an absolute sense (rather than just ‘at least as hard as some other problem we believe to be hard’). C.f. P vs NP. Or perhaps I misunderstand your meaning; are you ok with assuming e.g. integer factorisation to be computationally hard?

Personally I also don’t think this is so important; if we could solve alignment modulo assuming e.g. integer factorisation (or some suitable lattice problem) is hard, then I think we should be very happy…
1. More generally, I’m a bit sceptical of the effectiveness a bounty here because the commercial application of FHE are already so great.
2. About 10 years ago when I last talked to people in the area about this I got a bit the impression that FHE schemes were generally expected to be somewhat less secure than non-homomorphic schemes, just because the extra structure gives an attacker so much more to work with. But I have no idea if people still believe this.

DavidHolmes 27 Aug 2022 5:39 UTC
4 points
3
in reply to: Ramana Kumar’s comment on: Your posts should be on arXiv
I suspect the arXiv might not be keen on an account that posts papers by a range of people (not including the account-owner as coauthor). This might lead to heavier moderation/whatever. But I could be very wrong!

DavidHolmes 21 Apr 2022 5:37 UTC
4 points
in reply to: Steven Byrnes’s comment on: Call For Distillers
I was about to write approximately this, so thank you! To add one point in this direction, I am sceptical about the value of reducing the expectation for researchers to explain what they are doing. My research is in two fields (arithmetic geometry and enumerative geometry). In the first we put a lot of burden on the writer to explain themselves, and in the latter poor and incomplete explanations are standard. This sometimes allows people in the latter field to move faster, but
- it leaves critical foundational gaps, which we can ignore for a while but which eventually causes lot of pain;
- sometimes really critical points are hidden in the details, and we just miss these if we don’t write the details down properly. Disclaimers:
- while I think a lot of people working in these fields would agree with me that this distinction exists, not so many will agree that it is generally a bad thing.
- I’m generally criticising lack of rigour rather than lack of explanation. I am or claiming these necessarily have to go together, but in my experience they very often do.

DavidHolmes 21 Feb 2022 21:02 UTC
4 points
in reply to: interstice’s comment on: Understanding “Deep Double Descent”
Thank you for the quick reply! I’m thinking about section 5.1 on reparametrising the model, where they write:

every minimum is observationally equivalent to an infinitely sharp minimum and to an infinitely flat min- imum when considering nonzero eigenvalues of the Hessian;

If we stick to section 4 (and so don’t allow reparametrisation) I agree there seems to be something more tricky going on. I initially assumed that I could e.g. modify the proof of Theorem 4 to make a sharp minimum flat by taking alpha to be big, but it doesn’t work like that (basically we’re looking at alpha + 1/alpha, which can easily be made big, but not very small). So maybe you are right that we can only make flat minimal sharp and not conversely. I’d like to understand this better!

DavidHolmes 14 May 2021 6:24 UTC
4 points
in reply to: eukaryote’s comment on: There’s no such thing as a tree (phylogenetically)
Definitely the antagonistic bits—I enjoyed the casual style! Really just the line ‘ Sit down. Sit down. Shut up. Listen. You don’t know nothing yet’ I found quite off-putting—even though in hindsight you were correct!

DavidHolmes 16 Feb 2021 22:07 UTC
4 points
on: Generalised models as a category

So the set of worlds, $W$ , is the set of functions from $F$ to …

I guess the $F$ should be a $¯ F$ ? Also, you don’t seem to define $E$ ; perhaps $E = W$ ?

Bias towards simple functions; application to alignment?

DavidHolmes18 Aug 2022 16:15 UTC

3 points

7 comments2 min readLW link

DavidHolmes 27 Aug 2022 5:12 UTC
3 points
0
in reply to: habryka’s comment on: Your posts should be on arXiv
The arXiv really prefers that you upload in tex. For the author this makes it less likely that your paper will be flagged for moderation etc (I guess). So if it were possible to export to Rex I think that for the purposes of uploading to arXiv this would be substantially better. Of course, I don’t know how much more/less work it is…

DavidHolmes 19 Aug 2022 8:19 UTC
3 points
0
in reply to: evhub’s comment on: Bias towards simple functions; application to alignment?
Thanks very much for the link!

DavidHolmes 15 Feb 2022 9:35 UTC
3 points
in reply to: interstice’s comment on: Understanding “Deep Double Descent”
I’m not sure I agree with interstice’s reading of the ‘sharp minima’ paper. As I understand it, they show that a given function can be made into a sharp or flat minimum by finding a suitable point in the parameter space mapping to the function. So if one has a sharp minmum that does not generalise (which I think we will agree exists) then one can make the same function into a flat minimum, which will still not generalise as it is the same function! Sorry I’m 2 years late to the party...

DavidHolmes 13 May 2021 6:33 UTC
3 points
on: Academia as Company Hierarchy
I’m sceptical of your decision to treat tenured and non-tenured faculty alike. As tenured faculty, this has long seemed to me to be perhaps the most important distinction.

More generally, what you write here is not very consistent with my own experience of academia (which is in mathematics and in Europe, though I have friends and collaborators in other countries and fields, so I am not totally clueless about how things work there).

Some points I am not seeing in your post are:
1. For many academics, being able to do their own research and work with brilliant students is their primary motivation. Grants etc are mainly valuable in how they facilitate that. This makes for a confusing situation where ‘losers’ in the original LCS model do the minimum work necessary for their paycheck, whereas ‘losers’ in the academic system (as you seem to be defining them?) do the maximum work that is compatible with their health and personal situation. Not only is this conceptually confusing to me, it also means that all other things being equal, the more `losers’ one is in academia the more impressive one’s CV will tend to be. Which is I think the opposite of the situation in the conventional LCS hierarchy?
2. The fact that I ‘perform peer review for nothing at all’ apparently makes me clueless. But this is weird; it does not go on my CV, and I do it because I think it is important to the advancement of science. Surely this makes it a `loser’ activity?
3. Acceptance of papers and awarding of grants is decided by people external to your university. This makes a huge difference, and I think you miss it by writing `So we might analyze this system at the department level, at the university level, or at the all-academia level, but it doesn’t make much of a difference.’.
Perhaps the above makes it sound as if I view academia as an organisational utopia; this is far from the case! But I do not think this post does a good job of identifying problems. I think a post analysing moral mazes in academia would be interesting, but I’m not convinced that the LCS hierarchy is an appropriate model, and this attempt to apply it does not seem to me to make useful category distinctions.

DavidHolmes 29 Aug 2019 7:23 UTC
3 points
in reply to: Gordon Seidoh Worley’s comment on: Categorial preferences and utility functions
Sure, in the end we only really care about what comes top, as that’s the thing we choose. My feeling is that information on (relative) strengths of preferences is often available, and when it is available it seems to make sense to use it (e.g. allowing circumvention of Arrow’s theorem).

In particular, I worry that, when we only have ordinal preferences, the outcome of attempts to combine various preferences will depend heavily on how finely we divide up the world; by using information on strengths of preferences we can mitigate this.

DavidHolmes 10 Aug 2019 0:16 UTC
3 points
on: Toy model piece #1: partial preferences revisited
Thanks for pointing me to this updated version :-). This seems a really neat trick for writing down a utility function that is compatible with the given preorder. I thought a bit more about when/to what extent such a utility function will be unique, in particular if you are given not only the data of a preorder, but also some information on the strengths of the preferences. This ended up a bit too long for a comment, so I wrote a few things in outline here:
https://www.lesswrong.com/posts/7ncFy84ReMFW7TDG6/categorial-preferences-and-utility-functions
It may be quite irrelevant to what you’re aiming for here, but I thought it was maybe worth writing down just in case.

DavidHolmes

Neu­ral net­works bi­ased to­wards ge­o­met­ri­cally sim­ple func­tions?

Cat­e­go­rial prefer­ences and util­ity functions

Bias to­wards sim­ple func­tions; ap­pli­ca­tion to al­ign­ment?

Neural networks biased towards geometrically simple functions?

Categorial preferences and utility functions

Bias towards simple functions; application to alignment?