algon33

Karma: 82

algon33 22 Aug 2020 16:30 UTC
9 points
on: Epistemic Comparison: First Principles Land vs. Mimesis Land
If there is a mistake deep in the belief of someone
Are they not ideal Bayesians? Also, do they update based off other people’s priors? It could be intresting to make them all ultra-finitists.
Mimemis land is confusing from the outside. I’m not sure how they could avoid stumbling upon “correct” forms of manipulating beliefs, if they persist for long enough and there are large enough stochastic shocks to the communities beliefs. If they also copid succesful people in the past, I feel like this would be even more likely. Unless they happen to be the equivalent of chinese rooms: just an archive of if else clauses.
Anyway, thank you for introducing this delightful style of thought experiments.

algon33 13 Aug 2020 12:57 UTC
8 points
on: Alignment By Default
This came out of the discussion you had with John Maxwell, right? Does he think this is a good presentation of his proposal?
How do we know that the unsupervised learner won’t have learnt a large number of other embeddings closer to the proxy? If it has, then why should we expect human values to do well?
Some rough thoughts on the data type issue. Depending on what types the unsupervised learner provides the supervised, it may not be able to reach the proxy type by virtue of issues with NN learning processes.
Recall that tata types can be viewed as homotopic spaces, and construction of types can be viewed as generating new spaces off the old e.g. tangent spaces or path spaces etc. We can view neural nets as a type corresponding to a particular homotopic space. But getting neural nets to learn certain functions is hard. For example, learning a function which is 0 except in two sub spaces A and B. It has different values on A and B. But A and B are shaped like intelocked rings. In other words, a non-linear classification problem. So plausibly, neural nets have trouble constructing certain types from others. Maybe this depends on architecture or learning algorithm, maybe not.
If the proxy and human values have very different types, it may be the case that the supervised learner won’t be able to get from one type to another. Supposing the unsupervised learner presents it with types “reachable” from human values, then the proxy which optimises performance on the data set is just unavailable to the system even though its relatively simple in comparison.
Because of this, checking which simple homotopies neural nets can move between would be useful. Depending on the results, we could use this as an arguement that unsupervised NNs will never embed the human values type because we’ve found out it has some simple properties it won’t be able to construct de novo. Unless we do something like feed the unsupervised learner human biases/start with an EM and modify it.

algon33 28 Aug 2020 14:18 UTC
6 points
on: Zibbaldone With It All
Epistemic status: unsure
I have a hypothesis about why Zettlekasten provide diminishing returns over time. A corrolary is that others should find even less value in your Zettles. Which ties into some of your points, and shows what is missing from the Zibbaldone. Plus there are some suggestions on how to correct the flaw.
One of the key benefits to the Zettlekasten is that the way you link cards reflects your psyche’s understanding of the ideas. Of course other note-taking systems have this advantage. But this isn’t baked into them like it is with Zettlekasten.
Traversing the Zettlekasten lets you approximate your past state of mind when working on a problem. Which lets you dredge up whatever your subconscious has come up with on the topic. The seemingly random orgranisations of yesterdays Zettles helps this along a little by providing a glimpse into yesterdays self. So when you wake up in the morning and look over yesterdays cards, a flood of relevant houghts arises. When considering where to place them, you glance through your Zettlekasten. Zettles your mind was working on bring new thoughts to the forefront. Often it will feel trivial to combine and play with all these new ideas, generating even more thoughts.
Unfortunately, past a certain size your Zettlekasten contains too many cards for your brain to be processing at once and too many potential states of mind you could have been in. We expect a gradual reduction in value of the system. And less value to others, who have a different understanding of the ideas. Something similair is true for other note taking systems. There’s just a steeper decline in value.
How does this square with peoples’ reports that between different note taking systems provides the same early returns they got with the old one? My guess is that the reduced scope of your notes and the context shift is what does it. Your brain realises it no longer has to keep track of all those ideas and can focus on a few relatively simple ones. Which makes the low hanging fruit all the easier to grab.
Can we fix these issues? Maybe. And I think you’d have to go digital to do it. Consider spaced repetition. A memory’s strength decays exponentially with time. When reminded of them, your memory is strengthened. Spaced repition takes advantage of the fact that there are optimal timings to strengthen the memory. By analogy, we might say there are optimal times to dredge up ideas from your mind. And likewise, there may be optimal timings to link zettles. Perhaps these timings depend on the “distance” between the zettles.
A digital system could provide all this. A useful format would be the zettle to consider, and a graph surrounding it of the zettles you should link it to. When moving to adjacent zettles, you can see all the zettles it links to in order to provide relevant context.

algon33 26 Dec 2020 14:24 UTC
5 points
on: The Best Visualizations on Every Subject
Here’s a visualisation that goes along with Euclid’s elements
A plot of which theorems are used in the proof of each theorem in Euclid’s elements, ordered by the book e.g. the black dots at the bottom say proofs in the book 13 mostly used theorems from all the books bar 7,8,9 and 12.
This was one of many from an article on “The Empirical MetaMathematics of Euclid and Beyond”. It is a long essay on the overarching structure of Euclid’s elements and verifies some claims made about Euclid’s Elements e.g. the proofs were ordered in nearly the most parsimonious way possible. It also finds the most difficult theorems in each book, the greatest possible reductions in proof length, and hints that the network of theorem dependancy has a local 2-d structure. Highly recommend the article.

algon33 14 Nov 2020 14:03 UTC
5 points
on: Covid 11/12: The Winds of Winter
Minor quibble which I hope isn’t breaking a norm: BetFair did seem to pay out last week, or at least some of the bets on who would win the presidency were settled on 07/11/20.
Do you expect we’ll be n the midst of a third wave before the vaccine begins to be doled out? Or just beginning to enter one?

Thanks for the post.

algon33 9 Jan 2021 15:45 UTC
4 points
on: Science in a High-Dimensional World
I am kind of suprised you didn’t reference causal inference here to just gesture at the task in which we “figure out which variables are directly relevant—i.e. which variables mediate the influence of everything else”. Are you pointing to a different sort of idea/do you not feel causal inference is adequate for describing this task?
Also, scenario 1 and 2 seem fairly close to the “linear” and “non-linear” models of innovation Jason Crawford described in his talk “The Non-Linear Model of Innovation.” To be honest, I prefered his description of the models. Though he didn’t cover how miraculous it is that somehow the model can work. That, to a good approximation, the universe is simple and local.

algon33 26 Dec 2020 14:05 UTC
3 points
on: The Best Visualizations on Every Subject
Definetely not a subject, but I’d say that the visualisation of Wolfram’s theory of everything is excellent. Of course there are problems with his theory of everything, like the fact that he hasn’t actually proved his claims that it generates GR field equations or replicates QM. Or shown that his theory evades the critical objection Scott Aaronson raised. but as a visualisation:
1. It is aesthetically pleasing
2. Compactly contains the basic ideas of his T.o.E.
3. Ties the basic concepts together to see how they could generate a theory of physics
So I’d still recommend it.

algon33 23 Dec 2020 9:25 UTC
3 points
on: Wholehearted choices and “morality as taxes”
I am glad you put the quotation marks around “morality as taxes” since what my mind jumped to upon verbalising the title was what you described in the last part of your post: something you’d be glad to evade where possible. In retrospect, its clear that the quotation marks were meant to point to another approach and not the one your thought experiment is meant to represent. Still, I think “Wholehearted choices vs morality as taxes” would be a little clearer as a title.

algon33 5 Nov 2020 15:28 UTC
3 points
on: Covid 11/5: Don’t Mention the War
The post about Sweden’s unusual situation you linked to has updated. The author claims that the reduced death rate is mostly due to younger people getting near all of the covid cases, which is supported by recent data (the figure shows total number of changes between July and 03 Nov). Why that is the case is another issue.
Edit: As always, thanks for the post.

algon33 30 Sep 2020 16:04 UTC
3 points
in reply to: ChristianKl’s comment on: Numeracy neglect—A personal postmortem
Because of the shift in culture in mathematics, wherein the old proofs were considered unrigorous. Analysis ala Weirstrauss put the old statements on firmer footing, everyone migrated there, and infinitesimals were left to langiush until a transfer principle was proven to give them a rigorous founding. But by that time, standard analysis had born such great fruits that it was deeply intertwined with modern mathematics. And of course, there’s been a trend against the infinitary and against the incomputable in the past century.
So there’s both institutional inertia due to historical developments, as well as some philosophical objections which really boil down to whether you’re fine with infinitary mathematics. I make no arguements concerning the latter, I just note that one can reject infinitary mathematics without believing they’re ugly. Now if you’re saying not all infinitary mathematics is ugly, just the hypereals, that’s a different claim. I can get why one might think they’re uglier than e.g. the complex numbers, but I don’t get why they’d be ugly, period. May I ask why you think so?

algon33 15 Aug 2020 0:15 UTC
3 points
in reply to: johnswentworth’s comment on: Alignment By Default
Based off what you’ve said in the comments, I’m guessing you’d say the various forms of corrigibility are natural abstractions. Would you say we can use the strategy you outline here to get “corrigibility by default”?
Regarding iterations, the common objection is that we’re introducing optimisation pressure. So we should expect the usual alignment issues anyway. Under your theory, is this not an issue because of the sparsity of natural abstractions near human values?

algon33 31 Jul 2020 20:22 UTC
3 points
on: “Go west, young man!”—Preferences in (imperfect) maps
Sometimes the cluster in the map a preference is pointing at involves another preference. Which provides a natural resolution mechanism. What happens when there’s two preferences, I’m unsure. I suppose it depends on how your map changes. In which case, I think you should focus on how to make purity coherent you should start off with some “simple” map and various “simple” changes in the map. To make purity coherent relative to your map is both computationally hard, and empathetically hard.
Side-note: It would be interesting to see which resolution mechanisms produce the most varied shifts in preferences for boundedly rational agents with complex utility functions.
Side-note^2: Stuart, I’m writing a review of all the work done on corrigibility. Would you mind if I asked you some questions on your contributions?

algon33 31 Dec 2020 18:57 UTC
2 points
in reply to: Ruby’s comment on: LessWrong/EA New Year’s Ultra Party
Hymn to breaking strain or Hope Eyrie.

algon33 1 Sep 2020 20:19 UTC
2 points
in reply to: nostalgebraist’s comment on: interpreting GPT: the logit lens
IIRC, this also shows a discontinuous flip at the bottom followed by slower change.
Maybe edit the post so you include this? I know I was wondering about this too.

algon33 2 Dec 2020 12:38 UTC
1 point
in reply to: TurnTrout’s comment on: SETI Predictions
My personal reasons:
1. I assumed the question was about the first few decades after “first contact”.
2. A large chunk of my probability mass is on first contact being unintentional, and something neither side can do much about. Or perhaps one “side” is unaware of it. Like if we receive some message directed to no one in particular, or recording the remnants of some extreme cosmic event that seems mighty unatural.
3. It feels like we’re near certain to have created an AGI by then. I am unsure enough about the long term time scales of AGI improvement, and their limits, that I can assign some credence to the AGI we make possessing relatively advanced technology. And so, it may be in a good bargainning position. If we make plenty of AI, maybe they’ll be less powerful individually, but they should still be quite potent in the face of a superior adversary.

algon33 1 Dec 2020 19:13 UTC
1 point
on: SETI Predictions
You should alter questions to make it clear “we” is meant to be humans or whatever we makes that succeeds us.
Also, perhaps a queston on whether “first contact” will be us detecting them without their being aware of it.

algon33 27 Oct 2020 17:24 UTC
1 point
in reply to: Kaj_Sotala’s comment on: Memory reconsolidation for self-affection
Thanks for the reply. Feelings of helplessness sounds about right, and I think you may be right about giving your self the feeling that you are being supported. Only, people with severe chronic pain often suffer from anxiety and depression as well. It seems like it would be a hard battle getting their brains to recognise those aforementioned feelings.

algon33 27 Oct 2020 10:19 UTC
1 point
on: Memory reconsolidation for self-affection
How does this apply for physically painful trauma? I understand that the broader process should work, but I’m curious if you could guess what frame would be the most helpful for such trauma.

algon33 21 Oct 2020 18:02 UTC
1 point
on: Open & Welcome Thread – October 2020
Somewhat urgent: can anyone recommend a good therapist or psychiatrist for anxiety/depression in the UK? Virtual sessions are probably required. Private is fine. Also, they shouldn’t be someone biased towards rationalist types. The person I’m thinking of has nearly no knowledge of these ideas.
Other recommendations that seem relevant welcome.

algon33 1 Oct 2020 10:16 UTC
1 point
in reply to: Richard_Kennaway’s comment on: Numeracy neglect—A personal postmortem
I still disagree. You can use Fermat’s last theorem rigorously without understanding why it works. Same for the four colour theorem. And which mathematics understand why we can classify finite simple groups the way we do? I’d bet fewer than a percent do. Little wonder, if the proof’s 3 volumes long! My point is that there are many theorems a mathematician will use without rigorously knowing why it works. Oh sure, you can tell them a rough story outlining the ideas. But could the prove it themselves? Probably not, without a deep understanding of the area. Yet even without that understanding, they can use these theorems in formal proofs. They can get a machine to check over it.
Now, I admit that’s unsatisfying. I agree that if they don’t, then they don’t have a rigorous understanding of the theorem. Eventually, problems will arise which they cannot resolve without understanding that which they accepted as magic. But is that really so fatal a flaw for teaching students the hyperreals? One only needs a modest amount of logic, perhaps enough for a course or two, to understand why the transfer principle works. Which seems a pretty good investment, given how much model theory sheds light on what we take for grounded.
Now I suppose if you find infinitary mathematics ugly, then is all besides the point. And unfortunately, there’s not much I can say against that beyond the usual arguements and personal aesthetics.