Dalcy

Karma: 1,102

“I always remember, [Hamming] would come into my office and try to solve a problem [...] I had a very big blackboard, and he’d start on one side, write down some integral, say, ‘I ain’t afraid of nothin’, and start working on it. So, now, when I start a big problem, I say, ‘I ain’t afraid of nothin’, and dive into it.”
—Bruce MacLennan

Dalcy 18 Jan 2026 20:42 UTC
21 points
4
on: Darcy’s Shortform
Plot idea: Isekaied from 2026 into some date in the past. Only goal: get cryogenically preserved into the glorious transhumanist singularity. How to influence the trajectory of history into a direction that would broadly enable this kind of future, while setting up the long-lasting infrastructure & incentives that would cryogenically preserve oneself for centuries to come?
I’m not going to write this, but I think this is a very interesting premise & problem (especially if thousands of years into the past) and would love to see someone build on it.
Some thoughts:
- It’s definitely doable if it’s to a century ago. The earliest cryopreserved human still preserved today is James Bedford, born in 1893 and cryopreserved in 1967. So just stay healthy, avoid wars, accumulate wealth, and use the decades to influence the field of cryopreservation to arrive sooner and stronger.
- It gets harder (and more interesting) as the protagonist is transported further into the past, since they will have to bootstrap large parts of the technology pipeline that goes into making reliable cryopreservation possible, and acquire the relevant knowledge as they go. My guess is that there is a somewhat sharp divide in difficulty, depending on whether one isekaies with sufficient portion of their lifespan extending into the industrial revolution, or not—because that determines whether there will naturally exist large chunks of the basic industrial pipeline and science necessary for cryopreservation (eg science (thermodynamics) and practical engineering of heat pump, or just good metallurgy), or whether one will have to counterfactually bootstrap those pipelines into existence.
- Beyond some point, a large chunk of the difficulty will mainly be in having to counterfactually bootstrap the relevant pieces of technology pipeline into existence, so that future generations have the technology and science to make it possible-in-principle to maintain the preservation long-term.
- Besides the technological problem, the most problem is in setting up the infrastructure & incentive structure that would make future generations continue the cryopreservation for centuries to come. The most obvious and reliable solution is to just use a will—accumulate a large amount of wealth (this itself is probably pretty difficult, even with foreknowledge), set up a trust and a foundation that reinvests and grows the wealth and a will specifying its use for the maintenance of cryopreservation. I am imagining the best place to be the UK given their relatively long history of various institutions and succession laws?
- These infrastructural problems will gradually get more difficult further back in time because:
  1. just Rot, i.e. of course wills aren’t actively enforced by the government and someone must actively initiate a legal process to deal with violations of it, and a single mismanagement or simply just forgetting could irreversibly damage the preservation—so the protagonist will have to create some very strong institutional structures & culture (+ acquire resources required for it) that would incentivize the long-term enforcing of the will
  2. periods of instability (eg war). Also includes religion—I don’t imagine a will based on wanting to preserve oneself to cheat death and live forever would go super well.
- And sufficiently further back in time, there won’t be a reasonably stable line of governance and rules of law in any civilization that extends to the modern day to expect any sort of preservation of will that is this complicated—hm. now that I write this, I realize that having counterfactually accelerated the relevant pipelines for enabling cryopreservation pre-industrial-revolution probably already makes predictions of specific historical events based on basic foreknowledge of past events difficult, meaning the specific question of “whether there exists a civilization at the time that the protagonist got isekaied into that has governance structures that lasted (to our present) for centuries” is sort of moot—so they’ll have to extrapolate carefully the kind of impact their acceleration will have in the trajectory of current-to-protagonist-civilizations and strategize accordingly.
- I’m sure it’s still possible even if thousands of years ago—still easier than the problems of Dr. Stone—just Shut up do the impossible! Skills and knowledge like physics, engineering (theory and the practical knowledge to the extent needed to bootstrap large parts of the industrial revolution), charisma and social skill, knowledge of older languages and dialects, history and anthropology, financial skills, etc etc would now all be pretty much necessary (and the amount of foreknowledge that the protagonist brings with them starts to matter especially more).
  - I would imagine being isekaied here would make for the most interesting story & problem.
- All of this is plausibly much, much easier with chemical-based brain preservation, supposing the protagonist brings the relevant knowledge with them.
  - … actually, considering the much more realistic scenario of a protagonist not already being an expert in brain preservation (but perhaps knows some physics & engineering & chemistry & biology) and not knowing, for the case of chemical-based brain preservation, what specific fixative to use, and for the case of cryogenic preservation, what cryopreservant to use, the protagonist will have to acquire this knowledge with Science—and doing so has the common pipeline of bootstrapping sufficient chunks of the industrial pipeline necessary to get things like good microscopes or metals made. Given this scenario, I am unsure which preservation method is easier—though I’m sure our protagonist will pursue them both.
  - (your regular reminder that chemical-based brain preservation is free)

Dalcy 25 Dec 2025 10:33 UTC
3 points
0
in reply to: wingspan’s comment on: The unreasonable deepness of number theory
I also find instances of this phenomenon very interesting. One example I can think of is in differential geometry where statements about manifolds are often easier proved (and in many cases, so far only proved) by routing through Riemannian geometry (since via partitions of unity, a manifold can always be given a Riemannian metric) and manipulating the manifold primarily through its metric structure (eg Ricci Flow) - even though the original statement doesn’t invoke metrics at all!

Dalcy 21 Dec 2025 20:54 UTC
3 points
1
in reply to: wingspan’s comment on: wingspan’s Shortform
In some way this is a great loss for students, being a new source of temptation (like the internet & stackexchange) to just look at the answers instead of actually eating your vegetables. It seems like a harder environment these days to Build Character.

Dalcy 15 Dec 2025 9:57 UTC
2 points
0
in reply to: GenericModel’s comment on: The Axiom of Choice is Not Controversial
Consider my vote to be placed on writing that megasequence, I know next to nothing about large cardinals and would be eager to know more about them!

Dalcy 15 Dec 2025 9:32 UTC
2 points
0
in reply to: GenericModel’s comment on: The Axiom of Choice is Not Controversial
Agree with the last sentence. I think in a majority of the fields, lines of investigation with higher insight-per-effort, in the current margin, are those done with choice (or with even more controversial things like the large cardinal axioms).
Edit: this comment by Terence Tao also expresses a similar perspective:
In general, it seems that infinitary methods are good for “long-range” mathematics, as by ignoring all quantitative issues one can move more rapidly to uncover qualitatively new kinds of results, whereas finitary methods are good for “short-range” mathematics, in which existing “soft” results are refined and understood much better via the process of making them increasingly sharp, precise, and quantitative. I feel therefore that these two methods are complementary, and are both important to deepening our understanding of mathematics as a whole.

Dalcy 14 Dec 2025 18:41 UTC
8 points
7
in reply to: GenericModel’s comment on: The Axiom of Choice is Not Controversial
I think you misinterpreted me—my claim is that working without choice often reveals genuine hidden mathematical structures that AC collapses into one. This isn’t just an exercise in foundations, in the same way that relaxing the parallel postulate to study the resulting inequivalent geometries (which were equivalent, or rather, not allowed under the postulate) isn’t just an “exercise in foundations.”
Insofar as [the activity of capturing natural concepts of reality into formal structures, and investigating their properties] is a core part of mathematics, the choice of working choice-free is just business as usual.

Dalcy 14 Dec 2025 10:50 UTC
24 points
8
on: The Axiom of Choice is Not Controversial
My understanding is that yes, axiom of choice (or more generally non-constructive methods) is convenient and it “works”, and if you naively take definitions and concepts from those realm and see what results / properties hold when removing the axiom of choice (or only use constructive methods), many of the important results / properties no longer hold (as you mentioned: Tychonoff, existence of basis, … ).
But it is often the case that you can redevelop these concepts in a choice-free / constructive context in such a way that it captures the spirit of what those definitions and concepts originally intended to capture, and yes it is harder this way, but 1) doing so often lets one recover the “morally correct” equivalent of those results / properties that do in fact hold in this context, and more importantly, 2) doing so has a lot of conceptual value.
For example, equivalent definitions become non-equivalent (such as finiteness; trying to do computable analysis and make sense of the intermediate value theorem in this context leads to new ideas like locale theory, abstract stone duality, overtness (dual of compactness, which is trivial classically), etc) where each has different and new interpretation, and the role of computability and approximation is made explicit which requires bringing in new / additional mathematical structures, etc. Also, many classical theorems have their choice-free / constructive equivalent, eg Tychonoff’s theorem for locales (arbitrary coproduct of compact frames is compact—no axiom of choice required to prove!) - and all of this gives us new and sometimes deep insight about the concept that would have been overlooked in the classical realm^[1].
To put it differently: Choice turns structure into property. Without choice, we can instead treat those structure as additional data. This lets various theorems re-emerge, often in many non-equivalent forms—and this is good.
See also: Five Stages of accepting Constructive Mathematics, Expanding the domain of discourse reveals structure already there but hidden.
1. ^
  I have only heard of these examples (I am not at all familiar with them) in the context of constructive / computable analysis, but I expect this to be a lesson that holds broadly throughout mathematics (and more narrowly: that it is possible to come up with a “morally correct” choice-free equivalent of the theory that in its current form crucially depends on choice, and that this gives new conceptual insights), and this not having been done already for some subject X is more of an issue of lack-of-mathematician-years put into it.

Dalcy 11 Dec 2025 4:49 UTC
8 points
1
on: Rock Paper Scissors is Not Solved, In Practice
Another solution: Practice dynamic visual acuity and predict the opponent’s move via their hand shape.
The extreme version of this strategy looks like this robot.
The human version of this strategy (source) is to realize that rock is the weakest (since it is easy to recognize as there is no change in hand shape over time, given that the default hand-state is usually rock), and so conclude that the best strategy is to play paper if you recognize no change in hand shape, and play scissor if you recognize any movement (because it means it’s either paper or scissor, and scissor gives win or draw)^[1].
1. ^
  This is of course vulnerable to exploitation once the opponent knows you’re using this and they also have good dynamic visual acuity (eg opponent can randomize the default hand-state, diagonalize against your heuristic by inserting certain twitches to their hand movement, etc).

Dalcy 2 Dec 2025 22:16 UTC
7 points
3
on: Darcy’s Shortform
Update to my last shortform on “Why Homology?”
My current favorite frame for thinking about homology is as fixing Poincare’s initial conception of “counting submanifolds up to cobordism”. (I’ve learned this perspective from this excellent blog post, and I summarize my understanding below.)
In Analysis Situs, Poincare sought to count m-submanifolds of a given a n-manifold up to some equivalence relation—namely, being a boundary of some (m+1)-submanifold, i.e. cobordism. I personally buy cobordism as a concept that is as natural as homotopy for one to have come up with (unlike the initially-seemingly-unmotivated definition of “singular homology”), so I am sold on this as a starting point.
Formally, given a n-manifold $X$ and m-submanifolds (disjoint) $M_{1}, \dots, M_{m}$ , being cobordant means there’s a (m+1)-submanifold $W$ such that $\partial W = M_{1} ⊔ \dots ⊔ M_{m}$ . These may have an orientation, so we can write this relation as a formal sum $\sum_{i = 1}^{m} c_{i} M_{i} \sim 0$ where $c_{i} = \pm 1$ . Now, if there are many such (m+1)-submanifolds for which the $M_{i}$ form a disjoint boundary, we can sum all of these formal sums together to get $\sum_{i = 1}^{m} a_{i} M_{i} \sim 0$ where $a_{i} \in Z$ .
Now, this already looks a lot like homology! For example, above already implies $M_{i}$ themselves have empty boundary (because manifold boundary of manifold boundary is empty, and $M_{i}$ are disjoint). So if we consider two formal sums $\sum_{i = 1}^{m} a_{i} M_{i}$ and $\sum_{i = 1}^{m} b_{i} M_{i}$ to be the same if $\sum_{i = 1}^{m} (a_{i} - b_{i}) M_{i} \sim 0$ , then 1) we are considering formal sums of $M_{i}$ with empty boundary 2) up to being a boundary of a (m+1)-dimensional manifold. This sounds a lot like $ker \partial / im \partial$ - though note that Poincare apparently put none of this in a group theory language.
So Poincare’s “collection of m-submanifolds of $X$ up to cobordism” is the analogue of $H_{m} (X)$ !
But it turns out this construction doesn’t really work for some subtle issues (due to Heegaard). This led Poincare to a more combinatorial alternative to this cobordism idea that didn’t face these issues, which became the birth of the more modern notion of simplicial homology.
(The blog post then describes how Poincare’s initial vision of “counting submanifolds up to cobordism” can still be salvaged (which I plan to read more about in the future), but for my purpose of understanding the motivation behind homology, this is already very insightful!)
What links here?
- Dalcy's comment on Dalcy’s Shortform by Dalcy (2 Dec 2025 21:33 UTC; 4 points)

Dalcy 2 Dec 2025 21:33 UTC
4 points
0
in reply to: Algon’s comment on: Darcy’s Shortform
It was a good idea!

Dalcy 24 Nov 2025 21:48 UTC
7 points
0
in reply to: James Camacho’s comment on: Maybe Insensitive Functions are a Natural Ontology Generator?
Perhaps relevant: An Informational Parsimony Perspective on Probabilistic Symmetries (Charvin et al 2024), on applying information bottleneck approaches to group symmetries:
… the projection on orbits of a symmetry group’s action can be seen as an information-preserving compression, as it preserves the information about anything invariant under the group action. This suggests that projections on orbits might be solutions to well-chosen rate-distortion problems, hence opening the way to the integration of group symmetries into an information-theoretic framework. If successful, such an integration could formalise the link between symmetry and information parsimony, but also (i) yield natural ways to “soften” group symmetries into flexible concepts more relevant to real-world data — which often lacks exact symmetries despite exhibiting a strong “structure” — and (ii) enable symmetry discovery through the optimisation of information-theoretic quantities.

Dalcy 24 Nov 2025 21:37 UTC
17 points
0
on: Maybe Insensitive Functions are a Natural Ontology Generator?
Have you heard of Rene Thom’s work on Structural Stability and Morphogenesis? I haven’t been able to read this book yet^[1], but my understanding^[2] of its thesis is that: “development of form” (i.e. morphogenesis, broadly construed, eg biological or otherwise) depends on information from the structurally stable “catastrophe sets” of the potential driving (or derived from) the dynamics—structurally stable ones, precisely because what is stable under infinitesimal perturbation are the only kind of information observable in nature.
Rene Thom puts all of this in a formal model—and, using tools of algebraic topology, show that these discrete catastrophes (under some conditions, like number of variables) have a finite classification, and thus (in the context of this morphological model) is a sort of finitary “sufficient statistics” of the developmental process.
This seems quite similar to the point you’re making: [insensitive / stable / robust] things are rare, but they organize the natural ontology of things because they’re the only information that survives.
… and there seems to be the more speculative thesis of Thom (presumably; again, I don’t know this stuff), where geometric information about these catastrophes directly correspond to functional / internal-structure information about the system (in Thom’s context, the Organism whose morphogenic process we’re modeling) - this presumably is one of the intellectual predecessors of Structural Bayesianism, the thesis that there is a correspondence between internal structures of Programs or the Learning Machine with the local geometry of some potential.
1. ^
  I don’t think I have enough algebraic topology background yet to productively read this book. Everything in this comment should come with Epistemic Status: Low Confidence.
2. ^
  From discussions and reading distillations of Thom’s work.

Dalcy 20 Nov 2025 21:14 UTC
3 points
0
in reply to: Algon’s comment on: Darcy’s Shortform
Thank you for the suggestion! That sounds like a good idea, this thread seems to have some good recommendations, will check them out.

Dalcy 20 Nov 2025 12:50 UTC
37 points
0
on: Darcy’s Shortform
Learning algebraic topology, homotopy always felt like a very intuitive and natural sort of invariant to attach to a space whereas for homology I don’t think I have anywhere as close of an intuitive handle or sense of naturality of this concept as I do for homotopy. So I tried to collect some frames / results for homology I’ve learned to see if it helps convince my intuition that this concept is indeed something natural in mathspace. I’d be very curious to know if there are any other frames or Deeper Answers to “Why homology?” I’m missing:
1. Measuring “holes” of a space
  - Singular homology: This is the first example I encountered, which will serve as intuition / motivation for the later abstract definitions.
  - Fixing some notations (feel free to skip this bullet point if you’re familiar with the notations):
    Let’s fix some space $X$ , and recall our goal associating to that space an algebraic object invariant under homeomorphism / homotopy equivalence.
    First, a singular $p$ -simplex is a map $σ : Δ^{p} \to X$ , intuitively representing a simplex living inside the space $X$ . So there is a natural $σ^{(i)} : Δ^{p - 1} \to X$ map which represents each of the $i$ faces. Then, it is natural to consider the set ${σ^{(i)}}_{i = 0}^{p}$ as representing the “boundary” of the singular p-simplex $σ$ .
    To make this last idea more precise, we define singular p-chain, which is a free abelian group generated from all the singular p-simplicies of a space, denoted $Δ_{p} (X)$ . In short, its elements look like (finite) formal sums $\sum_{σ : Δ^{p} \to X} n_{σ} σ$ . A singular p-simplex $σ$ is naturally an element of this group via $1 σ \in Δ_{p} (X)$ .
    This construction, again, is motivated by the boundary idea earlier, since we now can define the boundary of a singular p-simplex $σ$ as formal sum $\sum_{i = 0}^{p} σ^{(i)} \in Δ_{p - 1} (X)$ .
    In fact, the boundary of a singular p-simplex $σ$ is actually $\sum_{i = 0}^{p} (- 1)^{i} σ^{(i)} \in Δ_{p - 1} (X)$ .
    Why? Intuition: if we draw these $σ^{(i)}$ of simple shapes like triangles (so $σ : Δ^{2} \to X$ , hence $σ^{(i)} : Δ^{1} \to X$ , which is identified with a (directed) edge), we will note that they are oriented kind of weird, contra our intuition that the “boundary” of a triangle ought to be these directed edges that are oriented consistently, clockwise or counterclockwise. The signs correct this.
    So, we now generally define the boundary of a singular p-chain (not just a simplex) as $\partial_{p} : Δ_{p} (X) \to Δ_{p - 1} (X)$ where $\sum_{σ} n_{σ} σ \mapsto \sum_{σ} n_{σ} \partial σ$ , i.e. the obvious extension of the earlier map as group homs of the free group.
    Also, brute force application of the definitions show that $\partial_{p} \circ \partial_{p + 1} = 0$ , which matches the intuition that a boundary of a boundary should be empty. Yet another motivation for the sign fix earlier!
  - Let’s collect all the objects so far: $(Δ_{*} (X), \partial_{*})$ . Then, abstractly, we associated to a topological space X, a collection of groups and maps inbetween of order 2 (i.e. $\partial_{p} \circ \partial_{p + 1} = 0$ ). We call this object a singular complex.
  - The singular chain groups $Δ_{*} (X)$ are obviously invariant under homeomorphism! But it’s too large to be useful invariants.
  - So the natural thing to do is to take some quotients and make them smaller. Conveniently, note that $\partial_{p} \circ \partial_{p + 1} = 0$ , so $im \partial_{p + 1} \subseteq ker \partial_{p}$ , and because they’re abelian, we can take the quotient.
    This is the part where it’s like measuring holes—this video explains it nice.
  - So our singular complex $(Δ_{*} (X), \partial_{*})$ induces a new collection of groups, $H_{*} (X)$ where $H_{p} (X) := ker \partial_{p} / im \partial_{p + 1}$ . This is the p-th singular homology group of $X$ , also a homotopy invariant.
  - So singular homology has two objects: 1) singular chain complex $(Δ_{*} (X), \partial_{*})$ , and 2) singular homology groups $H_{*} (X)$ .
2. Homological algebra on topological spaces
  - We can abstract the structure of singular homology, and talk of “homology theory” in general as anything with the following data:
    chain complex $(C_{*}, \partial_{*})$ - a collection of some groups indexed by the integers and group homomorphisms between consecutive groups of order 2: $\partial_{p} \circ \partial_{p + 1} = 0$ .
    homology groups $H_{*} := ker \partial_{*} / im \partial_{* + 1}$ (note that the definition of our abstract chain complex above suffices to make this well-defined)
    (Note this doesn’t invoke anything about topology—though these chain complexes often arise from topological spaces, as in the singular homology example.)
  - Homology from this perspective is then basically a functor that assigns, to some object of interest, a “chain” of groups that are connected by maps $\partial$ such that they vanish in order 2 $\partial_{p} \circ \partial_{p + 1} = 0$ , which implies $im \partial_{p + 1} \subseteq ker \partial_{p}$ . But not necessarily exactness, i.e. $im \partial_{p + 1} = ker \partial_{p}$ . Homology, then, measures the failure of exactness via taking quotients.
  - Studying properties of chain complexes and their homology groups on their own as algebraic objects (without caring about where they came from) is called homological algebra, and apparently it shows up in various places in mathematics.
    So given this assumption of homological algebra’s utility, one could expect that it would be useful to do homological algebra on topological spaces by finding a way to construct chain complexes from spaces, and singular complexes for example does that.
    (though from a historical perspective this reason for framing the utility of homology in topology feels like double-counting evidence; though might be a frame that convinces an expert who is already convinced of the utility of homology in other fields but doesn’t know algebraic topology?)
3. Elienberg-Steenrod axioms
  - Going back to “homology” for topological spaces, we can abstract them from singular homology alternatively via an axiomatic approach using the Eilenberg-Steenrod axioms, which are axioms that a Top to Grp should satisfy. Showing that singular homology satisfies these axioms is somewhat difficult (specifically, the Excision axiom), but once this is done, it’s easy to prove various things about singular homologies directly from the axioms.
  - Turns out, for nice topological spaces (CW complex), Eilenberg-Steenrod axioms fully characterize the homology of that space up to isomorphism!!! Singular homology is an example, but if you hand me some other chain complex and homology induced from that space satisfying the axioms, then their homology should match that of singular homology.
    This is quite impressive! It really seems like “homology,” as a concept, isn’t “ad hoc” (as one might feel when first learning about singular homology), but rather something deep & universal about spaces, as homotopy is?
    (going back to 1. Measuring holes, we can then add other examples of chain complexes and the resulting homology groups, of topological spaces, aside from singular homology—the standard ones are: cellular homology, simplicial homology. Why care about multiple homology theories of topological spaces that give you isomorphic homology groups (at least for nice spaces)? Because they have comparative advantages, eg singular: easy to prove things in, cellular: easy for humans to compute with, simplicial: easy for computers to compute with, etc)
4. Homology = (Abelianized? Symmetrized? Linearized?) Homotopy
  - Hurewicz theorem gives a homomorphism between the nth homotopy group $π_{n} (X)$ and the nth homology group $H_{n} (X)$ . In particular, it says that the 1st homology group is an abelianization of the fundamental group (!!!!)
    But homotopy groups are always abelian for $n \geq 2$ , so no hopes of this abelianization connection beyond $n = 1$ (or not?)
  - Dold-Thom theorem: $H_{n} (X) ≅ π_{n} S P (X)$ for CW complex $X$
    $S P (X)$ is the infinite symmetric product space of $X$ (in short, take finite products of $X$ and mod by permutation—and given such collection, take a direct limit).
    This is crazy!
  - Dold-Kan correspondence (?)

Dalcy 19 Nov 2025 23:39 UTC
8 points
0
in reply to: Richard_Ngo’s comment on: ricraz’s Shortform
Not exactly about adversarial error correction, but: there is a construction (Çapuni & Gács 2021) of a (class of) universal 1-tape (!!) Turing machine that can perform arbitrarily long computation subject to random noise in the per-step action. Despite the non-adversarial noise model, naive majority error correction (or at least their construction of it) only fixes bounded & local error bursts—meaning it doesn’t work in the general case, because even though majority vote reduces error probability, the effective error rate is still positive, so something almost surely goes wrong (eg error burst of size greater than what majority vote can handle) as $T \to \infty$ .
Their construction, in fact, looks like a hierarchy of simulated turing machines where the higher-level TM is simulated by a level below it but at a bigger tape scale, such that it can resist larger error bursts—and the overall construction looks like “moving” the “live simulation” of the actual program that we want to execute up the hierarchy over time to coarser and more reliable levels.

Dalcy 2 Nov 2025 18:29 UTC
4 points
0
on: Darcy’s Shortform
Becoming Stronger™ (Oct 13 - Nov 2)
Notes and reflections on the things I’ve learned while Doing Scholarship the last ~~two~~ three weeks (i.e. studying math).
(EDIT (Nov 18): I will post these less frequently, maybe once a month or two, and also make it more self-contained in context, since journal-like content like this probably isn’t all that useful for most people. I will perhaps make a blog for more regular learning updates.)
The past three weeks were busier than usual so I had slower progress this time but here it is:
Chapter 6 continued: Sard’s theorem
- Tubes! I might have thought the fact that you can embed manifolds in $R^{N}$ might have been one of those theorems whose main values are conceptual but not that useful in practice (such as Cayley’s theorem and the famous quote that the fact that any group is a subgroup of a symmetry group never actually made the task of studying & classifying groups easier) - but that is not the case, due to tubular neighborhoods.
  - The main issue with trying to solve problems about smooth maps between manifolds by embedding the codomain manifold in $R^{N}$ is that the resulting construction may not lie in the original manifold. Tubular neighborhoods address this by showing that it is always possible, given an embedding of a manifold in $R^{N}$ , to come up with a tube-like open neighborhood of the manifold, equipped with a retraction map, i.e. a map from this neighborhood to itself such that it is an identity when restricted to the manifold.
  - So conceptually, tubular neighborhoods let us reason about manifold problems by solving them in $R^{N}$ , and bringing it back to manifolds.
- Applications are massive! Might be my favorite concept so far.
  - Whitney approximation theorem (any continuous map is homotopic to a smooth one. If the original continuous map is smooth on a closed subset, the homotopy can be taken relative to it)
    Proof: Prove it for the case when the codomain is $R^{m}$ , and just compose it with the tubular neighborhood retraction.
  - If a compact manifold has a nonzero vector field, then there exists a map to itself without fixed points
    Proof: Embed everything (the manifold, its tangent spaces) in R^m, let the map be the one that takes a point to the direction indicated by the nonzero vector field at some small magnitude $ϵ$ . This will take things outside of the manifold, so compose with the tubular neighborhood retraction (which is allowed if $ϵ$ is small enough).
- Transversality was also introduced in this chapter (though I am already somewhat familiar with this from my difftop class long time ago).
  - In a sense, it generalizes regular values. Preimage of regular values form embedded submanifolds, preimage of submanifolds transversal to the map leads to embedded submanifolds.
  - Two submanifolds X, Y \subseteq M being transverse formalize the notion of them intersecting in a “generic manner.”
  - Transversality Homotopy Theorem: Given any embedded submanifold X \subseteq M, any smooth map is homotopic to another smooth map which is transverse to X.
    This shows that transversality is generic. Very important for the study of stable property of smooth maps.
Chapter 8: Vector Fields
- Vector fields a section $X : T M \to M$ of the projection map of the tangent bundle $π : T M \to M$ . Very elegant definition! Also lets me better appreciate the smooth structure defined over the tangent bundle, which automatically implies that a vector field is smooth iff their coordinate functions are smooth, which is what should morally be true.
Chapter 10: Vector Bundles
- Vector bundles are objects that locally look like product spaces / vector-space “fibers” attached to each point of a manifold. Comb-like picture.
Chapter 11: Cotangent Bundle
- I finally understand what covectors are and what’s their point. Cotangent bundle is just the dual of the tangent bundle. Covector takes a tangent vector and returns a number.
- Applications:
  - Gradient of a smooth map, defined as a vector field where (under a given coordinate chart) the components of a function’s partial derivatives, is not invariant under change of coordinate chart, so it’s ill-defined. But when defined as a covector field, it is well-defined. This makes sense, since a “gradient of a function at a point” is morally an object that you take the inner product with a direction to return a scalar (directional derivative), thus it must be a covector that takes a tangent vector (“direction vector”) and returns a number.
Chapter 13: Riemannian Manifold
- Smooth manifolds are metrizable. Why?
  - Broke: Manifold is locally compact Hausdorff second countable. Locally compact Hausdorff ⇒ completely regular. By Urysohn metrization theorem, completely regular & second countable ⇒ metrizability.
  - Woke: Use the distance metric on the manifold induced by the Riemannian structure, which always exists on manifold. This is much more intuitive.
Then I read some Bredon for Algebraic Topology.
- Turns out, tubular neighborhoods are also useful for algebraic topology: How to show that a sphere $S^{n}$ is not a retract of a disc $D^{n + 1}$ ?
  - Assume such a retract $f : D^{n + 1} \to S^{n}$ exists. Scale it and compose it appropriately to make it a radial projection ( $z \mapsto \frac{z}{| | z | |}$ ) near the boundary $S^{n}$ , and f but rescaled in the inside of the disc (easy to do). Smooth out the map in the inside via the Whitney approximation theorem. By Sard’s theorem, there is a point $z \in S^{n}$ that is a regular value of $f$ . Its preimage $f^{- 1} (z)$ , then, is a 1-dimensional manifold with boundary, where $z$ is the only boundary (via radial projection). This contradicts the classification theorem of compact 1-manifolds which in particular says they have even number of boundary points.
  - From this follows Brouwer’s fixed point theorem ( $D^{n} - > D^{n}$ has fixed point), the fact that the sphere $S^{n}$ is not contractible, etc.
- I should more easily skip sections that I don’t understand. I struggled a bit with the first sub-section of the Fundamental Group chapter because it introduced the general notion of $[S X; Y]_{*}$ (the basepoint-preserving homotopy class of pointed maps $S X \to Y$ where $S X$ refers to the reduced suspension $X \times I / (X \times \partial I \cup {x_{0}} \times I)$ ), for which the nth homotopy group becomes a special case $[S S^{n - 1}; Y]_{*} = [S^{n}; Y]_{*}$ , for which the fundamental group is a special case, which is the only thing that is really needed until like very late in the book.
  - But thanks to muddling through, I think I much better understand this construction and motivation for constructions like the reduced suspension.
- Fundamental group & Covering spaces & Lifting theorems & Deck Transformation
  - Mostly a review. Functors are just really natural objects (no wonder why they came from algebraic topology).
    Specifically, the functor that transforms $Y$ to $[S X; Y]_{*}$ and $g : Y \to Z$ to $g_{#} : [S X; Y]_{*} \to [S X; Z]_{*}$ where $f \mapsto g \circ f$ . Homotopy groups (including the fundamental group) are a special case of this.
  - Covering spaces are an important tool for calculating fundamental groups. Also covering spaces have a general lifting theorem characterizing when maps can be lifted by the covering map.
    Deck transformation and fundamental group are resp right / left actions on the fiber of the covering map, and they are commuting actions.
- (Singular) Homology
  - This is new. Has the advantage of not caring about base-points. Weaker than homotopy.
- Hatcher and Bredon are great complements. I saw a lot of anti-recommendations for Hatcher, but I think they’re great together.
  - Hatcher provides great intuition (geometric and conceptual) that Bredon just never really talked about. eg geometrically visualizing a deformation retract of a shape onto its skeleton by extruding the map in 3d mirrors the mapping cylinder construction, which is something I learned from Bredon but never (so far) learned the motivation for.
    Many such examples and expositional niceness that helped reorganize my ontology (eg motivation-of-concept-wise, deformation retract comes prior to mapping cylinder, which comes prior (and motivates) the more general notion of homotopy and retracts. From this, it becomes intuitively clear why eg deformation retract should imply homotopy equivalence. Bredon taught the concepts the opposite way I think.)

Dalcy 2 Nov 2025 18:21 UTC
3 points
0
in reply to: Lucius Bushnaq’s comment on: Darcy’s Shortform
There’s a couple easy ones, like low rank structure, but I never really managed to get a good argument for why generic symmetries in the data would often be emulatable in real life.
Right, I expect emulability to be a specific condition enabled by a particular class of algorithms that a NN might implement, rather than a generic one that is satisfied by almost all weights of a given NN architecture^[1]. Glad to hear that you’ve thought about this before, I’ve also been trying to find a more general setting to formalize this argument beyond the toy exponential model.
Other related thoughts^[2]:
- Maybe this can help decompose the LLC into finer quantities based on where the degeneracy rises from—eg a given critical point’s LLC might come solely from the degeneracy in the parameter-function map, some from one of the multiple groups that the true distribution is invariant under at order r, other from an interaction of several groups, etc (sort of Mobius-like inversion)
- And perhaps it’s possible to distinguish / measure these LLC components experimentally by measuring how the LLC changes as you perturb the true distribution $q (x)$ by introducing new / destroying existing symmetries (susceptibilites-style).
1. ^
  This is more about how I conceptually think they should be (since my motivation is to use their non-genericity to argue why certain algorithms should be favored over others), and there are probably interesting exceptions of symmetries that are generically emulatable due to properties of the NN architecture (eg depth).
2. ^
  Some of these ideas were motivated following a conversation with Fernando Rosas.

Dalcy 29 Oct 2025 20:36 UTC
5 points
0
in reply to: koanchuk’s comment on: koanchuk’s Shortform
one goal whose attainment is not only impossible to observe
This part doesn’t sound that unique? It’s typical for agents to have goals (or more generally values) that are not directly observable (cf Human values are a function of Humans’ latent variables), and very often they only have indirect evidence about the actualization of those goals / values (which may be indirect evidence for their actualization in the distant future at which the agent may not even exist to even potentially be able to observe) - such as my philanthropic values extending over people I will never meet and whose well-being I will never observe.

Dalcy 23 Oct 2025 17:12 UTC
15 points
6
in reply to: titotal’s comment on: The Doomers Were Right
Doomers predicted that the Y2K bug would cause massive death and destruction. They were wrong.
This seems like a misleading example of doomers being wrong (agree denotationally, disagree connotationally), since I think it’s plausible that Y2K was not a big deal (to such an extent that “most people think it was a myth, hoax, or urban legend”) precisely because of the mitigation efforts stemmed by the doomsayers’ predictions.

Dalcy 13 Oct 2025 4:43 UTC
6 points
0
on: Darcy’s Shortform
Becoming Stronger™ (Sep 28 - Oct 12)
Notes and reflections on the things I’ve learned while Doing Scholarship the last two week (i.e. studying math).
Mostly the past two weeks were on differential geometry (Lee):
- Ch 4 (Submersion, Immersion, Embedding) comments:
  - Conceptually, by the Constant rank theorem, constant rank maps (smooth maps whose differential $d F_{p} : T_{p} M \to T_{F (p)} N$ is constant rank at all $p$ ) are precisely the maps with a linear local coordinate representation (thus are maps well-modeled locally by its differentials).
    Basically a nonlinear version of the linear algebra theorem that any square matrix can be expressed as $[\begin{matrix} I_{r} & 0 0 & 0 \end{matrix}]$ . The proof is much more complicated however: basically a clever choice of coordinate transformation via the inverse function theorem.
  - The point of the chapter is to come up with various characterizations of submersion, immersion, embedding. For example, 1) smooth immersion iff locally smooth embedding, 2) smooth submersion iff every point is an image of a local section, 3) surjective maps ⇒ submersion & injective ⇒ immersion …
    The proof of 3) is a very cool application of the Baire category theorem. Baire category theorem says the countable union of nowhere dense sets has empty interior; this is not very motivating, but reading Bredon^[1] helped clarify its conceptual significance.
    Namely, consider the more illuminating contrapositive statement: countable intersection of dense open sets is dense. Conceptually, the space is some configuration space, and dense open sets represent configurations that satisfy certain generically satisfied constraints (polynomial $p (x)$ being nonzero is a prototypical example, which is a dense & open set). Then, the question is whether the property of a countable number of these constraints being satisfied at the same time is still generic, i.e. dense. The Baire category theorem says this is indeed the case (for locally compact Hausdorff spaces).
  - Sections are just right inverses, and their intuitive geometric content was a bit confusing until I read the wikipedia page: a section of f is an abstraction of a graph by viewing f as a sort of “projection map.” That makes sense! I’m sure this will come up later in the fiber bundle context.
  - The “figure-eight curve” and “dense torus map” as prototypical examples of smooth immersions that isn’t a smooth embedding, due to topological considerations.
- Ch 5 (Submanifold) comments:
  - Similar to Ch 4, many useful characterizations of submanifolds and how to generate them. eg embedded submanifold iff locally a “slice” of the ambient manifold’s coordinate chart. embedded submanifold iff image of smooth embedding, immersed submanifold iff image of smooth immersion. Level sets of a smooth map at a “regular value” are embedded submanifolds …
- Ch 6 (Sard’s theorem) comments:
  - Finally, one of the more fun chapters! Finally learned the proof of the Whitney embedding / immersion theorem that I’ve heard a lot about.
  - The compact case of the Whitney embedding theorem is much more conceptually straightforward:
    Given a $m$ (finite, possible since compact) chart of the $n$ -dim manifold, literally just adjoin them while multiplying them with appropriate partitions of unity to get a $M \to R^{n m}$ map, and adjoin the m partitions of unity (a “chart indicator variable”) to get a $M \to R^{n m + m}$ map. This turns out to be an immersion, and thus an embedding since M is compact.
    Apply the projection map $R^{N} \to R^{N - 1}$ with a 1-dim kernel $R v$ . By Sard’s theorem, this turns out to be an immersion (when restricted to $M$ ) for almost any choice of $v$ , as long as $N > 2 n + 1$ . Repeatedly apply this to the massive codomain $M \to R^{n m + m}$ to get an immersion to $R^{2 n + 1}$ .
    This projection map can in fact be promoted to an embedding, given that the original immersion of M to R^n is an embedding.
  - High-level takeaways:
    The most dumb and obvious way of interpolating coordinate charts into a global map via partitions of unity, with slight modifications, gives a bona fide immersion of a manifold into $R^{N}$ !!
    It was interesting to learn that there was a 1-2 decade period of foundational uncertainty (between the first proposal of the abstract manifold definition and Whitney’s above proof) where people didn’t know whether the abstract manifold definition was actually more general than $R^{N}$ or not.^[2]
    Partitions of unity really is used everywhere. I wonder how the theory of complex analytic manifolds ever do anything when analytic partitions of unity don’t exist.
    Proof strategy of promoting a smooth map to a proper map (at the cost of increased dimensionality of the codomain) by literally adjoining a proper map next to it. Clever!
    I presume this is the main motivation behind exhaustion functions ( $f : M \to R$ s.t. $f^{- 1} ((- \infty, c])$ is compact $\forall c \in R$ ). It’s a proper map, it exists for any manifolds (again, shown by partitions of unity), and has codomain of dimension 1 so it minimally increases the function codomain dimension.
  - More applications on Whitney approximation theorems and transversality arguments.
    The latter, including the transversality homotopy theorem (actually learned this a year ago in my difftop class, though that class used Guillemin’s book where manifolds are always embedded in $R^{N}$ - so it’s good to learn them from a more intrinsic perspective) is very interesting.
    It also ties to one of my motivation for all this math learning, backchaining from trying to do good alignment theory work, which is learning the math of structural stability and its role in the theory of forms (morphogenesis) cf Thom, Structural Stability and Morphogenesis (thank you Dan Murfet for explaining this perspective).
Rabbit holes that I could not afford to pursue:
- The category of smooth manifolds is an idempotent-splitting completion of the category of open subspaces of findim cartesian spaces?!?!?! My mind is blown.
  - So much more elegant than the standard definition via charts and maximal smooth structures and such. Unsure of the utility of this characterization though, lol (read Lawvere’s paper).
- There is a duality between the category of smooth manifolds and the category of R-algebras. Fascinating how such dualities between algebra and geometry seem to be a very common motif throughout different fields, I’m sure this will come up in Vakil’s book later. Also curious about Gelfand’s duality on this for topological spaces.
- “It is better to have a good category with bad objects than a bad category with good objects.”—Grothendieck (probably not). For example, the category of smooth manifolds is not nice, motivating smooth sets, diffeological spaces, and so on.
  - Dichotomy between nice objects and nice categories: in the context of alignment theory, maybe I can view Programs as Singularities as enlarging an instantiation of this idea by enlarging the class of Turing machines.
- I found this intuition for adjoint functors illuminating. Specifically, note set maps $f : X \to Y$ and $g : Y \to X$ being inverses are equivalent to the condition that their graphs are mirrored along the diagonal, i.e. $(x, f (x)) = (g (y), y)$ . Rephrase this using Kronecker delta, $δ (x, g (y)) = δ (f (x), y)$ . Now $δ$ can be seen as expressing a “relation” that could be exhibited by two elements of a set, i.e. equality (1) or inequality (0). But in general categories, objects can exhibit more relations—so replace $δ$ by $H o m$ - you get adjoint functors!
1. ^
  Example of how reading books in parallel improves learning efficiency.
2. ^
  Why that long? The dimensionality reduction by projection is perhaps more nontrivial because of Sard, but the obvious gluing should have been sufficient to construct an immersion at least, albeit at the cost of inefficient codomain dimension. Maybe the historically difficult part was the concept of partition of unity and that it always exist in manifolds?

Dalcy

Becoming Stronger™ (Oct 13 - Nov 2)

Becoming Stronger™ (Sep 28 - Oct 12)