Mateusz Bagiński

Karma: 2,153

Agent foundations, AI macrostrategy, civilizational sanity, human enhancement.

I endorse and operate by Crocker’s rules.

I have not signed any agreements whose existence I cannot mention.

Mateusz Bagiński 11 Oct 2025 16:50 UTC
4 points
0
on: The Most Common Bad Argument In These Parts
Good post!
Why did you call it “exhaustive free association”? I would lean towards something more like “arguing from (falsely complete) exhaustion”.
Re it being almost good reasoning, a main thing making it good reasoning rather than bad reasoning is having a good model of the domain so that you actually have good reasons to think that your hypothesis space is exhaustive.

Mateusz Bagiński 8 Oct 2025 20:06 UTC
2 points
0
in reply to: Jan_Kulveit’s comment on: Jan_Kulveit’s Shortform
As far as I understand, at least one of the authors has an unusual moral philosophy such as not believing in consciousness or first-person experiences, while simultaneously believing that future AIs are automatically morally worthy simply by having goals.
[narrow point, as I agree with most of the comment]
For what it’s worth, I think this seems to imply that illusionism (roughly, people who, in a meaningful sense, “don’t believe in consciousness”) makes people more inclined to act in ethically deranged ways, but, afaict, this mostly isn’t the case, because I’ve known a few illusionists (was one myself until ~1 year ago) and, afaict, they were all decent people, not less decent than the average of my social surroundings.
To give an example, Dan Dennett was an illusionist and very much not a successionist. Similarly, I wouldn’t expect any successionist aspirations from Keith Frankish.
There are caveats, though in that I do think that a sufficient combination of ideas which are individually fine, even plausibly true (illusionism, moral antirealism, …), and some other stuff (character traits, paycheck, social milieu) can get people into pretty weird moral positions.

Mateusz Bagiński 8 Oct 2025 16:57 UTC
4 points
0
in reply to: TsviBT’s comment on: TsviBT’s Shortform
So there’s steelmanning, where you construct a view that isn’t your interlocutor’s but is, according to you, more true / coherent / believable than your interlocutor’s.
[nitpick] while also being close to your interlocutor’s (perhaps so that your interlocutor’s view could be the steelmanned view with added noise / passed through Chinese whispers / degenerated).
A proposed term
Exoclarification? Alloclarification? Democlarification (dēmos—“people”)?

Mateusz Bagiński 7 Oct 2025 11:57 UTC
4 points
0
in reply to: niplav’s comment on: Ontological Cluelessness
Another perhaps example, though not quite analytic philosophy, but rather a neo-religion: Discordianism.
Specifically, see here: https://en.wikipedia.org/wiki/Principia_Discordia#Overview

Mateusz Bagiński 7 Oct 2025 6:09 UTC
2 points
0
on: Shortest damn doomsplainer in world history
Computers are getting smarter and making entities smarter than yourself, which you don’t understand is very unsafe.

Mateusz Bagiński 3 Oct 2025 7:13 UTC
2 points
0
on: IABIED and Memetic Engineering
Scott criticizes the Example ASI Scenario as the weakest part of the book; I think he’s right, it might be a reasonable scenario but it reads like sci-fi in a way that could easily turn off non-nerds. That said, I’m not sure how it could have done better.
I think the scenario in VIRTUA requires remarkably little suspension of disbelief; it’s still “sci-fi-ish”, but less “sci-fi-ish” than the one in IABIED (according to my model of the general population), and leads to ~doom anyway.
(I feel like I’m groping for a concept analogous to an orthogonal basis in linear algebra—a concept like “the minimal set of words that span an idea”—and the title “If Anyone Builds It, Everyone Dies” almost gets there)
You don’t need orthogonality to get a minimal set that spans some idea/subspace/whatever.

Mateusz Bagiński 3 Oct 2025 6:38 UTC
6 points
3
in reply to: Mateusz Bagiński’s comment on: niplav’s Shortform
Also, it would be good to deconflate the things that these days go as “AI agents” and “Agentic™ AI”, because it makes people think that the former are (close to being) examples of the latter. Perhaps we could rename the former to “AI actors” or something.
(Sidenote: Both “agent” and “actor” derive from Latin agere, meaning “to drive, lead, conduct, manage, perform, do”. Coincidentally, the word “robot” was coined from the Czech “robota”, meaning “work”, and also related to “robit”, meaning “to do” (similar words mean “to do” in many other Slavic languages).)

Mateusz Bagiński 3 Oct 2025 6:31 UTC
9 points
0
in reply to: niplav’s comment on: niplav’s Shortform
1. “SLT” as “Singular Learning Theory” →”SiLT”
2. “SLT” as “Statistical Learning Theory” →”StaLT”
3. “SLT” as “Sharp Left Turn” →”ShaLT”
https://www.lesswrong.com/posts/thXohzXrWCA2EhZCH/mateusz-baginski-s-shortform?commentId=nacqGC5aHii7yzJCg

Mateusz Bagiński 3 Oct 2025 6:30 UTC
4 points
1
in reply to: niplav’s comment on: niplav’s Shortform
It is “instrumental” but in a different sense, of being convergence in instrumentality, similarly to “moral convergence” (although if moral convergence qua convergence of morality is true, then presumably it is also moral to converge on the convergent morality (according to default interpretations of the idea, at least)).

Mateusz Bagiński 2 Oct 2025 20:13 UTC
2 points
4
in reply to: Algon’s comment on: Why’s equality in logic less flexible than in category theory?
There’s the equivalence of categories. Two categories are equivalent when they are isomorphic up to an isomorphism. Specifically, if you have two functors $F : C \to D$ and $G : D \to C$ , such that there are natural isomorphisms (invertible natural transformations) $α : F \circ G \sim \to 1_{D}$ and $β : 1_{C} \sim \to G \circ F$ . On objects, this means that if you start at an object $X$ in $C$ , then you can go to $F X$ in $D$ and then to $G F X$ in $C$ that is isomorphic to $X$ : $G F X ≅ X$ . Similarly if you start at some object in $D$ .
Equivalence is an isomorphism when the isomorphisms $G F X ≅ X$ (equivalently,^[1] the natural transformations between the compositions and identity functors) are equalities.
An even weaker equality-like notion is adjunction and this where things start to get asymmetrical. There’s a few equivalent^[2] ways of defining them^[3], but the contextually simplest way (if not completely rigorous), since I just described equivalences, is that it’s an equivalence, except the natural transformations $α : F \circ G \to 1_{D}$ and $β : 1_{C} \to G \circ F$ are not (in general) isomorphic. So, you go $X F \to F X G \to G F X$ and you could instead have gotten there via some morphism $X \to G F X$ in the category $C$ but you may not be able to go back $G F X \to X$ . On the other hand, starting at some $Y$ in $D$ , you can take the trip $Y G \to G Y F \to F G Y$ and go back to where you started via some morphism within $D$ , $F G Y \to Y$ . Again, there may not be a morphism $Y \to F G Y$ .
Then we say that F is left adjoint to G (equivalently, G is right adjoint to F), denoted $F ⊣ G$ . The natural transformations $η_{X} : X \to G F X$ and $ϵ_{Y} : F G Y \to Y$ are called the unit and the counit of the adjunction, respectively.
Seven Sketches in Compositionality introduces adjunctions in a slow and palatable way that is good for building an intuition, starting with Galois connections, which are just adjunctions for preorders, which are just Bool-categories.
1. ^
  no pun intended
2. ^
  double no pun intended
3. ^
  many such cases in category theory

Mateusz Bagiński 2 Oct 2025 17:46 UTC
6 points
0
in reply to: Cole Wyeth’s comment on: Mateusz Bagiński’s Shortform
Let this thread be the canonical reference for the posterity that this idea appeared in my mind at ODYSSEY at the session “Announcing Universal Algorithmic Intelligence Reading Group” held by you and Aram in Bayes Attic, on Wednesday, 2025-08-27, around 07:20 PM, when, for whatever reason, the two S Learning Theories entered the conversation, when you were putting some words on the whiteboard, and somebody voiced a minor complaint that SLT stands for both of them.

Mateusz Bagiński 2 Oct 2025 14:14 UTC
2 points
0
in reply to: Linda Linsefors’s comment on: Linda Linsefors’s Shortform
I usually don’t, though maybe unconsciously? Plausibly it would be good for me to try to track it explicitly.
cf https://www.lesswrong.com/posts/bhLxWTkRc8GXunFcB/what-are-you-tracking-in-your-head

Mateusz Bagiński 2 Oct 2025 10:20 UTC
17 points
5
on: Mateusz Bagiński’s Shortform
The acronym SLT (in this community) is typically taken/used to refer to Singular Learning Theory, but sometimes also to (~old-school-ish) Statistical Learning Theory and/or to Sharp Left Turn.
I therefore put that to disambiguate between them and to clean up the namespace, we should use SiLT, StaLT, and ShaLT, respectively.
What links here?
- Mateusz Bagiński's comment on shortplav by niplav (3 Oct 2025 6:31 UTC; 9 points)

Mateusz Bagiński 30 Sep 2025 7:59 UTC
3 points
0
on: Finite Factored Sets
It is easy to see that factorizations and partitions are duals if we model them in the category FinSet.
A partition on a set $S$ is “just” an epimorphism (i.e., a surjection in FinSet), $e : S ↠ X$ . That’s because an epimorphism induces a partition on $S = ∐_{x \in X} e^{- 1} (x)$ indexed by the elements $x \in X$ . (Surjectivity/epicness is necessary because without it we would have some $x$ ’s beyond the image of $e$ , so that their preimages would be empty: $e^{- 1} (x) = \emptyset$ ). In the other direction, any partition $S = ∐_{i \in I} X_{i}$ induces a unique surjection mapping each element of $S$ to the part it belongs to. It’s easy to see that these two views are equivalent (i.e. moving partition→epimorphism→partition gets us back to the same partition and similarly epimorphism→partition→epimorphism).
So, a factorization of a set $S$ is an isomorphism $e : S \sim \to \prod_{i \in I} b_{i}$ constructed as a product of surjections indexed by a (finite) set $I$ , $e = \prod_{i \in I} e_{i}$ , $\forall i \in I . e_{i} : S ↠ b_{i}$ . Explicitly: $e (s) = ⟨ e_{1} (s), e_{2} (s), . . ., e_{| I |} (s) ⟩$ . Moreover, we require each partition/epic $e_{i} : S ↠ b_{i}$ to be non-trivial, i.e., it must have at least two elements, so none of the $b_{i}$ ’s is a singleton, i.e., the terminal object.
If we dualize this construction, we get an isomorphism $e : ∐_{i \in I} b_{i} \sim \to S$ from the coproduct (i.e., disjoint sum in FinSet) to the set $S$ , that is constructed from monomorphisms (i.e., injections in FinSet) $e = ∐_{i \in I} e_{i}$ , $\forall i \in I . e_{i} : b_{i} ↣ S$ . Moreover, since in the factorization case we assumed that none of the $b_{i}$ ‘s is terminal (singleton), here, after dualization, none of the $b_{i}$ ’s is initial, i.e., it is not the empty set. The isomorphism means that the two sets are equinumerous: $| ∐_{i \in I} b_{i} | = \sum_{i \in I} | b_{i} | = | S |$ , so the set of the “co-basis” elements $(b_{i})_{i \in I}$ is isomorphic to a partition of $S$ , since, as we just remarked, each $b_{i}$ is non-empty. In other words, the “co-basis” elements “are” parts of a partition. Each $e_{i}$ being an injection means that $\forall i \in I . | b_{i} | \leq | S |$ , but that’s already implied by equinumerosity (actually, strict inequality is implied because each $b_{i}$ is non-empty). The natural interpretation of the monic $e_{i}$ is the subset inclusion of the elements of the part $b_{i} \subset S$ .
To go from the partition-as-epi view $e : S ↠ I$ , we “convert” it into the isomorphism between $S$ and the disjoint union of the parts of the partition $∐_{i \in I} b_{i} \sim \to S$ (where $b_{i} = e^{- 1} (i)$ ), which can be viewed as a coproduct of subset inclusions (i.e. monics/injections), and then dualize to get $S \sim \to \prod_{i \in I} b_{i}$ .
[Previously in categorical view of FFS: drocta and Gurkenglas. Most likely somebody has figured this out already, but I haven’t seen it written up anywhere, so I’m posting this comment.]

Mateusz Bagiński 29 Sep 2025 13:53 UTC
19 points
8
on: Mateusz Bagiński’s Shortform
On IABIED
First things first, I wholeheartedly endorse the main actionable conclusion: Ban unrestrained progress on AI that can kill us all.
I broadly think Eliezer and Nate did a good job communicating what’s so difficult about the task of building a thing that is more intelligent than all of humanity combined and shaped appropriately so as to help us, rather than have a volition of its own that runs contrary to ours.^[1]
The main (/most salient) disagreement I can see at the moment is the authors’ expectations of value-strangeness and maximizeriness of superintelligence; or rather, I am much more uncertain about this. However, this detail is not relevant for the desirability of the post-ASI future, conditional on business-close-to-as-usual and therefore not relevant for whether the ban is good.
(Also, not sure about their choice of some stories/parables, but that’s a minor issue as well.)
I liked the comparison with the Allies winning against the Axis in WWII, which, at least in resource/monetary terms, must have costed much more than it would cost to implement the ban. The things we’re missing at the moment are awareness of the issue, pulling ourselves together, and collective steam.
1. ^
  Whatever that means, cf the problems of CEV and idealized values.

Mateusz Bagiński 28 Sep 2025 19:31 UTC
2 points
0
in reply to: Ben Pace’s comment on: Ben Pace’s Shortform Feed
I guess. But I would think the bigger issue is that people don’t notice.

Mateusz Bagiński 28 Sep 2025 16:31 UTC
2 points
0
in reply to: Ben Pace’s comment on: Ben Pace’s Shortform Feed
I think “Elaborate” would be more useful (i.e., more likely to actually induce an elaboration on the point being reacted at) if its corresponding notification was grouped not with karma updates and other reacts (which people get notified about in a batch, daily, weekly, etc.), but rather with the normal notifs like “so-and-so replied to your comment/post” or “you received a message from so-and-so”. (Probably the same for the new “Let’s make a bet!” react.)
But this would most likely require doing something somewhat inelegant and complicated to the codebase, so it may not be worth it, atm at least.

Mateusz Bagiński 28 Sep 2025 10:18 UTC
7 points
3
in reply to: niplav’s comment on: Vanessa Kosoy’s Shortform
It seems not-very-unlikely to me that, over the next few years, many major (and some non-major) world religions will develop a “Butlerian” attitude to machine intelligence: deeming it a profanity to attempt to replicate (or even to do things that have a non-negligible chance to result in replicating) all the so-far-unique capacities/properties of the human mind, and will use it to justify their support of a ban, along with the catastrophic/existential risks on which they (or some fraction of them) would agree with worried seculars.
In a sense, both human-bio-engineering and AI are (admissible to be seen by conservatively religious folks as) about “manipulating the God-given essence of humanity”, which amounts to admitting that God’s creation is flawed/imperfect/in need of further improvement.

Mateusz Bagiński 26 Sep 2025 9:57 UTC
4 points
0
in reply to: the gears to ascension’s comment on: IABIED is on the NYT bestseller list
The simplest general way is to buy it in whatever format and then download it from one of the well-known websites with free pdfs/mobis/epubs.

Mateusz Bagiński 24 Sep 2025 15:11 UTC
9 points
0
on: Ontological Cluelessness
Analytic metaphysics, as far as I can tell, mostly tacitly rejects ontological cluelessness.
To give some ~examples from the analytic tradition: As far as I understand them, Brian Cantwell Smith and Nancy Cartwright espouse(d^[1]) a view somewhat adjacent to ontological cluelessness, albeit perhaps slightly stronger, in that, according to (my model of?) them, there is no final/fundamental basis of reality and it’s not infinite regress either.
Somewhat more specifically, reading BCS’s On the Origin of Objects (haven’t read Cartwright yet) gave me the picture of a gunky-unknowable reality, where for a “part” of reality to even become a type of thing that can be known, it needs to be stabilized into a knowable object or something like that, and that process of stabilization involves parts/regions of the universe acting at a distance in a way that involves a primitive form of “aboutness” (?).
(There is some superficial semi-inconsistency in this way of talking about it, in that it describes [what it takes a not-yet-a-Thing to stabilize into a (knowable) “Thing”] in terms of knowable Things, so the growing Thing should also be knowable by transitivity or something (?). But I don’t think I’m passing BCS’s ITT.)
For another adjacent analyticist, Eric Schwitzgebel? https://faculty.ucr.edu/~eschwitz/SchwitzAbs/Weirdness.htm
Oh, and how could I forget The Guy Against Reality? https://en.wikipedia.org/wiki/Donald_D._Hoffman
1. ^
  I just saw that BCS died 18 days ago :(.