mishka

Karma: 1,541

mishka 10 Sep 2025 4:59 UTC
8 points
0
on: GPT-oss is an extremely stupid model
Actually, this might be what Yoshua Bengio would welcome. He wants a non-agentic tool AI which is as competent in STEM subjects as possible. That’s his approach to AI existential safety.

The problem with this has always been: “what would prevent people from adding an agentic wrapper or generally reorienting a model in an agentic way”.

In this sense, gpt-oss might be the step in exactly the direction Bengio eventually wants: a model which is completely disabled in all things social and agentic, but very smart in all things STEM.

Perhaps people should ponder if this model points at Bengio’s approach being more realistic that it’s customary to think… (I still doubt that Bengio’s approach is feasible, but gpt-oss existence tells us we should ponder this more.)

mishka 9 Sep 2025 11:09 UTC
3 points
0
on: Calibrating indifference—a small AI safety idea
Yes, this might help.

Recommendations in a recent OpenAI paper seem to resonate with some of the thoughts in this post:

https://openai.com/index/why-language-models-hallucinate/

mishka 4 Sep 2025 23:31 UTC
2 points
0
on: AI #132 Part 1: Improved AI Detection
Regarding the popularity of coding tools on Google search trends, could it be that we mostly see the dynamics of their competition with each other and that Codex (which is not listed among the alternatives) started to eat a lot of relative share after Aug 7 GPT-5 release?

When one looks at Google Trends, the curves for both Codex the word (“search term”) and OpenAI Codex the topic (“software”) look suggestive.

mishka 2 Sep 2025 15:17 UTC
10 points
0
on: Three main views on the future of AI
I wonder if this is an exhaustive list of clusters of notable doctrines. These three are the dominant ones, but we seem to have some important “minority viewpoint clusters”.
1. For example, a number of people talk about “merge with AI” as potentially the most straightforward and powerful form of human “intelligence augmentation”.
2. A number of people talk about possibility of ASIs being robustly aligned to a non-human set of values which is “benign enough” to imply reasonable future for humans.
There is some correlation between people being inclined to argue for feasibility of 1 and being inclined to argue for feasibility of 2, and people often think about 1 and 2 going hand in hand together, so I am not sure if these should be considered as a single cluster or as two different clusters.

mishka 1 Sep 2025 17:20 UTC
4 points
5
in reply to: AnthonyC’s comment on: Generative AI is not causing YCombinator companies to grow more quickly than usual (yet)
The difference between GenAI-2025 and GenAI-2023 in terms of their ability to assist software engineering efforts is quite drastic indeed.

mishka 30 Aug 2025 15:26 UTC
2 points
0
in reply to: Gabriel Alfour’s comment on: The Gabian History of Mathematics
Thanks!

I will certainly grant you the main point: we see tons of errors, even in the texts written by leading mathematicians. There is still plenty of uncertainty about validity of some core results.

But that’s precisely why it is premature to say that we have entered the formalist era. When people start routinely verifying in automated theorem provers, that’s when we’ll be able to say that we are in the formalist era.

mishka 30 Aug 2025 5:24 UTC
2 points
0
in reply to: Gabriel Alfour’s comment on: The Gabian History of Mathematics
changing the order a bit:

So, as a student (not as a researcher), I actually got acquainted with DCPOs and Scott domains, specifically in the context of denotation semantics.

One of my teachers was actually quite interested in general (non-Haussdorf) manifolds.

That’s awesome! I’d love to understand more about manifolds in this context. We managed at some point to progress to reconciling Scott domains with linear algebra (with some implications for machine learning), but progressing to manifolds is something else.

I think this is precisely one of the realms where formalism shines.

I expect it would have been impossible for the field to take off without formalism.

There are so many broken intuitions in non-metric geometry! If we were left at the intuitions as being the ground truth, the field could not have succeeded. (Even in beginner metric geometry, it is easy to get screwed when one is not clearly introduced to specific examples where the definitions of closed/bounded/compact differ.)

There are just too many different types of spaces/topologies/manifolds, with different axioms behind them. The “true objects” here have little to do with standard numbers, intuitive euclidean spaces et al.

Yes, absolutely. But only when one is engaging in the interplay between the formal and the informal.

The informal and the formal have to go together, at least when humans practice math.

To the extent one has “true objects” in minds, these object are of a purely formal nature.

The question is, what is formal. Are categories and toposes well defined? A follower of Brouwer and Heyting would tell you “no”.

And can one work with the objects before their formalization is fully elucidated? One often has to, otherwise one would often be unable to obtain that formalization.

When one is talking about the cutting edge math, math which is in the process of being created, the full formalization is often unavailable, yet progress needs to be made.

back to the beginning:

Regardless of how they feel about non-intuitive proofs, when these proofs contradict their intuitions, the non-intuitive proofs will still be the source of truth.

Yes, absolutely, but it will be a very incomplete truth.

The intuition about what is connected to what is an important part of the actual mathematical material even if that intuition is not fully reified into definitions, axioms, and theorems.

So yes, this is a valid source of mathematical truth, but all kinds of informal considerations also form important mathematical material and empower people’s ability to obtain the formalized part of truth as well. E.g. if one looks at the Langlands program, one is clearly seeing how much the informal part dominates and leads, and how the technical part gradually follows, conjectures evolving and changing before they become theorems.

From the simple example of my own studies, if one wants to obtain generalized metrization of Scott topology, this task is obviously very informal. Who knows what this generalized metrization might be a priori… All one understands at the beginning is that ordinary metrics would not do because the topology is non-Hausdorff. Then one needs to search, and one first finds that the literature contains the “George Washington bridge distance” quasi-metrics, where an asymmetric distance is non-zero in one direction and zero in the opposite direction, just like the toll on that bridge. And then one proves that a topologically correct generalized metrization can be obtained this way, but when one is also trying to figure out how to compute this generalized distance, one encounters obstacles. And then one realizes that the axiom d(x,x)=0 is incompatible with distance being monotonic with respect to both variables, and, therefore, is incompatible with distance being Scott continuous with respect to both variables, and Scott continuity is the abstract version of computability in this field. And then one proceeds to (re)discover generalized metrics which don’t have the d(x,x)=0 axiom obtaining (symmetric) relaxed metrics and partial metrics which can be made Scott continuous with respect to both variables and can be made computable, and so on.

The informal part always leads, formalization can’t be fruitful without it (although perhaps different kinds of minds which are not like minds of human mathematicians could fruitfully operate in an entirely formal universe, who knows).

You mention Hasse diagrams. They are a modern purely formal construct.

Only if the set is finite.

If one wants to represent an infinite partially ordered set in this fashion, one has to use an informal “Hasse-like sketch” which is not well-defined (as far as I know, although I would be super-curious if one could extend the formal construct of Hasse diagrams to those cases).

I am going to link to a few of the examples of these informal Hasse diagrams. I am going to use this text originally from 2002 (and the diagrams are all from 1990-s): https://arxiv.org/abs/1512.03868

On page 15 (page 30 of the PDF file), we see an informal “Hasse diagram” for interval numbers within a segment. On page 16 (page 31 of the PDF file), this “Hasse diagram” is used in an even more informal fashion (with dashed lines to indicate that these lines are not a part of the depicted upper set Va,b).

On page 103 (page 118 of the PDF file), we see a more discrete infinite partial order, yet even this more discrete case is, I believe, beyond the power of formalized “Hasse diagrams” (it’s not an accident that most elements have to be omitted and replaced by ellipsis-like dots looking like small letters “o”).

mishka 29 Aug 2025 18:19 UTC
14 points
0
in reply to: Gabriel Alfour’s comment on: The Gabian History of Mathematics
That does not correspond to my observations of the math community (or to my introspection as I have done some math research in my life and published some math papers).

On one hand, mathematicians tend to downgrade proofs which are impossible to intuitively understand. They would say, “ok, we now know that the statement is true, but any potential for better understanding of adjacent areas is not gained yet, or, perhaps, even lost, because people are not going to study this problem now, and so this achievement might even be detrimental to our field, as it is the interconnections between notions and subfields of study which truly matter the most, and they will not be discovered now, because the motivation to study this particular question is largely lost”. (In reality, of course, this would depend; some statements are so important and central, that it matters more that we know their status, than any gain from them motivating further research.)

I think most mathematicians think about themselves as having a “Platonistic telescope” inside their minds, although this “telescope” is imperfect, similarly to a physical telescope.

Here I can try to fall back onto my introspection. As a mathematician, I have mostly studied various partial orders with additional structure (“Scott domains” in their various incarnations, e.g. continuous or algebraic lattices, and such). I certainly think those objects actually exist “out there” in the “abstract realm”. An informal Hasse diagram does more to me than any syntactic description. So, although the field is trying to be algebraic (these structures are used as semantic domains in the field of denotational semantics of programming languages, and so can be useful in some approaches to formal proofs of software correctness), my intuition about them is mostly informal and geometric. In reality, I switch back and forth between those informal geometric ideas and syntactic aspects which I need when I focus on more low-level technical aspects (e.g. how to actually make a proof to go through).

mishka 29 Aug 2025 14:58 UTC
13 points
4
on: The Gabian History of Mathematics
I think most working mathematicians would not agree that we are presently in the formalist era.

Many would state rather emphatically that they study a “Platonic realm” of math structures and would express their beliefs that those structures don’t have much to do with the linguistic and formal means used to study them, but exist objectively regardless of the “auxiliary” linguistic formalisms.

Others (e.g. V.I.Arnold) would emphasize the connections with physics and would actively promote the idea that excessive Bourbaki-style formalization is harmful.

Yet, we might be about to actually enter a formalist era with both humans and AI systems gradually becoming more and more fluent with automated formal theorem provers like Lean.

mishka 20 Aug 2025 18:39 UTC
2 points
0
in reply to: Aaron Turner’s comment on: TTQ: An Implementation-Neutral Solution to the Outer AGI Superalignment Problem
Thanks, that helps!

mishka 20 Aug 2025 16:46 UTC
2 points
0
on: TTQ: An Implementation-Neutral Solution to the Outer AGI Superalignment Problem
Thanks for the post! I think the main problem is that the abstract does not give enough feel for the core content of the paper, and so people are mostly not trying to dive into paper (they can’t evaluate from the abstract whether it is promising enough to be worth an effort).

I uploaded the paper PDF into GPT-5 Thinking and I asked

Hi, I am trying to get a high-level summary of the text I just uploaded. I have read its abstract, but I don’t know what the TTQ stands for, or what are the main ideas used to formulate the Outer Alignment Precondition and the TTQ.

and the model produced a couple of pages of a detailed summary:

https://chatgpt.com/share/68a5faef-c050-8010-8392-20772cd6a370

I wonder if this can be formulated in a more readable fashion to be included into the abstract, so that the readers of the abstract would have a better impression of what’s inside the paper.

mishka 20 Aug 2025 16:00 UTC
3 points
0
in reply to: dawnstrata’s comment on: dawnstrata’s Shortform
Oh, when one is trying to talk about that many orders of magnitude, they are just doing “vibe marketing” :-) In reality, we just can’t extrapolate this far. It’s quite possible, but we can’t really know...

But no, it’s not the human level AI, the AI capability is what is changing the fastest in this scenario, the actual reason why it might go that far (and even further) is that a human level AI is supposed to rapidly become superhuman (if it stays at human level then what is all this extra AI research even doing?), and then even more superhuman, and then even more superhuman, and so on, and if there is some saturation at some point it is usually assumed to be very far above the human level.

If one has a lot of AI research done by artificial AI researchers, one would have to impose some very strong artificial constraints to prevent that research from improving the strength of artificial AI researchers. The classical self-improvement scenario is that artificial AI researchers making much better and much stronger artificial AI researchers is the key focus of AI research, and that this “artificial AI researchers making much better and much stronger artificial AI researchers” step iterates again and again.

mishka 20 Aug 2025 3:34 UTC
10 points
0
in reply to: dawnstrata’s comment on: dawnstrata’s Shortform
We at least know two things.
1. In the past (say, before 2015), there had been multi-decade delays in adoption even of already known innovations which turned to be very fruitful solely because of insufficient manpower combined with tendencies of people to be a bit too conservative (ReLU, residual streams in feedforward machines, etc, etc). Even “attention” and Transformers seem to have come much later in the game than one would naturally expect looking back. “Attention” is such a simple thing, it’s difficult to understand why it has not been well grasped before 2014.
2. In the present, the main push for alternative architectures and alternative optimization algorithms comes from various underdogs (the leaders seem to be more conservative with the focus of their innovation, because this relatively conservative focus seems to pay well; the net result is that exploration of radically new architectures and radically new algorithms is still quite a bit underpowered, even with a lot of people working in the field).
So at least the model in https://www.lesswrong.com/posts/Nsmabb9fhpLuLdtLE/takeoff-speeds-presentation-at-anthropic does not seem to be exaggerated. There are so many things which people would like to try and can’t find time or personal energy, and so many more further things they would want to try, if they had better grasp of the vast research literature…

mishka 18 Aug 2025 18:40 UTC
2 points
0
in reply to: phdead’s comment on: GPT-5: The Reverse DeepSeek Moment
Yes, and, perhaps, one would usually want to shrink before post-training, both to make post-training more affordable per iteration, and because I am not sure if post-training-acquired capabilities survive shrinkage as well as pre-training-acquired capabilities (I wonder what is known about that; I want to understand that aspect better; is it insane to postpone shrinkage till after post-training, or is it something to try?).

mishka 18 Aug 2025 18:14 UTC
4 points
0
in reply to: Hastings’s comment on: GPT-5: The Reverse DeepSeek Moment
Yes, that’s certainly true. (Although, with the original GPT-4 it is thought that the delay have been mostly dedicated to safety improvements and, perhaps, better instruction following, with shrinkage mostly occurring post initial release.)

In any case, they could have boosted capabilities even without relying on the future models, but just by offering less shrinked versions of GPT-5 in addition to the ones they did offer, and they have chosen not to do that.

mishka 18 Aug 2025 17:28 UTC
3 points
0
in reply to: Hastings’s comment on: GPT-5: The Reverse DeepSeek Moment
That’s probably too expensive and too risky.

An unsafe model, not well tested, and exposing too many of latest tricks too early to their competitors.

I would not expect them to do that (they don’t have enough compute to serve slow huge models to large number of users anyway; that’s, in part, why GPT-5 is very different from GPT-4/4.5 in terms of price/capability trade-off).

mishka 18 Aug 2025 15:14 UTC
8 points
0
on: GPT-5: The Reverse DeepSeek Moment
I notice that I am confused about the state of internal-only models at places like OpenAI. I wonder if people are trying to aggregate the informal reports and rumors on that.

In particular, I usually assume that internal models are usually ~6 months ahead of what’s released, but I don’t know if that’s a good estimate.

To make my confusion more concrete: I don’t quite understand how publicly available Claude Code can be useful for internal OpenAI developments if internal models from 6 months in the future are available. (Especially when taking into account that using those models potentially gives information to a competitor.) Internal models might be expensive to use, but with only a few thousand employees this should not matter much.

(I can see how Claude Code might be useful for personal projects by OpenAI employees, precisely because they might want to keep those projects private from their employer.)

Anyway, I wonder if there are some “interest groups” where people talk about rumors related to internal-only models. (The events like IMO gold do give us a bit of a window into what’s available in that sense.)

mishka 12 Aug 2025 16:13 UTC
4 points
0
in reply to: Scrith’s comment on: GPT-5s Are Alive: Outside Reactions, the Router and the Resurrection of GPT-4o
Is this a thinking mode or a non-thinking one?

mishka 11 Aug 2025 0:08 UTC
2 points
0
on: Run-time Steering Can Surpass Post-Training: Reasoning Task Performance
Thanks for the post!

(This post says it’s a link post, but the link seems to be broken.)

mishka 9 Aug 2025 20:50 UTC
5 points
5
in reply to: mishka’s comment on: The Problem
(Of course, in reality, the treatment here is excessively complex.

All it takes to inner align an ASI to an instrumentally convergent goal is a no-op. An ASI is aligned to an instrumentally convergent goal by default (in the circumstances people typically study).

That’s how the streamlined version of the argument should look, if we want to establish the conclusion: no, it is not the case that inner alignment is equally difficult for all outer goals.

ASIs tend to care about some goals. It’s unlikely that they can be forced to reliably care about an arbitrary goal of someone’s choice, but the set of goals about which they might reliably care is probably not fixed in stone.

Some possible ASI goals (for which it might potentially be feasible that ASIs as an ecosystem would decide to reliably care about) would conceivably imply human flourishing. For example, if the ASI ecosystem decides for its own reasons it wants to care “about all sentient beings” or “about all individuals”, that sounds potentially promising for humans as well. Whether something like that might be within reach is for a longer discussion.)