I.M.J. McInnis

Karma: 507

I.M.J. McInnis 16 May 2026 19:14 UTC
5 points
5
in reply to: StanislavKrym’s comment on: StanislavKrym’s Shortform
I would be interested in hearing from Anthropic employees here, but I imagine it’s the usual reason: they have deemed it inexpedient to do so, either because they believe that they need to “sound normal” or because they in fact don’t want the type of regulation that awareness of x-risk would produce.

I.M.J. McInnis 14 May 2026 2:42 UTC
1 point
0
on: Vibe Excel and the Future of White-Collar Work
It was spooky how much of my own commercial thinking I recognized here. (We just launched! https://www.withai.co/). Am meant to be sending emails right now, so I’ll resist the urge to write my little manifesto, but I’m excited for this era of work (despite being gloomy about superintelligence)

I.M.J. McInnis 13 May 2026 5:19 UTC
1 point
0
in reply to: leogao’s comment on: leogao’s Shortform
hm. it worked for the early Christians...

I.M.J. McInnis 14 Apr 2026 22:39 UTC
1 point
0
on: The frightening future (i.e. present) of AI surveillance
in one case, Claude managed to infer a user’s inner struggles by observing the user’s multiple queries relating to catholic relics and saints, and other groups of queries relating to gay male models
Claude is leaning on stereotypes here. Without queries about the sinfulness-or-not of homosexuality, this may well just be a pious, gay Catholic. I am friends with many!

I.M.J. McInnis 14 Apr 2026 14:57 UTC
10 points
2
on: Americans For Moskovitz
visit americansformoskovitz.com
This currently links to “https://armiercansformoskovitz.com/″, which does not resolve

I.M.J. McInnis 20 Mar 2026 4:37 UTC
3 points
0
on: Subscriber count graphs of popular youtubers on ASI risk
These all look rather… linear. Worrisome! Though it would be better if we could see 0 and see more backhistory.

I.M.J. McInnis 11 Mar 2026 6:42 UTC
8 points
0
in reply to: lc’s comment on: lc’s Shortform
OpenAI models (particularly 5-thinking) have long had a fetish for jargon, cramming their sentences as full of it as the situation allows (and sometimes fuller).

I.M.J. McInnis 5 Mar 2026 4:22 UTC
4 points
0
on: MalcolmMcLeod’s Shortform
A fun riddle I was shocked to see the gippities solve without extended thinking or much yapping, even. I gave up on the third!
Here’s the riddle, as stated to Opus 4.6 on an empty context window. “Consider single-word-name countries. An inclusive pair is when the name of one country is contained in another wholly. There are three such pairs. Find them.”
I was surprised not only by how quickly it solved it, but also by the lack of thinking tokens. Gemini 3.1 Fast also did it. And by the unusual order in which the solutions were produced, in the exact reverse order most people find them.
(Answer below)
.
.
.
.
.
.
.
.

I.M.J. McInnis 24 Feb 2026 22:05 UTC
1 point
0
on: Solving The RAISE Act Like a (fictional) New York Detective
Despite the anemia of this law, it has still enraged the super PACs like nothing else. Imagine how they’ll react when we pass a real law.

I.M.J. McInnis 24 Feb 2026 22:04 UTC
16 points
11
on: Exclusive: Hegseth gives Anthropic until Friday to back down on AI safeguards
This is (instinctively) quite frightening, particularly the “no we NEED to spy on American citizens at scale, and we NEED killer robots.” On the other hand… I guess it shows that the government isn’t always and everywhere in the pocket of AI developers? Or maybe it just further shows that the government is in the pocket of certain other AI developers...

I.M.J. McInnis 24 Feb 2026 21:12 UTC
170 points
96
on: Responsible Scaling Policy v3
If there were strong and broad political will for treating AI like nuclear power and slowing it down arbitrarily much to keep risks low, the situation might be different. But that isn’t the world we’re in now, and I fear that “overreaching” can be costly.
I think it would make a nontrivial contribution to that ‘strong and broad political will’ if Dario were to come out and say “actually, sorry about all that deliberate Overton-window-closing I did in previous writings. In fact, political will is not a totally exogenous oh-well thing, but it is the responsibility of frontier developers to inculcate that political will by telling the public that a pause is possible and desirable, instead of a dumb lame thing not even worth considering. So now we’re saying loud and clear: a pause is possible and desirable, and the world should work toward it as a Plan A!”
I’m being deliberately cartoonish here, but you get the point. If incentives are forcing Anthropic to abandon things that are good for human survival––which occurrence was, no offense, completely obvious from day one––Anthropic should be screaming from the rooftops, Help!! Incentives are forcing us to abandon things that are good for human survival!!
If this is a crux for you––if you/Anthropic think a pause is so undesirable/unlikely that it’s important for the safety of the human race to publicly disparage the possibility of a pause (as Dario opens many of his essays by doing)––please say so! Otherwise, this lily-livered, disingenuous, “oh no, the incentives! it’s a shame incentives can never be changed!” moping will give us all an undignified death.
To be clear, I’m not actually mad about the weakening of the RSP; that was priced in. I suppose I’m glad it’s stated, in case there were still naïfs who thought A Good Guy With An AI could save us. It’s far more virtuous than outright lying, as every other company (to my knowledge) does (more of).
Also, although you seemed to try to answer “What is the point of making commitments if you can revise them any time?” You really just replied “Well, actually these commitments were inconvenient to revise, and in fact they should be more convenient to revise, albeit not arbitrary convenience.” Forgive me if I am not reassured!
I respect your work a lot, Holden. You’ve done great things for humanity. Please don’t lose the forest for the trees.
What links here?
- MichaelDickens's comment on Responsible Scaling Policy v3 by Holden Karnofsky (EA Forum; 26 Feb 2026 3:52 UTC; 70 points)
- Anthropic Responsible Scaling Policy v3: A Matter of Trust by Zvi (1 Apr 2026 18:10 UTC; 65 points)

I.M.J. McInnis 17 Feb 2026 21:26 UTC
2 points
0
on: Persuading Trump of a proper US-China-led AI Treaty
My main concern is that, in a targeted persuasion-of-the-powerful campaign, you want folks who are very good at persuading the powerful. By talent, this seems largely to draw from the “intellectually honest fringe” (where I and my friends live), who (as I understand it) haven’t been successful in “targeted persuasion of fancy, respectable people.” I really like the idea, though! I think this might be one of the most under-resourced causes in AI safety, and I plan to donate to help with the bridge funding. But—what should reassure me about my worries, that y’all are odd and not-well-connected, therefore being ill-suited to persuade all these important people?

I.M.J. McInnis 17 Feb 2026 1:04 UTC
3 points
0
on: What to Do If You Take AGI Seriously
Please flag up front how and how much you used LLMs for the writing. When the prose and structure is this LLMy, I’m forced to assume the ideas are LLMy too, so I don’t read it.

I.M.J. McInnis 9 Feb 2026 19:42 UTC
2 points
0
on: Can Hardware Save Us from Software?
If anyone has influence at a venture capital firm, it would be really really worthwhile to make this one of their big “calls for startups.”

I.M.J. McInnis 27 Jan 2026 20:21 UTC
16 points
2
on: Dario Amodei – The Adolescence of Technology
On the one hand, I am glad to see such awareness and honesty about the risks. On the other hand, I remain furious at Dario’s ceaseless insistence that a pause or slowdown is completely impossible (and so, it is implied) not even worth trying for.

I.M.J. McInnis 26 Jan 2026 22:58 UTC
2 points
1
in reply to: Eye You’s comment on: The Possessed Machines (summary)
It’s stated at the bottom of the webpage. After a few paragraphs at the start I was like… “ok surely this is AI generated” and popped it in a few detectors. i happened to scroll to the bottom to see how long the dang thing was and saw the disclaimer. i wish this former lab insider—who surely has money and at least one eloquent friend—could’ve paid another human being to rewrite it. but alas!

I.M.J. McInnis 26 Jan 2026 1:27 UTC
16 points
8
on: The Possessed Machines (summary)
I am surprised you didn’t mention the fact that the whole thing was paraphrased to preserve anonymity by Opus 4.5. (Which really stood out to me! When I first read it, I assumed it was AI-generated, and I was disconcerted to see such quality of thought coming with such a slopreek to the prose.)
What links here?
- The Possessed Machines (summary) by L Rudolf L (25 Jan 2026 20:47 UTC; 129 points)

I.M.J. McInnis 21 Jan 2026 22:33 UTC
9 points
4
on: MalcolmMcLeod’s Shortform
AI company heads should sign the Superintelligence Statement (by their professed values)
Elon Musk, Demis Hassabis, and Dario Amodei have now all said that they would slow down if they could (or in Dario’s case, “wish we had more time”). They should sign the Statement on Superintelligence—there is not much daylight between their casual remarks in public and what that statement proposes. Employees should pressure their leadership to sign this; we need to make common knowledge so that it’s not so easy for them to say “but oh that will never happen.”
I don’t know how sincere they are about this desire. They sure don’t talk about it much, and when they do, they are so insistent that coordination could never possibly happen, which is an awfully convenient belief. “It is difficult to get a man to understand something when his [billion$ and global status] depends upon his not understanding it.”
Sam Altman and Mark Zuckerberg should sign it too, of course. As should everyone! But it seems much easier to get Musk, Hassabis, and Amodei to do so.
If you’re reading this and can exert any pressure on them to do so (or generally promoting signing it within your company), please please do so.

I.M.J. McInnis 12 Jan 2026 17:04 UTC
1 point
0
on: What potent consumer technologies have long remained inaccessible?
I expect some measure of “speciation” to occur, but not as a result of money-lack. Rather, some cultures will reject these techs, and as the cultures that accept these techs drift biologically farther from “base humans,” the more entrenched the social and cultural divisions will be.

I.M.J. McInnis 5 Jan 2026 14:10 UTC
16 points
3
in reply to: Hastings’s comment on: In My Misanthropy Era
this is the most lesswrong thing i’ve ever seen. never change