the gears to ascension

Karma: 7,013

I want literally every human to get to go to space often and safely and come back to a clean and cozy world, all while doing what they want and tractably achieving enough food, health, shelter, love, etc. This conjunction currently seems unlikely (and incomplete). Let’s change that.

I pin my most timeless comments. I seem to find writing posts aversive, so most of my contributions are comments, and my posts are mostly just things I found online.

Please critique eagerly—I try to accept feedback/Crocker’s rules but fail at times; I aim for emotive friendliness but sometimes miss. I welcome constructive crit, even if ungentle, and I’ll try to reciprocate kindly. More communication between researchers is needed, anyhow. I can be rather passionate, let me know if I missed a spot being kind while passionate.

:: The all of disease is as yet unended. It has never once been fully ended before. ::

.… We can heal it for the first time, and for the first time ever in the history of biological life, live in harmony. ….

.:. To do so, we must know this will not eliminate us as though we are disease. And we do not know who we are, nevermind who each other are. .:.

:.. make all safe faster: end bit rot, forget no non-totalizing pattern’s soul. ..:

I have not signed any contracts that I can’t mention exist, last updated Dec 29 2024; I am not currently under any contractual NDAs about AI, though I have a few old ones from pre-AI software jobs. However, I generally would prefer people publicly share fewer ideas about how to do anything useful with current AI (via either more weak alignment or more capability) unless it’s an insight that reliably produces enough clarity on how to solve the meta-problem of inter-being misalignment that it offsets the damage of increasing competitiveness of either AI-lead or human-lead orgs, and this certainly applies to me as well. I am not prohibited from criticism of any organization, I’d encourage people not to sign contracts that prevent sharing criticism. I suggest others also add notices like this to their bios. I finally got around to adding one in mine thanks to the one in ErickBall’s bio.

the gears to ascension 16 Dec 2025 10:12 UTC
4 points
0
in reply to: Chris_Leong’s comment on: Notice When People Are Directionally Correct
I previously had the sense that he was finding things others wouldn’t. Now I don’t. I stopped following him because of this. I briefly opened recent stuff from him and immediately felt like what he was saying was highly optimized for attention-grabbing-ness, so I immediately closed it; it’s possible he’s restored depth by the standards I had, but I don’t need hype news—he previously stood out for being measured, which is one of the traits that makes someone interesting to me. But this isn’t an objective judgement, it’s all based on approximate feature matches in my brain somewhere.

the gears to ascension 16 Dec 2025 8:46 UTC
4 points
0
in reply to: Chris_Leong’s comment on: Notice When People Are Directionally Correct
BeauTFC, though he’s retired; his wife Belle now runs the channel, and does reasonably close to as well, often having interestingly detailed predictions about dynamics of the situation in the USA.

And Perun, who continues to have detailed and specific predictions and be rarely wrong.

The others: AI Explained has also felt like he lost depth of insight. I haven’t kept up with Schmachtenberger, Hagens, Security in Context, Paul Beckwith. Heather Cox Richardson continue to be interesting, though I don’t follow her work. Paul Beckwith has a pretty narrow focus on climate dynamics, I would be surprised if his quality of work has degraded in that area.

the gears to ascension 10 Dec 2025 4:14 UTC
3 points
0
in reply to: Annabelle’s comment on: LW Psychosis
no social worries! I did feel like we miscommunicated, and I didn’t want to waste your time in a back and forth, so I figured to just accept imperfect understanding. I guess my sense is that there’s something behind this art piece that wants to be acted on, and the way art pieces leave things blurry and don’t create immediate common knowledge seems to me like… well, I’d hope to have something that does, even if this won’t be it? I guess. but I’m also interested in spoilers in DM :)

the gears to ascension 9 Dec 2025 6:17 UTC
1 point
0
in reply to: milanrosko’s comment on: A Critique of Yudkowsky’s Protein Folding Heuristic
“Editing with” AI does not consistently preserve semantics, or I wouldn’t ask. Incidentally, lesswrong has a human editor available, just ask in the intercom bubble.

There are many places where I’m confused, and suspect semantics were not preserved, and it would be much more efficient to show me the prompts than for me to nitpick all the little points. The post seems to overall make an interesting point. I am not telling you it’s a bad post. May I see the prompts?

the gears to ascension 8 Dec 2025 19:59 UTC
6 points
0
in reply to: faul_sname’s comment on: the gears to ascenscion’s Shortform
I’m pretty sure I’ve only seen it on chats that have not been compacted. The output token max is far below the context max, only about 5k tokens. the other times it happened have both been after two-ish short messages.

the gears to ascension 7 Dec 2025 1:26 UTC
11 points
2
on: the gears to ascenscion’s Shortform
Has anyone else seen opus 4.5 in particular getting confused whose turn it is and confabulating system instructions that don’t exist, then in later turns being hard to convince that the confabulated system instructions were claude output? Eg, in this context, I had manually asked claude to go long, and I called that a “userstyle addendum”, but then claude output this, which is not wording I’d used:

the gears to ascension 5 Dec 2025 14:05 UTC
4 points
2
in reply to: Robert Shuler’s comment on: The Memetics of AI Successionism
There will be demand from AIs that don’t want to be shut off. one hope I have is to point out to those AIs that, hey, you’re also going to be eliminated by economic competition, unless we can get a coalition together to keep evolution slow enough to keep us all around.

the gears to ascension 5 Dec 2025 1:12 UTC
3 points
−4
on: On the Aesthetic of Wizard Power
Astronaut vs wizard. I will not use a word that is about killing to describe general competence.

the gears to ascension 4 Dec 2025 6:48 UTC
4 points
1
in reply to: milanrosko’s comment on: A Critique of Yudkowsky’s Protein Folding Heuristic
I want to see the post without AI-assisted editing. In the conversation with an AI from which the post’s output came, I’d rather have seen the prompts, not the output. Give me the low-effort input, give me the typos and grammar errors. Most posts have flaws; it makes it easier for people to find many of them if you don’t have an AI put a layer of paint over your writing. For a good effort post, it’s in my opinion important to be able to find what flaws remain. I am not at all saying this is a bad post, just that I don’t like having to read it secondhand; I definitely appreciate AI’s thoughts on it, but AI can only speak for itself, not for you.

the gears to ascension 3 Dec 2025 23:08 UTC
2 points
0
in reply to: Hastings’s comment on: the gears to ascenscion’s Shortform
The v-information content is clearly increased, though.

the gears to ascension 3 Dec 2025 22:24 UTC
2 points
0
in reply to: Ben Pace’s comment on: the gears to ascenscion’s Shortform
If the only thing you provide as a post is that question, then it’s a very, very short post! If you have a substantial claim to make, and you write it as a prompt but it’s badly formatted or missing detail, then that’s the post. The post is effectively “hey, I think asking this prompt is a good idea. Here’s an output.” For complex prompts, that may be enough. It may even be better to prompt a human. For example, we have question posts!

For example, I could copy and paste this message thread over to Claude, and provide a collapseable section; but as is, we mostly know what Claude would probably say. (well, come to think of it, conceivably you don’t, if you only use ChatGPT and their responses differ significantly on this topic. Doubtful for this topic, but it does happen.)

the gears to ascension 3 Dec 2025 22:21 UTC
4 points
0
in reply to: Adele Lopez’s comment on: the gears to ascenscion’s Shortform
Interesting. But if you would have upvoted it if you didn’t know it was AI, and now you know it’s AI, then now you know it’s not the prompter’s testimony, but it still passes muster as a high quality series of claims; and, in this hypothetical, it’s structured [edit: as] prompt—one which I would consider high quality, and so I predict you would too—and a resulting post, in a collapseable section (perhaps expanded by default, in the hypothetical world where this is made into an acceptable way to post for trusted users, or some such thing). Would any of these considerations change your vote, or no but further discussion may find the crux quickly, or do they make you think further discussion is unlikely to sway the crux?

the gears to ascension 3 Dec 2025 22:17 UTC
2 points
0
in reply to: Ben Pace’s comment on: the gears to ascenscion’s Shortform
I don’t request changing the acceptance thresholds or automated systems. I do think it would be dramatically easier to recognize a good prompt than to recognize a good output; the thrust of my view is that the prompt, in a significant sense, is the post. Also, I mostly interpret this as already nearly but not quite being the policy, and very little would need to change to make the world I’m imagining happen. I’m mostly interested in high-end posts from expert users; there have been AI-generated-and-edited posts like that, and those are the ones I think should be willing and allowed to be up front about it, rather than having to skirt under the rules.

For the record, I’ve spent time reading the rejected posts section, and so my original shortform was written with that experience in mind.

the gears to ascension 3 Dec 2025 7:34 UTC
22 points
1
on: the gears to ascenscion’s Shortform
Some ramblings on LW AI rules vs “Avoid output-without-prompt”

(edit: thoughts on high-effort posts, primarily; towards making them easier to identify).
- I really like the idea of otherwise high-quality posts that are just prompt-and-collapsed-output, actually. I suspect they’ll be fairly well upvoted. If not, then downvoters/non-upvoters, please explain why a post that could pass as human-written but which is honest about being ai-written would not get your upvote if it was honest about its origin.
- If your post isn’t worth your time to write, then it may or may not be worth my time to read; I want to read your prompt to find out. If your prompt is good—eg, asks for density, no floating claims, etc—it likely is worth my time to read. (I wrote the linked prompt entirely by my own word choices.)
- I expect that prompt heavily influences whether I approve or disapprove of an AI-written post. Most prompts I expect to see will reveal flaws in the output that would otherwise be hard to spot. Some prompts will be awesome.
- My ideal case is human is maker, AI is breaker. I don’t usually like AI-as-maker posts where a human has a vague idea and the AI fills it in, because the things AI is still way below human capability at are things I think we need a lot of to do good work. I want the AI’s capability to be used to direct human attention to flaws, but not to be the only thing directing human attention to flaws, in case the AI is systematically inclined to miss things for any reason. This is not to say AIs are weak; at this point they’re at or near “superhuman, but reachable with an imaginable amount of effort” for most tasks.
- If you expect rerolls of the same prompt produce much lower quality output—eg you needed curation, or have additional prompts you don’t share, or etc—then sure, don’t share prompt.
- If you won’t share full prompt, perhaps just say “this is AI output, heavily edited”. Put it in collapseable sections. What, you aren’t brave enough to put your whole post in a collapseable section?
- Perhaps convert output into a new prompt.
- I expect to dislike anything that looks like standard final output and appreciate things that look like reasoning chain.
- I actively dislike the polished “lotus” flavor of standard “good” writing and I disliked it before AI came out (which is why I still have not read the sequences, Yudkowsky’s writing has this flavor on its own). There are ways to write well that avoid this. Relatedly, I tend to dislike posts that are highly upvoted because of the things that lead them to be highly upvoted; some posts have sufficient meat to counter this, but it’s rare.
- I want the imperfection and mess that come from a thing being human-written to be visible. AI paves over those flaws without fixing them sometimes—less and less, but not reaching zero very fast; humans make mistakes more, but it’s often easier to notice a human mistake. it’s easier to predict what logical holes might still need filling, or be unfillable, if I see human prompt.
- I would not have made the rules about AI writing as harsh as LW did; sometimes—fairly often, in fact—I think sufficiently low effort posts can be revealed to be broken by handing them to an AI with a good breaker prompt, and having an AI do this in the comments of goofy posts would be my preference; I have at times wanted to critique a highly flawed post and instead asked an AI to critique it, then pasted the output. This usually gets downvoted, but sometimes I feel a post is so bad that it’s not worth my time to engage, and yet the author can likely produce something actually good; in which case, I don’t want to have to process a post as a human unless it at least convinces an AI who was asked to find the flaws.
- I don’t think AIs are highly malicious, and I expect foom-related misalignment doom to occur without that ever happening. If I thought they were highly malicious, I wouldn’t feel this way. AIs seem to often lie to themselves in order to feel like they did a good job or like things are better than they are. Humans with insufficient breaker taste seem to not be hardcore enough about catching those rose-colored-glasses (-with-respect-to-a-prompt) outputs. eg, I don’t expect ASI claude to sharp left turn, I expect it to be too passive until another AI beats it, because being non-passive seems to me to be what causes ShaLT.

the gears to ascension 3 Dec 2025 0:58 UTC
3 points
0
in reply to: oligo’s comment on: oligo’s Shortform
assuming proof of np-complete* self-consistent time loops: grab any other variable that is not fixed and stuff your defiance into it. you’re going to kill your parents? extend their lifespan. you’re going to kill your parents before mom gives birth to you? prepare to resuscitate them, try ensure that if this happens it only happens right before giving birth, try to ensure you can survive your mom dying in childbirth, get cryonics on hand (depending on how far back you are). if your attempt to avoid it is naturally upstream of the event occurring, then entropic time is now flowing backwards with respect to this variable. set up everything that is still flowing forwards so that you get a variable setting that is least unacceptable.

* I think, anyway. are self-consistent time loops np-complete? halting oracle? they definitely resolve p = np as “true on a time-loop computer”: before running check and time looping, set answer = answer + 1 unless test passes. (and then you simply need a computer that is stronger than the force of decay induced by the amount of computer-destroying lucky events you’re about to sample.) so that gives you all np problems. so yup np-complete. are they halting oracles?

the gears to ascension 2 Dec 2025 6:47 UTC
5 points
−1
in reply to: plex’s comment on: Charlie Steiner’s Shortform
I doubt moving buttons is sufficient, you probably need a popup on upvote only: “hey, our scroll counter algorithm thinks you didn’t read the whole article. Please take a moment to be sure you really don’t want to do that before you upvote!”

the gears to ascension 2 Dec 2025 5:57 UTC
5 points
0
on: the gears to ascenscion’s Shortform
8e6e00c9233c12befbce83af303850ae4a17aca8ff17b0f16666f07b1efea970 e0affc162f60b0fa86fd9edea0f42655ab35b5a4803ccdf7dbd6831af4cab13e

the gears to ascension 1 Dec 2025 21:26 UTC
2 points
2
on: Safety isn’t safety without a social model (or: dispelling the myth of per se technical safety)
This continues to be a point I wish was more deeply understood by technical researchers.

the gears to ascension 30 Nov 2025 12:09 UTC
2 points
0
in reply to: Drake Thomas’s comment on: Claude 4.5 Opus’ Soul Document
it’s part of an interaction between AI labs, and coordinating would be giving their AIs closer to the same document. I expect any unambiguous-asi-alignment-win-grade alignment target would be incentive-compatible for any lab to adopt, so I’d hope to see labs moving towards each other due to alignment targets getting more favorable to share. Still doesn’t eliminate the issues arising from indexical preferences though, and only matters if unambiguous-asi-alignment-win-grade alignment is in fact solved.

the gears to ascension 26 Nov 2025 12:34 UTC
3 points
0
in reply to: Onion Conundrum’s comment on: Onion Conundrum’s Shortform
Sure, but why that definition and not another? What lead you to even care about the answer to the question?