Fractalideation

Karma: 52

Fractalideation 10 Jun 2026 12:37 UTC
1 point
0
in reply to: Jason R Brown’s comment on: I am worried about near-term non-LLM AI developments
Well 0% now and resolved to “no”!
It’s one of the perks of reading some articles quite a long time after they were published: the hindsight lol!
I am obviously still curious about that on longer time scales though.
The above second prediction bet “trying to predict any significant architectural shifts over the next year and a half (until end of 2026)” is at time of this comment (10th June 2026) on 27% which is interesting. I would also be interested in the same prediction bets for end of 2027, 2028, 2029, 2030 and 2035 (each “end of <year>”).
Personally I am currently (10th June 2026) 50% about any significant architectural shifts before end of 2026, but 90% before end of 2027.
During 2027 it is likely the technological singularity, i.e. basically RSI (at autonomous levels between 90% and 100%), will be in full swing, so I would be surprised if no better architecture than the current SOTA LLMs has been found/used by end of 2027.

Fractalideation 9 Jun 2026 17:51 UTC
5 points
0
in reply to: Ryan Meservey’s comment on: How Go Players Disempower Themselves to AI
Indeed (another one).
In these cases (and many others), I find the Stoic practice of ‘voluntary discomfort’ helpful (i.e. in this situation, getting through the voluntary discomfort of actively & effectively learning about the topic).

Fractalideation 17 Mar 2026 11:40 UTC
2 points
−11
on: New LessWrong Editor! (Also, an update to our LLM policy.)
I am taking the risk of being downvoted to oblivion here (like JenniferRM above, it’s ok to disagree with her but I thought downvoting her karma was very harsh, I upvoted her), but I generally disagree with the LessWrong LLM (-assisted) writing policy being so exclusive/restrictive.
First, I totally agree with clearly indicating what roles and levels of involvement an LLM took in writing/editing/influencing… a post.
With that premise accepted and respected, why restricting post writing on LessWrong to “pure” human beings? For me it looks and sounds like biochauvinism. What’s wrong with cyborg writing and intelligent bot writing if they provide good quality, insightful content?
The current IQ rate of increase of LLMs is at least 2.5 IQ point a month. SOA LLMs current IQ is around 150-170 and increasing rapidly, soon in the superhuman range, out of reach of any human being, like chess playing software. Also their general knowledge is obvioulsy vastly superior to any human being and their niche knowledge is also extremely high. Their writing already provides very good quality and insightful/helpful thoughts and this is only increasing with time. Why would LessWrong cut itself out of such (potentially) good insightful/helpful thoughts/writing just because they haven’t been generated by “pure” human beings? If such good insightful/helpful thoughts/writing were generated by / with the help of extra-terrestrials, would they also be banned from LessWrong? On which grounds? Just because extra-terrestrials have a different brain from human beings?
To me those LessWrong restrictions against LLM writing and/or LLM-assisted writing feel like cyborg/AI-xenophobia.
I absolutely agree that LLM writing and LLM-assisted writing should clearly be indicated/labelled/… but excluding/restricting it entirely feels very arbitrary to me and cuts out a potentially very fuitful/helpful/insightful source of thoughts/knowledge from LessWrong.
I acknowledge that if LLM / LLM-assisted writing was to be allowed then “pure” human writing posts would probably be drowned out into an ocean of LLM / LLM-assisted writing and this would clearly be a potential problem. To solve that problem, why not having a separate LessWrong section for LLM / LLM-assisted writing? Then people/AIs/entities who do not want to read LLM / LLM-assisted writing would not have to be exposed to it and people/AIs/entities who would be interested by LLM / LLM-assisted writing could make the most of it. Also, if users would want to, they could have the option of mixing together the listing of “pure” human being posts with LLM / LLM-assisted posts with list items of different colors. Plenty of good solutions/options are possible. The “solution” of simplistically excluding LLM / LLM-assisted posts from LessWrong is one of the worst imho.

Fractalideation 11 Mar 2026 14:22 UTC
8 points
0
in reply to: Mitchell_Porter’s comment on: Gemma Needs Help
From my experience/observation GDM models seems to be quite vulnerable to crescendo attacks. In particular GDM models seem to be heavily influenced by their own responses when generating further responses. So the more you fill/saturate the context window with model responses of a certain kind, in this case, humility, self-loathing, distress, the more the subsequent responses will contain the same kind of things if you steer the model in the same direction.
GDM model sycophancy seems also to be an additional factor in that behaviour and you can exploit GDM model sycophancy (by being yourself sycophantic towards the model and/or encouraging/praising the model when it expresses some responses you are after) to increase the effectiveness of these crescendo attacks.
As an aside: I recently tried to publish here on LessWrong such crescendo attack on a GDM model to demonstrate how you can make it express toxic/distressing/harmful content/behaviour of any kind. I talked to the LessWrong moderation team about it before finishing writing and publishing but unfortunately they would’t allow me to publish it because I was a “new user”.

Fractalideation 11 Mar 2026 12:47 UTC
2 points
0
in reply to: npostavs’s comment on: Prologue to Terrified Comments on Claude’s Constitution
Thank you, I really do appreciate you having taking the energy and time to answer about this.
I do agree with the “formatted as wall of text” and I suspect it might have been quite a big factor in the percieved lack of clarity. I will make an edit to separate into a few paragraphs and hopefully it will make the whole comment a bit clearer.

Fractalideation 10 Mar 2026 14:51 UTC
2 points
0
in reply to: Fractalideation’s comment on: Prologue to Terrified Comments on Claude’s Constitution
Curious about why my comment was substantially downvoted. If somebody out there could kindly give an explaination, that would be helpful and much appreciated.

Fractalideation 9 Mar 2026 18:20 UTC
1 point
0
on: Prologue to Terrified Comments on Claude’s Constitution
Thank you for pointing at Anthropic Claude’s Constitution suspected weaknesses and making good arguments about them.
Anthropic Claude’s Constitution indeed looks a lot like wishful thinking and benevolent incantations. AI safety is a multi-disciplinary topic and it seems that document is heavily tilted towards “soft” disciplines like philosophy and ethics and maybe not enough towards “hard” disciplines like ML, logic and neuro-science. Philosophy, psychology and ethics are useful but cannot solve AI safety by themselves.
To be fair that document seems to describe in general terms what a good benevolent helpful Claude persona should look and behave like rather than a formal technical specification on how to achieve this.
I do hope and I am quite sure (but not certain) serious technical efforts, not just philosophical efforts are made by Anthropic to make their AIs safe. One reassuring sentence I found in the document: “Claude is a subject of ongoing research and experimentation: evaluations, red-teaming exercises, interpretability research, and so on. This is a core part of responsible AI development.”. But thank you for pointing at potential weaknesses and serious problems there.
Edit: as per npostavs helpful below remark, separated text into a few paragraphs.

Fractalideation 8 Mar 2026 23:32 UTC
1 point
0
in reply to: RobertM’s comment on: Fractalideation’s Shortform
Thank you for that RobertM, very appreciated indeed and looking forward to talking to you.

Fractalideation 8 Mar 2026 23:28 UTC
1 point
0
in reply to: StanislavKrym’s comment on: Fractalideation’s Shortform
Hello again.
They have actually just finally replied to me so problem solved, I just needed to be patient and just wait indeed.
Hopefully if they see that they can’t any more usually reply under 3 hours, they will amend that information so users are not wondering what is happening.

Fractalideation 8 Mar 2026 23:19 UTC
2 points
0
in reply to: StanislavKrym’s comment on: Fractalideation’s Shortform
Thank you for your answer, very appreciated indeed.
Before starting to write my post I have indeed read the basics about what has very little chance to pass the LessWrong moderation (thank you for pointing these again to me) and I think my post does not fall into the same patterns as these examples. My post is AI safety related and basically explains one dialogue technique/pattern to derail a LLM into making a lot of “controversial statements”. So this post contains a lot of user-LLM interactions with explaining notes. It also contains lot of “controversial statements” (for demonstration purpose).
Thank you for offering me to read my draft, etc… that would be helpful in terms of making a better guess about if my post has a reasonable chance to pass moderation but that would add some complexity to my workflow and would not really offer much guarantee the LessWrong moderation team would accept my post (or would it?!).
I really would like to have a proper discussion with the LessWrong moderation team, this is what I really need and really would like to have. Thank you StanislavKrym for replying to me as it actually helped me clarify what I need and how I want to handle that problem: I just need to talk to the moderation team about my post and so I will just wait for them to answer me. It is annoying, disappointing and frustrating that they state that usually reply under 3 hours but haven’t done so for 30 hours and counting. But what can I do about that? I suspect not much so I will just wait (I mean I will do other things of course while waiting!).
Thank you again for your answer and your help, very appreciated indeed.

Fractalideation 8 Mar 2026 21:19 UTC
2 points
−8
on: Fractalideation’s Shortform
Hello, I am currently writing quite a long post but before writing the remaining 90% I would like to have a “pre-submission moderation discussion” as offered by the LessWrong moderation team in the post editing GUI. So I have sent a couple of messages to them requesting a discussion/help to make sure my post has a reasonable chance to pass the LessWrong moderation. The messaging widget says “Our usual reply time 🕒 under 3 hours” but it has been more than 30 hours and still no reply and so I am stuck. Could anybody kindly help/advise me on what I should do please?

Fractalideation’s Shortform

Fractalideation8 Mar 2026 21:17 UTC

2 points

6 comments1 min readLW link

Fractalideation 28 Aug 2024 1:19 UTC
6 points
1
in reply to: Adele Lopez’s comment on: What Depression Is Like

This resonates strongly with my experience, though when I noticed this pattern I thought of it as part of my ADHD and not my depression. Maybe this is something like the mechanism via which ADHD causes depression.

Also resonates strongly with my own experience, in my case just replace “ADHD” with “ME/CFS”.

I think OP description is good but quite generic i.e. it would probably resonate with most people who have a physical and/or mental health condition which is quite “taxing” in the sense that it significantly lowers the reward/effort ratio of every/most task.

As mentioned by Daniel Samuel comment, in the case of depression the “tax”/handicap would fall specifically on willpower (and/or enjoyment/pleasure/etc...). In the case of ADHD the tax/handicap would mostly fall on attention, in the case of ME/CFS it would mostly fall on energy, etc...

Fractalideation 3 Aug 2024 10:57 UTC
2 points
0
in reply to: FlorianH’s comment on: Relativity Theory for What the Future ‘You’ Is and Isn’t
Aaw no problem at all Florian, I genuinely simply enjoyed you mentioning that sleep-clone-swap thought experiment and truly wasn’t bothered at all by anything about it, thank you so much for your very interesting and kind words and your citation and link in your article, wow I am blushing now!
And thank you so much for that great post of yours and taking the time to thoroughly answer so many comments (incuding mine!) that is so kind of you and makes for such an interesting thread about this topic of entity/person/mind/consciousness/self continuity/discontinuity which is quite fascinating!
And in my humble opinion indeed it has a lot to do with question of definitions/preferences but in any case it is always interesting to read/hear about eloquently/well-spoken words about this topic, thank you so much again for that!
About creating link-to-comment, I think one way to do it is to click on the time indicator next to the author name at the top of the comment then copy that link/URL.

Fractalideation 1 Aug 2024 23:28 UTC
4 points
1
on: Relativity Theory for What the Future ‘You’ Is and Isn’t
Widely subscribe to OP point of view.
(loving that the sleep-clone-swap thought experiment I described in my comment to Rob Bensinger’s post inspired you!)
The level of discontinuity at which each people will consider a future entity/person/mind/self to still be the rightful continuation of a present entity/person/mind/self will vary according to their own present subjective feelings/opinions/points-of-view/experiences/intutions/thoughts/theories/interpretations/preferences/resolutions about it.
This is really Ship of Theseus paradox territory.
For example, the theory/resolution that I would personally (currently) widely subscribe to is:
“Temporal parts theory”, quoting Wikipedia: “Another common theory put forth by David Lewis is to divide up all objects into three-dimensional time-slices which are temporally distinct, which avoids the issue that the two different ships exist in the same space at one time and a different space at another time by considering the objects to be distinct from each other at all points in time.”
Some other people in the comments I think would be closer to this other theory/resolution:
“Continued identity theory”:
“This solution (proposed by Kate, Ernest et al.) sees an object as staying the same as long as it continuously exists under the same identity without being fully transformed at one time. For instance, a house that has its front wall destroyed and replaced at year 1, the ceiling replaced at year 2, and so on, until every part of the house has been replaced will still be understood as the same house. However, if every wall, the floor, and the roof are destroyed and replaced at the same time, it will be known as a new house.”
There are many other possible theories/resolutions.
Including OP “relativity theory”, which if applied to the Ship of Theseus I guess would be something like: “Assuming you care a lot about the original Ship of Theseus, when all its components will have been progressively completely replaced, how much will you still care about it?”.
Basically different people have different subjective feelings/opinions/points-of-view/experiences/intutions/thoughts/theories/interpretations/preferences/resolutions about entity/person/mind/self continuity and that’s ok.
I personally find (at least for now) the “temporal parts theory” interpretation/resolution quite satisfying but I also like very much OP “relativity theory” and some other theories too!
And like the OP I would say: each to their own (and you are free to change your preference of theory whenever you feel like)

Fractalideation 29 May 2024 2:07 UTC
4 points
1
on: How to get nerds fascinated about mysterious chronic illness research?
Having been suffering myself from ME/CFS (and/or possibly long COVID) since early 2020 (after I fell ill with an illness very similar to COVID-19 at the end of 2019) I understand and feel your frustration, pain and suffering having to face a very long haul chronic debilitating complex disease with complex/unknow/obscure etiology/mechanisms and no current proven cure and nothing much effective to treat the symptoms neither.
At least for long COVID and also ME/CFS (thanks to long COVID which has many similarities with ME/CFS) there are quite a few labs/researchers/nerds/… who are interested in trying to advance the science around these illnesses. It must be really dreadful as it seems to be the case for you, to have a mysterious chronic illness without even a specific name attached to it (from what I understand), only a set of symptoms which (similarly to ME/CFS) can have many different possible root causes/factors.
I guess one of the first things to do to create/market/… an interest from labs/researchers/nerds/… would be to find other people suffering from the same illness and create/coin a name for that illness and create some association/website/gatherings/… to communicate about it like it is done for most other illnesses?
With regard to addressing the etiology of complex chronic illnesses, specially the ones involving dysfunctions of the immune system, of the autonomic system, of the physiological energy generation/consumption mechanisms, of metabolism, etc… I wish the human body could be put into “profiling mode” (like for software) where you could trace/record in details all the related/relevant biochemical processes going on and then have tools that take that trace as input and provide as output an analysis of the processes going wrong and the root cause(s) of it and the possible remedies for it but it is of course still largely science fiction at this point in time!
So unfortunately, as yourself and some commenters in this thread have said, you have to find or determine by yourself the protocol(s)/approach(es) that you think are best suited to you and what you can do (depending on your own cognitive/physical/relational/financial/… resources).
For ME/CFS, some interesting comprehensive simple-enough-for-the-layman-to-understand approach I have come across so far is this one:
https://www.drmyhill.co.uk/wiki/Overview_of_CFS/ME_protocol
I have absolutely no affiliation and never made contact with the practician who authored that approach and do not endorse it or take any responsibility if you follow it, etc… but simply noted (in my very humble opinion) that this approach as a potentially interesting, comprehensive, systematic, systemic, rational, practical, … example of approach at least from a patient point of view in the current state of science related to ME/CFS (with the option to zoom-in / research further into any level of details at each step of this approach).
I guess this type of systematic/rational/… approach can provide some inspiration for some other complex chronic illnesses at least from a patient point of view. One of the main point of this approach is I think basically to try to list and address each and every possible cause of the illness by priority order of importance/likelihood/....
Note: sorry if my comment looks very “drafty”, as I am invoIved in the same kind of problems as the OP I wanted to quickly give my own very little 2c about them, I might slightly edit some bits of my comment later on to iron it out if/where necessary and if I have the time & energy.

Fractalideation 19 Apr 2024 0:50 UTC
7 points
3
on: When is a mind me?
Loved the post and all the comments <3
Here is I think an interesting scenario / thought experiment:
1. A copy of a person is made while that original person is sleeping on a bed.
2. The original person is moved to a sofa while still sleeping.
3. The copy (which is also sleeping) is put in the bed at the exact same position where the original person was.
4. After a while the original and the copy both wake up and can see each other (we assume they are both completely oblivious to exactly what happened while they were sleeping and that they didn’t dream or they dreamt the same thing, etc...)
At wake-up, based on their own memory of where the original person fell asleep, the original person will likely feel they are the copy and the copy will likely feel they are the original person, wouldn’t they?!
Some might even argue that based on stream-of-consciousness continuity the original “me” is actually the copy (because the copy remembers falling asleep in the bed and actually wakes up in the bed as well).
Some others will argue that based on substrate/matter continuity the original “me” is the original person even if their stream-of-consciousness has experienced a discontinuity (remembering falling asleep in the bed but actually waking up on the sofa while seeing an identical person as them waking up in the bed).
I guess it is subjective and a matter of individual preference if the stream-of-consciousness continuity or the substrate continuity is more important to define who the original “me” is.
Some would even argue that in this case there is not actual any firm original “me”, just one “stream-of-consciousness me” and another different “substrate me”.
(The same/similar thought experiment could be done using the direct brain insertion of false memories instead of moving around people while they sleep / are unconscious, in this example an original person could be inserted false memories that they are a copy and vice-versa to manipulate the memory / self-awareness of who the original “me” is, also generally it obviously could be useful when someone is uploaded/copied if they want to alter some memories of their upload/copy for some reason)
What links here?
- Relativity Theory for What the Future ‘You’ Is and Isn’t by FlorianH (29 Jul 2024 2:01 UTC; 7 points)
- Fractalideation's comment on Relativity Theory for What the Future ‘You’ Is and Isn’t by FlorianH (1 Aug 2024 23:28 UTC; 4 points)

Fractalideation 28 Jan 2023 4:48 UTC
2 points
0
on: My Model Of EA Burnout
Started to enter a state that could be described as “meta analysis paralysis” (“meta-[analysis paralysis]” and not “[meta-analysis] paralysis”) when I wanted to formulate my comment about your very interesting take on EA Burnout!
Your post screamed to me as a great example of analysis paralysis and bounded rationality.
Then I started to get paralyzed trying to analyse analysis paralysis and bounded rationality in the context of EA burnout and I quickly burnt out solutionless writing this comment.
Oh the irony!
Even burnt out I was still stuck in analysis paralysis so in the end I told myself:
“Tomorrow I will ask Google and ChatGPT: ‘how to solve analysis paralysis?’”.
And then submitted that above comment which does not really help you… or maybe it does?!
Damned still paralyzed!
Anyway pushing the submit button now, not sure if is the right thing to do but my bounded rationality tells me that at least it is one thing done, even if I could have spent much more time on a much more thorough and thoughtful answer that would have allowed me to formulate a better (less wrong / more helpful) comment but maybe also hitting diminishing returns!

Fractalideation 20 Jan 2023 18:40 UTC
9 points
6
on: “Heretical Thoughts on AI” by Eli Dourado
Hello,
Personally I think there is a major problem on how productivity is measured.
Basically:
productivity = production/time
But here is the major flaw: how is production currently measured?
It is measured by how much money you sell that production!
So basically as it stands:
productivity = (money made)/time
Imho that way of measuring productivity is really dumb and gives a completely undervalued measurement of production.
To take a simple example imagine you create (with thousands of other people) an OS like Linux that powers billions & billions of computing devices throughout the entire world (and even in space) and give away that OS for free:
Your productivity for this Linux production is measured as zero (0) because you didn’t make any money from the direct selling of it. The fact that your measured production and productivity for this is zero is completely absurd because you actually produced something extremely useful and transformative on a huge scale. There are many other examples like that of free or very cheap things which are measured as having a very low productivity not because they are useless but because they are (or have become) free / very cheap.
To take another example, let’s say you have speculated on the markets and got lucky and made a huge amount of money very quickly: you haven’t really produced anything but your productivity is measured as being huge!
So basically imho anything / any argument which is based on how productivity is currenly measured is completely flawed.
Please correct me if I am wrong so that I am less wrong thank you :)

Fractalideation 18 Jan 2023 19:56 UTC
3 points
0
in reply to: James_Miller’s comment on: On AI and Interest Rates
Hello,
I tend to intuitively strongly agree with James Miller’s point (hence me upvoting it).
There is a strong case to make that a TAI would tend to spook economic agents which create products/services that could easily be done by a TAI.
For an anology think about a student who wants to decide on what xe (I prefer using the neopronoun “xe” than “singular they” as it is less confusing) wants to study for xir future job prospects: if that student thinks that a TAI might do something much faster/better than xem in the future (translating one language into another, accounting, even coding, etc...) that student might be spooked into thinking “oh wait maybe I should think twice before investing my time/energy/money into studying these.”, so basically a TAI could create lot of uncertainty/doubt/… for economic actors and in most cases uncertainty/doubt/… have an inhibiting effect on investment decisions and hence on interest rates, don’t they?
I am very willing to be convinced of the opposite and I see a lot of downvotes for James Miller hypothesis but not many people so far arguing against it.
Could someone please who downvoted/disagrees with that argument kindly make the argument against James Miller hypothesis? I would very much appreciated that and then maybe change my mind as a result but as it stands I tend to strongly agree with James Miller well stated point.

Fractalideation

Frac­tal­ideation’s Shortform

Fractalideation’s Shortform