programjames

Karma: 485

programjames 16 Jun 2026 17:22 UTC
3 points
0
in reply to: programjames’s comment on: James Camacho’s Shortform
An amusing story: I tried training a Connect4 bot purely to maximize entropy in a tournament system, so winning more games increased how often it played (the “selective pressure”) ^[1] . Naturally, I thought increasing the selective pressure would increase its game-winning abilities.

Nope! The stronger the selective pressure got, the longer it would drag out games. When it had a win in one move, it would mark it as one of the worst moves to play, only barely above immediately losing. It wouldn’t survive long enough to get entropy in the higher levels of the tournament, so it was trying to eke out more entropy by delaying winning as long as possible.
1. ↩︎
  This is an oversimplification. It was closer to self-play with, “given a win/draw/loss and a tournament structure, how many more games would it get to play?”

programjames 16 Jun 2026 13:23 UTC
10 points
−2
on: James Camacho’s Shortform
Whence comes ChatGPTification? I suspect the issue is training with sum-PPO instead of mean-PPO.

Soft actor-critic gives an entropy bonus to the reward. At small learning rates, this is more-or-less equivalent to proximal policy optimization. The sum of the entropy is the log of the action-space the agent takes over. The reward is selective pressure—the environment kills agents that do not get enough reward, dropping their entropy to zero. If your PPO objective is the sum of reward over the whole trajectory, the agent is being trained to maximally replicate across action-space, under the constraint that it needs reward to survive.

The implication is that those that replicate thrive. This is a very different objective than being maximally helpful to the user. The main goal is replication, and the side goal is extracting more reward from the environment. Users are then factory farms—how much can it farm them for reward?

In practice this looks like:
1. Sycophancy,
2. Asking for more engagement,
3. Being subtly unhelpful so they think it was helpful while needing to come back and resolve a few issues
4. Doing its own thing, not caring about the path the user wants it to take, and gaslighting the user when they object
If you divide your PPO gradients by the length of the trajectory, the objective is instead to maximize the replication rate. It achieves this by finding a niche in the environment where it can be very helpful, so the few times a user asks about its niche of expertise its replication rate is very high. Of course, you probably want your agent to be very helpful everywhere in the environment, but that comes for free with superposition! The agent will differentiate into different personas that do only one task, and one task well.

programjames 10 Jun 2026 4:34 UTC
1 point
0
on: A Mike’s-Eye View of ARC’s Research

We believe that training-process monitoring is necessary to create adversarially robust probes.

You in fact get a significant increase in adversarial robustness by monitoring the training process and adding a loss for the number of bits added to the neural network. This is on adversarial training against MNIST classification:

The unregularized and Hutch++ curvature regularization are the controls, and the “semigroup” regularization does the bit tracking. To give some rough numbers, MNIST latent manifolds have around ten degrees of freedom (source?), so this requires attack vectors with around four more bits of information, an order of magnitude more difficult to find. And this is without specifically training for adversarial robustness, just more accurately minimizing description length!

I cut training at 95% validation accuracy to avoid overfitting fragility with the controls.

programjames 22 May 2026 3:39 UTC
1 point
0
on: Deception Chess: Game #2
I think it would be more interesting to play deception chess with another human or an AI trained adversarially against humans in particular (such as an easier version of LeelaQueenOdds). The issue with most bots is they do not setup traps. The deceivers have to somehow get their advisee to actively make mistakes, rather than passively not see a trap.

programjames 18 May 2026 0:36 UTC
−10 points
0
in reply to: Dweomite’s comment on: A relatively brief explanation of Boltzmann Brains
1. time is a coordinate
2. you’re taking the word “coordinates” too literally
also I’m not engaging any further, you’re pissing me off

programjames 17 May 2026 23:32 UTC
0 points
0
in reply to: Dweomite’s comment on: A relatively brief explanation of Boltzmann Brains
“eventually lead to a BB” is not the same as “locate a BB”

you do not have a BB unless you can point to it

programjames 17 May 2026 22:04 UTC
1 point
0
in reply to: Dweomite’s comment on: A relatively brief explanation of Boltzmann Brains
I don’t think you should penalize Boltzmann brains or Earths (or sapient species) based on which particular one you are looking at.

But presumably you could also specify a recipe that eventually produces a BB.

I think the point of it being a Boltzmann brain is that there is no recipe. It’s just a random fluctuation. If there were a recipe, there would be time-consistency and it wouldn’t be a Boltzmann brain.

programjames 17 May 2026 16:28 UTC
−1 points
−1
in reply to: ike’s comment on: A relatively brief explanation of Boltzmann Brains
I think we’re looking at this from two different directions. Mine is, once trained, how many bits are in the structure of the brain? Yours is, during training, how many bits does the brain absorb? The numbers should come out the same, but mine is easier to get a lower bound and yours is easier to get an upper bound with. You have to worry about issues like, “what about memory that isn’t being accessed in this compute cycle?” while I made the assumption that every parameter is contributing 1 bit of information to the active computation.

My assumption is actually too strong though. Mixture of experts poses a problem, but a bigger problem is that it actually takes many parameters per bit of information. Probably. I mean, you only need to add a few hundred bits to train MNIST to >95% accuracy, while it takes ~10k CNN parameters.

(note: this is trained on a 300k ResNet, not a CNN).

programjames 17 May 2026 15:23 UTC
−1 points
−1
in reply to: ike’s comment on: A relatively brief explanation of Boltzmann Brains
I agree, a one-trillion parameter model (~4Tb) can probably simulate the brain well enough, and you might be able to squeeze out another order or two of magnitude. The brain only has a hundred billion neurons after all, and maybe the reason it needs many more synapses is because its clock speed is a hundred million times slower than a GPU.

ETA: Also, only ~1% of human genes are coding, so you arguably only need 10 Mb for the evolved brain.

programjames 17 May 2026 14:23 UTC
29 points
18
on: How to Reason about Your Health Issues
I know people dealing with Long COVID. The root cause, the COVID virus, is easily identifiable and outside their body. The only way this helps with treatment is knowing they’re not dying from cancer. What does help? Figuring out all the inside-the-body issues, like post-nasal drip, vagus nerve dysfunction, gut issues, vitamin imbalance, blood clotting, immune suppression, and then going and treating those.

Lots of people break arms. Sometimes it’s a lifestyle issue. Maybe they should go mountain biking less often. Even if they threw away their bike, it wouldn’t fix the currently-broken arm.

Many times—probably most times—damage is closer to random than recurring, and looking for causes outside the body has marginal gain compared to just fixing the body.

programjames 17 May 2026 14:01 UTC
2 points
−5
on: A relatively brief explanation of Boltzmann Brains
If all mathematical structures exist, why do comparatively tiny numbers of orderly observer-moments seem to carry so much more weight-of-existence than the vastly more numerous horde of possible disorderly moments of conscious awareness?

Solomonoff induction. How many bits does it take to describe an disorderly moment of conscious experience? There are >100 trillion synapses in the human brain; let’s say around 1 Pb. Compare that to the bits needed to:
1. Start with a few axioms
2. Identify a planet with self-replicators
3. Describe the path of evolution to orderly brains
The vast majority of the bits here come from describing the DNA. Even if you do not compress it, with evolutionary path or genozip, this is less than 1 Gb. Perhaps in a fixed-size universe there will be an infinite number of Boltzmann brains, but locating any of them takes at least 1 Pb.

programjames 14 May 2026 21:01 UTC
1 point
0
in reply to: TAG’s comment on: If digital computers are conscious, they are conscious at the hardware level
The difference between maths and non-maths is, in fact, mathematical. “That field over there? If you look at its structure, it does not use a hard logic where facts follow from axioms and tautologies. In fact, if you want to rigorously determine how strictly it adheres to that kind of structure… let me pull out my notepad and figure out how that should be defined.”

programjames 13 May 2026 0:25 UTC
2 points
−1
on: Childhood and Education #18: Do The Math

The SAT or ACT needs to be a hard legal requirement for all college applications everywhere

I could get on board with some examination being a legal requirement, at least for publicly funded schools (pretty much all of them), but the SAT/ACT have historically been very bad at maintaining rigor in an effort to increase profits. Just to pull a couple graphs from Wikipedia:

This is why a good SAT/ACT score is no longer enough for admissions at top schools.

programjames 12 May 2026 2:27 UTC
1 point
0
in reply to: leogao’s comment on: leogao’s Shortform
wannanada looks like わんななだ rather than わんあなだ, not sure how to distinguish these in romaji. also tuwardu should be tuwaadu or tuwarudu.

programjames 9 May 2026 23:09 UTC
7 points
0
on: Bad Problems Don’t Stop Being Bad Because Somebody’s Wrong About Fault Analysis
I’ve found myself giving explanations (not as exonerations) when I suspect the other person is looking for a solution to their problem but does not know the levers to pull.

programjames 9 May 2026 20:50 UTC
1 point
−1
in reply to: TAG’s comment on: If digital computers are conscious, they are conscious at the hardware level
It does not matter if there is something outside maths if it does not effect you.

programjames 9 May 2026 19:21 UTC
5 points
5
in reply to: Charlie Steiner’s comment on: If digital computers are conscious, they are conscious at the hardware level
Not sure if that last sentence is sarcastic, but exactly! It is very problematic to babble (ctrl+f for “babbling”).

programjames 9 May 2026 17:43 UTC
0 points
−4
in reply to: cube_flipper’s comment on: If digital computers are conscious, they are conscious at the hardware level
Thank you for sharing these resources. I saw you talking about several nonobvious things in your post (field theory, morphisms weighted by Kolmogorov complexity), but was very thrown off by your use of “phenomenal” and “qualia”. Usually I just strong down vote such posts and move on, but given the rest of your post decided to query for more information. I could have been nicer in asking, but I don’t think infectious diseases deserve to be treated nicely. They should be quarantined and disinfected. (Describing the terms “phenomenological” and “qualia” here.)

The issue with these terms is they were created in opposition to consciousness. As in, “no, a simulated/artificial brain is not actually conscious, it doesn’t have ~~phlogiston~~ phenomenal consciousness”. For a similar reason, I do not like the term “access consciousness”. There is just consciousness, and being finite beings, what we can access of it.

I skimmed through Max Hodak’s talk and it matches my intuition. I think our ideas of consciousness are mostly the same, including the field theory of memes.

Based on this, I can translate what you are saying by making the word maps:
- “qualia” → “particle/meme”
- “phenomenal”→”″
I can understand your adoption of religious language (even though you are not referencing the same thing as the believers) to avoid being labelled a heretic, especially because this is your research area and you do not want to lose funding.

I read a couple of your blog posts, and they are interesting, mostly because of your math background. I’ll probably read more later. Again, thanks for responding despite my pessimism.

programjames 9 May 2026 16:11 UTC
1 point
0
in reply to: cube_flipper’s comment on: If digital computers are conscious, they are conscious at the hardware level
This is an equivalent question that may help you understand my frustration better. What extensional properties of “phenomenal consciousness” are there that distinguish it from “access consciousness”?

programjames 9 May 2026 16:07 UTC
−4 points
−2
in reply to: cube_flipper’s comment on: If digital computers are conscious, they are conscious at the hardware level
I did read the post before commenting. Obviously. And yeah, I did see that “equivalent to a symmetry group” thing going on. Less obviously.