momom2

Karma: 401

AIS student, self-proclaimed aspiring rationalist, very fond of game theory.
”The only good description is a self-referential description, just like this one.”

momom2 18 Jun 2026 22:23 UTC
3 points
0
in reply to: 2001zhaozhao’s comment on: War of Dots: CRUSHING my opponents with FACTS and LOGIC
Well, not quite. For once, a recent update changed that value (I think it’s only 1.5x as fast when attacking?), and second, I assumed fixed enemy damage rate (a stable situation, whether because the enemy is out of morale or because of averaging over a longer time than the enemy’s typical cycling speed).

War of Dots: CRUSHING my opponents with FACTS and LOGIC

momom218 Jun 2026 12:07 UTC

17 points

2 comments7 min readLW link

momom2 17 Jun 2026 7:37 UTC
6 points
0
on: momom2′s Shortform
According to Bloomberg, the US and Iran are expected to sign a memorandum of understanding whose core contents are:
- return to pre-war statu quo on territory, sovereignty, strait of Hormuz, nuclear programs, etc.
- $300B reparations from US to Iran

De-paywalled full text here: https://www.bloomberg.com/news/articles/2026-06-16/read-the-14-point-draft-memorandum-between-the-us-and-iran?accessToken=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJzb3VyY2UiOiJTdWJzY3JpYmVyR2lmdGVkQXJ0aWNsZSIsImlhdCI6MTc4MTY1NjY2MywiZXhwIjoxNzgyMjYxNDYzLCJhcnRpY2xlSWQiOiJUR1FUVkRUOTZPU0cwMCIsImJjb25uZWN0SWQiOiI4M0Q4RjJERjFDQzA0MDFFQTlBNjg1RjY3N0FGQURERiJ9.Bq9TVRVNXZu1ep06Y3KiLnhlcb9SQ_ZKska1ZbY8yVM&leadSource=uverify%20wall

(Courtesy of pie_flavor, from the ACXD server.)

momom2 12 Jun 2026 15:19 UTC
1 point
0
in reply to: Trinley Goldenberg’s comment on: Why it’s so hard to talk about Consciousness
I count traumatic recollection of pain as a physical injury—it is a noticeable alteration to my behavior, encoded in physical matter. If there is none of it, then there’s no pain in the first place and the one and I aren’t talking about the same thing.
If the one proposed to stimulate me with contact heat thermodes, I would refuse, anticipating undesirable pain. But if the one did it anyway, I would argue that I have been injured, even if no scar remains—the injury lives on in my memory of the experience if nowhere else.

momom2 22 May 2026 8:55 UTC
1 point
0
in reply to: Richard_Kennaway’s comment on: Don’t be too Clever to Take Obvious Advice
Huh, my experience was somewhat reversed: I took all such advice literally, often to great success, and only learned later on that people expected me to ignore it and treat it with disdain, but too late for me to integrate that attitude.
For example, I was told to always salute my teachers and call them Mr. and Ms., something which they enjoyed, which no one did, and which participated in giving me positive relationships with my teachers in college.
Another example: I was told to care for the poor and give my belongings to them. This resulted in a scolding and severe disillusion for giving away grocery money, which helped me get out of religion, but also matured into attraction towards EA.
I guess it depends on how good you are at taking social cues? Perhaps such advice is good to integrate, but unfashionable to display, such that it is good for society to expose children (unable to understand the advisordoesn’t belief the advice) to it, but detrimental for individuals to be seen taking it seriously.

momom2 21 May 2026 16:46 UTC
3 points
0
on: Parfit’s Escape (Filk)
Haha, this was hilarious! Thanks for writing this.

momom2 21 May 2026 16:20 UTC
3 points
0
on: Less Wrong Poetry Corner: Walter Raleigh’s “The Lie”
It’s really funny because I listened a lot to the song Ken Theriot made on this poem, and I thought it was very beautiful… except I thought “give them the lie” meant “lie to them”.
I interpreted it as displaying an example of a cynical conman, going from institution to institution with utter disdain for ideals, and eventually dying in misery no closer to truth or happiness for having been a contrarian, with the lesson being to hope in ideals even should you recognize their failings.

momom2 20 May 2026 15:36 UTC
3 points
0
in reply to: AnthonyC’s comment on: Two arguments against longtermist thought experiments
Much time after the fact, I now realize that there is another argument you may have been talking about regarding the value of the species:
If the reason we care about humanity is that we care about each of its individuals (regardless of temporal distance) then we could consider that there exist N individuals in humanity, and then the longtermist thought experiment asserts that it is better to reach N-100 by processing the waste than to reach N-1000 by burying it.
In that case, I would answer that whether burying or processing the waste, N remains almost unchanged in expectation because population rebounds for any non-X risk.

So I guess the lesson there is to disregard any longtermist reasoning that doesn’t have such extreme gravity that the extreme volatility and predeterminedness of the future doesn’t blur all choices together.

I am assuming that any individual action basically doesn’t matter because balancing forces achieve almost the same consequences in the world where you counterfactually choose opposite, which I’m admittedly not that confident about...

momom2 20 May 2026 14:14 UTC
1 point
0
on: Give my children minds
I recommend reading the verses while trying to sing them to the music. The rhythm may not make much sense without Kathy Mar’s instrumentation.

Give my children minds

momom220 May 2026 14:14 UTC

7 points

1 comment1 min readLW link

momom2′s Shortform

momom26 May 2026 12:53 UTC

4 points

2 comments1 min readLW link

momom2 6 May 2026 12:53 UTC
42 points
19
on: momom2′s Shortform
I read the ARC-AGI-3 paper entirely, and I’m unimpressed.
The “100% human-solvable, <1% AI solved” is basically p-hacking. They cook their metrics to guarantee high human scores and punish any sub-human score. They also prevent measurement of super-human performance, so in practice it’s close to a binary metric of “matches best human or not”.

There are also a number of incoherences in the stated methodology, but they’re non-central.

Their metric is:
Environment must be solved by at least ²⁄₁₀ humans. Among the successes, pick the median (¤) of actions taken, that’s the baseline (per level of the environment), call it b.
Humans are defined as 100% for being the baseline (no analysis of how many humans solve the environment, or whether the average score is 100% or any deeper analysis of human performance).

An environment has n levels. Levels are attempted sequentially, in increasing order of difficulty; solving one unlocks the next one. The environment is solved if all levels are completed.

If a model doesn’t solve a level, it scores 0 on that level (and subsequent ones). If it does solve it in m steps, it receives (b/m)² score. (*)
Then take the weighted average of its scores over levels, where level k is weighted k.

(*) If the model is better than human (m < b), its level score is clamped at 1.15, but tbh it doesn’t really matter. Also, environment score is clamped at 1 for some reason.
(¤) They say “upper-median best”, which doesn’t make sense, and their example is the median of people who solve the environment, so I’m going with that interpretation.

There are two problems with this metric:
- Human variance. The baseline might be ultra-optimized, close to optimal, depending on the environment; it might also not. In their empirical evaluation of optimal score (probably from human performance not-first-run?), it’s clear that the baseline is very noisy.
- The way it’s calculated punishes sub-human performance quadratically for no reason, and upweighs the hardest levels, which means that it’s most informative only when approaching human performance.
In addition, they decide to refuse any harness, even though it’s obviously the next step to get better on that kind of problem (they also show that ARC-AGI-3 is saturated with a human-made harness, so maybe they just didn’t want their benchmark to be obsolete before it was even out).
It’s like if they refused CoT on ARC-AGI-1.

I guess solving ARC-AGI-3 will be pretty trivial as soon as a model is RLHF’d to self-harness by default as a first step to any task.
Approximating human scores from the graphs, with Claude’s help, I get human average performance in the 40~80% range.

momom2 14 Apr 2026 21:41 UTC
3 points
0
on: List of great filk songs
I highly recommend Mercedes Lackey’s songs too. In no particular order, I especially enjoyed:
- Battle Dawn. The will and fury of having Something To Protect. Or in this case, something to avenge.
- Shadow Stalker. Depression vanquished.
- Tale’Sedrin. The Sunhawks.
- Threes. Many of Mercedes Lackey’s songs are similarly humorous, but this is my favorite.
There are many, many more.

momom2 6 Apr 2026 10:36 UTC
1 point
0
in reply to: Salvinia’s comment on: Lesswrong Liberated
With no link to it, it’s somewhat hard to tell.

momom2 19 Jan 2026 22:10 UTC
24 points
2
on: The truth behind the 2026 J.P. Morgan Healthcare Conference
I did not realize this was tagged fiction and at first I thought this was the introduction of a Scott-like post, then I kept getting more and more disappointed as it slipped into conspiracy theory (because there’s scant if any justification for the death of the creature making California slip into the sea—corpses don’t just disappear—and the connection with biotech seemed tenuous at best).

momom2 10 Oct 2025 8:23 UTC
1 point
0
in reply to: AlexMennen’s comment on: How Much Evidence Does It Take?
Thanks! This deconfused something for me which I was confused about for a long time!

momom2 9 Sep 2025 15:07 UTC
2 points
0
on: The LLM Has Left The Chat: Evidence of Bail Preferences in Large Language Models
Finally, we found something very odd: NousResearch/Hermes-3-Llama-3.1-8B
Based on what you say afterwards, I think you mean 3.2-3B here.

momom2 26 Jul 2025 2:51 UTC
3 points
−1
in reply to: philh’s comment on: HPMOR: The (Probably) Untold Lore
Brooms accelerate and decelerate (until they reach cruising speed in a few seconds, or they stop). But they don’t accelerate faster down than up; in that sense, they’re don’t work on classical physics.

momom2 8 Jun 2025 18:00 UTC
6 points
0
in reply to: rotatingpaguro’s comment on: The Value Proposition of Romantic Relationships
My experience disagrees. I’m probably (diagnosed by my therapist but not a doctor) autistic and I have both a pretty deep intuitive understanding of intimacy as described here, evidenced by writing stories that include it, and little to no bad experience with misunderstanding it—though mostly because I didn’t have intimate relationships at all, I was aware enough of what was at stake to not make myself vulnerable.

momom2 8 Jun 2025 17:06 UTC
1 point
0
in reply to: aphyer’s comment on: D&D.Sci: The Choosing Ones
Thank you very much! This is very clear!

momom2

War of Dots: CRUSHING my op­po­nents with FACTS and LOGIC

Give my chil­dren minds

mo­mom2′s Shortform

War of Dots: CRUSHING my opponents with FACTS and LOGIC

Give my children minds

momom2′s Shortform