Lao Mein

Karma: 3,181

P(doom) = 50%. It either happens, or it doesn’t.

Lao Mein | Statistics is Hard. | Patreon

I give full permission for anyone to post part or all of any of my comments/posts to other platforms, with attribution.

Currently doing solo work on glitch tokens and tokenizer analysis. Feel free to send me job/collaboration offers.

DM me interesting papers you would like to see analyzed. I also specialize in bioinformatics.

Lao Mein 2 Mar 2026 5:50 UTC
31 points
2
on: Lao Mein’s Shortform
I remember watching a lot of modern bombing campaigns growing up and implicitly comparing them to WWII strategic bombing. I always assumed that the WWII US Army Air Force would deliver more munitions against enemy cities than the modern US Air Force could. After all, those massive strategic bombers could carry more bombs than the smaller modern fighter-bombers, which were optimized for agility and speed, right?
Wrong. A modern fighter bomber carries between 7-15 tons of ground-attack munitions on a sortie, which is generally more than the up to 9 tons of bombs carried by a B-29. A large scale strategic bombing raid in WWII against Japan had around 500 planes dropping a total of 5000 tons of munitions. The US averaged something like 1 raid per day at the peak of 1945.
A single US carrier can generate 2-3 sorties per plane per day from its up to 100 aircraft. A pair can basically do the WWII strategic bombing campaign against Japan by themselves. (Of course, there’s pilot and crew fatigue, ect, ect)
That’s not even taking into accound the benefits of precision, which are of course massive.
I read somewhere that the US and Israel generated something like 1500 aircraft sorties against Iran in the first day of the present conflict. That’s close to 10 kilotons of munitions being dropped against individual targets.
This is why I really don’t like the “only nukes can cause surrender from strategic bombing” narrative. Conventional airforces can and have done far worse damage to civilian and military targets than 2 nuclear bombs.

Lao Mein 16 Feb 2026 5:38 UTC
3 points
1
on: Life at the Frontlines of Demographic Collapse
What is your opinion on state-funded surrogacy to produce children who can then be injected into the existing foster care system (which will of course require significant expansion if this is to work)?
It seems like one of the few option left that won’t be impacted by the continued desire for childrearing, and becomes more cost-efficient than childbirth subsidies when the cost of the latter reaches $200,000-300,000.

Lao Mein 13 Feb 2026 5:44 UTC
3 points
−3
on: Lao Mein’s Shortform
My big problem with METR time horizons as a useful metric is that they start to break down exactly when things get interesting, after the 1 hour mark. I think this is because the benchmark is based on the pool of [people available to METR via friend/professional networks and also randoms from Task Rabbit], and there aren’t enough people in that pool with time horizons much longer that 1 hour to set a consistant metric.
source:2503.14499
I think there are actual human beings who can complete actual 16 hour+ time horizon tasks, but people like that can’t be found on Task Rabbit, and are instead doing cutting-edge research or making bank at Jane Street.
Conclusion: an 8 hour time horizon shouldn’t actually impress you much more than a 1 hour time horizon, since the benchmark breaks down right around that point due to a lack of consistant 1 hour+ time horizon humans to use as a benchmark. It would definitely help if METR improved their human baseliner selection methodology.

Lao Mein 4 Feb 2026 1:33 UTC
13 points
0
in reply to: Daniel Kokotajlo’s comment on: Lao Mein’s Shortform
I’ve never actually bribed a Customs official, so I don’t know the exact process. I think they can initiate extra inspections, which can’t be cancelled, but can be sped up?

Lao Mein 3 Feb 2026 3:25 UTC
60 points
0
on: Lao Mein’s Shortform
Apparently a lot of H200s were stuck in Chinese customs for a while, spawning conspiracies about top-down directives.
People who don’t regularly deal with Chinese customs may not realize that they are genuinely the single most corrupt, incompetent, and evil section of the Chinese government. Over the past 5 years, I have literally never heard about anything bribe-related that wasn’t about Customs. They lost 4 seperate copies of my diploma during COVID. They will sometimes “lose” biological samples if not paid a bribe.
All a foreign adversary would need to do to delay hardware shipments to Chinese firms is to pay $500 over WeChat to some mid-level bureaucrat at Customs. Maybe that’s what actually happened.

Lao Mein 28 Jan 2026 19:17 UTC
2 points
−3
in reply to: goldfine’s comment on: Steven Blake’s Shortform
I suggest investing ~100% of your money in developing-country funds. This is a generally good decision if you’re young (high-risk, but also high EV), and seems much more sustainable than donating 10% of income. Maybe there’s a related guide out there?
It’s honestly my biggest gripe with the EA movement. I think a counterfactual EA movement which stressed more altruistically effective investment as the primary message instead of an actual tithe (40k/Catholic LARP does not generalize across cultures) would do far more good.

Lao Mein 23 Jan 2026 7:36 UTC
2 points
0
on: One Minute Every Moment
Working memory digit limits vary greatly based on language, averaging ~10 for Chinese (all digits are single short syllables) and ~5 for Hebrew (2 syllables per digit). For a conservative estimate of the upper bound, there’s ~40 possible phonemes in human language, which can be extended to 160 if we use 4 different tones, and log2(160) is 7.3 bits. This implies that our auditory loop alone should be able to hold 73 bits of information based on easily pronounced phonemes alone.
I think you can push this much higher if you used a soundboard, which would free you from using only human-pronounceable sounds.

Lao Mein 14 Jan 2026 7:48 UTC
4 points
3
on: Lao Mein’s Shortform
A Definition of AGI
Currently looking at some of the example problems used to assess AI capabilities in this paper. This… obviously doesn’t work as intended? Based only on recall (no search), Deepseek answers these questions somewhat well, getting ²⁄₃ attempts for the first, ⁰⁄₃ for the second, [assumes Star Wars refers to Episode I and gives the answer for that movie, changes to correct answer after clarification, so either ⁰⁄₃ or ³⁄₃]. It identifies Satan, but misidentifies the squished animal as a cat for #4 in all 3 tests. So a model with zero visual ability could score quite well just from the context given by the question and maybe the metadata of the upload file. The first question doesn’t even require the name of the movie for a correct answer!
This seems to indicate fundemental problems with their test construction—it gives “written by someone from 1990 who makes an AGI test without ever interacting with an LLM” vibes. Maybe that was the intent?
They also cited actual College Board example AP tests from ~2014 as their sources for like 30% of their problems. I would be shocked they weren’t in the training data multiple times.
This whole paper just feels wrong.

Lao Mein 9 Jan 2026 20:06 UTC
7 points
0
in reply to: Max Harms’s comment on: I Read Red Heart and I Heart It
Talking about moral philosophy with your CCP boss in a genuine heart-to-heart, philosophical way is like, actually insane. Imagine reading a book where an American protagonist grabs a police officer’s gun from behind and the officer laughing and saying “ahh, that’s good old American CQB, my friend stranger, you won’t get me next time!” and then the two becoming fast friends after shouting racial slurs at each other in a McDonald’s for 20 minutes.
I would at least use after-work binge drinking to make it a bit more plausible.

Lao Mein 9 Jan 2026 19:44 UTC
3 points
0
in reply to: FlorianH’s comment on: Lao Mein’s Shortform
I was specifically refering to US import tariffs on consumer goods.

Lao Mein 9 Jan 2026 17:06 UTC
2 points
−9
on: Lao Mein’s Shortform
Starting to suspect the hypothesis “torture doesn’t work for information extraction” doesn’t survive the replication crisis.

Lao Mein 9 Jan 2026 14:24 UTC
2 points
0
on: Fertility Roundup #6: The Art of More Dakka
$300,000 is a big number, enough that it starts becoming cost-competitive for the government to just contract out every component of the gamete → educated 18 year old pipeline.
That is, the state pays for donor gametes, IVF, surrogacy, and 18 years of foster care. There’s a lot of confounding factors here, but the last time I ran the numbers it was something like $200,000-$300,000 per child.
At the very least, this is a solution that continues to work even if desired number of chidren craters all the way to 0 - daycares don’t seem to have any trouble finding workers, after all.

Lao Mein 9 Jan 2026 10:42 UTC
6 points
1
in reply to: RogerDearnaley’s comment on: My 2003 Post on the Evolutionary Argument for AI Misalignment
I actually think modern urbanites have a lower effective social population density than the ancestral environment. Most hunter-gatherers have very little privacy, with entire entended families huddled in the same single-room dwellings. This level of density is very rarely present in developed cities (other than pubic transit). Most people in, say, Tokyo have their own room and thus few far less crowded than the average hunter gatherer, if you polled them throughout their day. Lowering the number of people encountered per day doesn’t seem to greatly increase fertility, otherwise COVID would have resulting in a big fertility bump. Maybe the problem is the opposite—modern technology allow us to have fewer undesired social encounters, which is a high-priority desire for most people, but those same undesired encounters were the main force behind the formation of romantic relationships.
It is possible that the easiest way to increase the fertility rate is a legal mandate for dinner with co-workers.

Lao Mein 9 Jan 2026 8:06 UTC
9 points
0
on: Lao Mein’s Shortform
Very confused why “the US economy contains a lot of middlemen, who can just absorb massive taxes without passing them on to the consumer” is only ever used as a handwave for explaining the unexpectedly low rate of inflation. I’ve seen multiple people (including normally level-headed people like Ezra Klein!) bring this up briefly, but never examine the concept in detail. If true, it’s a big deal, since that’s a lot of consumer surplus to be gained (presumably by limiting zero-sum advertising games?), and this issue deserves much more attention! Maybe it just seems more plausible to me—shipping alone can’t why the exact same CPU fans cost a fraction of the cost in China as they do in the US. Maybe Chinese consumers are just much more willing to pay the time cost to search for better deals (This seems obviously true to me, but I doubt it explains everything)?
Something to think about.

Lao Mein 5 Jan 2026 9:21 UTC
2 points
0
on: The Maduro Polymarket bet is not “obviously insider trading”
I don’t think anyone in the world knows for sure how 2 and 3 are going to resolve.
On a related note, the Roman Triarii were well known to have wept and had child-like tantrums in battles where they were not deployed, sometimes to the point of mutiny. It’s probably a good idea for someone in, say, the MEU to have sold if they were tilted after not getting to kick down doors.

Lao Mein 23 Dec 2025 11:47 UTC
3 points
0
on: How to game the METR plot

Lao Mein 13 Nov 2025 22:11 UTC
4 points
0
on: I Read Red Heart and I Heart It
I would quibble and say that this is very much not an accurate depiction of China.

Lao Mein 13 Nov 2025 18:31 UTC
5 points
4
on: Lao Mein’s Shortform
>No paper, not even a pre-print
>All news articles link to a not-yet-released documentary as the sole source. It doesn’t even have a writeup or summary.
>The company that made it is know for making “docu-dramas”
>No raw data
>Kallmann Syndrome primarily used to mock Hitler for having a micropenis
Yeah, I don’t think the Hitler DNA stuff is legit.

Lao Mein 11 Nov 2025 4:14 UTC
3 points
−4
on: One way violinists fail
We should view artists the same way we view gooners. Continuous exposure to superstimuli has dulled their responses. They can no longer perceive anything “normal” (Linkin Park) as pleasureable or interesting, instead seeking ever more extreme stimuli (Animals as Leaders). If you stare at your reflection for long enough, you get body dysmorphia, and the same applies to music. Sad.
They are martyrs. Noble, and worthy of pity, but also worthy of at least a little disgust.

Lao Mein 11 Nov 2025 4:05 UTC
4 points
0
on: On Fleshling Safety: A Debate by Klurl and Trapaucius.
I’m surprised that no one has mentioned the obvious problem with this allegory. If an alien robot appeared to humanity and says “I am your creator, give me 10% of your iridium in perpetuity and also let me change some of your core values”, it will get some iridium and some humans will volunteer for limited value editing. This continues to be true even after the dominance shift, although at that point Trapaucius would be worse off^[1] than if humanity never existed in the first place. But he would still exist.
Obviously, the resulting value for the creator-bot will be much less than a more conventional machine offspring, but Trapaucius is slighly right for more-or-less the reasons he gave. If we align AI IRL as well as Trapaucius aligns humanity for iridium extraction or evolution aligns humanity for the abundance of specific DNA sequences^[2], humanity wins in some strong sense.
1. ^
  Probably.
2. ^
  I think a future aligned with current human values will contain more modern-human DNA since liberaltarian values + (bio)conservatism of normies + success of fringe groups like the Amish result in O’Neil Cylinders filled with baseliners (actual modern human DNA, max lifespan of ~100, ect.) in perpetuity. If this doesn’t turn out to be true in the future, I expect it to be due to selection pressures/memetic drift, but that’s a Moloch problem and not an AI alignment problem.