Brendan Long’s Shortform

Brendan Long18 Apr 2025 2:23 UTC

6 points

28 comments1 min readLW link

Brendan Long 8 Apr 2026 19:38 UTC
16 points
5
I’m surprised no one is discussing Meta’s new model at all: https://ai.meta.com/blog/introducing-muse-spark-msl/
This part seems good:
We found that Muse Spark demonstrates strong refusal behavior across high-risk domains such as biological and chemical weapons, enabled by pretraining data filtering, safety-focused post-training, and system-level guardrails. In the Cybersecurity and Loss of Control domains, Muse Spark does not exhibit the autonomous capability or hazardous tendencies needed to realize threat scenarios.
And this seems.. less good:
In third-party evaluations on a near-launch checkpoint, Apollo Research found that Muse Spark demonstrated the highest rate of evaluation awareness of models they have observed. The model frequently identified scenarios as “alignment traps” and reasoned that it should behave honestly because it was being evaluated.
I’m pleasantly surprised that they decided Safety should be one of the four sections in the announcement post, and that they call out the eval awareness.
Disclaimer: I work at Meta, but not in this department and I obviously don’t speak for the company.
What links here?
- StanislavKrym's comment on kaiwilliams’s Shortform by kaiwilliams (15 Apr 2026 22:07 UTC; 6 points)
- Rauno Arike 9 Apr 2026 5:52 UTC
  10 points
  1
  Parent
  Muse Spark is a natively multimodal reasoning model with support for tool-use, visual chain of thought, and multi-agent orchestration.
  Does anyone know what they’re referring to by visual chain-of-thought here? The first paper that comes up when searching for visual chain-of-thought is Qin et al., which says: “We introduce Chain-of-Visual-Thought (COVT), a framework that enables VLMs to reason not only in words but also through continuous visual tokens-compact latent representations that encode rich perceptual cues.” Something like this seems like it would be somewhat concerning for CoT monitoring, though I should mention that this paper isn’t written by Meta and I haven’t read it to properly assess how concerned I would be about this.
  What links here?
  - StanislavKrym's comment on kaiwilliams’s Shortform by kaiwilliams (15 Apr 2026 22:07 UTC; 6 points)
  - J Bostock 16 Apr 2026 3:17 UTC
    6 points
    2
    Parent
    Strong guess: they’re letting it generate images in the chain-of-thought. This would obviously be useful for image generation (make ten tries, pick the best parts of each for final answer) and is probably useful for other kinds of planning as well, but I’d guess it’s hard to RL a model into thinking in pictures usefully (no pretraining data of that form).
    I guess you could RL a model into generating images that don’t actually do anything during chain-of-thought, just by instructing it to do so, then rewarding that. Depending on how competent Meta’s team is, they might have done either thing.
- StanislavKrym 9 Apr 2026 1:16 UTC
  3 points
  0
  Parent
  I thought that I’ve had enough of xAI being likely 3 months behind the frontier, and now we get this… I tried to find out anything about Meta’s model and had Claude Opus 4.6 conclude that Meta’s model is also 3-4 months behind. There also is the issue of Meta having manipulated some benchmarks to present Llama 4 as more capable and with Meta’s claimed benchmark performance on the benchmarks ARC-AGI-2 and SWE-bench verified where the rivals’ models allegedly have different results than in the real leaderboards of ARC-AGI-2 and SWE-bench verified, likely because of a different method of elicitation. How do I lobby for a law change requiring EVERY new American model to be thoroughly evaluated by the entire Big Three?
  What links here?
  - StanislavKrym's comment on kaiwilliams’s Shortform by kaiwilliams (15 Apr 2026 22:07 UTC; 6 points)
Brendan Long 15 Nov 2025 23:13 UTC
7 points
0
I got to approximately my goal weight (18% body fat) and wanted to start gaining muscle^[1] instead, so I stopped taking retatrutide to see what would happen. Nothing changed for about two weeks and then suddenly I was completely ravenous and ended up just wanting snack food. It’s weird because I definitely used to always feel that way, and it was just “normal”. I mostly kept the weight gain at bay with constant willpower.
I’m going to try taking around a quarter of my previous dose and see if it makes it easier to stay at approximately this weight and not constantly think about rice crispies.
1. ^
  I didn’t notice any muscle loss with retatrutide, I just started out less strong than I want to be and find it hard to gain muscle on a calorie deficit.
- MichaelDickens 16 Nov 2025 18:43 UTC
  2 points
  2
  Parent
  Are you also lifting weights? I’m quite confident that you can gain muscle while taking retatrutide if you lift weights.
  
  IIRC GLP-1 agonists cause more muscle loss than “old-fashioned” dieting, but the effect of resistance training far outweighs the extra muscle loss.
  - Brendan Long 17 Nov 2025 6:42 UTC
    2 points
    0
    Parent
    Yeah muscle loss hasn’t been a problem for me. I can do more pull-ups, push-ups and hike longer and faster than when I started. Progress was really slow with a significant calorie deficit.
    
    I’m trying a much lower dose now to see if I can build muscle without rapidly regaining the weight.
    
    Separately, I’m just really bad at dealing with the complexity of weights. I’m going to see if Crossfit helps this week.
Brendan Long 11 Apr 2026 20:50 UTC
5 points
0
I’ve been serving my personal website from CloudFront (Amazon’s CDN) for years, which was nice because it costs a few cents a month, but it annoyed me that cache misses get served slowly from S3. In some cases, this can take several hundred milliseconds. Completely unacceptable!
I finally decided to look up if anyone would let me serve all of my files from the CDN all of the time, and apparently Bunny CDN^[1] does. It’s “expensive” (over 10 cents per GB per month!), but since my entire website is ~30 MB, I just told them to store the entire thing on SSDs in every edge region.
Result: Every page loads in ~40 ms from anywhere remotely near an edge location^[2], regardless of how recently anyone else has requested the page.
My “unacceptable” above is mostly tongue-in-cheek, but there really is something nice about every link loading instantly rather than in half a second.
The code to do this is also much simpler since Bunny CDN has a CLI that handles sync properly, and cache “misses”^[3] are so fast that I’m just not hot caching HTML pages.
1. ^
  I assume there are other options for this, but this is the one everyone talks about and it’s going to cost me like $0.10/mo, so I didn’t look very hard for alternatives.
2. ^
  Sadly, there aren’t edge locations in the Middle East, China, Russia, or most of Africa; so people in those countries may experience 80 ms load times
3. ^
  There’s two layers of lookups in a CDN: The CDN edge (hot cache) and the origin (usually slow). With Bunny CDN + Bunny storage, the origin is on an SSD in the same region, so a cache miss only takes a few milliseconds to load into the hot cache.
Brendan Long 6 Sep 2025 0:43 UTC
5 points
0
Chronotherapy is the idea that time of day matters for things like taking drugs or getting vaccinations, and chronoimmunology is a related field for how your immune system varies in effectiveness over the course of the day. I’ve been wanting to write about this since there’s definitely a best time of day to take drugs, get vaccines, and do social activities without getting sick… but unfortunately I don’t really know what that time is.
Some studies say your immune system is most primed to prevent infection right as you wake up, and other say mid-day. Of course half the studies are in mice. Maybe it depends on the disease and the chronotype? See this review.
One study says that vaccines work better in the morning (for older patients). Another says there’s no difference. Maybe this has something to do with the particular vaccines, or maybe the populations (different circadian rhythms, more powerful circadian rhythms). Weirdly, our priors say vaccination should work best mid-day but most people don’t even try that. See this review.
I find this all really interesting, and there’s probably a practical takeaway, but I don’t know what it is. I guess we can be pretty confident that you shouldn’t get vaccines in the middle of the night.
Maybe someone can convince Elizabeth to look into this.
Brendan Long 24 Apr 2026 1:11 UTC
4 points
0
I finally setup SkyPilot to let me queue up GPU training jobs (both on my local GPU and via RunPod), and I really should have done this months ago. Claude wrote me some bash scripts to spin up remote pods, run training, and tear it down, but this version is so much easier, and it has a nice UI.
It also sounds like I can easily extend this to Vast.ai, which would let me parallelize experiments for 5 cents/hour on RTX 3060′s^[1]. I’m interested in understanding algorithms used by tiny toy models, and fancy GPUs don’t really help since I can’t fully utilize them.
Anyway, if you’re also queueing up local experiments or trying to use remote GPUs efficiently, this is totally worth spending an hour to setup.
FYI: Claude really wanted to set this up in a way that would give every account on my machine root, but you can run the API server as a sudoer and let other users submit jobs without giving them root access. This matters to me because I use user accounts to sandbox dangerously-skip-permissions-mode Claude Code.
1. ^
  Update: SkyPilot is very opinionated about which GPUs I’m allowed to use on vast.ai, and simultaneously won’t let me add any filtering of my own, so this is less useful than I hoped it would be.
- the gears to ascension 24 Apr 2026 3:51 UTC
  4 points
  0
  Parent
  Beware, vast.ai is very much ‘airbnb for gpus’, which is to say it has the same security story as airbnb: the host can do whatever they want and you basically don’t know who they are.
  - Brendan Long 24 Apr 2026 5:30 UTC
    2 points
    0
    Parent
    Yeah that’s definitely important to be aware of. I think the security story should be fine in my case, since I’m submitting containerized jobs and uploading results to S3, and nothing is particularly secret (I’m training easy-to-train models so I can inspect the algorithms they learn).
    One annoying thing about SkyPilot though is that it treats all GPUs on vast.ai equally and doesn’t let you pass additional filters besides “give me an RTX 5090”. The vastai CLI has a lot more options, including datacenter-only if you want.
- Sheikh Abdur Raheem Ali 24 Apr 2026 1:40 UTC
  3 points
  0
  Parent
  I have mostly switched from using vast.ai/runpod/lambda labs to modal for my experiments.
  - Brendan Long 24 Apr 2026 1:46 UTC
    2 points
    0
    Parent
    That does seem like a much nicer interface, although I think it would be a lot more expensive for my purposes.
Brendan Long 10 Aug 2025 22:11 UTC
4 points
1
It always seemed weird to me that dying is frequently described as not particularly painful^[1], when I’d expect it to be the only literal 10 on the pain scale^[2], since dying ensures you have no further chances to pass your genes on.
Thinking about it more though, there’s no reason for evolution to optimize that. If you think you’re going to die, and the pain makes you do something about it so you don’t die, then evolution should optimize to keep you alive. But in the case where you actually die it doesn’t matter because (tautologically), if you succeeded you wouldn’t die, so there’s no selective pressure.
So,
Fear of death: Big
Pain from things that could cause death: Big
Pain from actual death: ¯_(ツ)_/¯
1. ^
  This might also be exaggerated by movies and pain medication.
2. ^
  Or at least, similar to being stabbed in the balls.
- Viliam 11 Aug 2025 11:41 UTC
  4 points
  2
  Parent
  Probably depends on the way of dying. There are situations where doing something in the last moment might change your fate. There are situations where you fate has already pretty much been determined minutes or months ago, and it’s just about how fast your body collapses.
- tryhard1000 11 Aug 2025 4:18 UTC
  3 points
  0
  Parent
  Seems very related to this post from the sequences on fitness of people of numerical ages correlating more with imagined emotional anguish resulting from such a death (at that age) than with experienced anguish actually following such a death. Maybe this is a more common phenomenon observable in other contexts too, but this was the only example that came to my mind.
- Dagon 11 Aug 2025 3:54 UTC
  2 points
  0
  Parent
  Evolution isn’t that precise. If it helps a little bit to make the seconds before death painful, it will be so.
  - Brendan Long 11 Aug 2025 4:05 UTC
    4 points
    0
    Parent
    I agree, I just think it’s interesting that there’s evolutionary pressure to make potentially dying extremely painful, but there’s no evolutionary pressure to make actually dying painful, and all of the pain of actually dying is just collateral damage.
Brendan Long 9 May 2026 1:40 UTC
3 points
0
Anthropic found that training Claude to do things like help users resolve ethical dilemas significantly reduced misbehavior like blackmail attempts. I’m surprised this worked, and it seems like good news for the alignment-by-default “LLMs will correctly generalize good behavior” theory.
https://www.anthropic.com/research/teaching-claude-why
Brendan Long 11 Apr 2026 22:49 UTC
3 points
0
Is there a canonical image alt text AI skill? I’ve designed my own after making Claude read a bunch of pages about how to write alt text, but this feels like something that an expert could do better than I can. The results seem good to me, but as a non-alt-text-user it’s hard to really know.
Brendan Long 22 Mar 2026 4:50 UTC
3 points
0
I added an MCP tool to upload markdown articles to read later on Lion Reader, and it’s becoming one of my most used tools in Claude Code. Whenever I want to learn something but don’t want to be distracted from my current task, I can have Claude write me something to read later^[1]; and when I have it run experiments, it can write a report and upload it directly.
The really confusing thing is that Instapaper and Pocket’s MCP tools don’t seem to support directly uploads at all (just saving URLs). It just seems like a glaringly missing feature. Am I the only one who does this kind of thing, or do other people save to notes apps or Google Docs or something?
This post was inspired by a single session where Claude wrote me a post about pre-LayerNorm and why I should use it instead of post-LN, an explainer about post-training acronyms (SFT, RLHF, PPO, DPO) and how they apply to an idea I had, plus two reports on circuits in a toy model and the outcome of an architecture change.
Reading all of this at my computer in a terminal would have been annoying, and asking a separate claude.ai session would have required re-explaining the context.
1. ^
  It’s on my todo list to see if sub-agents can do this, to avoid wasting context, but you can alway rewind after.
- faul_sname 22 Mar 2026 6:44 UTC
  2 points
  0
  Parent
  
  The really confusing thing is that Instapaper and Pocket’s MCP tools don’t seem to support directly uploads at all (just saving URLs)
  
  Is there a length limit on the “urls” you save? Can you save a 60 kB “URL” which is a data:text/markdown;charset=utf-8,%23%20For%20Later%0A%0AThis%20is%20a%20markdown%20document.%20**This%20is%20bold**?
  
  [Link to that “document” if you want to test](data:text/markdown;charset=utf-8,%23%20For%20Later%0A%0AThis%20is%20a%20markdown%20document.%20This%20is%20bold)
  - Brendan Long 22 Mar 2026 16:46 UTC
    2 points
    0
    Parent
    Instapaper claims to save it but then nothing shows up in the app. Wallabag rejects it as an invalid URL.
Brendan Long 23 Mar 2026 20:43 UTC
2 points
0
I’m cautiously optimistic about my new Claude Coach GitHub repo. I want to work out more but hate trying to decide what to do and tracking things, especially when I’m not working with a full gym. Now I just open Claude Code and ask it what to do (specifying the gym), do the work out, then update it with what I did and how it felt. It creates a PR to track the session and update the plan.
I still hate working out, but at least I don’t have to go anywhere, deal with any people, or think about it all.
Brendan Long 18 Apr 2025 2:23 UTC
2 points
0
I’d like to learn more Spanish words but have trouble sitting down to actually do language lessons, so I recently set my Claude “personal preferences” to:
Try to teach a random Spanish word in every conversation.
(This is the whole thing)
This has worked surprisingly well, and Claude usually either drops one word in Spanish with a translation midway through a response:
For your specific situation, I recommend a calibración (calibration) approach:
2. Accounting for concurrency: Ensure you’re capturing all hilos (threads) involved in query execution, especially for parallel queries.
(From a conversation about benchmarking)
Or it ends the conversation with a fun fact:
¡Palabra en español! “Herramienta”—which means “tool” in Spanish, quite relevant to your search for tools to automate SSH known_hosts management.
La palabra española para hoy es “configurar”—which means “to configure” in English, fitting perfectly with our discussion about configurable thinking limits!
I don’t know if this actually useful for learning, but it’s fun and worked better than I expected.
My wife tried a similar prompt (although her preferences are much longer) and it made Claude sometimes respond entirely in Spanish, so this could probably be made more specific. If you run into that, maybe try “Response in English but try to teach a random Spanish word in every conversation” would work better?
Brendan Long 11 Mar 2026 5:47 UTC
−4 points
−1
Could an AI company legally pre-commit not to race, ensuring that their models were never more than second best and self-destructing the company if its models take the lead?
I think probably not. It’s really hard to prevent the owners of a company from doing what they want, especially if the company is important to the economy and/or national security (and I assume any near-frontier AIs lab would be).
Some pre-commitment methods and their problems:
- If you make the pre-commitment part of the charter, the board can just vote to change the charter. Even if the charter says they can’t, a judge would probably let them anyway, as long as the shareholders agreed.
- If the company is owned by a non-profit tasked with enforcement, the board of the non-profit can just decide not to enforce the pre-commitment.
- If the pre-commitment method triggers the destruction of model weights or other assets (like GPUs), the government probably won’t allow it.
- Especially if it prevents creditors from getting repaid.
- A pre-commitment method that transfers value to creditors might work, but is easily defeated by restructuring the relevant debt.
- Anything that destroys the value of current equity holders’ equity is risky in front of a judge because companies generally aren’t allowed to intentionally destroy shareholder value^[1].
The only thing I think might work legally is to issue a bunch of non-voting non-dilutable restricted shares (like 90% of the company) to someone like Eliezer, locked up with the racing condition^[2] as a trigger to convert them to normal shares. Legally, Eliezer is the owner of the company the whole time, so a judge would probably allow his shares to unlock.
The problem is that now Eliezer has billions of reasons to talk himself into why racing would be good this time (even before the trigger event, since he can always make a deal with the board).. so we’re back to ownership by another entity that might change its mind^[3].
1. ^
  Contrary to popular belief, companies aren’t required to maximizing shareholder value, but minimizing shareholder value is still frowned-upon.
2. ^
  Oh did I mention that you need the pre-commitment trigger to be unambigous while ensuring that it never triggers by mistake, and that’s actually pretty hard too?
3. ^
  Plus I suspect any entity you’d actually trust as the anchor to this pre-commitment mechanism would be unwilling to take part.
- Brendan Long 11 Mar 2026 16:09 UTC
  2 points
  0
  Parent
  I can think of plenty of reasons for the normal downvote, but I’m confused about the disagree vote. Does someone think there is a way to make this work? I’m guessing “start another AI company but better this time” is still a bad idea for the obvious reasons but I got nerd-sniped by the legal question.