The effects of caffeine consumption do not decay with a ~5 hour half-life
epistemic status: confident in the overall picture, substantial quantitative uncertainty about the relative potency of caffeine and paraxanthine
tldr: The effects of caffeine consumption last longer than many assume. Paraxanthine is sort of like caffeine that behaves the way many mistakenly believe caffeine behaves.
You’ve probably heard that caffeine exerts its psychostimulatory effects by blocking adenosine receptors. That matches my understanding, having dug into this. I’d also guess that, insofar as you’ve thought about the duration of caffeine’s effects, you’ve thought of them as decaying with a ~5 hour half-life. I used to think this, and every effect duration calculator I’ve seen assumes it (even this fancy one based on a complicated model that includes circadian effects). But this part is probably wrong.
Very little circulating caffeine is directly excreted.[1] Instead, it’s converted (metabolized) into other similar molecules (primary metabolites), which themselves undergo further steps of metabolism (into secondary, tertiary, etc. metabolites) before reaching a form where they’re efficiently excreted.
Importantly, the primary metabolites also block adenosine receptors. In particular, more than 80% of circulating caffeine is metabolized into paraxanthine, which has a comparable[2] binding affinity at adenosine receptors to caffeine itself. Paraxanthine then has its own 3-5 hour half-life as it’s metabolized into a handful of other things.
Since paraxanthine is by far the dominant primary metabolite, and its further metabolites are mostly either very short lived or have poor affinity for adenosine receptors, we can fruitfully use a simplified model of Caffeine ⟶ Paraxanthine ⟶ Elimination. The upshot is an effective concentration curve with a broader peak and slower decline—about twice as long to reach half of peak effective concentration, assuming paraxanthine and caffeine are equipotent—than that given by the simple elimination model. When I say effective concentration, I mean the concentration of caffeine that would be needed to produce the same effect.
Below is a simulator that models caffeine and paraxanthine metabolism following ingestion of 100mg caffeine or paraxanthine (link to full version). Note that the relative potency of paraxanthine can be adjusted; I am uncertain about how it compares to caffeine within that 4-fold window (see the section on relative potency).
Paraxanthine supplements
If the above model is correct, paraxanthine itself is sort of like caffeine that behaves the way I mistakenly used to believe caffeine behaves. Directly using paraxanthine as a stimulant would have two major differences/advantages compared to caffeine:
Effects wear off a lot faster, which means you can take it later in the day without affecting sleep as much (or in the morning without affecting a siesta as much)
For a given peak level of stimulation you probably develop less tolerance, since active molecules don’t hang around as long (really just another implication of point 1)[3]
(It might also have some other differences, like a somewhat different profile of effects.)
In the US you can buy paraxanthine itself as a supplement (I believe it only became available recently, in 2022). I’ve been using 100mg capsules intermittently in the last few weeks.[4] Some early impressions:
Taking one or two capsules in the morning has a similar wakefulness-promoting effect to drinking a cup or two of coffee in the morning
I’ve taken one capsule around 4-5pm a handful of times, which did not interfere with going to sleep at 10-11pm
Taking a capsule at 7pm gave me enough energy to do some work in the evening, and did not interfere with going to sleep at 11:30pm
Subjectively the peak effects of 100mg paraxanthine feel similar or weaker than those of 100mg caffeine, while 200mg paraxanthine feels stronger than 100mg caffeine
The effects seem to peak within an hour of dosing (maybe 30-45 minutes)
I sometimes feel a bit of an energy crash 2-3 hours after dosing when I take paraxanthine in the morning, but not in the afternoon/evening
The supplements sold in the US appear to exclusively use enfinity branded paraxanthine, perhaps due to them holding some very broad patents on the use of paraxanthine as a supplement.[5] On their website they emphasize that paraxanthine has a “cleaner” effect than caffeine, is supposedly safer, and has a somewhat shorter half-life that isn’t affected by slow caffeine metabolism.[6] The somewhat shorter half-life undersells this point: it’s effectively much shorter due to the lack of active downstream metabolites. They also don’t say anything about reduced tolerance on the main page (though it’s mentioned in the FAQ).
Exactly how potent is paraxanthine compared to caffeine?
By “how potent”, I mean the binding affinity at adenosine receptors. The binding affinity (Ka) is equivalent to the odds ratio that a ligand will be bound to some particular receptor at some point in time, divided by the concentration of the ligand (since the odds ratio is proportional to ligand concentration).[7] If one substance has twice the affinity as another, you only need half as much to get the same receptor occupancy.
There are four subtypes of adenosine receptors: A1, A2A, A2B, A3. The psychostimulatory effects of caffeine are thought to be mediated by the blocking of A1 and A2A receptors, with A1 receptors probably being more important. I was surprised to learn that the affinities of caffeine and paraxanthine for human A1 and A2A receptors are not well established in the literature. Below are my takeaways from researching this:
Paraxanthine has about 1.5-2x the affinity of caffeine for A1 and A2A in most studies where both have been estimated, across multiple species
Paraxanthine’s affinity for human A1 and A2A receptors has only been measured in one study that I was able to find
This result concurs with paraxanthine having ~2x caffeine’s affinity
This study used a different experimental method for the estimation (adenosine-cAMP dose-response curves) than most other studies (radioligand binding)
The human receptors were expressed by CHO cells (a cell line from hamsters that’s easy to work with)
Human measurements of caffeine’s affinity for A1 and A2A range over a factor of 5 and 20 respectively[8]
The 5x range for A1 does seem to reflect genuine reported measurements, while I’m less sure about the 20x range for A2A (e.g. maybe the most extreme values were misreported in that giant linked table)
From the above we might tentatively guess that paraxanthine is 1.5-2x as potent a stimulant as caffeine, while noting that measurements seem to be all over the place. This is in contrast to my subjective experience so far, where an equal dose of paraxanthine feels similar or weaker than caffeine.
Note that I’m comparing paraxanthine to coffee, which could be confounded by several things such as other stuff in the coffee having effects, placebo, or the coffee pods I was using having more or less caffeine than claimed. A blind comparison between caffeine and paraxanthine capsules would be much better, and in hindsight I wish I’d done that right away.
Some other hypotheses for what could be going on here:
My subjective impressions are unreliable
The relevant receptors behave differently in human brains than in the assay conditions used by these studies
The measurements being all over the place is suggestive of conditions making a big difference
The A1/A2A antagonism model is incomplete
E.g. my impression is due to the lack of other subjective effects of caffeine besides the primary stimulatory effects
E.g. caffeine has additional stimulatory effects mediated by non A1/A2A pathways
Less absorbed paraxanthine reaches A1/A2A receptors in the brain for some reason
Paraxanthine’s peak effects are reduced by slower absorption
I doubt this: subjectively effects seem to peak quickly (maybe 30-45 minutes after dosing), and wear off fairly quickly (a few hours)
The bioavailability (absorption) is worse than that of caffeine
Probably not if we think the absorption is fast, since fast absorption generally implies high bioavailability
There’s less paraxanthine in the capsules than claimed
Concluding thoughts
I recommend trying paraxanthine as a stimulant. For me personally, two use cases stand out so far:
Taking a 100mg capsule in the late afternoon/evening can give me enough energy to work when I’d otherwise be too tired, without noticeably affecting my sleep
Taking 100-200mg after waking can maybe replace coffee for getting me going in the morning, probably with less resulting tolerance buildup
Considering how widely used caffeine is, I was surprised to learn through independent research that paraxanthine considerably extends the duration of its effects, which almost no one seems to understand. I was also surprised by how poorly characterized the interactions of caffeine and paraxanthine with adenosine receptors seem to be. There’s probably a civilizational inadequacy story here.
If such an important point about caffeine—the one ubiquitous nootropic—was unknown to even the nerds, that’s some evidence towards there being other low hanging cognitive enhancement fruit. That is, we expect more low hanging fruit in this world than in the worlds where the nerds already knew about this.
Paraxanthine-based stimulants look to me like a pretty darn low-hanging fruit that took forever to be picked; science has known about caffeine metabolism and paraxanthine’s adenosine receptor antagonism since at least the early 1980s, yet the paraxanthine supplements only became available a few years ago.[9]
- ^
Caffeine can easily cross between the brain and bloodstream. This means that, because of diffusion, the concentration in the brain (where psychostimulatory effects are mediated) closely tracks the concentration in the bloodstream. When I say “excreted” I mean the molecule is removed from circulation, such that its concentration in the bloodstream (and brain) is reduced. This is mostly done by the kidneys, with the molecules ending up in urine.
- ^
Numbers are surprisingly hard to pin down here; see the section on relative potency.
- ^
My guess is that equilibrium tolerance is roughly a function of the (per day) area under the curve (AUC) of equilibrium effective concentration.
- ^
The label says not to exceed 300mg per day. My guess is this comes from the maximum recommended daily caffeine intake of 400mg, which is metabolized to about 300mg paraxanthine in vivo.
- ^
The linked patent seems to claim any use of paraxanthine as a dietary supplement with a dose in the range 2-800mg.
- ^
Might not there be slow paraxanthine metabolizers as well? I suppose enfinity isn’t incentivized to ask this question.
- ^
Note that reported values are usually Ki/Kd, which is the reciprocal of this definition. The meaning of these values is the concentration needed for 50% receptor occupancy by the ligand (i.e. 1:1 odds ratio).
- ^
Why such big discrepancies? I don’t know. One thought is that, since adenosine receptors are G-coupled protein receptors (GPCRs), it might not be reasonable to summarize a ligand’s binding affinity with a single number. This is because GPCRs have two different states, and ligand binding affinities for the states can differ.
- ^
This is also evidence that some circumstances recently changed. For example: a regulatory change, or cheap mass production of paraxanthine being enabled by other tech developments.
I believe there is a more general point to be made here. Namely, the effects of most (if not all) psychoactive substances don’t decay with their half-life. A more useful framing would be that they don’t track blood concentration all that well, and psychoactive effects can last both significantly longer or shorter than looking at the half-life would predict. I don’t do theoretical analyses such as yours, because I don’t expect getting much insight that is not available through (other people’s) empiricism.[1]
Consider a few cases: (half-life numbers found through quick googling)
Caffeine - [2] Looking at your applet, it suggests a ~fourth of (effective, after including paraxantine) the peak concentration after 16 hours. But at the same time, I would predict that taking a bolus caffeine dose after waking up would cause less sleep disturbance than taking a paraxantine dose in the evening to produce the same (effective) blood concentration. I presume you would expect this as well?
Phenibut—half life 5.3 hours. Psychonaut wiki lists the duration of effects as 10-16 hours which basically matches my experience. By any objective/subjective metric I’d predict we’d find the effects to dimish more slowly than with caffeine.
Armodafinil—half life 15 hours. Duration 8-15 hours, which, again, matches my experience. Note, the duration is a bit shorter than with phenibut despite the huge half-life difference.
Nicotine
Nicotine half life 1-2 hours. Duration (from buccal administration [3]which I use) 0.5-1.5 hours.
Psychedelics (in general) - I believe psychedelic trips are largely inexplicable through any exponential decay model (of psychoactive effects). I can explain, if you don’t share this intuition.
I could go on, of course. But my overall point is that regardless of whether one uses receptor affinity and occupancy and dowstream metabolites or more/other variables in one’s model, said model will likely still not be accurate enough to predict subjective effects.
Though I heard about Paraxantine before
Psychonaut wiki lists duration as 2-5 hours which is proably a bit too short, but not by much.
Administration can also alter the duration of effects greatly. Way more than looking at plasma concentrations would predict.
Presumably, there must be research on this?
I understand the general point. Here I wanted to point out a pretty narrow misconception about caffeine specifically. Of course psychological state isn’t purely a function of blood concentration since there are other variables that change throughout the day, like adenosine levels. It’s also possible that there could be a lag time or complicated mapping from receptor occupancy to psychological effects. But there’s a useful intermediate node in the causal graph, effective concentration of receptor antagonists, that looks pretty different depending on whether you model paraxanthine or not.
I don’t think I would expect that, degree of tolerance held constant.
On further thought I would weakly expect more difficulty getting to sleep from paraxanthine if you took it within an hour of going to bed, since there could be some stuff that has to do with the derivative of effective concentration. If you took it several hours before bed to end up with the same effective concentration at bedtime I don’t think I expect a noticeable difference in ability to get to sleep. I also think I’d expect less sleep disturbance overall from paraxanthine, effective concentration at bedtime held constant and barring scenarios where you’re taking it within an hour of bedtime, since the effective concentration resulting from paraxanthine will decay faster than that resulting from caffeine.
The relative potency seems quite important to the overall effect. Like, if paraxanthine is 0.5x as effective as caffeine, it doesn’t seem like a huge deal. More than that and it starts seeming quite significant! Would be great to get more of a sense.
If I understand correctly there is somewhat broad consensus that between two psychoactive substances with comparable mechanism the one with quicker metabolism is generally more addictive, due to a combination of more frequent dosing behavior and sharper decay being a more salient signal.
Wow. Ordered. Looking forward to better sleep.
Where did you get the 100mg version? I could only find 200 and would rather not split them.
Wrt the implications: there are probably more low hanging fruits than you thought. And we’re probably more likely to die from AGI than you thought before noticing this important gaps in group epistemics ;)
Thanks so much for figuring this out and sharing.
here (The serving says 200mg, but that’s two capsules. They’re the only 100mg capsules I was able to find.)
You’re welcome! 🙂
I wanted to see how this compared to a world where caffeine had a 10 hour half-life, and no relevant metabolite.
Here’s what Claude made:
(Did you know that you could embed this directly as a custom widget using the /custom-widget command? The future is here!)
I’m guessing this is assuming that caffeine and paraxanthine both have a half life of 5 hours, and that all caffeine is converted into paraxanthine. My simulator by default uses 4 hours for paraxanthine and 5 hours for caffeine, and a ratio of 0.84 for caffeine → paraxanthine.
Anecdotally speaking, I have avoided caffeine for this reason (as hard as it is to do that these days where I am). It lasts way longer than I want it to, and makes me a little compulsive well into the evening and night even when I take it in the morning. Although my internal reasoning was that I must have some genetic thing that makes caffeine last way longer than it’s supposed to. I also thought there might have been second order effects of alertness that are hard to wind down once they get going.
It’s interesting to read that there was always this alternative third explanation that I hadn‘t thought of. That it just works in a different way than what I originally understood. That it might be because it is metabolized into paraxantine which also blocks adenosine receptors.
Thank you for this post.
I’ll echo this. I always thought I was particularly sensitive to caffeine and so went without it for years until recently. I’ll find that I’m not able to sleep for 12-14 hours after initial administration, and I rarely have more than 50mg in a day. I notice effects mostly wearing off around 20 hours after administration usually.
I do wonder, as some commenters stated, how much the natural accumulation of adenosine throughout the day interplays with this, though, in addition to the dopaminergic and serotonergic effects on mood modulation. I would imagine if one is elevated into a state of higher nervous system arousal, there could be feedback effects long after the direct adenosine receptor antagonism dwindles in efficacy.
I tried it for a few weeks. it does basically nothing for me even at 200 mgs.
How much caffeine had you been consuming prior to that?
100 mg every morning.
In pill form?
1 cup of coffee.
Did you feel anything from 200mg PX at all?
In an unpublished blind comparison between PX and CA I found 100mg CA in pill form surprisingly weak compared to what I expected from drinking coffee. I ended up increasing the dose to 200mg for both. I rated the peak effects of 200mg PX somewhat weaker on average, and napped more on the PX days.
>Did you feel anything from 200mg PX at all?
not really. i can’t write without caffeine/nicotine. and PX/nicotine didn’t work.
Thanks for the data point!
I’m curious whether 100mg caffeine in pill form would work for you.
when you compare caffeine to paraxanthine, are you comparing caffeine pills or coffee? cuz coffee has a bunch of other things in there that could be confounding. its also really hard to isolate eg gut motility, receptor saturation, circadian dynamics in PK, so prolly good to normalize those if comparing.
curious about whether the crash is worse, if anyone caffeine crash prone tries, would be cool to get an update
Good point, I’ve been comparing to coffee. There could also be placebo effects since I associate coffee with stimulation. I’m also not sure how much I should trust the claimed caffeine content of the coffee pods I was using. I’ll update the post to mention these.
with respect to caffeine content of coffee pods: Was able to find this (paywalled) study https://journals.sagepub.com/doi/10.1177/0260106018810941#core-collateral-purchase-access
Their method of analysis seems sound, and they find considerable variability across at least this brand of coffee pods. Caffeine content of resulting coffee varied with serving size on the order of 10-20 mg, max-min varied by 40 or 30 mg though for this I’m pretty sure they considered all 4 measurements for a given type of pod (2 measurements for small size and 2 for large), which is not a lot, so the data is pretty noisy, but it’s enough to see that the caffeine content on the packaging is not to be trusted (unless your brand of coffee pods uses some weird process that doesnt involve caffeine coming from the beans themselves).
Also a note for their comparisons to manufacturer contents—they compare the All means to listed content, which is weird since people probably consume one of the coffee serving sizes, not both, so it’s understating the difference from listed contents.
Not the half life active curve, but people underrate how the curve for decreasing your tolerance as you abstain differes between substances. There isn’t much research into 1. how a substance becomes less effective over time nor 2. How fast you become addicted.
I assume this is 1 reason a lot of people got addicted to nicotine: despite the shorter half life you need a longer time off to reset the effectiveness. Caffiene after 2-7 days your body starts to undo its chemical adaptations and mostly washed out after a few weeks. For nicotine it’s closer to a week to start 12 weeks to wash out an addiction.
From Largely Vibes
Adderall: Proscribed for every day, Scott Alexander recommends 5 days on 2 off
Nicotine: once every week? (Alexey Guzey started planning with once every third day, in 2024 self described as a smoker, but smoking what?). Problem one: since half life is only 2h to be stimulated all day you’d ingest several times likely training yourself to reach for it when tired unless you’re counting.
Caffeine: every other day likely keeps the same effect size?
Modafinil: folk wisdom is once per week else it stops working. This is “taking every day will really reduce effects after 90-180 days”, unclear if this means it’ll be effective over 5 years.
Alchohol: 14 drinks per week for men or 8 for women makes you an alcoholic and will probably cause an addiction over several years, sooner if higher. But the effect size doesn’t change as much. Compare how “holding your liquor” means tolerating maybe 50% more for your weigh, not nicotines pack a day vs 1 cig for irregular smoker or several cups of coffee being 500mg to not have the effect of 100mg did initially.
I’d be curious if anyone has actually looked into for Nootropics
What range of dosing schedule maximizes long term usefulness
What dosing schedule is dangerous and indicates big problems within the next month
I think nicotine is a strong behavioral reinforcer (especially when smoked, since it hits the brain immediately).
My guess is that you build half as much tolerance taking X mg every other day compared to taking X mg daily.
Not my personal experience. Taking 300-400 mg caffeine every other day leads to ~no tolerance, or a very slowly developing one; taking that amount every day maxes out the tolerance very fast. My current schedule is just that, plus a once-a-month stretch where I skip caffeine for three days in a row to wipe out whatever tolerance may have crept up. (But obviously everyone’s biochemistry differs significantly, etc.)
My model says that if you took 400mg EOD:
on the caffeine days you’d feel similar to having taken 200mg in a caffeine-naive state
on the off days you’d feel similar to abstaining from caffeine after having taken 200mg daily for a while
200mg in a caffeine-naive state would feel pretty stimulating for most people I’d think. Do you feel more tired than a caffeine-naive baseline on the off days?
Yup, I expect the same regarding both.
Yup, it is.
Hard to say, but probably somewhat.
Sounds like you don’t actually disagree with “you build half as much tolerance” then?
stop being so similar to me lol
Depends on your model of addiction: is it pavlovian conditioning to generate the idea to smoke or a biochemical change which generates cravings you sooth by smoking? The later latter on, the earlier probably some part of causing smoking more.
All the +400mg/day people I know take it every day. Though I wouldn’t notice 200
Probable “write down your exact dowing schedule and if you feel a craving for more abstain for a long while” works, but how strong a craving do you get? (And doesn’t address reduced efficacy)
Isn’t the point that a dose of caffeine is actually metabolized as a dose of caffeine PLUS paraxanthine metabolites? So you never actually experience caffeine by itself, and can’t really compare the two directly, although you can definitely model it modulo the rate of metabolism.
I’ve been trying to compare the early/peak effects, which for caffeine will still be mostly just caffeine.
Huh, this seems possibly really important. If there are more insights of a similar or greater level already hidden in the existing literature, it seems worthwhile to put a bunch of effort into finding them. I wonder if existing LLM harnesses would be enough if told to look for the right things, or if we would need a new one to effectively sort through and find them?
Does it also give you more of a kick?
I replaced coffee with a guarana mixture for stomach reasons, but I really miss the coffee hit (and the smell, taste, preparation and everything else about coffee but what can I do).
Probably less of a kick if anything, it does seem a bit “smoother”. I also rather enjoy coffee, so not sure I’ll be replacing it long term.
You could try this coffee where they decaffeinate it and then add in paraxanthine
I’m trying drinking decaf along with a paraxanthine pill.
Decaf coffee reputedly has a bunch of health benefits as well as
being delicioussatisfying the addiction association.I stopped drinking coffee regularly because I would always get a headache when I forgot to drink coffee, e. g. on weekends. Do you have experience with withdrawl symptoms? Are they better or worse with paraxanthine? If you don’t, do you have a guess?