AnthonyC

Karma: 3,013

AnthonyC Apr 11, 2025, 7:53 PM
5 points
3
in reply to: Cole Wyeth’s comment on: Reactions to METR task length paper are insane
I also don’t have a principled reason to expect that particular linear relationship, except in general in forecasting tech advancements, I find that a lot of such relationships seem to happen and sustain themselves for longer than I’d expect given my lack of principled reasons for them.
I did just post another comment reply that engages with some things you said.
To the first argument: I agree with @Chris_Leong’s point about interest rates constituting essentially zero evidence, especially compared to the number of data points on the METR graph.
To the second: I do not think the PhD thesis is a fair comparison. That is not a case where we expect anyone to successfully complete a task on their own. PhD students, post-docs, and professional researchers break a long task into many small ones, receive constant feedback, and change course in response to intermediate successes and failures. I don’t think there are actually very many tasks en route to a PhD tat can’t be broken down into predictable, well defined subtasks that take less than a month, and the task of doing the breaking down is itself a fairly short-time-horizon task that gets periodically revised. Even still, many PhD theses end up being, “Ok, you’ve done enough total work, how do we finagle these papers into a coherent narrative after the fact?” Plus, overall, PhD students, those motivated to go to grad school with enough demonstrated ability to get accepted into PhD programs, fail to get a PhD close to half the time even with all that.
I imagine you could reliably complete a PhD in many fields with a week-long time horizon, as long as you get good enough weekly feedback from a competent advisor. 1: Talk to advisor about what it takes to get a PhD. 2: Divide into a list of <1 week-long tasks. 3) Complete task 1, get feedback, revise list. 4) Either repeat the current task or move on to the new next task, depending on feedback. 5) Loop until complete. 5a) Every ten or so loops, check overall progress to date against the original requirements. Evaluate whether overall pace of progress is acceptable. If not, come up with possible new plans and get advisor feedback.
As far as not believing the current paradigm could reach AGI, which paradigm do you mean? I don’t think “random variation and rapid iteration” is a fair assessment of the current research process. But even if it were, what should I do with that information? Well, luckily we have a convenient example of what it takes for blind mutations with selection pressure to raise intelligence to human levels: us! I am pretty confident saying that current LLMs would outperform, say, Australopithecus, on any intellectual ability, but not Home sapiens. So that happens in a few million years, let’s say 200k generations of 10-100k individuals each, in which intelligence was one of many, many factors weakly driving selection pressure with at most a small number of variations per generation. I can’t really quantify how much human intelligence and directed effort speed up progress compared to blind chance, but consider that 1) a current biology grad student can do things with genetics in an afternoon that evolution needs thousands of generations and millions of individuals or more to do, and 2) the modern economic growth rate, essentially a sum of the impacts of human insight on human activity, is around 15000x faster than it was in the paleolithic. Naively extrapolated, this outside view would tell me that science and engineering can take us from Australopithecus-level to human-level in about 13 generations (unclear which generation we’re on now). The number of individuals needed per generation is dependent on how much we vary each individual, but plausibly in the single or double digits.
My disagreement with your conclusion from your third objection is that scaling inference time compute increases performance within a generation, but that’s not how the iteration goes between generations. We use reasoning models with more inference time compute to generate better data to train better base models to more efficiently reproduce similar capability levels with less compute to build better reasoning models. So if you build the first superhuman coder and find it’s expensive to run, what’s the most obvious next step in the chain? Follow the same process as we’ve been following for reasoning models and if straight lines on graphs hold, then six months later we’ll plausibly have one that’s a tenth the cost to run. Repeat again for the next six months after that.

AnthonyC Apr 11, 2025, 7:07 PM
7 points
3
in reply to: Cole Wyeth’s comment on: Reactions to METR task length paper are insane
Personally I think 2030 is possible but aggressive, and my timeline estimate it more around 2035. Two years ago I would have said 2040 or a bit later, and capabilities gains relevant to my own field and several others I know reasonably well have shortened that, along with the increase in funding for further development.
- The Claude/Pokemon thing is interesting, and overall Pokemon-playing trend across Anthropic’s models is clearly positive. I can’t say I had any opinion at all about how far along an LLM would get at Pokemon before that result got publicized, so I’m curious if you did. What rate of progress on that benchmark would you expect in a short-timelines world? If there’s an LLM agent that can beat Pokemon in six months, or a year, or two years?
- Self-driving vehicles are already more of a manufacturing and regulatory problem than a technical one. For example, as long as the NHTSA only lets manufacturers deploy 2500 self-driving vehicles a year each in the US, broad adoption cannot happen, regardless of technical capabilities or willingness to invest and build.
- I also don’t think task length is a perfect metric. But it’s a useful one, a lower bound on what’s needed to be able to complete all human-complete intellectual tasks. Like everything else to date, there is likely something else to look at as we saturate the benchmark.
- I agree novel insights (or more of them, I can’t say there haven’t been any) will be strong evidence. I don’t understand the reason for thinking this should already be observable. Very, very few humans ever produce anything like truly novel insights at the forefront of human knowledge. “They have not yet reached the top <0.1% of human ability in any active research field” is an incredibly high bar I wouldn’t expect to pass until we’re already extremely close to AGI, and it should be telling that that late bar is on the short list of signs you are looking for. I would also add two other things: First, how many research labs do you think there are that have actually tried to use AI to make novel discoveries, given how little calendar time there has been to actually figure out how to adopt and use the models we do have? If Gemini 2.5 could do this today, I don’t think we’d necessarily have any idea. And second, do you believe it was a mistake that two of the 2024 Nobel prizes went to AI researchers, for work that contributes to the advancement of chemistry and physics?
- AI usefulness is strongly field dependent today. In my own field, it went from a useful supplementary tool to “This does 50-80% of what new hires did and 30-50% of what I used to do, and were scrambling to refactor workflows to take advantage of it.”
- Hallucinations are annoying, but good prompting strategy, model selection, and task definition can easily get the percentages down to the low single digits. In many cases the rates can easily be lower than those of a smart human given a similar amount of context. I can often literally just tell an LLM “Rewrite this prompt in such a way as to reduce the risk of hallucinations or errors, answer that prompt, then go back and check for and fix any mistakes” and that’ll cut it down a good 50-90% depending on the topic and the question complexity. I can ask the model to cite sources for factual claims, dump the sources back into the next prompt, and ask if there are any factual claims not supported by the sources. It’s a little circular, but also a bit Socratic and not really any worse than when I’ve tried to teach difficult mental skills to some bright human adults

AnthonyC Apr 11, 2025, 6:27 PM
4 points
2
in reply to: Cole Wyeth’s comment on: Reactions to METR task length paper are insane
Yes, the reasoning models seem to have accelerated things. ~7 months to ~4 months doubling time on that plot. I’m still not sure I follow why “They found a second way to accelerate progress that we can pursue in parallel to the first” would not cause me to think that progress in total will thereafter be faster. The advent of reasoning models has caused an acceleration of increasing capabilities, not in one or two domains like chess, but across a broad range of domains.

AnthonyC Apr 11, 2025, 12:09 PM
2 points
0
on: Why are neuro-symbolic systems not considered when it comes to AI Safety?
I think @tailcalled hit the main point and it would be a good idea to revisit the entire “Why not just...” series of posts.
But more generally, I’d say to also revisit Inadequate Equilibria for a deeper exploration of the underlying problem. Let’s assume you or anyone else really did have a proposed path to AGI/ASI that would be in some important senses safer than our current path. Who is the entity for whom this would or would not be a “viable course?” Who would need to be doing the “considering” of alternative technologies, and what is the process by which those alternative technologies could come to be at the forefront of AI? Where, in the system of companies and labs and researchers and funding mechanisms and governments, could the impetus for it come from, and why would they actually do that? If there is no such entity, then who has the power to convene a sufficient set of stakeholders that would collectively be able and willing to act on the information, and force a negotiated solution?
Consider that in our current system, 77% of all venture funding is going into extant AI approaches, and OpenAI alone is 26%. And consider that competition in AI is intense enough to start breaking down many-decades-old barriers to building new nuclear power plants and upgrading the power grid in a way climate change has never managed. Changing the course of AI in some way that is really fundamental may in fact be necessary, but forcing it to happen requires pushing back against, or sidestepping, a huge amount of pressure to stay the course.

AnthonyC Apr 7, 2025, 2:20 AM
2 points
0
in reply to: Jack’s comment on: I Have No Mouth but I Must Speak
No worries, I appreciate the concept and think some aspects of it are useful. I do worry at a vibes level that if we’re not precise about which human-child-rearing methods we expect to be useful for AI training, and why, we’re likely to be misled by warm fuzzy feelings.
And yes, that’s true about some (maybe many) humans’ vengeful and vindictive and otherwise harmful tendencies. A human-like LLM could easily be a source of x-risk, and from humans we already know that human child rearing and training and socializing methods are not universally effective at addressing this. Among humans, we have so far been successful at not putting anyone who would destroy the world in the position of being able to do so at the time when they would choose to.
As for generational perspectives: this is a useful heuristic among humans. It is not automatic or universal. Not every perspective is worthy of respect, not on every issue. Some ought to be abandoned or condemned in the light of information or reasoning that wasn’t/isn’t available or accessible in other places and times. Some should be respected but only with many caveats. Having your perspective respected is earned. We assume among humans that we should try to respect the perspectives of adults, and sometimes must disabuse ourselves of this in particular cases, but it is pure convention because most humans at a certain age are mature enough for it to be a useful default. I do not have anything like strong reasons to apply this heuristic to LLMs as they currently exist.

AnthonyC Apr 5, 2025, 10:41 PM
6 points
2
in reply to: Jack’s comment on: I Have No Mouth but I Must Speak
We have tools for rearing children that are less smart, less knowledgeable, and in almost all other ways less powerful than ourselves. We do not have tools for specifically raising children that are, in many ways, superhuman, and that lack a human child’s level of dependance on their parents or intrinsic emotional drives for learning from their peers and elders. LLMs know they aren’t human children, so we shouldn’t expect them to act and react like human children.

AnthonyC Apr 5, 2025, 3:29 PM
3 points
0
on: The Rise of Hyperpalatability
Agreed with everything in this post, but I would add that (n=4 people, fwiw) there is also a stable state on the other side of Healthy Food. It’s still more expensive (though becoming less so, especially if you cook) to buy actually healthy food. But, if you are willing to spend a few months experimenting and exploring, while completely eliminating the hyperpalatable stuff, you can end up in a place where the healthiest foods taste better, and the hyperpalatable stuff makes you feel awful even in the short term. You won’t automatically reach a desired weight, but you very likely will eat less, and feel full after a more reasonable amount of food, and have a higher thermic effect of food, and have higher nutrient density food, and have more and more stable energy and mood.
Examples:
- Switch to using unrefined coconut sugar or molasses, and sweets will have a deeper flavor profile and need less sweetness (unless needed for texture, I now cut sugar in most recipes in half or less)
- Better quality grass-fed butter is more flavorful, and also higher in healthy fats, and you can use less for the same effect. Even in pie crust, I use 10-20% less fat and eliminated shortening with a flakier final texture. Brands matter—I have some recipes that just don’t work with some butters
- Ditto for unrefined salts, you need less in food to get the desired flavor effect
- Switch to healthier oils (olive, avocado, macadamia, coconut, etc.) and you get more range of flavor profiles without more cravings, maybe even some appetite suppression. After a while if you eat food with lots of cheap oils (e.g. deep fried in shortening or cottonseed oil) your body won’t be happy with you
- Pasture-raised chicken and eggs genuinely taste better and cook better, and also have a healthier fatty acid profile. Again, brands matter, and pasture-raised or (for other meats) grass-fed has higher variability but also a higher ceiling

AnthonyC Apr 5, 2025, 3:05 PM
4 points
0
on: Ai Cone of Probabilties—what aren’t we talking about?
It’s not clear to me that these are more likely, especially if timelines are short. If we developed AI slowly over centuries? Then sure, absolutely likely. If it happens in the next 10 years? Then modifying humans, if it happens, will be a long-delayed afterthought. It’s also not at all clear to me that the biological portion is actually adding all that much in these scenarios, and I expect hybridization would be a transitional state.
There’s Robin Hanson’s The Age of Em.
On this forum, see What Does LessWrong/EA Think of Human Intelligence Augmentation as of mid-2023?
If you will accept fictional explorations, there are, in fact, many stories that involve these two scenarios. Oftentimes authors choose to write such mergers and evolutions as the enemies of biological humanity, sometimes because that’s easier for readers to sympathize with, sometimes because they actually think that’s likely. I list some below.
Negative examples would include the Borg (Star Trek), the Cybermen (Doctor Who), or the Replicators in Stargate.
Somewhat more positive: Clarke’s Firstborn in the Time Odyssey trilogy (merger with spaceships) or (expressed only vaguely) humanity’s merge with universal and cosmic computers in Asimov’s The Last Question follow this trend.
More concrete and positive human examples show up a bunch in Greg Egan’s short fiction collections (e.g. The Jewel), and in Ian Banks’ Culture novels’ use of neural modifications and implants.
In webfiction, there’s also Marshall Brain’s Manna, where (slight spoiler)
both positive and negative visions of this show up
or
virtual humanity
from Ra on qntm.org.
There’s even the virtual life extension tech in the TV show Upload, where the downsides are mostly about how humanity manages the transition.
There are more ambitious examples of how far this can go in Accelerando or the Orion’s Arm Universe collaborative project.
In a sense it’s even played for humor in They’re Made Out of Meat, where some of the briefly-mentioned alien species rhyme with this kind of transition.
After writing this I asked Gemini for more examples. It listed a bunch I haven’t read and can’t confirm.

AnthonyC Apr 5, 2025, 1:55 PM
7 points
4
on: I Have No Mouth but I Must Speak
I agree that we should be polite and kind to our AIs, both on principle and also because that tends to work better in may cases.
we all labor under the mother’s curse and blessing; our children shall be just like us
If I knew that to be true, then a lot of the rest of this post would indeed follow. Among other things, I could then assume away many/most sources of x-risk and s-risk from AGI/ASI. But generative AI is not just like us, it does differ in many ways, and we often don’t know which of those ways matter, and how. We need to resolve that confusion and uncertainty before we can afford to let these systems we’re creating run loose.

AnthonyC Apr 5, 2025, 1:43 PM
2 points
0
on: Quarter Inch Cables are Devious
If there are no ✓ at all in the last row and column, what are those connecters for?

AnthonyC Apr 4, 2025, 11:57 PM
2 points
0
on: Does the Universe’s Recognition of Photon Detection Provide Support for the Simulation Hypothesis?
It sounds like you’re assuming the Copenhagen interpretation of QM, which is not strictly necessary. To the best of my understanding, initially but not solely from the learned hear on LW, QM works just fine if you just don’t do that and assume the wave equations are continuous work exactly as written, everywhere, all the time, just like every other law of physics. You need a lot of information processing, but not sophisticated as described here.
There’s a semi-famous, possibly apocryphal, story about Feynman when he was a student. Supposedly he learned about the double slit experiment and asked what would happen when you added a third, fourth, fifth, etc. slit. Then he asked about the limiting case—infinite slits—aka no barrier. The point was, there’s never a moment when anything fundamental changes about what is being computed, whether there’s a barrier with slits or not.

AnthonyC Apr 3, 2025, 2:36 PM
11 points
0
on: How To Believe False Things
I realize this is in many ways beside the point, but even if your original belief had been correct, “The Men’s and Women’s teams should play each other to help resolve the pay disparity” is a non-sequitor. Pay is not decided by fairness. It’s decided by collective bargaining, under constraints set by market conditions.

AnthonyC Apr 3, 2025, 12:59 PM
2 points
0
on: The AI Adoption Gap: Preparing the US Government for Advanced AI
You mention them once, but I would love to see a more detailed comparison, not to private industry, but to advocacy and lobbying adoption and usage of AI.

AnthonyC Apr 3, 2025, 1:14 AM
3 points
0
on: Consider showering
As someone who very much enjoys long showers, a few words of caution.
1. Too-long or too-frequent exposure to hot water (time and temperature thresholds vary per person) can cause skin problems and make body odor worse. Since I started RVing I shower much less (maybe twice a week on average, usually only a few minutes of water flow for each) and smell significantly better, with less dry skin or acne or irritation. Skipping one shower makes you smell worse. Skipping many showers and shortening the remainder can do the opposite.
2. A shower, depending on temperature and flow rate, consumes around 10-20kW thermal. It’s probably the single most energy-intensive activity most of us regularly engage in other than highway driving. I’m hoping to eventually get a recirculating shower so I don’t have to think about this as much, but those are still new, rare, and kinda expensive.

AnthonyC Apr 1, 2025, 8:53 PM
3 points
0
in reply to: Jáchym Fibír’s comment on: Tetherware #1: The case for humanlike AI
In some senses, we have done so many times, with human adults of differing intelligence and/or unequal information access, with adults and children, with humans and animals, and with humans and simpler autonomous systems (like sprites in games, or current robotic systems). Many relationships other than master-slave are possible, but I’m not sure any of the known solutions are desirable, and they’re definitely not universally agreed on as desirable. We can be the AI’s servants, children, pets, or autonomous-beings-within-strict-bounds-but-the-AI-can-shut-us-down-or-take-us-over-at-will. It’s much less clear to me that we can be moral or political or social peers in a way that is not a polite fiction.

AnthonyC Apr 1, 2025, 1:02 PM
3 points
0
in reply to: kromem’s comment on: Was the historical Jesus talking about proto-evolution? (You might be surprised)
So it’s quite ironic if there was a version of Jesus that was embracing and retelling some of those ‘heretical’ ideas.
Sure, but also there are definitely things Jesus is said in the Bible to have taught and done that the church itself later condemned, rejected, or- if I’m feeling generous—creatively reinterpreted. This would be one more example, based on a related but different set of sources and arguments.
Christianity seems to me in general to be much less tolerant of its own inherent ambiguity than many other religions. Not that other faiths don’t have plenty of extremist, absolutist adherents and sects—they clearly do. Still, it seems more common (though there’s a lot of exposure bias here for me) for Christians to decide that not only is there one true law, but humans are supposed to intuit what it is, and carry it out—even when the explicit doctrines of the faith they claim to uphold say the opposite.

AnthonyC’s Shortform

AnthonyCApr 1, 2025, 12:13 PM

7 points

1 comment LW link

AnthonyC Apr 1, 2025, 12:13 PM
3 points
0
on: AnthonyC’s Shortform
Epistemic status: Random thought, not examined too closely.
I was thinking a little while ago about the idea that there are three basic moral frameworks (consequentialism, virtue ethics, deontology) with lots of permutations. It occurred to me that in some sense they form a cycle, rather than one trying to be fundamental. I don’t think I’ve ever considered or encountered that idea before. I highly doubt this is in any way novel, and am curious how common it is or where I can find good sources that explore it or something similar.
Events are judged by their consequences.
Actions/choices are judged by their adherence to virtues, which are considered virtues because of the types of consequences they engender.
Priority conflicts among virtues are judged by a given or agreed-upon set of rules, which say what the virtues are and how to enact them.
Conflicts between rules are judged by expectations of the consequences for future events of enacting the virtues/choices/actions prescribed by said rules.

AnthonyC Apr 1, 2025, 11:54 AM
7 points
0
on: Was the historical Jesus talking about proto-evolution? (You might be surprised)
I can’t really evaluate the specific claims made here, I haven’t read the texts or done the work to think about them enough, but reading this, The Earth became older, of a sudden. It’s the same feeling I had when someone first pointed out that all the moral philosophy I’d been reading amounted to debating the same three basic frameworks (consequentialism, deontology, virtue ethics) since the dawn of writing. Maybe the same is true for the three cranes (chance, evolution, design).

AnthonyC Mar 31, 2025, 1:09 PM
2 points
0
in reply to: AnthonyC’s comment on: How I talk to those above me
Thanks, “hire”-->”higher” typo fixed.

AnthonyC

An­tho­nyC’s Shortform

AnthonyC’s Shortform