zeshen

Karma: 410

Feedback welcomed: www.admonymous.co/zeshen

I sometimes write my thoughts here: airisks.substack.com

zeshen 10 Mar 2026 4:22 UTC
1 point
0
in reply to: zeshen’s comment on: zeshen’s Shortform
More broadly speaking, my take is that models will continue to smash benchmarks faster than even the most optimistic expectations but we won’t see a intelligence explosion that is genuinely existentially threatening in a non-misuse way in the next ten years. Benchmarks will be increasingly disconnected from reality.

zeshen’s Shortform

zeshen17 Feb 2026 6:46 UTC

4 points

2 comments1 min readLW link

zeshen 17 Feb 2026 6:46 UTC
1 point
0
on: zeshen’s Shortform
Just wanted to register that I’m on the side of Freddie’s bet for all items below except the one on the BLS (because some categories are small enough some that losing 50% jobs on at least one category probably isn’t that unlikely).
Here’s what he says:
For me to win the wager, all of the following must be true on Feb 14, 2029:
Labor Market:
1. The U.S. unemployment rate is equal to or lower than 18%
2. Labor force participation rate, ages 25-54, is equal to or greater than 68%
3. No single BLS occupational category will have lost 50% or more of jobs between now and February 14th 2029
Economic Growth & Productivity:
1. U.S. GDP is within −30% to +35% of February 2026 levels (inflation-adjusted)
2. Nonfarm labor productivity growth has not exceeded 8% in any individual year or 20% for the three-year period
Prices & Markets:
1. The S&P 500 is within −60% to +225% of the February 2026 level
2. CPI inflation averaged over 3 years is between −2% and +18% annually
Corporate & Structural:
1. The Fortune 500 median profit margin is between 2% and 35%
2. The largest 5 companies don’t account for more than 65% of the total S&P 500 market cap
White Collar & Knowledge Workers:
1. “Professional and Business Services” employment, as defined by the Bureau of Labor Statistics, has not declined by more than 35% from February 2026
2. Combined employment in software developers, accountants, lawyers, consultants, and writers, as defined by the Bureau of Labor Statistics, has not declined by more than 45%
3. Median wage for “computer and mathematical occupations,” as defined by the Bureau of Labor Statistics, is not more than 60% lower in real terms than in February 2026
4. The college wage premium (median earnings of bachelor’s degree holders vs high school only) has not fallen below 30%
Inequality:
1. The Gini coefficient is less than 0.60
2. The top 1%’s income share is less than 35%
3. The top 0.1% wealth share is less than 30%
4. Median household income has not fallen by more than 40% relative to mean household income

zeshen 2 Sep 2025 22:02 UTC
1 point
0
on: My AI Predictions for 2027
Thanks for writing this up! I also want to register that I agree with all of this, maybe except for the part where AIs can’t tell novel funny jokes—I expect this to be relatively easy. But of coursre it depends on the definition of ‘novel’.

I struggled to do this exercise myself because when I looked at AI as a normal technology I felt like I basically agree with most of their thinking, but it was also hard to find concrete differences between their predictions and AI2027 at least in the near term. For example, for things like “LLMs are broadly acknowledged to be plateauing”, it’s probably going to be concurrently both true and false in a way that’s hard to resolve—a lot of people may complain that it’s plateauing but the benchmark scores and the usage stats could show otherwise.

zeshen 11 Jul 2025 11:06 UTC
1 point
0
in reply to: Vladimir_Nesov’s comment on: On thinking about AI risks concretely
Yeah, at least “literally everyone dies” has a concrete ending even though it doesn’t have concrete intermediate concrete steps. Gradual disempowerment seem less concrete on both the ending and the intermediate steps, so it becomes even less action-relevant.

On thinking about AI risks concretely

zeshen11 Jul 2025 0:04 UTC

9 points

4 comments4 min readLW link

zeshen 4 Jul 2025 18:11 UTC
1 point
0
on: Foom & Doom 1: “Brain in a box in a basement”
…But I’m not sure that actual existing efforts towards delaying AGI are helping.
But perhaps actual existing efforts to hype up LLMs are helping? I am sympathetic to François Chollet’s position:
OpenAI basically set back progress towards AGI by quite a few years probably like five to 10 years for two reasons. They caused this complete closing down of Frontier research publishing but also they triggered this initial burst of hype around LLMs and now LLMs have sucked the oxygen out of the room.

zeshen 30 Nov 2024 12:24 UTC
6 points
2
on: (The) Lightcone is nothing without its people: LW + Lighthaven’s big fundraiser
Is there any difference between donating through Manifund or directly via Stripe?

zeshen 30 Nov 2024 6:53 UTC
5 points
0
in reply to: Gordon Seidoh Worley’s comment on: Information vs Assurance
This happened all the time at my line of work. Forecasts become targets and you become responsible for meeting them. So whenever I was asked to provide a forecast, I will either i) ask as many questions as I need to know the exact purpose of the request, and produce a forecast that meets exactly that intent, or ii) pick a forecast and provide it, but first list down all the assumptions and caveats behind the forecast that I can possibly think of. With time, I’d also get a sense of who I need to be extra careful with when providing any forecasts because of all sorts of ways that might backfire.

zeshen 14 May 2024 10:38 UTC
5 points
−4
in reply to: Alexander Gietelink Oldenziel’s comment on: Alexander Gietelink Oldenziel’s Shortform
Agreed. I’m also pleasantly surprised that your take isn’t heavily downvoted.

zeshen 10 May 2024 10:10 UTC
3 points
−1
on: We might be missing some key feature of AI takeoff; it’ll probably seem like “we could’ve seen this coming”
There’ll be discussions about how these systems will eventually become dangerous, and safety-concerned groups might even set up testing protocols (“safety evals”).
My impression is that safety evals were deemed irrelevant because a powerful enough AGI, being deceptively aligned, would pass all of them anyway. We didn’t expect the first general-ish AIs to be so dumb, like how GPT-4 was being so blatant and explicit about lying to the TaskRabbit worker.

zeshen 8 May 2024 4:10 UTC
5 points
0
on: Deep Honesty
Scott Alexander talked about explicit honesty (unfortunately paywalled) in contrast with radical honesty. In short, explicit honesty is being completely honest when asked, and radical honesty is being completely honest even without being asked. From what I understand from your post, it feels like deep honesty is about being completely honest about information you perceive to be relevant to the receiver, regardless of whether the information is explicitly being requested.
Scott also links to some cases where radical honesty did not work out well, like this, this, and this. I suspect deep honesty may lead to similar risks, as you have already pointed out.
And with regards to:
“what is kind, true, and useful?”
I think they would form a 3-circle venn diagram. Things that are within the intersection of all three circles would be a no-brainer. But the tricky bits are the things that are either true but not kind/useful, or kind/useful but not true. And I understood this post as a suggestion to venture more into the former.

zeshen 3 May 2024 3:43 UTC
8 points
3
on: Why is AGI/ASI Inevitable?
Can’t people decide simply not to build AGI/ASI?
Yeah, many people, like the majority of users on this forum, have decided to not build AGI. On the other hand, other people have decided to build AGI and are working hard towards it.

Side note: LessWrong has a feature to post posts as Questions, you might want to use it for questions in the future.

zeshen 27 Apr 2024 12:09 UTC
1 point
2
in reply to: JustisMills’s comment on: LLMs seem (relatively) safe
Definitely. Also, my incorrect and exaggerated model of the community is likely based on the minority who have a tendency of expressing those comments publicly, against people who might even genuinely deserve those comments.

zeshen 26 Apr 2024 10:35 UTC
3 points
−2
in reply to: the gears to ascension’s comment on: LLMs seem (relatively) safe
I agree with RL agents being misaligned by default, even more so for the non-imitation-learned ones. I mean, even LLMs trained on human-generated data are misaligned by default, regardless of what definition of ‘alignment’ is being used. But even with misalignment by default, I’m just less convinced that their capabilities would grow fast enough to be able to cause an existential catastrophe in the near-term, if we use LLM capability improvement trends as a reference.

zeshen 26 Apr 2024 9:00 UTC
11 points
0
on: LLMs seem (relatively) safe
Thanks for this post. This is generally how I feel as well, but my (exaggerated) model of the AI aligment community would immediately attack me by saying “if you don’t find AI scary, you either don’t understand the arguments on AI safety or you don’t know how advanced AI has gotten”. In my opinion, a few years ago we were concerned about recursively self improving AIs, and that seemed genuinely plausible and scary. But somehow, they didn’t really happen (or haven’t happened yet) despite people trying all sorts of ways to make it happen. And instead of a intelligence explosion, what we got was an extremely predictable improvement trend which was a function of only two things i.e. data + compute. This made me qualitatively update my p(doom) downwards, and I was genuinely surprised that many people went the other way instead, updating upwards as LLMs got better.
What links here?
- zeshen's comment on Alexander Gietelink Oldenziel’s Shortform by Alexander Gietelink Oldenziel (14 May 2024 10:38 UTC; 5 points)

zeshen 22 Apr 2024 9:13 UTC
1 point
0
on: Modern Transformers are AGI, and Human-Level
I’ve gotten push-back from almost everyone I’ve spoken with about this
I had also expected this reaction, and I always thought I was the only one who thinks we have basically achieved AGI since ~GPT-3. But looking at the upvotes on this post I wonder if this is a much more common view.

zeshen 19 Mar 2024 8:56 UTC
3 points
0
on: Using axis lines for good or evil
My first impression was also that axis lines are a matter of aesthetics. But then I browsed The Economist’s visual styleguide and realized they also do something similar, i.e. omit the y-axis line (in fact, they omit the y-axis line on basically all their line / scatter plots, but almost always maintain the gridlines).
Here’s also an article they ran about their errors in data visualization, albeit probably fairly introductory for the median LW reader.

zeshen 7 Mar 2024 12:53 UTC
3 points
2
on: Good taxonomies of all risks (small or large) from AI?
I’m pretty sure you have come across this already, but just in case you haven’t:
https://incidentdatabase.ai/taxonomy/gmf/

zeshen 23 Dec 2023 10:37 UTC
4 points
1
on: Funding case: AI Safety Camp
Strong upvoted. I was a participant of AISC8 in the team that went on to launch AI Standards Lab, which I think counterfactually would not be launched if not for AISC.

zeshen

zeshen’s Shortform

On think­ing about AI risks concretely

On thinking about AI risks concretely