Michaël Trazzi

Karma: 2,898

theinsideview.ai

Michaël Trazzi 28 Apr 2026 7:25 UTC
2 points
0
in reply to: yams’s comment on: Michaël Trazzi’s Shortform
I do agree that “asserts” was too strong. Changed to “writes about the possibility of needing to coordinate with governments and other labs before proceeding further” to stay closer to the quote.
That said, I still think interpreting the “before” part as a statement about pausing (even for a short time) is a reasonable interpretation.

Michaël Trazzi 27 Apr 2026 21:52 UTC
13 points
2
on: Michaël Trazzi’s Shortform
In a blogpost posted yesterday, Sam Altman writes about the possibility of needing to coordinate with governments and other labs before proceeding further (emphasis mine):
We expect there will be periods where we need to collaborate with governments, international agencies, and other AGI efforts to ensure that we have sufficiently solved serious alignment, safety, or societal problems before proceeding further with our work.
This comes one month after Stop The AI Race’s March 21st protest in front of OpenAI, Anthropic and xAI (which I organized), asking Sam Altman (alongside other CEOs) to make a statement on pausing frontier AI development (conditionally), and a follow-up direct message on March 25th asking Sam Altman to clarify his take on conditionally pausing AI. (The Musk v. Altman trial also begins today, which may be relevant to the timing.)
Other parts of the blogpost also point towards more coordination with other labs and governments:
“we need to ensure that key decisions about AI are made via democratic processes and with egalitarian principles, and not just made by AI labs.”

″AI will introduce new risks, and we will work with other companies, ecosystems, governments, and society to solve them. “

”No AI lab can ensure a good future alone. For an obvious example, there may be extremely capable models that make it easier to create a new pathogen, and we need a society-wide approach to defend against this with pathogen-agnostic countermeasures.”

Michaël Trazzi 23 Apr 2026 3:20 UTC
LW: 15 AF: 5
9
AF
on: Preventing extinction from ASI on a $50M yearly budget
I’ve been pretty impressed with ControlAI’s team & ability to talk to many policymakers in the UK & US overall.
At the moment, we are cautiously optimistic: in the past 5 months, with ~1 staff member,^[16]we’ve managed to personally meet with and brief 18 members of Congress, as well as over 90 Congressional offices.
Footnote 16 says:
1 member for most of this period; the 2nd member joined in the past month.
How successful do you expect the third, fourth, fifth, etc. person you hire to be at getting those meetings?

Michaël Trazzi 20 Mar 2026 18:02 UTC
2 points
0
in reply to: Ruby’s comment on: Michaël Trazzi’s Shortform
Yeah, some people who have been flyering for this have noticed that most people just take a picture of the flyer & don’t bother to actually RSVP to the protest (sometimes for privacy reasons). We’ll see how many people end up coming!

Michaël Trazzi 20 Mar 2026 4:32 UTC
4 points
0
in reply to: Austin Chen’s comment on: Michaël Trazzi’s Shortform
More people show up on weekends yeah

Michaël Trazzi 19 Mar 2026 18:24 UTC
9 points
0
in reply to: Garrett Baker’s comment on: Michaël Trazzi’s Shortform
Removed the quietly and linked to Holden’s post, thanks!

Michaël Trazzi 19 Mar 2026 17:35 UTC
122 points
28
on: Michaël Trazzi’s Shortform
In two days (March 21st, 12-4pm), about 140 of us (event link) will be marching on Anthropic, OpenAI and xAI in SF asking the CEOs to make statements on whether they would stop developing new frontier models if every other major lab in the world credibly does the same. This comes after Anthropic removed its commitment to pause development from their RSP.

We’ll be starting at 500 Howard St, San Francisco (Anthropic’s Office, full schedule and more info here). This is shaping to be the biggest US AI Safety protest to date, with a coalition including Nate Soares (MIRI), David Krueger (Evitable), Will Fithian (Berkeley Professor) and folks representing PauseAI, QuitGPT, Humans First.
What links here?
- Caleb Biddulph's comment on shortplav by niplav (19 Mar 2026 21:24 UTC; 5 points)

METR’s 14h 50% Horizon Impacts The Economy More Than ASI Timelines

Michaël Trazzi20 Feb 2026 21:08 UTC

45 points

11 comments2 min readLW link

36,000 AI Agents Are Now Speedrunning Civilization

Michaël Trazzi30 Jan 2026 21:21 UTC

86 points

27 comments1 min readLW link

Michaël Trazzi 22 Jan 2026 6:31 UTC
LW: 2 AF: 1
0
AF
on: AI Futures Timelines and Takeoff Model: Dec 2025 Update
Some questions I have:

1. Compute bottleneck
The model says experiment compute becomes the binding constraint once coding is fast. But are frontier labs actually compute-bottlenecked on experiments right now? Anthropic runs inference for millions of users while training models. With revenue growing, more investment coming in, and datacenters being built, couldn’t they allocate eg. 2x more to research compute this year if they wanted?
2. Research taste improvement rate
The model estimates AI research taste improvement based on how quickly AIs have improved in a variety of metrics.
But researchers at a given taste level can now run many more experiments because Claude Code removes the coding bottleneck.

More experiment output means faster feedback, which in turn means faster taste development. So the rate at which human researchers develop taste should itself be accelerating. Does your model capture this? Or does it assume taste improvement is only a function of effective compute, not of experiment throughput?
3. Low-value code
Ryan’s argument (from his October post) is that AI makes it cheap to generate code, so people generate more low-level code they wouldn’t have otherwise written.
But here’s my question: if the marginal code being written is “low-value” in the sense of “wouldn’t have been worth a human’s time before,” isn’t that still a real productivity gain, if say researchers can now run a bunch of claude code agents instances to run experiments instead of having to interface with a bunch of engineers?
4. What AIs Can’t Do
The model treats research taste as qualitatively different from coding ability. But what exactly is the hard thing AIs can’t do? If it’s “generating novel ideas across disciplines” or “coming up with new architectures”, these seem like capabilities that scale with knowledge and reasoning, both improving. IIRC there’s some anecdotal evidence of novel discoveries of an LLM solving an Erdős problem, and someone from the Scott Aaronson sphere discussing AI contributions to something like quantum physics problems? Not sure.
If it’s “making codebases more efficient”, AIs already beat humans at competitive programming. I’ve seen some posts on LW discussing how they timed theirselves vs an AI against something that the AI should be able to do, and they beat the AI. But intuitively it does seem to me that models are getting better at the general “optimizing codebases” thing, even if it’s not quite best-human-level yet.
5. Empirical basis for β (diminishing returns)
The shift from AI 2027 to the new model seems to come partly from “taking into account diminishing returns”, aka the Jones model assumption that ideas get harder to find. What data did you use to estimate β? And given we’re now in a regime with AI-assisted research, why should historical rates of diminishing returns apply going forward?

Michaël Trazzi 21 Jan 2026 2:05 UTC
101 points
4
on: Michaël Trazzi’s Shortform
Demis Hassabis finally agreed that he would pause if everyone else also paused.
https://x.com/emilychangtv/status/2013726877706313798?s=20
What links here?
- Thane Ruthenis's comment on OpenAI employees: Now is the time to stop doing good work. by bhauth (2 Mar 2026 7:24 UTC; 23 points)

The Hunger Strike To Stop The AI Race

Michaël Trazzi8 Jan 2026 21:05 UTC

37 points

0 comments1 min readLW link

(www.youtube.com)

Michaël Trazzi 6 Jan 2026 6:56 UTC
6 points
11
in reply to: peterbarnett’s comment on: peterbarnett’s Shortform
I’m not sure it might result in a wake up of AI Researchers.

See this thread yesterday between Chris Painter and Dean Ball:
Chris: I have the sense that, as of the last 6 months, a lot of tech now thinks an intelligence explosion is more plausible than they did previously. But I don’t feel like I’m hearing a lot about that changing people’s minds on the importance of alignment and control research.

Dean: do people really think alignment and control research is unimportant? it seems like a big part of why opus is so good is the approach ant took to aligning it, and like basically everyone recognizes this?
Chris: I’m not sure they think it’s unimportant. It’s more that around a year ago a lot of people would’ve said something like “Well, some people are really nervous about alignment and control research and loss of control etc, but that’s because they have this whole story of AI foom and really dramatic self-improvement. I think that story is way overstated, these models just don’t speed me up that much today, and I think we’ll have issues with autonomy for a long time, it’s really hard.” So, they often stated their objection in a way that made it sound like rapid progress on AI R&D automation would change their mind. To be clear, I think there are stronger objections that they could have raised and could still raise, like “we will then move to hardware bottlenecks, which will require AI proliferation for a true speed up to materialize”. Also, sorry if it’s rough for me to not be naming specific names, would just take time to pull examples.

Michaël Trazzi 27 Dec 2025 17:24 UTC
6 points
1
in reply to: Daniel Kokotajlo’s comment on: Are We In A Coding Overhang?
Using @ryan_greenblatt’s updated 5-month doubling time: we reach the 1-month horizon from AI 2027 in ~5 doublings (Jan 2028) at 50% reliability, and ~8 doublings (Apr 2029) at 80% reliability. If I understand correctly, your model uses 80% reliability while also requiring 30x cheaper and faster than humans. It does seem like if the trend holds, by mid-2029 the models wouldn’t be much more expensive or slower. But I agree that if a lab tried to demonstrate “superhuman coder” on METR by the end of next year using expensive scaffolding / test-time compute (similar to o1 on ARC-AGI last year), it would probably exceed 30x human-cost, even if already 30x faster.
What links here?
- Are We In A Coding Overhang? by Michaël Trazzi (27 Dec 2025 8:16 UTC; 110 points)
- METR’s 14h 50% Horizon Impacts The Economy More Than ASI Timelines by Michaël Trazzi (20 Feb 2026 21:08 UTC; 45 points)

Are We In A Coding Overhang?

Michaël Trazzi27 Dec 2025 8:16 UTC

110 points

14 comments3 min readLW link

Michaël Trazzi 20 Dec 2025 7:40 UTC
3 points
0
in reply to: Roman Malov’s comment on: Claude Opus 4.5 Achieves 50%-Time Horizon Of Around 4 hrs 49 Mins
Fixed

Claude Opus 4.5 Achieves 50%-Time Horizon Of Around 4 hrs 49 Mins

Michaël Trazzi20 Dec 2025 7:13 UTC

92 points

14 comments1 min readLW link

Michaël Trazzi 3 Dec 2025 20:38 UTC
6 points
0
in reply to: habryka’s comment on: Habryka’s Shortform Feed
Any updates on the 2025 numbers for Lighthaven? (cf. this table from last year’s fundraiser)

Michaël Trazzi 19 Nov 2025 3:11 UTC
10 points
7
in reply to: Simon Lermen’s comment on: AI safety undervalues founders
My guess at what’s happening here: for the first iterations of MATS (think MATS 2.0 at the Lightcone WeWork) you would have folks who were already into AI Safety for quite a long time and were interested in doing some form of internship-like thing for a summer. But as you run more cohorts (and make the cohorts bigger) then the density of people who have been interested in safety for a long time naturally decreases (because all the people who were interested in safety for years already applied to previous iterations).

Michaël Trazzi 7 Nov 2025 5:23 UTC
6 points
2
on: My new nonprofit Evitable is hiring.
Congrats on the launch!

I would add the main vision for this (from the website) directly in the post as quoted text, so that people can understand what you’re doing (& discuss).

Michaël Trazzi

METR’s 14h 50% Hori­zon Im­pacts The Econ­omy More Than ASI Timelines

36,000 AI Agents Are Now Speedrun­ning Civilization

The Hunger Strike To Stop The AI Race

Are We In A Cod­ing Over­hang?

Claude Opus 4.5 Achieves 50%-Time Hori­zon Of Around 4 hrs 49 Mins

METR’s 14h 50% Horizon Impacts The Economy More Than ASI Timelines

36,000 AI Agents Are Now Speedrunning Civilization

Are We In A Coding Overhang?

Claude Opus 4.5 Achieves 50%-Time Horizon Of Around 4 hrs 49 Mins