testingthewaters comments on Research Agenda: Synthesizing Standalone World-Models

testingthewaters 23 Sep 2025 23:20 UTC
2 points
0
Glad to see we’re basically agreed. However, how would you take safety precautions around your own work on such algorithms, given our last big similar breakthrough (transformers for language modelling) basically instantly got coopted for RL to be “agentified”? Unless you’re literally doing this alone (with a very strong will) wouldn’t that be the natural path for any company/group once the simulator is finished?
- Thane Ruthenis 23 Sep 2025 23:43 UTC
  2 points
  0
  Parent
  “Share the dual-use stuff only with specific people who are known to properly understand the AGI risk, can avoid babbling about it in public, and would be useful contributors” seems like the straightforward approach here.
  Like, groups of people are able to maintain commercial secrets. This is kind of not unlike that, except with somewhat higher stakes.
  - testingthewaters 24 Sep 2025 0:14 UTC
    2 points
    0
    Parent
    I mean, AI people are notoriously bad at doing these kinds of things xD I would expect the people running openai or anthropic to say similar things to this (when their orgs were just starting out). So I hope you can see why I wanted to ask this. None of this is to cast any doubt on your ability or motives, just noting the minefield that is unfortunately next to the park where we’re having this conversation.
    - Thane Ruthenis 24 Sep 2025 0:27 UTC
      2 points
      0
      Parent
      just noting the minefield that is unfortunately next to the park where we’re having this conversation
      For what it’s worth, I’m painfully aware of all the skulls lying around, yep.