Raemon comments on Wei Dai’s Shortform

Raemon 30 Oct 2025 18:34 UTC
LW: 2 AF: 1
0
AF
In your mind what are the biggest bottlenecks/issues in “making fast, philosophically competent alignment researchers?”
- Jan_Kulveit 30 Oct 2025 23:58 UTC
  4 points
  0
  Parent
  [low effort list] Bottlencks/issues/problems
  
  - philosophy has worse short feedback loops than eg ML engineering → in all sorts of processes like MATS or PIBBSS admissions it is harder to select for philosophical competence, also harder to self-improve
  - incentives: obviously stuff like being an actual expert in pretraining can get you lot of money and respect in some circles; even many prosaic AI safety / dual use skills like mech interpretability can get you maybe less money than pretraining, but still a lot of money if you work in AGI companies, and also decent ammount of status in ML community and a AI safety community; improving philosophical competence may get you some recognition but only among relatively small and weird group of people
  - the issue Wei Dai is commenting on in the original post, founder effects persist to this day & also there is some philosophy-negative prior in STEM—
  idk, lack of curiousity? llms have read it all, it’s easy to check if there is some existing thinking on a topic
  - Raemon 31 Oct 2025 0:17 UTC
    2 points
    0
    Parent
    Do you have own off-the-cuff guesses about how you’d tackle the short feedbackloops problem?
    Also, is it more like we don’t know how to do short feedbackloops, or more like we don’t even know how to do long/expensive loops?
    - M. Y. Zuo 31 Oct 2025 2:50 UTC
      −2 points
      0
      Parent
      There’s a deeper problem, how do we know there is a feedback loop?
      I’ve never actually seen a worked out proof of well any complex claim on this site using standard logical notation…(beyond pure math and trivial tautologies)
      At most there’s a feedback loop on each other’s hand wavey arguments that are claimed to be proof of this or that. But nobody ever actually delivers the goods so to speak such that they can be verified.
- Raemon 30 Oct 2025 20:43 UTC
  LW: 2 AF: 1
  0
  AF Parent
  (Putting the previous Wei Dai answer to What are the open problems in Human Rationality? for easy reference, which seemed like it might contain relevant stuff)
- StanislavKrym 30 Oct 2025 19:33 UTC
  −2 points
  −1
  Parent
  AI doing philosophy = AI generating hands, plus the fact that philosophy is heavily corrupted by postmodernism to the point where two authors write books dedicated to criticism of postmodernism PRECISELY because their parodies got published.
  - Raemon 30 Oct 2025 20:41 UTC
    3 points
    0
    Parent
    I think I meant a more practical / next-steps-generating answer.
    I don’t think “academia is corrupted” is a bottleneck for a rationalist Get Gud At Philosophy project. We can just route around academia.
    The sorts of things I was imagining might be things like “figure out how to teach a particular skill” (or “identify particular skills that need teaching”, or “figure out how test whether someone has a particular skill), or “solve some particular unsolved conceptual problem(s) that you expect to unlock much easier progress.”