[Question] Suggestions for net positive LLM research

I am starting a PhD in computer science, focusing on agent foundations so far, which is great. I intend to continue devoting at least half my time to agent foundations.

However, for several reasons, it seems to be important for me to do some applied work, particularly with LLMs:

  1. I believe I’m otherwise pretty well positioned to get an impactful job at Google DeepMind, but apparently some impressive machine learning engineering creds are necessary to get on even the safety team currently.

  2. My PhD supervisor is pushing for me to work on LLMs, though he seems to be pretty flexible about the details.

  3. As much fun as math is, I also value developing my practical skills (in fact, half the fun of learning things is becoming more powerful in the real world).

  4. LLM experts seem likely to be in high demand right now, though I am not sure how long that will last.

Now, I’ve spent the last couple of years mostly studying AIXI and its foundations. I’m pretty comfortable with standard deep learning algorithms and libraries and I have some industry experience with machine learning engineering, but I am not an expert on NLP, LLMs, or prosiac alignment. Therefore, I am looking for suggestions from the community about LLM related research projects that would satisfy as many as possible of the following criteria:

  1. Is not directly focused on improving frontier model capabilities (for ethical reasons; though my timelines seem to be longer than the average lesswronger, I’m not able to accept the risk that I am wrong).

  2. Produces mundane utility. I find it much more fulfilling to work on things that I can see becoming useful to people, and I also want a measure of my success which is as concrete as possible.

  3. Contributes to prosiac alignment. It would be particularly nice if the experimental/​engineering work involved is likely to inform my ideas for mathematical alignment research.

  4. Machine learning engineering/​research creds.

Any suggestions are appreciated. I may also link this question to a Manifold market in the future (probably “conditional on working full time at DeepMind within 18 months of graduation, which areas of research did my PhD thesis include”) or something along those lines. Thanks!

No comments.