aog comments on Common misconceptions about OpenAI

aog 1 Sep 2022 16:35 UTC
14 points
2
Something I learned today that might be relevant: OpenAI was not the first organization to train transformer language models with search engine access to the internet. Facebook AI Research released their own paper on the topic six months before WebGPT came out, though the paper is surprisingly uncited by the WebGPT paper.
Generally I agree that hooking language models up to the internet is terrifying, despite the potential improvements for factual accuracy. Paul’s arguments seem more detailed on this and I’m not sure what I would think if I thought about them more. But the fact that OpenAI was following rather than leading the field would be some evidence against WebGPT accelerating timelines.
- habryka 1 Sep 2022 20:46 UTC
  3 points
  0
  Parent
  I did not know!
  
  However, I don’t think this is really the same kind of reference class in terms of risk. It looks like the search engine access for the Facebook case is much more limited and basically just consisted of them appending a number of relevant documents to the query, instead of the model itself being able to send various commands that include starting new searches and clicking on links.
  - gwern 1 Sep 2022 21:58 UTC
    5 points
    0
    Parent
    It does generate the query itself, though:
    
    A search query generator: an encoder-decoder Transformer that takes in the dialogue context as input, and generates a search query. This is given to the black-box search engine API, and N documents are returned.
    - habryka 2 Sep 2022 8:57 UTC
      3 points
      0
      Parent
      Does it itself generate the query, or is it a separate trained system? I was a bit confused about this in the paper.
      - gwern 2 Sep 2022 16:29 UTC
        6 points
        0
        Parent
        You’d think they’d train the same model weights and just make it multi-task with the appropriate prompting, but no, that phrasing implies that it’s a separate finetuned model, to the extent that that matters. (I don’t particularly think it does matter because whether it’s one model or multiple, the system as a whole still has most of the same behaviors and feedback loops once it gets more access to data or starts being trained on previous dialogues/sessions—how many systems are in your system? Probably a lot, depending on your level of analysis. Nevertheless...)