Background: AI Master Student, some practice in RL
I don’t think there is a fundamental reason that we can’t but it’s rather that no one did it. I don’t know a definitive answer as to why but here are some options:
too obscure (‘no-one has thought of it’, or ‘no-one thought it was a good idea, it’s only assembly-like code after all’)
high barrier of entry (you need to write an RL environment that you can query fast, and you need a lot of compute)
this makes it harder for individuals or small teams to do this, and larger players like DeepMind and OpenAI might have different priorities
now that we have Codex (and soon^TM its newer, allegedly much better version), there might not be any (economic or scientific) reason to do this
What’s the value you’d get out of a code wars expert model? I glanced at the Wikipedia page. How would you convert its outputs to a useful program, that does more than gobble up all your memory?
Background: AI Master Student, some practice in RL
I don’t think there is a fundamental reason that we can’t but it’s rather that no one did it. I don’t know a definitive answer as to why but here are some options:
too obscure (‘no-one has thought of it’, or ‘no-one thought it was a good idea, it’s only assembly-like code after all’)
high barrier of entry (you need to write an RL environment that you can query fast, and you need a lot of compute)
this makes it harder for individuals or small teams to do this, and larger players like DeepMind and OpenAI might have different priorities
now that we have Codex (and soon^TM its newer, allegedly much better version), there might not be any (economic or scientific) reason to do this
What’s the value you’d get out of a code wars expert model? I glanced at the Wikipedia page. How would you convert its outputs to a useful program, that does more than gobble up all your memory?