Hello! I recently finished a draft on a version of RL that maybe able to streamline an LLM’s situational awareness and match our world models. If you are interested send me a message.=)
Hello! I recently finished a draft on a version of RL that maybe able to streamline an LLM’s situational awareness and match our world models. If you are interested send me a message.=)