RSS

Mouhssine Rifaki

Karma: 9

Reinforcement learning, mostly. The parts I keep circling back to are multi-agent learning, environment design, and the question of when an agent should stop gathering information and commit.