The agent's goal is to collect as many rewards as possible with fewer steps and decisions to avoid/fight/jump obstacles and enemies. Structured base on Malmo's framework.