The agent (https://github.com/raven-ml/raven/tree/main/fehu/examples/05-sokoban) is currently learning to go to an edge and stay there. We need to tweak the parameters and potentially fix underlying issues to have the agent train correctly.
cc @lukstafi