Optimising Grid Topology Reconfiguration Using Reinforcement Learning