Avoiding Failure States During Reinforcement Learning