Stav dette: Hamilton-Jacobi Reachability Estimation in Reinforcement Learning