Reinforcement Learning in R