STATISTICALLY EFFICIENT REINFORCEMENT LEARNING