[2402.00313] Control in Stochastic Environment with Delays: A Model-based Reinforcement Learning Approach