Fleet Planning Under Demand And Fuel Price Uncertainty Using Actor-Critic Reinforcement Learning