A Q-learning Algorithm for Two-Stage Hybrid Flow Shop Scheduling