FPMCO decomposes multi-constraint RL into KL-projection sub-problems, achieving higher reward with lower computing than second-order rivals on the new SCIG robotics benchmark.
Abstract: Job-dependent tool switching is necessary in many batch processing systems (BPSs). Heterogeneous tool demand and extra time consumption for tool switches bring great challenges for ...