Table of Links
-
Similar Work
-
Methodology
-
Results
2 Deep Reinforcement Learning
The NN is trained by optimized by minimizing the difference between the output and the target value. This objective function for iteration 𝑖 is given by
Authors:
(1) Reilly Pickard, Department of Mechanical and Industrial Engineering, University of Toronto, Toronto, Canada ([email protected]);
(2) F. Wredenhagen, Ernst & Young LLP, Toronto, ON, M5H 0B3, Canada;
(3) Y. Lawryshyn, Department of Chemical Engineering, University of Toronto, Toronto, Canada.
This paper is