- 1 ADVANTAGE UPDATING
- 2 RESIDUAL GRADIENT ALGORITHMS
- 3 THE SIMULATION
- 3.1 GAME DEFINITION
- 3.2 THE BELLMAN RESIDUAL AND UPDATE EQUATIONS
- 4 RESULTS
- 4.1 RESIDUAL GRADIENT ADVANTAGE UPDATING RESULTS
- 4.2 COMPARATIVE RESULTS
- 4.2.1 Experiment Set 1
- 4.2.2 Experiment Set 2
- 5 Conclusion
- Acknowledgments
- References