- ABSTRACT
- TABLE OF CONTENTS
- LIST OF FIGURES
- LIST OF TABLES
- ACKNOWLEDGMENTS
- 1. INTRODUCTION
- 2. REINFORCEMENT LEARNING SYSTEMS
- 3. THE ADVANTAGE UPDATING ALGORITHM
- Advantage Updating
- 4. A LINEAR QUADRATIC REGULATOR PROBLEM
- 5. Q-LEARNING WITH SMALL TIME STEPS
- 6. SIMULATION RESULTS
- 7. CONVERGENCE OF ADVANTAGE UPDATING
- 8. IMPLEMENTATION ISSUES
- 9. CONCLUSION
- 10. BIBLIOGRAPHY
- APPENDIX A: NOTATION
- APPENDIX B: LQR CONSTANTS