- REINFORCEMENT LEARNING IN CONTINUOUS TIME:
- ADVANTAGE UPDATING
- ABSTRACT
- REINFORCEMENT LEARNING
- Q-LEARNING
- ADVANTAGE UPDATING
- The Advantage Updating Algorithm
- LINEAR QUADRATIC REGULATOR
- SIMULATION RESULTS
- CONCLUSION
- ACKNOWLEDGMENTS
- REFERENCES
- APPENDIX: LQR CONSTANTS