Session | ADP and RL in Real-time Feedback Systems |
Chair | Xin Xu |
Co-Chair | Haibo He |
Exponential Moving Average Q-Learning Algorithm | |
Real-Time Tracking on Adaptive Critic Design with Uniformly Ultimately Bounded Condition | |
A Novel Approach for Constructing Basis Functions in Approximate Dynamic Programming for Feedback Control | |
A Combined Hierarchical Reinforcement Learning Based Approach for Multi-Robot Cooperative Target Searching in Complex Unknown Environments | |
The Second Order Temporal Difference Error for Sarsa(λ) |