| Session | ADP and RL in Real-time Feedback Systems |
| Chair | Xin Xu |
| Co-Chair | Haibo He |
| Exponential Moving Average Q-Learning Algorithm |
| Real-Time Tracking on Adaptive Critic Design with Uniformly Ultimately Bounded Condition |
| A Novel Approach for Constructing Basis Functions in Approximate Dynamic Programming for Feedback Control |
| A Combined Hierarchical Reinforcement Learning Based Approach for Multi-Robot Cooperative Target Searching in Complex Unknown Environments |
| The Second Order Temporal Difference Error for Sarsa(λ) |

