A | B | C | D | E | F | G | H | I | J | K | L | M | N | O | P | Q | R | S | T | U | V | W | X | Y | Z
Abu-Khalaf, Murad
Continuous-Time ADP for Linear Systems with Partially Unknown Dynamics
AL Tamimi, Asma
Discrete-time nonlinear HJB solution using Approximate dynamic programming: Convergence Proof
Anderson, Stuart
Identifying Trajectory Classes in Dynamic Tasks
Antos, Andras
Value-Iteration Based Fitted Policy Iteration: Learning with a Single Trajectory
Atkeson, Christopher
Knowledge Transfer using Local Features
Randomly Sampling Actions In Dynamic Programming
Bakker, Bram
Reinforcement learning by backpropagation through an LSTM model/critic
Barty, Kengy
Q-learning algorithm with continuous state spaces and finite decision set
Bhattacharyya, Siddhartha
Leader-Follower semi-Markov Decision Problems: Theoretical Framework and Approximate Solution
Bowling, Michael
Dual Representations for Dynamic Programming and Reinforcement Learning
Cai, Chen
An Approximate Dynamic Programming Strategy for Responsive Traffic Signal Control
Carnieri, Ricardo
Strategy Generation with Cognitive Distance in Two-Player Games
Chapel, Laetitia
SVM Viability Controller Active Learning: Application to Bike Control
Coppin, Gilles
Dynamic optimization of the strength ratio during a terrestrial conflict
Coquelin, Pierre-Arnaud
A Dynamic Programming Approach to Viability Problems
Csaba, Szepesvari
Value-Iteration Based Fitted Policy Iteration: Learning with a Single Trajectory
Dagli, Cihan
Short-term Stock Market Timing Prediction under Reinforcement Learning Schemes
Davy, Manuel
Sparse Temporal Difference Learning using LASSO
De Jong, Edwin
Computing Optimal Stationary Policies for Multi-Objective Markov Decision Processes
Deffuant, Guillaume
SVM Viability Controller Active Learning: Application to Bike Control
Duan, Haibin
Hybrid Ant Colony Optimization Using Memetic Algorithm for Traveling Salesman Problem
Elhanany, Itamar
A Scalable Recurrent Neural Network Framework for Model-Free POMDPs
Enke, David
Short-term Stock Market Timing Prediction under Reinforcement Learning Schemes
Fernandez, Emmanuel
Frazier, Peter
The Knowledge Gradient Policy for Offline Learning with Independent Normal Rewards
Fukuda, Toshio
Strategy Generation with Cognitive Distance in Two-Player Games
Fullana, Rogelio
Gabel, Thomas
On a Successful Application of Multi-Agent Reinforcement Learning to Operations Research Benchmarks
Girardeau, Pierre
Q-learning algorithm with continuous state spaces and finite decision set
Huang, Ting
Discrete-Time Approximate Dynamic Programming using Wavelet Basis Function Neural Networks
Hudry, Olivier
Dynamic optimization of the strength ratio during a terrestrial conflict
Ilin, Roman
Jagannathan, Sarangapani
Online Reinforcement Learning Neural Network Controller Design for Nanomanipulation
Jin, Ning
Discrete-Time Approximate Dynamic Programming using Wavelet Basis Function Neural Networks
Jung, Tobias
Kamel, Mohamed S.
Opposition-Based Q(λ) with Non-Markovian Update
Kaymak, Uzay
A Theoretical Analysis of Cooperative Behavior in Multi-agent Q-learning
Kozma, Robert
Kuchen, Benjamin
Kulkarni, Nilesh
Reinforcement-Learning-based Magneto-Hydrodynamic Control of Hypersonic Flows
Lewis, Frank L.
Continuous-Time ADP for Linear Systems with Partially Unknown Dynamics
Lewis, Frank
Discrete-time nonlinear HJB solution using Approximate dynamic programming: Convergence Proof
Li, Baohua
Li, Hailin
Short-term Stock Market Timing Prediction under Reinforcement Learning Schemes
Lin, Wei-Song
DHP Adaptive Critic Motion Control of Autonomous Wheeled Mobile Robot
Liu, Derong
Discrete-Time Approximate Dynamic Programming using Wavelet Basis Function Neural Networks
Particle Swarm Optimized Adaptive Dynamic Programming
Liu, Zhenzhen
A Scalable Recurrent Neural Network Framework for Model-Free POMDPs
Loth, Manuel
Sparse Temporal Difference Learning using LASSO
Mahootchi, Masoud
Opposition-Based Reinforcement Learning in the Management of Water Resources
Martin, Sophie
A Dynamic Programming Approach to Viability Problems
Mollaie Emamzadeh, Mohammad
Munos, Remi
A Dynamic Programming Approach to Viability Problems
Nascimento, Juliana
An Optimal ADP Algorithm for a High-Dimensional Stochastic Control Problem
Nowe, Ann
The Effect of Bootstrapping in Multi-Automata Reinforcement Learning
Pang, Zhongyu
Discrete-Time Approximate Dynamic Programming using Wavelet Basis Function Neural Networks
Patino, Daniel
Peeters, Maarten
The Effect of Bootstrapping in Multi-Automata Reinforcement Learning
Peters, Jan
Evaluation of Policy Gradient Methods and Variants on the Cart-Pole Benchmark
Using Reward-weighted Regression for Reinforcement Learning of Task Space Control
Phan, Minh
Reinforcement-Learning-based Magneto-Hydrodynamic Control of Hypersonic Flows
Polani, Daniel
Ponnambalam, K.
Opposition-Based Reinforcement Learning in the Management of Water Resources
Powell, Warren
The Knowledge Gradient Policy for Offline Learning with Independent Normal Rewards
An Optimal ADP Algorithm for a High-Dimensional Stochastic Control Problem
Preux, Philippe
Sparse Temporal Difference Learning using LASSO
Prokhorov, Danil
Toward effective combination of off-line and on-line training in ADP framework
Pucheta, Julian
Ramirez-Hernandez, Jose
Remi, Munos
Value-Iteration Based Fitted Policy Iteration: Learning with a Single Trajectory
Riedmiller, Martin
Evaluation of Policy Gradient Methods and Variants on the Cart-Pole Benchmark
On a Successful Application of Multi-Agent Reinforcement Learning to Operations Research Benchmarks
Roy, Jean Sebastien
Q-learning algorithm with continuous state spaces and finite decision set
Sadati, Nasser
Schaal, Stefan
Evaluation of Policy Gradient Methods and Variants on the Cart-Pole Benchmark
Using Reward-weighted Regression for Reinforcement Learning of Task Space Control
Schaefer, Anton
A Recurrent Control Neural Network for Data Efficient Reinforcement Learning
Schugurensky, Carlos
Schuurmans, Dale
Dual Representations for Dynamic Programming and Reinforcement Learning
Sekiyama, Kosuke
Strategy Generation with Cognitive Distance in Two-Player Games
Shokri, Maryam
Opposition-Based Q(λ) with Non-Markovian Update
Si, Jennie
Srinivasa, Siddhartha
Identifying Trajectory Classes in Dynamic Tasks
Stolle, Martin
Knowledge Transfer using Local Features
Strehl, Alexander
Model-Based Reinforcement Learning in Factored-State MDPs
Strugarek, Cyrille
Q-learning algorithm with continuous state spaces and finite decision set
Sztykgold, Alexandre
Dynamic optimization of the strength ratio during a terrestrial conflict
Tham, Chen-Khong
Coordinated Reinforcement Learning for Decentralized Optimal Control
Tharakunnel, Kurian
Leader-Follower semi-Markov Decision Problems: Theoretical Framework and Approximate Solution
Timmer, Stephan
Tizhoosh, Hamid R.
Opposition-Based Q(λ) with Non-Markovian Update
Opposition-Based Reinforcement Learning in the Management of Water Resources
Udluft, Steffen
A Recurrent Control Neural Network for Data Efficient Reinforcement Learning
van Hasselt, Hado
Convergence of Model-Based Temporal Difference Learning for Control
Reinforcement Learning in Continuous Action Spaces
Two Novel On-policy Reinforcement Learning Algorithms based on TD(λ)-methods
Verbeeck, Katja
The Effect of Bootstrapping in Multi-Automata Reinforcement Learning
Vrabie, Draguna
Continuous-Time ADP for Linear Systems with Partially Unknown Dynamics
Waltman, Ludo
A Theoretical Analysis of Cooperative Behavior in Multi-agent Q-learning
Wang, Tao
Dual Representations for Dynamic Programming and Reinforcement Learning
Wang, Youyi
Continuous-Time ADP for Linear Systems with Partially Unknown Dynamics
Werbos, Paul
Using ADP to Understand and Replicate Brain Intelligence: the Next Level Design
Wiering, Marco
Computing Optimal Stationary Policies for Multi-Objective Markov Decision Processes
Convergence of Model-Based Temporal Difference Learning for Control
Reinforcement Learning in Continuous Action Spaces
Two Novel On-policy Reinforcement Learning Algorithms based on TD(λ)-methods
Yagan, Daniel
Coordinated Reinforcement Learning for Decentralized Optimal Control
Yang, Ping-Chieh
DHP Adaptive Critic Motion Control of Autonomous Wheeled Mobile Robot
Yang, Qinmin
Online Reinforcement Learning Neural Network Controller Design for Nanomanipulation
Yi, Jianqiang
Particle Swarm Optimized Adaptive Dynamic Programming
Yu, Xiufen
Hybrid Ant Colony Optimization Using Memetic Algorithm for Traveling Salesman Problem
Zhao, Dongbin
Particle Swarm Optimized Adaptive Dynamic Programming
Zimmermann, Hans-Georg
A Recurrent Control Neural Network for Data Efficient Reinforcement Learning