IEEE Symposium on Approximate Dynamic Programming and Reinforcement Learning (ADPRL 2007)

Author Index

A | B | C | D | E | F | G | H | I | J | K | L | M | N | O | P | Q | R | S | T | U | V | W | X | Y | Z

A

Return to Top

Abu-Khalaf, Murad

Continuous-Time ADP for Linear Systems with Partially Unknown Dynamics

AL Tamimi, Asma

Discrete-time nonlinear HJB solution using Approximate dynamic programming: Convergence Proof

Anderson, Stuart

Identifying Trajectory Classes in Dynamic Tasks

Antos, Andras

Value-Iteration Based Fitted Policy Iteration: Learning with a Single Trajectory

Atkeson, Christopher

Knowledge Transfer using Local Features

Randomly Sampling Actions In Dynamic Programming

B

Return to Top

Bakker, Bram

Reinforcement learning by backpropagation through an LSTM model/critic

Barty, Kengy

Q-learning algorithm with continuous state spaces and finite decision set

Bhattacharyya, Siddhartha

Leader-Follower semi-Markov Decision Problems: Theoretical Framework and Approximate Solution

Bowling, Michael

Dual Representations for Dynamic Programming and Reinforcement Learning

C

Return to Top

Cai, Chen

An Approximate Dynamic Programming Strategy for Responsive Traffic Signal Control

Carnieri, Ricardo

Strategy Generation with Cognitive Distance in Two-Player Games

Chapel, Laetitia

SVM Viability Controller Active Learning: Application to Bike Control

Coppin, Gilles

Dynamic optimization of the strength ratio during a terrestrial conflict

Coquelin, Pierre-Arnaud

A Dynamic Programming Approach to Viability Problems

Csaba, Szepesvari

Value-Iteration Based Fitted Policy Iteration: Learning with a Single Trajectory

D

Return to Top

Dagli, Cihan

Short-term Stock Market Timing Prediction under Reinforcement Learning Schemes

Davy, Manuel

Sparse Temporal Difference Learning using LASSO

De Jong, Edwin

Computing Optimal Stationary Policies for Multi-Objective Markov Decision Processes

Deffuant, Guillaume

SVM Viability Controller Active Learning: Application to Bike Control

Duan, Haibin

Hybrid Ant Colony Optimization Using Memetic Algorithm for Traveling Salesman Problem

E

Return to Top

Elhanany, Itamar

A Scalable Recurrent Neural Network Framework for Model-Free POMDPs

Enke, David

Short-term Stock Market Timing Prediction under Reinforcement Learning Schemes

F

Return to Top

Fernandez, Emmanuel

An Approximate Dynamic Programming Approach for Job Releasing and Sequencing in a Reentrant Manufacturing Line

Frazier, Peter

The Knowledge Gradient Policy for Offline Learning with Independent Normal Rewards

Fukuda, Toshio

Strategy Generation with Cognitive Distance in Two-Player Games

Fullana, Rogelio

Approximate Optimal Control-Based Neurocontroller with a State Observation System for Seedlings Growth in Greenhouse

G

Return to Top

Gabel, Thomas

On a Successful Application of Multi-Agent Reinforcement Learning to Operations Research Benchmarks

Girardeau, Pierre

Q-learning algorithm with continuous state spaces and finite decision set

H

Return to Top

Huang, Ting

Discrete-Time Approximate Dynamic Programming using Wavelet Basis Function Neural Networks

Hudry, Olivier

Dynamic optimization of the strength ratio during a terrestrial conflict

I

Return to Top

Ilin, Roman

Efficient Learning in Cellular Simultaneous Recurrent Neural Networks - The Case of Maze Navigation Problem

J

Return to Top

Jagannathan, Sarangapani

Online Reinforcement Learning Neural Network Controller Design for Nanomanipulation

Jin, Ning

Discrete-Time Approximate Dynamic Programming using Wavelet Basis Function Neural Networks

Jung, Tobias

Kernelizing LSPE(λ)

K

Return to Top

Kamel, Mohamed S.

Opposition-Based Q(λ) with Non-Markovian Update

Kaymak, Uzay

A Theoretical Analysis of Cooperative Behavior in Multi-agent Q-learning

Kozma, Robert

Efficient Learning in Cellular Simultaneous Recurrent Neural Networks - The Case of Maze Navigation Problem

Kuchen, Benjamin

Approximate Optimal Control-Based Neurocontroller with a State Observation System for Seedlings Growth in Greenhouse

Kulkarni, Nilesh

Reinforcement-Learning-based Magneto-Hydrodynamic Control of Hypersonic Flows

L

Return to Top

Lewis, Frank L.

Continuous-Time ADP for Linear Systems with Partially Unknown Dynamics

Lewis, Frank

Discrete-time nonlinear HJB solution using Approximate dynamic programming: Convergence Proof

Li, Baohua

Robust Dynamic Programming for Discounted Infinite-Horizon Markov Decision Processes with Uncertain Stationary Transition Matrices

Li, Hailin

Short-term Stock Market Timing Prediction under Reinforcement Learning Schemes

Lin, Wei-Song

DHP Adaptive Critic Motion Control of Autonomous Wheeled Mobile Robot

Liu, Derong

Discrete-Time Approximate Dynamic Programming using Wavelet Basis Function Neural Networks

Particle Swarm Optimized Adaptive Dynamic Programming

Liu, Zhenzhen

A Scalable Recurrent Neural Network Framework for Model-Free POMDPs

Loth, Manuel

Sparse Temporal Difference Learning using LASSO

M

Return to Top

Mahootchi, Masoud

Opposition-Based Reinforcement Learning in the Management of Water Resources

Martin, Sophie

A Dynamic Programming Approach to Viability Problems

Mollaie Emamzadeh, Mohammad

A Novel Fuzzy Reinforcement Learning Approach in Two-Level Intelligent Control of 3-DOF Robot Manipulators

Munos, Remi

A Dynamic Programming Approach to Viability Problems

N

Return to Top

Nascimento, Juliana

An Optimal ADP Algorithm for a High-Dimensional Stochastic Control Problem

Nowe, Ann

The Effect of Bootstrapping in Multi-Automata Reinforcement Learning

P

Return to Top

Pang, Zhongyu

Discrete-Time Approximate Dynamic Programming using Wavelet Basis Function Neural Networks

Patino, Daniel

Approximate Optimal Control-Based Neurocontroller with a State Observation System for Seedlings Growth in Greenhouse

Peeters, Maarten

The Effect of Bootstrapping in Multi-Automata Reinforcement Learning

Peters, Jan

Evaluation of Policy Gradient Methods and Variants on the Cart-Pole Benchmark

Using Reward-weighted Regression for Reinforcement Learning of Task Space Control

Phan, Minh

Reinforcement-Learning-based Magneto-Hydrodynamic Control of Hypersonic Flows

Polani, Daniel

Kernelizing LSPE(λ)

Ponnambalam, K.

Opposition-Based Reinforcement Learning in the Management of Water Resources

Powell, Warren

The Knowledge Gradient Policy for Offline Learning with Independent Normal Rewards

An Optimal ADP Algorithm for a High-Dimensional Stochastic Control Problem

Preux, Philippe

Sparse Temporal Difference Learning using LASSO

Prokhorov, Danil

Toward effective combination of off-line and on-line training in ADP framework

Pucheta, Julian

Approximate Optimal Control-Based Neurocontroller with a State Observation System for Seedlings Growth in Greenhouse

R

Return to Top

Ramirez-Hernandez, Jose

An Approximate Dynamic Programming Approach for Job Releasing and Sequencing in a Reentrant Manufacturing Line

Remi, Munos

Value-Iteration Based Fitted Policy Iteration: Learning with a Single Trajectory

Riedmiller, Martin

Evaluation of Policy Gradient Methods and Variants on the Cart-Pole Benchmark

Fitted Q Iteration with CMACs

On a Successful Application of Multi-Agent Reinforcement Learning to Operations Research Benchmarks

Roy, Jean Sebastien

Q-learning algorithm with continuous state spaces and finite decision set

S

Return to Top

Sadati, Nasser

A Novel Fuzzy Reinforcement Learning Approach in Two-Level Intelligent Control of 3-DOF Robot Manipulators

Schaal, Stefan

Evaluation of Policy Gradient Methods and Variants on the Cart-Pole Benchmark

Using Reward-weighted Regression for Reinforcement Learning of Task Space Control

Schaefer, Anton

A Recurrent Control Neural Network for Data Efficient Reinforcement Learning

Schugurensky, Carlos

Approximate Optimal Control-Based Neurocontroller with a State Observation System for Seedlings Growth in Greenhouse

Schuurmans, Dale

Dual Representations for Dynamic Programming and Reinforcement Learning

Sekiyama, Kosuke

Strategy Generation with Cognitive Distance in Two-Player Games

Shokri, Maryam

Opposition-Based Q(λ) with Non-Markovian Update

Si, Jennie

Robust Dynamic Programming for Discounted Infinite-Horizon Markov Decision Processes with Uncertain Stationary Transition Matrices

Srinivasa, Siddhartha

Identifying Trajectory Classes in Dynamic Tasks

Stolle, Martin

Knowledge Transfer using Local Features

Strehl, Alexander

Model-Based Reinforcement Learning in Factored-State MDPs

Strugarek, Cyrille

Q-learning algorithm with continuous state spaces and finite decision set

Sztykgold, Alexandre

Dynamic optimization of the strength ratio during a terrestrial conflict

T

Return to Top

Tham, Chen-Khong

Coordinated Reinforcement Learning for Decentralized Optimal Control

Tharakunnel, Kurian

Leader-Follower semi-Markov Decision Problems: Theoretical Framework and Approximate Solution

Timmer, Stephan

Fitted Q Iteration with CMACs

Tizhoosh, Hamid R.

Opposition-Based Q(λ) with Non-Markovian Update

Opposition-Based Reinforcement Learning in the Management of Water Resources

U

Return to Top

Udluft, Steffen

A Recurrent Control Neural Network for Data Efficient Reinforcement Learning

V

Return to Top

van Hasselt, Hado

Convergence of Model-Based Temporal Difference Learning for Control

Reinforcement Learning in Continuous Action Spaces

Two Novel On-policy Reinforcement Learning Algorithms based on TD(λ)-methods

Verbeeck, Katja

The Effect of Bootstrapping in Multi-Automata Reinforcement Learning

Vrabie, Draguna

Continuous-Time ADP for Linear Systems with Partially Unknown Dynamics

W

Return to Top

Waltman, Ludo

A Theoretical Analysis of Cooperative Behavior in Multi-agent Q-learning

Wang, Tao

Dual Representations for Dynamic Programming and Reinforcement Learning

Wang, Youyi

Continuous-Time ADP for Linear Systems with Partially Unknown Dynamics

Werbos, Paul

Efficient Learning in Cellular Simultaneous Recurrent Neural Networks - The Case of Maze Navigation Problem

Using ADP to Understand and Replicate Brain Intelligence: the Next Level Design

Wiering, Marco

Computing Optimal Stationary Policies for Multi-Objective Markov Decision Processes

Convergence of Model-Based Temporal Difference Learning for Control

Reinforcement Learning in Continuous Action Spaces

Two Novel On-policy Reinforcement Learning Algorithms based on TD(λ)-methods

Y

Return to Top

Yagan, Daniel

Coordinated Reinforcement Learning for Decentralized Optimal Control

Yang, Ping-Chieh

DHP Adaptive Critic Motion Control of Autonomous Wheeled Mobile Robot

Yang, Qinmin

Online Reinforcement Learning Neural Network Controller Design for Nanomanipulation

Yi, Jianqiang

Particle Swarm Optimized Adaptive Dynamic Programming

Yu, Xiufen

Hybrid Ant Colony Optimization Using Memetic Algorithm for Traveling Salesman Problem

Z

Return to Top

Zhao, Dongbin

Particle Swarm Optimized Adaptive Dynamic Programming

Zimmermann, Hans-Georg

A Recurrent Control Neural Network for Data Efficient Reinforcement Learning