Kybernetika 34 no. 2, 217-234, 1998

Adaptive control for discrete-time Markov processes with unbounded costs: Discounted criterion

Evgueni I. Gordienko and J. Adolfo Minjárez-Sosa

Abstract:

We study the adaptive control problem for discrete-time Markov control processes with Borel state and action spaces and possibly unbounded one-stage costs. The processes are given by recurrent equations $x_{t+1}=F(x_t,a_t,\xi _t),$ $t=0,1,\ldots$ with i.i.d. $\Re ^k$-valued random vectors $\xi _t$ whose density $\rho $ is unknown. Assuming observability of $\xi _t$ we propose the procedure of statistical estimation of $\rho $ that allows us to prove discounted asymptotic optimality of two types of adaptive policies used early for the processes with bounded costs.

paper.pdf

Kybernetika

Journal

Account

Kybernetika 34 no. 2, 217-234, 1998

Adaptive control for discrete-time Markov processes with unbounded costs: Discounted criterion

Abstract: