<?xml version="1.0" encoding="ISO-8859-1"?><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<front>
<journal-meta>
<journal-id>1665-6423</journal-id>
<journal-title><![CDATA[Journal of applied research and technology]]></journal-title>
<abbrev-journal-title><![CDATA[J. appl. res. technol]]></abbrev-journal-title>
<issn>1665-6423</issn>
<publisher>
<publisher-name><![CDATA[Universidad Nacional Autónoma de México, Instituto de Ciencias Aplicadas y Tecnología]]></publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id>S1665-64232011000200002</article-id>
<title-group>
<article-title xml:lang="en"><![CDATA[Mixed Acceleration Techniques for Solving Quickly Stochastic Shortest-Path Markov Decision Processes]]></article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname><![CDATA[García-Hernández]]></surname>
<given-names><![CDATA[M. de G.]]></given-names>
</name>
<xref ref-type="aff" rid="A01"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Ruiz-Pinales]]></surname>
<given-names><![CDATA[J]]></given-names>
</name>
<xref ref-type="aff" rid="A01"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Onaindía]]></surname>
<given-names><![CDATA[E]]></given-names>
</name>
<xref ref-type="aff" rid="A02"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Ledesma-Orozco]]></surname>
<given-names><![CDATA[S]]></given-names>
</name>
<xref ref-type="aff" rid="A01"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Aviña-Cervantes]]></surname>
<given-names><![CDATA[J. G.]]></given-names>
</name>
<xref ref-type="aff" rid="A01"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Alvarado-Méndez]]></surname>
<given-names><![CDATA[E]]></given-names>
</name>
<xref ref-type="aff" rid="A01"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Reyes-Ballesteros]]></surname>
<given-names><![CDATA[A]]></given-names>
</name>
<xref ref-type="aff" rid="A03"/>
</contrib>
</contrib-group>
<aff id="A01">
<institution><![CDATA[,University of Guanajuato  ]]></institution>
<addr-line><![CDATA[Salamanca Guanajuato]]></addr-line>
<country>México</country>
</aff>
<aff id="A02">
<institution><![CDATA[,Universitat Politécnica de Valéncia  ]]></institution>
<addr-line><![CDATA[Valencia ]]></addr-line>
<country>España</country>
</aff>
<aff id="A03">
<institution><![CDATA[,Electrical Research Institute  ]]></institution>
<addr-line><![CDATA[Cuernavaca Morelos]]></addr-line>
<country>México</country>
</aff>
<pub-date pub-type="pub">
<day>00</day>
<month>08</month>
<year>2011</year>
</pub-date>
<pub-date pub-type="epub">
<day>00</day>
<month>08</month>
<year>2011</year>
</pub-date>
<volume>9</volume>
<numero>2</numero>
<fpage>129</fpage>
<lpage>144</lpage>
<copyright-statement/>
<copyright-year/>
<self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_arttext&amp;pid=S1665-64232011000200002&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_abstract&amp;pid=S1665-64232011000200002&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_pdf&amp;pid=S1665-64232011000200002&amp;lng=en&amp;nrm=iso"></self-uri><abstract abstract-type="short" xml:lang="en"><p><![CDATA[In this paper we propose the combination of accelerated variants of value iteration mixed with improved prioritized sweeping for the fast solution of stochastic shortest-path Markov decision processes. Value iteration is a classical algorithm for solving Markov decision processes, but this algorithm and its variants are quite slow for solving considerably large problems. In order to improve the solution time, acceleration techniques such as asynchronous updates, prioritization and prioritized sweeping have been explored in this paper. A topological reordering algorithm was also compared with static reordering. Experimental results obtained on finite state and action-space stochastic shortest-path problems show that our approach achieves a considerable reduction in the solution time with respect to the tested variants of value iteration. For instance, the experiments showed in one test a reduction of 5.7 times with respect to value iteration with asynchronous updates.]]></p></abstract>
<abstract abstract-type="short" xml:lang="es"><p><![CDATA[En este documento proponemos la combinación de variantes aceleradas del algoritmo de iteración de valor combinadas con el algoritmo de barrido priorizado mejorado para la rápida solución de los procesos de decisión de Markov de ruta estocástica más corta. Iteración de valor es un algoritmo clásico para resolver a los procesos de decisión de Markov, pero este algoritmo y sus variantes son lentos para resolver problemas considerablemente grandes. Con el objeto de mejorar el tiempo de solución de este algoritmo, en este documento se han explorado técnicas de aceleración tales como actualizaciones asíncronas, priorización y barrido priorizado. Un algoritmo de reordenamiento topológico también fue comparado con uno de reordenamiento estático. Los resultados experimentales obtenidos en un problema de ruta estocástica más corta con espacios de estados-acciones finitos; muestran que nuestro enfoque logra una considerable reducción en el tiempo de solución con respecto a las variantes de iteración de valor probadas. Por ejemplo, los experimentos mostraron en una prueba una reducción de 5.7 veces con respecto a iteración de valor usando actualizaciones asíncronas.]]></p></abstract>
<kwd-group>
<kwd lng="en"><![CDATA[Markov decision processes]]></kwd>
<kwd lng="en"><![CDATA[acceleration techniques]]></kwd>
<kwd lng="en"><![CDATA[prioritization]]></kwd>
</kwd-group>
</article-meta>
</front><body><![CDATA[ <p align="center"><font face="verdana" size="4"><b>Mixed Acceleration Techniques for Solving Quickly Stochastic Shortest&#150;Path Markov Decision Processes</b></font></p>     <p align="center"><font face="verdana" size="2">&nbsp;</font></p>     <p align="center"><font face="verdana" size="2"><b>M. de G. Garc&iacute;a&#150;Hern&aacute;ndez*<sup>1</sup>, J. Ruiz&#150;Pinales<sup>1</sup>, E. Onaind&iacute;a<sup>2</sup>, S. Ledesma&#150;Orozco<sup>1</sup>, J. G. Avi&ntilde;a&#150;Cervantes<sup>1</sup>, E. Alvarado&#150;M&eacute;ndez<sup>1</sup>, A. Reyes&#150;Ballesteros<sup>3</sup></b></font></p>     <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>     <p align="justify"><font face="verdana" size="2"><i><sup>1</sup> University of Guanajuato, Comunidad de Palo Blanco s/n, C.P. 36885, Salamanca, Guanajuato, M&eacute;xico.</i> (<a href="mailto:garciag@ugto.mx">garciag@ugto.mx</a>, <a href="mailto:pinales@ugto.mx">pinales@ugto.mx</a>, <a href="mailto:selo@ugto.mx">selo@ugto.mx</a>, <a href="mailto:avina@ugto.mx">avina@ugto.mx</a>, <a href="mailto:ealvarad@ugto.mx">ealvarad@ugto.mx</a>)</font></p>     <p align="justify"><font face="verdana" size="2"><i><sup>2</sup> Universitat Polit&eacute;cnica de Val&eacute;ncia, DSIC, Camino de Vera s/n, 46022, Valencia, Espa&ntilde;a,</i> <a href="mailto:onaindia@dsic.upv.es">onaindia@dsic.upv.es</a></font></p>     <p align="justify"><font face="verdana" size="2"><i><sup>3</sup> Electrical Research Institute, Reforma 113, C.P. 62490, Temixco, Cuernavaca, Morelos, M&eacute;xico,</i> <a href="mailto:areyes@iie.org.mx">areyes@iie.org.mx</a></font></p>     <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>     <p align="justify"><font face="verdana" size="2"><b>ABSTRACT</b></font></p>     <p align="justify"><font face="verdana" size="2">In this paper we propose the combination of accelerated variants of value iteration mixed with improved prioritized sweeping for the fast solution of stochastic shortest&#150;path Markov decision processes. Value iteration is a classical algorithm for solving Markov decision processes, but this algorithm and its variants are quite slow for solving considerably large problems. In order to improve the solution time, acceleration techniques such as asynchronous updates, prioritization and prioritized sweeping have been explored in this paper. A topological reordering algorithm was also compared with static reordering. Experimental results obtained on finite state and action&#150;space stochastic shortest&#150;path problems show that our approach achieves a considerable reduction in the solution time with respect to the tested variants of value iteration. For instance, the experiments showed in one test a reduction of 5.7 times with respect to value iteration with asynchronous updates.</font></p>     ]]></body>
<body><![CDATA[<p align="justify"><font face="verdana" size="2"><b>Keywords: </b>Markov decision processes, acceleration techniques, prioritization.</font></p>     <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>     <p align="justify"><font face="verdana" size="2"><b>RESUMEN</b></font></p>     <p align="justify"><font face="verdana" size="2">En este documento proponemos la combinaci&oacute;n de variantes aceleradas del algoritmo de iteraci&oacute;n de valor combinadas con el algoritmo de barrido priorizado mejorado para la r&aacute;pida soluci&oacute;n de los procesos de decisi&oacute;n de Markov de ruta estoc&aacute;stica m&aacute;s corta. Iteraci&oacute;n de valor es un algoritmo cl&aacute;sico para resolver a los procesos de decisi&oacute;n de Markov, pero este algoritmo y sus variantes son lentos para resolver problemas considerablemente grandes. Con el objeto de mejorar el tiempo de soluci&oacute;n de este algoritmo, en este documento se han explorado t&eacute;cnicas de aceleraci&oacute;n tales como actualizaciones as&iacute;ncronas, priorizaci&oacute;n y barrido priorizado. Un algoritmo de reordenamiento topol&oacute;gico tambi&eacute;n fue comparado con uno de reordenamiento est&aacute;tico. Los resultados experimentales obtenidos en un problema de ruta estoc&aacute;stica m&aacute;s corta con espacios de estados&#150;acciones finitos; muestran que nuestro enfoque logra una considerable reducci&oacute;n en el tiempo de soluci&oacute;n con respecto a las variantes de iteraci&oacute;n de valor probadas. Por ejemplo, los experimentos mostraron en una prueba una reducci&oacute;n de 5.7 veces con respecto a iteraci&oacute;n de valor usando actualizaciones as&iacute;ncronas.</font></p>     <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>     <p align="justify"><font face="verdana" size="2"><a href="/pdf/jart/v9n2/v9n2a2.pdf" target="_blank">DESCARGAR ART&Iacute;CULO EN FORMATO PDF</a></font></p>     <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>     <p align="justify"><font face="verdana" size="2"><b><i>References</i></b></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;1&#93; Boutilier, C., Dean, T. and Hanks, S., Decision&#150;theoretic planning: structural assumptions and computational leverage, Journal of Artificial Intelligence Research, 11, 1999, pp 1&#150;94.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=4856860&pid=S1665-6423201100020000200001&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     ]]></body>
<body><![CDATA[<!-- ref --><p align="justify"><font face="verdana" size="2">&#91;2&#93; Bellman, R. E., The theory of dynamic programming, Bull. Amer. Math. Soc., 60, 1954, pp 503&#150;516.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=4856862&pid=S1665-6423201100020000200002&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;3&#93; Puterman, M. L., Markov Decision Processes, Wiley Editors, New York, USA, 1994.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=4856864&pid=S1665-6423201100020000200003&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;4&#93; Kuter, U., Hu, J., Computing and Using Lower and Upper Bounds for Action Elimination in MDP Planning, </font><font face="verdana" size="2">Proceedings of the Symposium on Abstraction, Reformulation and Approximation, SARA, 2007.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=4856866&pid=S1665-6423201100020000200004&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;5&#93; Dean, T., Kaelbling, L. P., Kirman, J. and Nicholson, A., Planning under Time Constraints in Stochastic Domains, Artificial Intelligence, 76 (1&#150;2), July 1995, pp 35&#150;74.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=4856868&pid=S1665-6423201100020000200005&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;6&#93; Boutilier, C., Dearden, R. and Goldszmidt, M., Stochastic Dynamic Programming with Factored Representations, Artificial Intelligence, 121 (1&#150;2), 2000, pp 49&#150;107.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=4856870&pid=S1665-6423201100020000200006&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     ]]></body>
<body><![CDATA[<!-- ref --><p align="justify"><font face="verdana" size="2">&#91;7&#93; Givan, R., Dean T. and Greig, M., Equivalence Notions and Model Minimization in MDPs, Artificial Intelligence, 147 (1&#150;2), 2003, pp 163&#150;233.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=4856872&pid=S1665-6423201100020000200007&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;8&#93; Tsitsiklis, J. N. and Van Roy, B., Feature&#150;based methods for large&#150;scale dynamic programming, Machine Learning, 22, 1996, pp 59&#150;94.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=4856874&pid=S1665-6423201100020000200008&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;9&#93; De Farias, D. P. and Van Roy, B., The linear programming approach to approximate dynamic programming, Operations Research, 51 (6), 2003, pp 850&#150;865.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=4856876&pid=S1665-6423201100020000200009&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;10&#93; Bonet, B. and Geffner, H., Labeled RTDP: Improving the Convergence of Real&#150;Time Dynamic Programming, International Conference on Automated Planning and Scheduling, ICAPS, 2003, pp 12&#150;21, Trento, Italy.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=4856878&pid=S1665-6423201100020000200010&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;11&#93; Hansen, E. A. and Zilberstein, S., LAO: A Heuristic Search Algorithm that finds solutions with Loops, Artificial Intelligence, 129, 2001, pp 35&#150;62.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=4856880&pid=S1665-6423201100020000200011&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     ]]></body>
<body><![CDATA[<!-- ref --><p align="justify"><font face="verdana" size="2">&#91;12&#93; Chang, H. S., Fu, M. C., Hu, J. and Marcus, S. I., An Adaptive sampling algorithm for solving MDPs, Operations Research, 53 (1), 2005, pp 126&#150;139.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=4856882&pid=S1665-6423201100020000200012&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;13&#93; Gardiol, N. and Kaelbling, L. P., Envelope&#150;based Planning in Relational MDP's, Neural Information Processing Systems NIPS, 16, 2003, Vancouver, B. C.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=4856884&pid=S1665-6423201100020000200013&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;14&#93; Gardiol, N., Relational Envelope&#150;based Planning, PhD Thesis, MIT, MA, USA, February 2008.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=4856886&pid=S1665-6423201100020000200014&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;15&#93; McMahan, H. B. and Gordon, G., Fast Exact Planning in Markov Decision Processes, 15th International Conference on Automated Planning and Scheduling (Monterey, CA, USA, 2005a).    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=4856888&pid=S1665-6423201100020000200015&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;16&#93; Dai, P. and Goldsmith, J., Topological Value Iteration Algorithm for Markov Decision Processes, 20<sup>th </sup>International Joint Conference on Artificial Intelligence, IJCAI, 2007, pp 1860&#150;1865, Hyderabad, India</font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=4856890&pid=S1665-6423201100020000200016&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">&#91;17&#93; Dibangoye, J. S., Chaib&#150;draa, B., Mouaddib, A., A Novel   Prioritization  Technique  for  Solving Markov Decision Processes, 21<sup>st</sup> International FLAIRS Conference, Association for the Advancement of Artificial Intelligence, Florida, USA, 2008.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=4856891&pid=S1665-6423201100020000200017&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;18&#93; Puterman, M. L., Markov Decision Processes, Wiley Interscience Editors, New York, USA, 2005.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=4856893&pid=S1665-6423201100020000200018&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;19&#93; Russell, S., Artificial Intelligence: A Modern Approach, 2n<sup>d</sup> Edition, Making Complex Decisions (Ch&#150;17), Pearson Prentice Hill Ed., USA, 2004.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=4856895&pid=S1665-6423201100020000200019&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;20&#93; Chang, I. and Soo, H., Simulation&#150;based algorithms for Markov decision processes, Communications and Control Engineering, Springer Verlag London Limited, 2007.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=4856897&pid=S1665-6423201100020000200020&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;21&#93; Agrawal, S. and Roth, D., Learning a Sparse Representation for Object Detection, Proc. 7<sup>th</sup> European Conference on Computer Vision (Copenhagen, Denmark, 2002), pp. 1&#150;15.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=4856899&pid=S1665-6423201100020000200021&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;22&#93; Kirk, W. A., Khamsi, M. A., An Introduction to Metric Spaces and Fixed Point Theory, John Wiley, New York, USA, 2001.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=4856901&pid=S1665-6423201100020000200022&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;23&#93; Tijms, H. C., A First Course in Stochastic Models, Wiley Ed., Discrete&#150;Time Markov Decision Processes (Ch&#150;6), UK, 2003.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=4856903&pid=S1665-6423201100020000200023&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;24&#93; Littman, M. L., Dean, T. L. and Kaelbling, L. P., On the Complexity of Solving Markov Decision Problems, 11<sup>th</sup> International Conference on Uncertainty in Artificial Intelligence, 1995, pp 394&#150;402, Montreal, Quebec.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=4856905&pid=S1665-6423201100020000200024&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;25&#93; Wingate, D. and Seppi, K. D., Prioritization Methods for Accelerating MDP Solvers, Journal of Machine Learning Research, 6, 2005, pp 851&#150;881.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=4856907&pid=S1665-6423201100020000200025&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;26&#93; Li, L., A Unifying Framework for Computational Reinforcement Learning Theory, PhD Thesis, The State University of New Jersey (New Brunswick, NJ, USA, October 2009).    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=4856909&pid=S1665-6423201100020000200026&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;27&#93; Vanderbei, R. J., Optimal Sailing Strategies, Statistics and Operations Research Program, University of Princeton, USA, (<a href="http://orfe.princeton.edu/%7Ervdb/sail/sail.html" target="_blank">http://orfe.princeton.edu/&#126;rvdb/sail/sail.html</a>), 1996.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=4856911&pid=S1665-6423201100020000200027&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;28&#93; Reyes, A., Ibarguengoytia, P., Sucar, L. E. and Morales, E., Abstraction and Refinement for Solving Continuous Markov Decision Processes, 3<sup>rd</sup> European Workshop on Probabilistic Graphical Models, 2006, pp 263&#150;270, Prague, Czech Republic.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=4856913&pid=S1665-6423201100020000200028&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;29&#93; Hinderer, K. and Waldmann, K. H., The critical discount factor for Finite Markovian Decision Processes with an absorbing set, Mathematical Methods of Operations Research, Springer Verlag, 57, 2003, pp 1&#150;19.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=4856915&pid=S1665-6423201100020000200029&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;30&#93; Garey, M. R. and Johnson, D. S., Computers and Intractability, A Guide to the Theory of NP&#150;Completeness, Appendix A: List of NP&#150;Complete Problems, W. H. Freeman, 1990.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=4856917&pid=S1665-6423201100020000200030&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;31&#93; Reyes, A., Sucar, L. E., Ibarg&uuml;engoytia, P., Power Plant Operator Assistant, Bayesian Modeling Applications Workshop in the 19th Conference on Uncertainty in Artificial Intelligence UAI&#150;2003, August 2003.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=4856919&pid=S1665-6423201100020000200031&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>      ]]></body><back>
<ref-list>
<ref id="B1">
<label>1</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Boutilier]]></surname>
<given-names><![CDATA[C]]></given-names>
</name>
<name>
<surname><![CDATA[Dean]]></surname>
<given-names><![CDATA[T]]></given-names>
</name>
<name>
<surname><![CDATA[Hanks]]></surname>
<given-names><![CDATA[S]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Decision-theoretic planning: structural assumptions and computational leverage]]></article-title>
<source><![CDATA[Journal of Artificial Intelligence Research]]></source>
<year>1999</year>
<volume>11</volume>
<page-range>1-94</page-range></nlm-citation>
</ref>
<ref id="B2">
<label>2</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Bellman]]></surname>
<given-names><![CDATA[R. E.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[The theory of dynamic programming]]></article-title>
<source><![CDATA[Bull. Amer. Math. Soc.]]></source>
<year>1954</year>
<volume>60</volume>
<page-range>503-516</page-range></nlm-citation>
</ref>
<ref id="B3">
<label>3</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Puterman]]></surname>
<given-names><![CDATA[M. L.]]></given-names>
</name>
</person-group>
<source><![CDATA[Markov Decision Processes]]></source>
<year>1994</year>
<publisher-loc><![CDATA[New York ]]></publisher-loc>
<publisher-name><![CDATA[Wiley]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B4">
<label>4</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Kuter]]></surname>
<given-names><![CDATA[U]]></given-names>
</name>
<name>
<surname><![CDATA[Hu]]></surname>
<given-names><![CDATA[J]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Computing and Using Lower and Upper Bounds for Action Elimination in MDP Planning]]></article-title>
<source><![CDATA[Proceedings of the Symposium on Abstraction, Reformulation and Approximation]]></source>
<year>2007</year>
<publisher-name><![CDATA[SARA]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B5">
<label>5</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Dean]]></surname>
<given-names><![CDATA[T]]></given-names>
</name>
<name>
<surname><![CDATA[Kaelbling]]></surname>
<given-names><![CDATA[L. P.]]></given-names>
</name>
<name>
<surname><![CDATA[Kirman]]></surname>
<given-names><![CDATA[J]]></given-names>
</name>
<name>
<surname><![CDATA[Nicholson]]></surname>
<given-names><![CDATA[A]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Planning under Time Constraints in Stochastic Domains]]></article-title>
<source><![CDATA[Artificial Intelligence]]></source>
<year>1995</year>
<volume>76</volume>
<numero>1-2</numero>
<issue>1-2</issue>
<page-range>35-74</page-range></nlm-citation>
</ref>
<ref id="B6">
<label>6</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Boutilier]]></surname>
<given-names><![CDATA[C]]></given-names>
</name>
<name>
<surname><![CDATA[Dearden]]></surname>
<given-names><![CDATA[R]]></given-names>
</name>
<name>
<surname><![CDATA[Goldszmidt]]></surname>
<given-names><![CDATA[M]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Stochastic Dynamic Programming with Factored Representations]]></article-title>
<source><![CDATA[Artificial Intelligence]]></source>
<year>2000</year>
<volume>121</volume>
<numero>1-2</numero>
<issue>1-2</issue>
<page-range>49-107</page-range></nlm-citation>
</ref>
<ref id="B7">
<label>7</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Givan]]></surname>
<given-names><![CDATA[R]]></given-names>
</name>
<name>
<surname><![CDATA[Dean]]></surname>
<given-names><![CDATA[T]]></given-names>
</name>
<name>
<surname><![CDATA[Greig]]></surname>
<given-names><![CDATA[M]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Equivalence Notions and Model Minimization in MDPs]]></article-title>
<source><![CDATA[Artificial Intelligence]]></source>
<year>2003</year>
<volume>147</volume><volume>1-2</volume>
<page-range>163-233</page-range></nlm-citation>
</ref>
<ref id="B8">
<label>8</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Tsitsiklis]]></surname>
<given-names><![CDATA[J. N.]]></given-names>
</name>
<name>
<surname><![CDATA[Van Roy]]></surname>
<given-names><![CDATA[B]]></given-names>
</name>
</person-group>
<source><![CDATA[Feature-based methods for large-scale dynamic programming, Machine Learning]]></source>
<year>1996</year>
<volume>22</volume>
<page-range>59-94</page-range></nlm-citation>
</ref>
<ref id="B9">
<label>9</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[De Farias]]></surname>
<given-names><![CDATA[D. P.]]></given-names>
</name>
<name>
<surname><![CDATA[Van Roy]]></surname>
<given-names><![CDATA[B]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[The linear programming approach to approximate dynamic programming]]></article-title>
<source><![CDATA[Operations Research]]></source>
<year>2003</year>
<volume>51</volume>
<numero>6</numero>
<issue>6</issue>
<page-range>850-865</page-range></nlm-citation>
</ref>
<ref id="B10">
<label>10</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Bonet]]></surname>
<given-names><![CDATA[B]]></given-names>
</name>
<name>
<surname><![CDATA[Geffner]]></surname>
<given-names><![CDATA[H]]></given-names>
</name>
</person-group>
<source><![CDATA[Labeled RTDP: Improving the Convergence of Real-Time Dynamic Programming]]></source>
<year></year>
<conf-name><![CDATA[ International Conference on Automated Planning and Scheduling]]></conf-name>
<conf-date>2003</conf-date>
<conf-loc> </conf-loc>
<page-range>12-21</page-range><publisher-loc><![CDATA[Trento ]]></publisher-loc>
</nlm-citation>
</ref>
<ref id="B11">
<label>11</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Hansen]]></surname>
<given-names><![CDATA[E. A.]]></given-names>
</name>
<name>
<surname><![CDATA[Zilberstein]]></surname>
<given-names><![CDATA[S]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[LAO: A Heuristic Search Algorithm that finds solutions with Loops]]></article-title>
<source><![CDATA[Artificial Intelligence]]></source>
<year>2001</year>
<volume>129</volume>
<page-range>35-62</page-range></nlm-citation>
</ref>
<ref id="B12">
<label>12</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Chang]]></surname>
<given-names><![CDATA[H. S.]]></given-names>
</name>
<name>
<surname><![CDATA[Fu]]></surname>
<given-names><![CDATA[M. C.]]></given-names>
</name>
<name>
<surname><![CDATA[Hu]]></surname>
<given-names><![CDATA[J]]></given-names>
</name>
<name>
<surname><![CDATA[Marcus]]></surname>
<given-names><![CDATA[S. I.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[An Adaptive sampling algorithm for solving MDPs]]></article-title>
<source><![CDATA[Operations Research]]></source>
<year>2005</year>
<volume>53</volume>
<numero>1</numero>
<issue>1</issue>
<page-range>126-139</page-range></nlm-citation>
</ref>
<ref id="B13">
<label>13</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Gardiol]]></surname>
<given-names><![CDATA[N]]></given-names>
</name>
<name>
<surname><![CDATA[Kaelbling]]></surname>
<given-names><![CDATA[L. P.]]></given-names>
</name>
</person-group>
<source><![CDATA[Envelope-based Planning in Relational MDP's]]></source>
<year></year>
</nlm-citation>
</ref>
<ref id="B14">
<label>14</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Gardiol]]></surname>
<given-names><![CDATA[N]]></given-names>
</name>
</person-group>
<source><![CDATA[Relational Envelope-based Planning]]></source>
<year></year>
</nlm-citation>
</ref>
<ref id="B15">
<label>15</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[McMahan]]></surname>
<given-names><![CDATA[H. B.]]></given-names>
</name>
<name>
<surname><![CDATA[Gordon]]></surname>
<given-names><![CDATA[G]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Fast Exact Planning in Markov Decision]]></article-title>
<source><![CDATA[Processes, 15th International Conference on Automated Planning and Scheduling]]></source>
<year>2005</year>
<publisher-loc><![CDATA[Monterey^eCA CA]]></publisher-loc>
</nlm-citation>
</ref>
<ref id="B16">
<label>16</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Dai]]></surname>
<given-names><![CDATA[P]]></given-names>
</name>
<name>
<surname><![CDATA[Goldsmith]]></surname>
<given-names><![CDATA[J]]></given-names>
</name>
</person-group>
<source><![CDATA[Topological Value Iteration Algorithm for Markov Decision Processes]]></source>
<year></year>
<conf-name><![CDATA[20 International Joint Conference on Artificial Intelligence]]></conf-name>
<conf-date>2007</conf-date>
<conf-loc> </conf-loc>
<page-range>1860-1865</page-range><publisher-loc><![CDATA[Hyderabad ]]></publisher-loc>
</nlm-citation>
</ref>
<ref id="B17">
<label>17</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Dibangoye]]></surname>
<given-names><![CDATA[J. S.]]></given-names>
</name>
<name>
<surname><![CDATA[Chaib-draa]]></surname>
<given-names><![CDATA[B]]></given-names>
</name>
<name>
<surname><![CDATA[Mouaddib]]></surname>
<given-names><![CDATA[A]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[A Novel Prioritization Technique for Solving Markov Decision Processes]]></article-title>
<source><![CDATA[]]></source>
<year></year>
<conf-name><![CDATA[21 International FLAIRS Conference]]></conf-name>
<conf-date>2008</conf-date>
<conf-loc>Florida Florida</conf-loc>
</nlm-citation>
</ref>
<ref id="B18">
<label>18</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Puterman]]></surname>
<given-names><![CDATA[M. L.]]></given-names>
</name>
</person-group>
<source><![CDATA[Markov Decision Processes]]></source>
<year>2005</year>
<publisher-loc><![CDATA[New York ]]></publisher-loc>
<publisher-name><![CDATA[Wiley Interscience]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B19">
<label>19</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Russell]]></surname>
<given-names><![CDATA[S]]></given-names>
</name>
</person-group>
<source><![CDATA[Artificial Intelligence: A Modern Approach]]></source>
<year>2004</year>
<edition>2</edition>
<publisher-name><![CDATA[Pearson Prentice Hill]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B20">
<label>20</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Chang]]></surname>
<given-names><![CDATA[I]]></given-names>
</name>
<name>
<surname><![CDATA[Soo]]></surname>
<given-names><![CDATA[H]]></given-names>
</name>
</person-group>
<source><![CDATA[Simulation-based algorithms for Markov decision processes, Communications and Control Engineering]]></source>
<year>2007</year>
<publisher-name><![CDATA[Springer Verlag London Limited]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B21">
<label>21</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Agrawal]]></surname>
<given-names><![CDATA[S]]></given-names>
</name>
<name>
<surname><![CDATA[Roth]]></surname>
<given-names><![CDATA[D]]></given-names>
</name>
</person-group>
<source><![CDATA[Learning a Sparse Representation for Object Detection]]></source>
<year></year>
<conf-name><![CDATA[ Proc. 7th European Conference on Computer Vision]]></conf-name>
<conf-date>2002</conf-date>
<conf-loc>Copenhagen </conf-loc>
<page-range>1-15</page-range></nlm-citation>
</ref>
<ref id="B22">
<label>22</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Kirk]]></surname>
<given-names><![CDATA[W. A.]]></given-names>
</name>
<name>
<surname><![CDATA[Khamsi]]></surname>
<given-names><![CDATA[M. A.]]></given-names>
</name>
</person-group>
<source><![CDATA[An Introduction to Metric Spaces and Fixed Point Theory]]></source>
<year>2001</year>
<publisher-loc><![CDATA[New York ]]></publisher-loc>
<publisher-name><![CDATA[John Wiley]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B23">
<label>23</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Tijms]]></surname>
<given-names><![CDATA[H. C.]]></given-names>
</name>
</person-group>
<source><![CDATA[A First Course in Stochastic Models]]></source>
<year>2003</year>
<publisher-name><![CDATA[WileDiscrete-Time Markov Decision Processes (Ch-6)]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B24">
<label>24</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Littman]]></surname>
<given-names><![CDATA[M. L.]]></given-names>
</name>
<name>
<surname><![CDATA[Dean]]></surname>
<given-names><![CDATA[T. L.]]></given-names>
</name>
<name>
<surname><![CDATA[Kaelbling]]></surname>
<given-names><![CDATA[L. P.]]></given-names>
</name>
</person-group>
<source><![CDATA[On the Complexity of Solving Markov Decision Problems]]></source>
<year></year>
<conf-name><![CDATA[11 International Conference on Uncertainty in Artificial Intelligence]]></conf-name>
<conf-date>1995</conf-date>
<conf-loc> </conf-loc>
<page-range>394-402</page-range><publisher-loc><![CDATA[MontrealQuebec ]]></publisher-loc>
</nlm-citation>
</ref>
<ref id="B25">
<label>25</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Wingate]]></surname>
<given-names><![CDATA[D]]></given-names>
</name>
<name>
<surname><![CDATA[Seppi]]></surname>
<given-names><![CDATA[K. D.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Prioritization Methods for Accelerating MDP Solvers]]></article-title>
<source><![CDATA[Journal of Machine Learning Research]]></source>
<year>2005</year>
<volume>6</volume>
<page-range>851-881</page-range></nlm-citation>
</ref>
<ref id="B26">
<label>26</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Li]]></surname>
<given-names><![CDATA[L]]></given-names>
</name>
</person-group>
<source><![CDATA[A Unifying Framework for Computational Reinforcement Learning Theory]]></source>
<year></year>
</nlm-citation>
</ref>
<ref id="B27">
<label>27</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Vanderbei]]></surname>
<given-names><![CDATA[R. J.]]></given-names>
</name>
</person-group>
<source><![CDATA[Optimal Sailing Strategies, Statistics and Operations Research Program]]></source>
<year>1996</year>
<publisher-name><![CDATA[University of Princeton]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B28">
<label>28</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Reyes]]></surname>
<given-names><![CDATA[A]]></given-names>
</name>
<name>
<surname><![CDATA[Ibarguengoytia]]></surname>
<given-names><![CDATA[P]]></given-names>
</name>
<name>
<surname><![CDATA[Sucar]]></surname>
<given-names><![CDATA[L. E.]]></given-names>
</name>
<name>
<surname><![CDATA[Morales]]></surname>
<given-names><![CDATA[E]]></given-names>
</name>
</person-group>
<source><![CDATA[Abstraction and Refinement for Solving Continuous Markov Decision Processes]]></source>
<year></year>
<conf-name><![CDATA[3 European Workshop on Probabilistic Graphical Models]]></conf-name>
<conf-date>2006</conf-date>
<conf-loc> </conf-loc>
<page-range>263-270</page-range><publisher-loc><![CDATA[Prague ]]></publisher-loc>
</nlm-citation>
</ref>
<ref id="B29">
<label>29</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Hinderer]]></surname>
<given-names><![CDATA[K]]></given-names>
</name>
<name>
<surname><![CDATA[Waldmann]]></surname>
<given-names><![CDATA[K. H.]]></given-names>
</name>
</person-group>
<source><![CDATA[The critical discount factor for Finite Markovian Decision Processes with an absorbing set, Mathematical Methods of Operations Research]]></source>
<year>2003</year>
<volume>57</volume>
<page-range>1-19</page-range><publisher-name><![CDATA[Springer Verlag]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B30">
<label>30</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Garey]]></surname>
<given-names><![CDATA[M. R.]]></given-names>
</name>
<name>
<surname><![CDATA[Johnson]]></surname>
<given-names><![CDATA[D. S.]]></given-names>
</name>
</person-group>
<source><![CDATA[Computers and Intractability, A Guide to the Theory of NP-Completeness, Appendix A: List of NP-Complete Problems]]></source>
<year>1990</year>
<publisher-name><![CDATA[W. H. Freeman]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B31">
<label>31</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Reyes]]></surname>
<given-names><![CDATA[A]]></given-names>
</name>
<name>
<surname><![CDATA[Sucar]]></surname>
<given-names><![CDATA[L. E.]]></given-names>
</name>
<name>
<surname><![CDATA[Ibargüengoytia]]></surname>
<given-names><![CDATA[P]]></given-names>
</name>
</person-group>
<source><![CDATA[ower Plant Operator Assistant]]></source>
<year></year>
<conf-name><![CDATA[19th Conference on Uncertainty in Artificial Intelligence]]></conf-name>
<conf-date>2003</conf-date>
<conf-loc> </conf-loc>
</nlm-citation>
</ref>
</ref-list>
</back>
</article>
