<?xml version="1.0" encoding="ISO-8859-1"?><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<front>
<journal-meta>
<journal-id>1405-5546</journal-id>
<journal-title><![CDATA[Computación y Sistemas]]></journal-title>
<abbrev-journal-title><![CDATA[Comp. y Sist.]]></abbrev-journal-title>
<issn>1405-5546</issn>
<publisher>
<publisher-name><![CDATA[Instituto Politécnico Nacional, Centro de Investigación en Computación]]></publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id>S1405-55462011000400006</article-id>
<title-group>
<article-title xml:lang="en"><![CDATA[Recognition-free Retrieval of Old Arabic Document Images]]></article-title>
<article-title xml:lang="es"><![CDATA[Recuperación de documentos árabes antiguos a partir de imágenes sin usar reconocimiento de caracteres]]></article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Sari]]></surname>
<given-names><![CDATA[Toufik]]></given-names>
</name>
<xref ref-type="aff" rid="A01"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Kefali]]></surname>
<given-names><![CDATA[Abderrahmane]]></given-names>
</name>
<xref ref-type="aff" rid="A01"/>
</contrib>
</contrib-group>
<aff id="A01">
<institution><![CDATA[,University Badji Mokhtar Laboratoire de Gestion Electronique de Documents ]]></institution>
<addr-line><![CDATA[Annaba ]]></addr-line>
<country>Algeria</country>
</aff>
<pub-date pub-type="pub">
<day>00</day>
<month>12</month>
<year>2011</year>
</pub-date>
<pub-date pub-type="epub">
<day>00</day>
<month>12</month>
<year>2011</year>
</pub-date>
<volume>15</volume>
<numero>2</numero>
<fpage>195</fpage>
<lpage>208</lpage>
<copyright-statement/>
<copyright-year/>
<self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_arttext&amp;pid=S1405-55462011000400006&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_abstract&amp;pid=S1405-55462011000400006&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_pdf&amp;pid=S1405-55462011000400006&amp;lng=en&amp;nrm=iso"></self-uri><abstract abstract-type="short" xml:lang="en"><p><![CDATA[Searching of old document images is a relevant issue today. In this paper, we tackle the problem of old Arabic document images retrieval which form a good part of our heritage and possess an inestimable scientific and cultural richness. We propose an approach for indexing and searching degraded document images without recognizing the textual patterns in order to avoid the high cost and the difficult effort of the optical character recognition (OCR). Our basic idea consists in casting the problem of document images retrieval from the field of document analysis to the field of information retrieval. Thus, we can combine symbolic notation and semic representation and exploit techniques from the two fields, in particular, the techniques of suffix trees and approximate string matching. Each document of the collection is assigned an ASCII file of word codes. Words are represented by their topological features, namely, ascenders, descenders, etc. So, instead of searching in the image, we look for word codes in the corresponding file code. The tests performed on two types of documents, Arabic historical documents and Algerian postal envelopes, have showed good performance of the proposed approach.]]></p></abstract>
<abstract abstract-type="short" xml:lang="es"><p><![CDATA[La búsqueda en imágenes de documentos antiguos es en la actualidad un tema relevante. En este artículo abordamos el problema de recuperación de documentos árabes antiguos a partir de imágenes sin usar el reconocimiento de caracteres (OCR). Dichos documentos forman una buena parte de nuestra herencia y poseen una riqueza científica y cultural invaluable. Nosotros proponemos un enfoque para indexar y buscar imágenes degradadas de documentos sin recurrir al reconocimiento de patrones textuales para así evitar el esfuerzo considerable y el alto costo que conlleva el OCR. La idea básica consiste en migrar el problema de la recuperación de estos documentos, desde el campo del análisis de documentos hacia el campo de la recuperación de información. Así, podemos combinar la notación simbólica y la representación sémica y explotar las técnicas que provienen de ambos campos de investigación, particularmente, las técnicas de árboles de sufijos y búsqueda aproximada de cadenas. A cada documento de la colección se le asigna un archivo en ASCII con códigos de palabras. Las palabras son representadas por sus características topológicas; ej. ascendientes, descendientes, etc. De esta forma, en vez de buscar en la imagen, nosotros buscamos en los códigos de palabra dentro del archivo de códigos correspondiente. Las pruebas se realizan en dos tipos de documentos: documentos históricos árabes y sobres postales argelinos. El enfoque propuesto muestra un buen rendimiento.]]></p></abstract>
<kwd-group>
<kwd lng="en"><![CDATA[Document retrieval]]></kwd>
<kwd lng="en"><![CDATA[Arabic handwriting recognition]]></kwd>
<kwd lng="en"><![CDATA[approximate string matching]]></kwd>
<kwd lng="en"><![CDATA[document analysis]]></kwd>
<kwd lng="es"><![CDATA[Recuperación de documentos]]></kwd>
<kwd lng="es"><![CDATA[reconocimiento de manuscrito árabe]]></kwd>
<kwd lng="es"><![CDATA[búsqueda aproximada de cadenas]]></kwd>
<kwd lng="es"><![CDATA[análisis de documento]]></kwd>
</kwd-group>
</article-meta>
</front><body><![CDATA[ <p align="justify"><font face="verdana" size="4">Art&iacute;culos</font></p>     <p align="justify"><font face="verdana" size="4">&nbsp;</font></p>     <p align="center"><font face="verdana" size="4"><b>Recognition&#150;free Retrieval of Old Arabic Document Images</b></font></p>     <p align="center"><font face="verdana" size="2">&nbsp;</font></p>     <p align="center"><font face="verdana" size="3"><b>Recuperaci&oacute;n de documentos &aacute;rabes antiguos a partir de im&aacute;genes sin usar reconocimiento de caracteres</b></font></p>     <p align="center"><font face="verdana" size="2">&nbsp;</font></p>     <p align="center"><font face="verdana" size="2"><b>Toufik Sari and Abderrahmane Kefali</b></font></p>     <p align="center"><font face="verdana" size="2">&nbsp;</font></p>     <p align="justify"><font face="verdana" size="2"><i>Laboratoire de Gestion Electronique de Documents (LabGED), University Badji Mokhtar, Annaba, Algeria. E&#150;mail:</i> <a href="mailto:sari@labged.net">sari@labged.net</a>, <a href="mailto:kefali@labged.net">kefali@labged.net</a></font></p>     <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>     ]]></body>
<body><![CDATA[<p align="justify"><font face="verdana" size="2">Article received on 11/15/2010.    <br> Accepted 05/06/2011.</font></p>     <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>     <p align="justify"><font face="verdana" size="2"><b>Abstract</b></font></p>     <p align="justify"><font face="verdana" size="2">Searching of old document images is a relevant issue today. In this paper, we tackle the problem of old Arabic document images retrieval which form a good part of our heritage and possess an inestimable scientific and cultural richness. We propose an approach for indexing and searching degraded document images without recognizing the textual patterns in order to avoid the high cost and the difficult effort of the optical character recognition (OCR). Our basic idea consists in casting the problem of document images retrieval from the field of document analysis to the field of information retrieval. Thus, we can combine symbolic notation and semic representation and exploit techniques from the two fields, in particular, the techniques of suffix trees and approximate string matching. Each document of the collection is assigned an ASCII file of word codes. Words are represented by their topological features, namely, ascenders, descenders, etc. So, instead of searching in the image, we look for word codes in the corresponding file code. The tests performed on two types of documents, Arabic historical documents and Algerian postal envelopes, have showed good performance of the proposed approach.</font></p>     <p align="justify"><font face="verdana" size="2"><b>Keywords: </b>Document retrieval, Arabic handwriting recognition, approximate string matching, document analysis.</font></p>     <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>     <p align="justify"><font face="verdana" size="2"><b>Resumen</b></font></p>     <p align="justify"><font face="verdana" size="2">La b&uacute;squeda en im&aacute;genes de documentos antiguos es en la actualidad un tema relevante. En este art&iacute;culo abordamos el problema de recuperaci&oacute;n de documentos &aacute;rabes antiguos a partir de im&aacute;genes sin usar el reconocimiento de caracteres (OCR). Dichos documentos forman una buena parte de nuestra herencia y poseen una riqueza cient&iacute;fica y cultural invaluable. Nosotros proponemos un enfoque para indexar y buscar im&aacute;genes degradadas de documentos sin recurrir al reconocimiento de patrones textuales </font><font face="verdana" size="2">para as&iacute; evitar el esfuerzo considerable y el alto costo que conlleva el OCR. La idea b&aacute;sica consiste en migrar el problema de la recuperaci&oacute;n de estos documentos, desde el campo del an&aacute;lisis de documentos hacia el campo de la recuperaci&oacute;n de informaci&oacute;n. As&iacute;, podemos combinar la notaci&oacute;n simb&oacute;lica y la representaci&oacute;n s&eacute;mica y explotar las t&eacute;cnicas que provienen de ambos campos de investigaci&oacute;n, particularmente, las t&eacute;cnicas de &aacute;rboles de sufijos y b&uacute;squeda aproximada de cadenas. A cada documento de la colecci&oacute;n se le asigna un archivo en ASCII con c&oacute;digos de palabras. Las palabras son representadas por sus caracter&iacute;sticas topol&oacute;gicas; ej. ascendientes, descendientes, etc. De esta forma, en vez de buscar en la imagen, nosotros buscamos en los c&oacute;digos de palabra dentro del archivo de c&oacute;digos correspondiente. Las pruebas se realizan en dos tipos de documentos: documentos hist&oacute;ricos &aacute;rabes y sobres postales argelinos. El enfoque propuesto muestra un buen rendimiento.</font></p>     <p align="justify"><font face="verdana" size="2"><b>Palabras clave: </b>Recuperaci&oacute;n de documentos, reconocimiento de manuscrito &aacute;rabe, b&uacute;squeda aproximada de cadenas, an&aacute;lisis de documento.</font></p>     ]]></body>
<body><![CDATA[<p align="justify"><font face="verdana" size="2">&nbsp;</font></p>     <p align="justify"><font face="verdana" size="2"><a href="/pdf/cys/v15n2/v15n2a6.pdf" target="_blank">DESCARGAR ART&Iacute;CULO EN FORMATO PDF</a></font></p>     <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>     <p align="justify"><font face="verdana" size="2"><b>References</b></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2"><b>1. Adamek, T., O'Connor, N.E. &amp; Smeaton, A.F. (2007). </b>Word matching using single closed contours for indexing handwritten historical documents, <i>IJDAR, </i>9, 153&#150;161.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2054690&pid=S1405-5546201100040000600001&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2"><b>2. Bai, S., Li, L. &amp; Tam, C.L. (2009). </b>Keyword Spotting in Document Images through Word Shape Coding. <i>10<sup>th</sup> International Conference on Document Analysis and Recognition ICDAR, </i>Barcelona, Spain.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2054692&pid=S1405-5546201100040000600002&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2"><b>3. Baird, H.S. (2004). </b>Difficult and Urgent Open Problems in Document Image Analysis for Libraries. <i>Third International Workshop on Document Image Analysis for Libraries DIAL.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2054694&pid=S1405-5546201100040000600003&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></i></font></p>     ]]></body>
<body><![CDATA[<!-- ref --><p align="justify"><font face="verdana" size="2"><b>4. Boyer, R.S &amp; Moore, J.S. (1977). </b>A fast string searching algorithm. <i>Communications of the ACM, </i>20(10), 762&#150;772.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2054696&pid=S1405-5546201100040000600004&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2"><b>5. Camillerapp, J., Pasquer, L. &amp; Co&uuml;asnon, B. (2004). </b>Indexation automatique de formulaires anciens par reconnaissance du patronyme manuscrit. <i>Reconnaissance des Formes et Intelligence Artificielle RFIA, </i>Toulouse, France, 1493&#150;1502.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2054698&pid=S1405-5546201100040000600005&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2"><b>6. Chen, F. &amp; Bloomberg, D. (1998). </b>Summarization of imaged documents without OCR. <i>Computer Vision and Image understanding, </i>70(3).    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2054700&pid=S1405-5546201100040000600006&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2"><b>7. Kefali, A., Sari, T. &amp; Sellami, M, (2009). </b>Impl&eacute;mentation de plusieurs techniques de seuillage d'images de documents arabes anciens, <i>5<sup>th</sup> International Symposium Images Multim&eacute;dias Applications Graphiques et Environnements IMAGE, </i>Biskra, Algeria, 123&#150;134.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2054702&pid=S1405-5546201100040000600007&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2"><b>8. Khurshid, K., Faure, C. &amp; Vincent, N. (2008). </b>Recherche de mots dans des images de documents par appariement de caract&egrave;res. <i>10<sup>th</sup> Colloque International Francophone sur l'&Eacute;crit ET le Document CIFED.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2054704&pid=S1405-5546201100040000600008&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></i></font></p>     ]]></body>
<body><![CDATA[<!-- ref --><p align="justify"><font face="verdana" size="2"><b>9. Khurshid, K., Siddiqi, A., Faure, C. &amp; Vincent, N. (2009). </b>Comparison of Niblack inspired Binarization methods for ancient documents. <i>16<sup>th</sup> Document Recognition and Retrieval Conference DRR, </i>USA.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2054706&pid=S1405-5546201100040000600009&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2"><b>10. Knuth, D.E., Morris, J.H. &amp; Pratt, V.R. (1974). </b><i>Fast pattern matching in strings. </i>TR CS&#150;74&#150;440, Stanford University, ford, California.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2054708&pid=S1405-5546201100040000600010&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2"><b>11. Leedham, G., Varma, S., Patankar A. &amp; Govindaraju,  V.  (2002).  </b>Separating  Text and Background in Degraded Documents Images &#150; A Comparison of Global Thresholding Techniques <i>for Multi&#150;Stage Thresholding, Proc. Eighth IWFHR, </i>Niagara&#150;on&#150;the&#150;Lake, 244&#150;249.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2054710&pid=S1405-5546201100040000600011&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2"><b>12. Mahmoud, A.S. (1994). </b>Arabic Character Recognition Using Fourier Descriptors and Character Contour Encoding. <i>Pattern Recognition, </i>27(6), 815&#150;824.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2054712&pid=S1405-5546201100040000600012&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2"><b>13. Manmatha, R., Han, C. &amp; Risemen, E. (1996). </b>Word spotting: a new approach to indexing handwriting. <i>IEEE Conference on Computer Vision and Pattern Recognition CVPR </i>96, 631&#150;637, 1996.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2054714&pid=S1405-5546201100040000600013&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     ]]></body>
<body><![CDATA[<!-- ref --><p align="justify"><font face="verdana" size="2"><b>14. McCreight, E.M. (1976). </b>A Space&#150;Economical Suffix Tree Construction Algorithm. <i>Journal ACM, </i>23(2), 262&#150;272.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2054716&pid=S1405-5546201100040000600014&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2"><b>15. Mitra, M. &amp; Chaudhuri, B.B. (2000). </b>Information Retrieval from Documents: A Survey. <i>Information Retrieval, </i>Kluwer Academic Publishers, 2, 141&#150;163.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2054718&pid=S1405-5546201100040000600015&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2"><b>16. Navarro, G. (2001). </b>A guided tour to approximate string matching. <i>ACM Computing Surveys, </i>33(1), 31&#150;88.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2054720&pid=S1405-5546201100040000600016&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2"><b>17. Plamondon, R. &amp; Srihari, S.N. (2000). </b>On&#150;line and off&#150;line handwriting recognition: A comprehensive survey. <i>IEEE Transactions on Pattern Analysis and Machine Intelligence, </i>22(1), 63&#150;84.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2054722&pid=S1405-5546201100040000600017&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2"><b>18. Pujari, A.K., Naidu, C.D. &amp; Jinaga, B.C. (2002). </b>An adaptive character recogniser for Telugu scripts using multiresolution analysis and associative memory. <i>3<sup>rd</sup> Indian Conference on Computer Vision, Graphics and Image Processing ICVGIP, </i>Ahmadabad, India.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2054724&pid=S1405-5546201100040000600018&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     ]]></body>
<body><![CDATA[<!-- ref --><p align="justify"><font face="verdana" size="2"><b>19. Ramel, J.Y. (2007). </b>User driven page layout analysis of historical printed books. <i>International Journal of Document Analysis and Recognition IJDAR, </i>05&#150;21.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2054726&pid=S1405-5546201100040000600019&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2"><b>20. Rath, T.M. &amp; Manmatha, R. (2003). </b>Features for Word Spotting in Historical Manuscripts. <i>Seventh International Conference on Document Analysis and Recognition ICDAR.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2054728&pid=S1405-5546201100040000600020&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></i></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2"><b>21. Rath, T.M. &amp; Manmatha, R. (2007). </b>Word Spotting for historical documents. <i>International Journal of Document Analysis and Recognition, </i>9, pp. 139&#150;152.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2054730&pid=S1405-5546201100040000600021&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2"><b>22. Sari, T. &amp; Sellami, M. (2007). </b>State of the art of Offline Arabic Handwriting Segmentation, <i>International Journal of Computer Processing of Oriental Languages.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2054732&pid=S1405-5546201100040000600022&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></i></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2"><b>23. Sari, T. &amp; Kefali, A. (2008V </b>A search engine for Arabic documents. Proc. <i>10<sup>th</sup> Colloque International Francophone sur l'&Eacute;crit et le Document CIFED, </i>Rouen, France, 97&#150;102.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2054734&pid=S1405-5546201100040000600023&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     ]]></body>
<body><![CDATA[<!-- ref --><p align="justify"><font face="verdana" size="2"><b>24. Smeaton, A.F. &amp; Spitz, A. (1997). </b>Using character shape  coding  for information  retrieval, <i>Fourth 208 Toufik Sari and Abderrahmane Kefali International Conference on Document Analysis and Recognition </i>97, IEEE Computer Society Press, 974&#150;978.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2054736&pid=S1405-5546201100040000600024&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2"><b>25. Spitz, A. (1995). </b>Using character shape codes for word spotting in document images. Dori, D. &amp; Bruckstein A. (Eds.), Shape, <i>Structure and Pattern Recognition, World Scientific </i>95, Singapore, 382&#150;389.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2054738&pid=S1405-5546201100040000600025&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2"><b>26. Ukkonen, E. (1985). </b>Finding approximate patterns in strings. <i>Journal of Algorithms, </i>6, 132&#150;137.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2054740&pid=S1405-5546201100040000600026&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2"><b>27. Weiner, P. (1973). </b>Linear pattern matching algorithm. 14<sup>th</sup> <i>IEEE Symposium on Switching and Automata Theory </i>1973, 1&#150;11.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2054742&pid=S1405-5546201100040000600027&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2"><b>28. Winkler, W.E. (1999). </b><i>The state of record linkage and current research problems. </i>Technical report, Statistics of Income Division, Internal Revenue Service Publication R99/04.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2054744&pid=S1405-5546201100040000600028&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     ]]></body>
<body><![CDATA[ ]]></body><back>
<ref-list>
<ref id="B1">
<label>1</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Adamek]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
<name>
<surname><![CDATA[O'Connor]]></surname>
<given-names><![CDATA[N.E.]]></given-names>
</name>
<name>
<surname><![CDATA[Smeaton]]></surname>
<given-names><![CDATA[A.F.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Word matching using single closed contours for indexing handwritten historical documents]]></article-title>
<source><![CDATA[IJDAR]]></source>
<year>2007</year>
<volume>9</volume>
<page-range>153-161</page-range></nlm-citation>
</ref>
<ref id="B2">
<label>2</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Bai]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Li]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
<name>
<surname><![CDATA[Tam]]></surname>
<given-names><![CDATA[C.L.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Keyword Spotting in Document Images through Word Shape Coding]]></article-title>
<source><![CDATA[]]></source>
<year>2009</year>
<conf-name><![CDATA[1 International Conference on Document Analysis and Recognition ICDAR]]></conf-name>
<conf-loc>Barcelona </conf-loc>
</nlm-citation>
</ref>
<ref id="B3">
<label>3</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Baird]]></surname>
<given-names><![CDATA[H.S.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Difficult and Urgent Open Problems in Document Image Analysis for Libraries]]></article-title>
<source><![CDATA[]]></source>
<year>2004</year>
<conf-name><![CDATA[Third International Workshop on Document Image Analysis for Libraries DIAL]]></conf-name>
<conf-loc> </conf-loc>
</nlm-citation>
</ref>
<ref id="B4">
<label>4</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Boyer]]></surname>
<given-names><![CDATA[R.S]]></given-names>
</name>
<name>
<surname><![CDATA[Moore]]></surname>
<given-names><![CDATA[J.S.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[A fast string searching algorithm]]></article-title>
<source><![CDATA[Communications of the ACM]]></source>
<year>1977</year>
<volume>20</volume>
<numero>10</numero>
<issue>10</issue>
<page-range>762-772</page-range></nlm-citation>
</ref>
<ref id="B5">
<label>5</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Camillerapp]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Pasquer]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
<name>
<surname><![CDATA[Coüasnon]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Indexation automatique de formulaires anciens par reconnaissance du patronyme manuscrit]]></article-title>
<source><![CDATA[]]></source>
<year>2004</year>
<conf-name><![CDATA[ Reconnaissance des Formes et Intelligence Artificielle RFIA]]></conf-name>
<conf-loc>Toulouse </conf-loc>
<page-range>1493-1502</page-range></nlm-citation>
</ref>
<ref id="B6">
<label>6</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Chen]]></surname>
<given-names><![CDATA[F.]]></given-names>
</name>
<name>
<surname><![CDATA[Bloomberg]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Summarization of imaged documents without OCR]]></article-title>
<source><![CDATA[Computer Vision and Image understanding]]></source>
<year>1998</year>
<volume>70</volume>
<numero>3</numero>
<issue>3</issue>
</nlm-citation>
</ref>
<ref id="B7">
<label>7</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Kefali]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Sari]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
<name>
<surname><![CDATA[Sellami]]></surname>
<given-names><![CDATA[M]]></given-names>
</name>
</person-group>
<article-title xml:lang="fr"><![CDATA[Implémentation de plusieurs techniques de seuillage d'images de documents arabes anciens]]></article-title>
<source><![CDATA[]]></source>
<year>2009</year>
<conf-name><![CDATA[5 International Symposium Images Multimédias Applications Graphiques et Environnements IMAGE]]></conf-name>
<conf-loc>Biskra </conf-loc>
<page-range>123-134</page-range></nlm-citation>
</ref>
<ref id="B8">
<label>8</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Khurshid]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
<name>
<surname><![CDATA[Faure]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Vincent]]></surname>
<given-names><![CDATA[N.]]></given-names>
</name>
</person-group>
<article-title xml:lang="fr"><![CDATA[Recherche de mots dans des images de documents par appariement de caractères]]></article-title>
<source><![CDATA[]]></source>
<year>2008</year>
<conf-name><![CDATA[10 Colloque International Francophone sur l'Écrit ET le Document CIFED]]></conf-name>
<conf-loc> </conf-loc>
</nlm-citation>
</ref>
<ref id="B9">
<label>9</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Khurshid]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
<name>
<surname><![CDATA[Siddiqi]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Faure]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Vincent]]></surname>
<given-names><![CDATA[N.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Comparison of Niblack inspired Binarization methods for ancient documents]]></article-title>
<source><![CDATA[]]></source>
<year>2009</year>
<conf-name><![CDATA[16 Document Recognition and Retrieval Conference DRR]]></conf-name>
<conf-loc> </conf-loc>
</nlm-citation>
</ref>
<ref id="B10">
<label>10</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Knuth]]></surname>
<given-names><![CDATA[D.E.]]></given-names>
</name>
<name>
<surname><![CDATA[Morris]]></surname>
<given-names><![CDATA[J.H.]]></given-names>
</name>
<name>
<surname><![CDATA[Pratt]]></surname>
<given-names><![CDATA[V.R.]]></given-names>
</name>
</person-group>
<source><![CDATA[Fast pattern matching in strings]]></source>
<year>1974</year>
<publisher-loc><![CDATA[ford^eCalifornia California]]></publisher-loc>
<publisher-name><![CDATA[Stanford University]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B11">
<label>11</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Leedham]]></surname>
<given-names><![CDATA[G.]]></given-names>
</name>
<name>
<surname><![CDATA[Varma]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Patankar]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Govindaraju]]></surname>
<given-names><![CDATA[V.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Separating Text and Background in Degraded Documents Images - A Comparison of Global Thresholding Techniques for Multi-Stage Thresholding]]></article-title>
<source><![CDATA[Proc. Eighth IWFHR, Niagara-on-the-Lake]]></source>
<year>2002</year>
<page-range>244-249</page-range></nlm-citation>
</ref>
<ref id="B12">
<label>12</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Mahmoud]]></surname>
<given-names><![CDATA[A.S.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Arabic Character Recognition Using Fourier Descriptors and Character Contour Encoding]]></article-title>
<source><![CDATA[Pattern Recognition]]></source>
<year>1994</year>
<volume>27</volume>
<numero>6</numero>
<issue>6</issue>
<page-range>815-824</page-range></nlm-citation>
</ref>
<ref id="B13">
<label>13</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Manmatha]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Han]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Risemen]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Word spotting: a new approach to indexing handwriting]]></article-title>
<source><![CDATA[]]></source>
<year>1996</year>
<conf-name><![CDATA[ IEEE Conference on Computer Vision and Pattern Recognition]]></conf-name>
<conf-date>96</conf-date>
<conf-loc> </conf-loc>
<page-range>631-637</page-range></nlm-citation>
</ref>
<ref id="B14">
<label>14</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[McCreight]]></surname>
<given-names><![CDATA[E.M.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[A Space-Economical Suffix Tree Construction Algorithm]]></article-title>
<source><![CDATA[Journal ACM]]></source>
<year>1976</year>
<volume>23</volume>
<numero>2</numero>
<issue>2</issue>
<page-range>262-272</page-range></nlm-citation>
</ref>
<ref id="B15">
<label>15</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Mitra]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Chaudhuri]]></surname>
<given-names><![CDATA[B.B.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Information Retrieval from Documents: A Survey]]></article-title>
<source><![CDATA[Information Retrieval]]></source>
<year>2000</year>
<volume>2</volume>
<page-range>141-163</page-range><publisher-name><![CDATA[Kluwer Academic Publishers]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B16">
<label>16</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Navarro]]></surname>
<given-names><![CDATA[G.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[A guided tour to approximate string matching]]></article-title>
<source><![CDATA[ACM Computing Surveys]]></source>
<year>2001</year>
<volume>33</volume>
<numero>1</numero>
<issue>1</issue>
<page-range>31-88</page-range></nlm-citation>
</ref>
<ref id="B17">
<label>17</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Plamondon]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Srihari]]></surname>
<given-names><![CDATA[S.N.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[On-line and off-line handwriting recognition: A comprehensive survey]]></article-title>
<source><![CDATA[IEEE Transactions on Pattern Analysis and Machine Intelligence]]></source>
<year>2000</year>
<volume>22</volume>
<numero>1</numero>
<issue>1</issue>
<page-range>63-84</page-range></nlm-citation>
</ref>
<ref id="B18">
<label>18</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Pujari]]></surname>
<given-names><![CDATA[A.K.]]></given-names>
</name>
<name>
<surname><![CDATA[Naidu]]></surname>
<given-names><![CDATA[C.D.]]></given-names>
</name>
<name>
<surname><![CDATA[Jinaga]]></surname>
<given-names><![CDATA[B.C.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[An adaptive character recogniser for Telugu scripts using multiresolution analysis and associative memory]]></article-title>
<source><![CDATA[]]></source>
<year>2002</year>
<conf-name><![CDATA[3 Indian Conference on Computer Vision, Graphics and Image Processing ICVGIP]]></conf-name>
<conf-loc>Ahmadabad </conf-loc>
</nlm-citation>
</ref>
<ref id="B19">
<label>19</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Ramel]]></surname>
<given-names><![CDATA[J.Y.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[User driven page layout analysis of historical printed books]]></article-title>
<source><![CDATA[International Journal of Document Analysis and Recognition]]></source>
<year>2007</year>
<page-range>05-21</page-range></nlm-citation>
</ref>
<ref id="B20">
<label>20</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Rath]]></surname>
<given-names><![CDATA[T.M.]]></given-names>
</name>
<name>
<surname><![CDATA[Manmatha]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Features for Word Spotting in Historical Manuscripts]]></article-title>
<source><![CDATA[]]></source>
<year>2003</year>
<conf-name><![CDATA[Seventh International Conference on Document Analysis and Recognition ICDAR]]></conf-name>
<conf-loc> </conf-loc>
</nlm-citation>
</ref>
<ref id="B21">
<label>21</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Rath]]></surname>
<given-names><![CDATA[T.M.]]></given-names>
</name>
<name>
<surname><![CDATA[Manmatha]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Word Spotting for historical documents]]></article-title>
<source><![CDATA[International Journal of Document Analysis and Recognition]]></source>
<year>2007</year>
<volume>9</volume>
<page-range>139-152</page-range></nlm-citation>
</ref>
<ref id="B22">
<label>22</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Sari]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
<name>
<surname><![CDATA[Sellami]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[State of the art of Offline Arabic Handwriting Segmentation]]></article-title>
<source><![CDATA[International Journal of Computer Processing of Oriental Languages]]></source>
<year>2007</year>
</nlm-citation>
</ref>
<ref id="B23">
<label>23</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Sari]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
<name>
<surname><![CDATA[Kefali]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[2008V A search engine for Arabic documents]]></article-title>
<source><![CDATA[Proc. 10th Colloque International Francophone sur l'Écrit et le Document CIFED]]></source>
<year></year>
<page-range>97-102</page-range><publisher-loc><![CDATA[Rouen ]]></publisher-loc>
</nlm-citation>
</ref>
<ref id="B24">
<label>24</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Smeaton]]></surname>
<given-names><![CDATA[A.F.]]></given-names>
</name>
<name>
<surname><![CDATA[Spitz]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Using character shape coding for information retrieval]]></article-title>
<source><![CDATA[]]></source>
<year>1997</year>
<conf-name><![CDATA[Fourth Toufik Sari and Abderrahmane Kefali International Conference on Document Analysis and Recognition]]></conf-name>
<conf-date>97</conf-date>
<conf-loc> </conf-loc>
<page-range>974-978</page-range><publisher-name><![CDATA[IEEE Computer Society Press]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B25">
<label>25</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Spitz]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Using character shape codes for word spotting in document images]]></article-title>
<person-group person-group-type="editor">
<name>
<surname><![CDATA[Dori]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Bruckstein]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<source><![CDATA[Shape, Structure and Pattern Recognition]]></source>
<year>1995</year>
<page-range>382-389</page-range><publisher-loc><![CDATA[Singapore ]]></publisher-loc>
<publisher-name><![CDATA[World Scientific 95]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B26">
<label>26</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Ukkonen]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Finding approximate patterns in strings]]></article-title>
<source><![CDATA[Journal of Algorithms]]></source>
<year>1985</year>
<volume>6</volume>
<page-range>132-137</page-range></nlm-citation>
</ref>
<ref id="B27">
<label>27</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Weiner]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Linear pattern matching algorithm]]></article-title>
<source><![CDATA[]]></source>
<year>1973</year>
<conf-name><![CDATA[14 IEEE Symposium on Switching and Automata Theory]]></conf-name>
<conf-date>1973</conf-date>
<conf-loc> </conf-loc>
<page-range>1-11</page-range></nlm-citation>
</ref>
<ref id="B28">
<label>28</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Winkler]]></surname>
<given-names><![CDATA[W.E.]]></given-names>
</name>
</person-group>
<source><![CDATA[The state of record linkage and current research problems]]></source>
<year>1999</year>
<publisher-name><![CDATA[Statistics of Income Division, Internal Revenue Service Publication R99/04]]></publisher-name>
</nlm-citation>
</ref>
</ref-list>
</back>
</article>
