<?xml version="1.0" encoding="ISO-8859-1"?><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<front>
<journal-meta>
<journal-id>1665-6423</journal-id>
<journal-title><![CDATA[Journal of applied research and technology]]></journal-title>
<abbrev-journal-title><![CDATA[J. appl. res. technol]]></abbrev-journal-title>
<issn>1665-6423</issn>
<publisher>
<publisher-name><![CDATA[Universidad Nacional Autónoma de México, Instituto de Ciencias Aplicadas y Tecnología]]></publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id>S1665-64232010000100004</article-id>
<title-group>
<article-title xml:lang="en"><![CDATA[A pattern recognition based esophageal speech enhancement system]]></article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Mantilla-Caeiros]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<xref ref-type="aff" rid="A01"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Nakano-Miyatake]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<xref ref-type="aff" rid="A02"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Perez-Meana]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
<xref ref-type="aff" rid="A02"/>
</contrib>
</contrib-group>
<aff id="A01">
<institution><![CDATA[,Instituto Tecnológico y de Estudios Superiores de Monterrey Campus Ciudad de México ]]></institution>
<addr-line><![CDATA[Mexico ]]></addr-line>
</aff>
<aff id="A02">
<institution><![CDATA[,Instituto Politécnico Nacional (IPN)  ]]></institution>
<addr-line><![CDATA[Mexico ]]></addr-line>
</aff>
<pub-date pub-type="pub">
<day>00</day>
<month>04</month>
<year>2010</year>
</pub-date>
<pub-date pub-type="epub">
<day>00</day>
<month>04</month>
<year>2010</year>
</pub-date>
<volume>8</volume>
<numero>1</numero>
<fpage>56</fpage>
<lpage>70</lpage>
<copyright-statement/>
<copyright-year/>
<self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_arttext&amp;pid=S1665-64232010000100004&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_abstract&amp;pid=S1665-64232010000100004&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_pdf&amp;pid=S1665-64232010000100004&amp;lng=en&amp;nrm=iso"></self-uri><abstract abstract-type="short" xml:lang="en"><p><![CDATA[A system for improving the intelligibility and quality of alaryngeal speech based on the replacement of voiced segments of alaryngeal speech with the equivalent segments of normal speech is proposed. To this end, the system proposed identifies the voiced segments of the alaryngeal speech signal by using isolate speech recognition methods, and replaces them by their equivalent voiced segments of normal speech, keeping the silence and unvoiced segments without change. Evaluation results using objective and subjective evaluation methods show that the proposed system proposed provides a fairly good improvement of the quality and intelligibility of alaryngeal speech signals.]]></p></abstract>
<abstract abstract-type="short" xml:lang="es"><p><![CDATA[Este artículo propone un sistema para mejorar la calidad e inteligibilidad de la voz de personas laringetomizadas, el cual se basa en el reemplazo de segmentos vocalizados de voz laringetomizada por segmentos equivalentes de voz normal. Con esta finalidad el sistema identifica los segmentos vocalizados de voz laringetomizada usando técnicas de reconocimiento de comandos aislados de voz, y las reemplaza por los segmentos equivalentes de voz normal, conservando sin cambio los segmentos y los no-vocalizados. Resultados obtenidos usando métodos de evaluación tanto subjetivos como objetivos muestran que el sistema propuesto proporciona una mejoría importante tanto en la calidad como en la inteligibilidad de señales de voz laringetomizada.]]></p></abstract>
<kwd-group>
<kwd lng="en"><![CDATA[Speech enhancement]]></kwd>
<kwd lng="en"><![CDATA[esophageal speech]]></kwd>
<kwd lng="en"><![CDATA[electronic larynx]]></kwd>
<kwd lng="en"><![CDATA[multilayer perceptron]]></kwd>
<kwd lng="en"><![CDATA[voiced and unvoiced segments detection]]></kwd>
<kwd lng="en"><![CDATA[speech synthesis]]></kwd>
</kwd-group>
</article-meta>
</front><body><![CDATA[  	    <p align="center"><font face="verdana" size="4"><b>A pattern recognition based esophageal speech enhancement system</b></font></p> 	    <p align="center"><font face="verdana" size="2">&nbsp;</font></p> 	    <p align="center"><font face="verdana" size="2"><b>A. Mantilla&#150;Caeiros<sup>1</sup>, M. Nakano&#150;Miyatake<sup>2</sup>, H. Perez&#150;Meana<sup>*2</sup></b></font></p> 	    <p align="justify"><font face="verdana" size="2">&nbsp;</font></p> 	    <p align="justify"><font face="verdana" size="2"><i><sup>1</sup> Instituto Tecnol&oacute;gico y de Estudios Superiores de Monterrey, Campus Ciudad de M&eacute;xico Calle del Puente 222, Ejidos de Huipulco, Tlalpan 14380 Mexico City.</i></font></p> 	    <p align="justify"><font face="verdana" size="2"><i><sup>2 </sup>ESIME Culhuac&aacute;n, Instituto Polit&eacute;cnico Nacional Av. Santa Ana 1000, Col, San Francisco Culhuac&aacute;n, 04430 Mexico City. *Email <a href="mailto:hmperezm@ipn.mx">hmperezm@ipn.mx</a></i></font></p> 	    <p align="justify"><font face="verdana" size="2">&nbsp;</font></p> 	    <p align="justify"><font face="verdana" size="2"><b>ABSTRACT</b></font></p> 	    <p align="justify"><font face="verdana" size="2">A system for improving the intelligibility and quality of alaryngeal speech based on the replacement of voiced segments of alaryngeal speech with the equivalent segments of normal speech is proposed. To this end, the system proposed identifies the voiced segments of the alaryngeal speech signal by using isolate speech recognition methods, and replaces them by their equivalent voiced segments of normal speech, keeping the silence and unvoiced segments without change. Evaluation results using objective and subjective evaluation methods show that the proposed system proposed provides a fairly good improvement of the quality and intelligibility of alaryngeal speech signals.</font></p> 	    ]]></body>
<body><![CDATA[<p align="justify"><font face="verdana" size="2"><b>Keywords:</b> Speech enhancement, esophageal speech, electronic larynx, multilayer perceptron, voiced and unvoiced segments detection, speech synthesis.</font></p> 	    <p align="justify"><font face="verdana" size="2">&nbsp;</font></p> 	    <p align="justify"><font face="verdana" size="2"><b>RESUMEN</b></font></p> 	    <p align="justify"><font face="verdana" size="2">Este art&iacute;culo propone un sistema para mejorar la calidad e inteligibilidad de la voz de personas laringetomizadas, el cual se basa en el reemplazo de segmentos vocalizados de voz laringetomizada por segmentos equivalentes de voz normal. Con esta finalidad el sistema identifica los segmentos vocalizados de voz laringetomizada usando t&eacute;cnicas de reconocimiento de comandos aislados de voz, y las reemplaza por los segmentos equivalentes de voz normal, conservando sin cambio los segmentos y los no&#150;vocalizados. Resultados obtenidos usando m&eacute;todos de evaluaci&oacute;n tanto subjetivos como objetivos muestran que el sistema propuesto proporciona una mejor&iacute;a importante tanto en la calidad como en la inteligibilidad de se&ntilde;ales de voz laringetomizada.</font></p> 	    <p align="justify"><font face="verdana" size="2">&nbsp;</font></p> 	    <p align="justify"><font face="verdana" size="2"><a href="/pdf/jart/v8n1/v8n1a4.pdf" target="_blank">DESCARGAR ART&Iacute;CULO EN FORMATO PDF</a></font></p> 	    <p align="justify"><font face="verdana" size="2">&nbsp;</font></p> 	    <p align="justify"><font face="verdana" size="2"><b><i>Acknowledgments</i></b></font></p> 	    <p align="justify"><font face="verdana" size="2">We thank the Consejo Nacional de Ciencia y Tecnolog&iacute;a (CONACyT) for the support provided during the realization of this research. Also, we would like to thank Dr. Xochiquetzal Hernandez from the Instituto de la Comunicaci&oacute;n Humana of the Centro Nacional de la Rehabilitaci&oacute;n of Mexico for her assistance during the subjective system evaluation.</font></p> 	    <p align="justify"><font face="verdana" size="2">&nbsp;</font></p> 	    ]]></body>
<body><![CDATA[<p align="justify"><font face="verdana" size="2"><b><i>References</i></b></font></p> 	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;1&#93; Barney H., Hawork H. &amp; Dunn F., An experimental transitorized artifcial larynx,.Bell System Technical Journal, Vol. 38, 1959, pp. 1337&#150;1356.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=4848386&pid=S1665-6423201000010000400001&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p> 	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;2&#93; Aguilar G., Nakano&#150;Miyatake M. &amp; Perez&#150;Meana H., Alaryngeal Speech Enhancement Using Pattern Recognition Techniques, IEICE Trans. Inf. &amp; Syst. Vol. E88&#150;D, No. 7, 2005, pp. 1618&#150;1622.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=4848388&pid=S1665-6423201000010000400002&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p> 	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;3&#93; Espy&#150;Wilson, C., Chari V. &amp; Huang C., Enhancement of alaryngeal speech by adaptive filtering, Technical report, Boston University, Boston, MA, 2000.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=4848390&pid=S1665-6423201000010000400003&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p> 	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;4&#93; Becerril H., Nakano&#150;Miyatake M. &amp; Perez&#150;Meana H., Development of an adaptive system for voice enhancement in persons with artificial larynx using DSP, Cientifica, Vol. 8, No. 2, April 2004, pp. 12&#150;20.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=4848392&pid=S1665-6423201000010000400004&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p> 	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;5&#93; Cole D., Sridharan S. &amp; Geva M., Application of noise reduction techniques for alaryngeal speech enhancement, IEEE TECON Speech and Image Processing for Computing and Telecommunications, 1997, pp. 491&#150;494.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=4848394&pid=S1665-6423201000010000400005&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p> 	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;6&#93; K. Matsui and N. Hara, Enhancement of esophageal speech using format synthesis, IEEE International Conference on Acoustic, Speech and Signal Pprocessing, Vo1. 1, 1999, pp. 81&#150;84.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=4848396&pid=S1665-6423201000010000400006&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p> 	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;7&#93; Gorrits M. &amp; Valiere J. , Low&#150;band extension of telephone&#150;band speech, IEEE International Conference on Acoustic, Speech and Signal Processing, 2000, pp. 1851&#150;1854.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=4848398&pid=S1665-6423201000010000400007&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p> 	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;8&#93; Bi N. &amp; Qi Y., Speech conversion and its application to alaryngeal speech enhancement, Proc. of The International Conference on Signal Processing, 1997, pp. 1586&#150;1589.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=4848400&pid=S1665-6423201000010000400008&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p> 	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;9&#93; Bi N. &amp; Qi Y., Application of speech conversion to alaryngeal speech enhancement, IEEE Trans. Speech and Audio Processing, Vol. 5, No. 2, March 1997, pp. 97&#150;105.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=4848402&pid=S1665-6423201000010000400009&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p> 	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;10&#93; Aguilar G., Perez&#150;Meana H., Nakano&#150;Miyatake M. &amp; Becerril H., Speech enhancement of voice produced by an electronic larynx, IEEE Midwest Symposium on Circuit and Systems, Vol. III, August 2004, pp. 37&#150;40.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=4848404&pid=S1665-6423201000010000400010&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p> 	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;11&#93; Rabiner L. &amp; Gold B., Digital processing of speech signals, Prentice Hall, Englewood Cliffs NJ, 1975.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=4848406&pid=S1665-6423201000010000400011&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p> 	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;12&#93; Rabiner L. &amp; Juang B., Fundamentals of Speech Recognition, Prentice Hall, Piscataway, USA, 1993.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=4848408&pid=S1665-6423201000010000400012&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p> 	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;13&#93; Rabiner L., Juang B. &amp; Lee C., An Overview of Automatic Speech Recognition, in Automatic Speech and Speaker Recognition: Advanced Topics, C. H. Lee, F. K. Soong and K. K. Paliwal editors, Kluwer Academic Publisher, 1996, pp. 1&#150;30.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=4848410&pid=S1665-6423201000010000400013&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p> 	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;14&#93; Junqua J. &amp; Halton J., Robustness in Automatic Speech Recognition, Kluwer Academic Publishers, Norwell MA, 1996.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=4848412&pid=S1665-6423201000010000400014&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p> 	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;15&#93; Suarez&#150;Guerra S. &amp; Oropeza&#150;Rodriguez J., Introduction to Speech Recognition, in Advances in Audio and Speech Signal Processing; Technologies and Applications, H Perez&#150;Meana editor, Idea Group Publishing, 2007, pp. 325&#150;347.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=4848414&pid=S1665-6423201000010000400015&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p> 	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;16&#93; Mantilla&#150;Caeiros A., Nakano&#150;Miyatake M. &amp; Perez&#150;Meana H., A New Wavelet Function for Audio and Speech Processing, IEEE Midwest Symposium on Circuit and Systems, August 2007, pp. 101&#150;104.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=4848416&pid=S1665-6423201000010000400016&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p> 	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;17&#93; Zhang X., Heinz M., Bruce I. &amp; Carney L., A phenomenological model for the responses of auditory&#150;nerve fibers: I. Nonlinear tuning with compression and suppression, Acoustical Society of America, Vol. 109, No.2, 2001, pp. 648&#150;670.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=4848418&pid=S1665-6423201000010000400017&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p> 	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;18&#93; Mantilla&#150;Caeiros A., Nakano. MIyatake M. &amp; Perez&#150;Meana H., Isolate speech recognition based on time&#150;frequency analysis methods, Lecture Notes in Computer Science, vol. LNCS 5856, pp. 297&#150;304.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=4848420&pid=S1665-6423201000010000400018&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p> 	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;19&#93; Rao R. &amp; Bopardikar A., Wavelets Transforms, Introduction to Theory and Applications, Addison Wesley, New York, 1998.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=4848422&pid=S1665-6423201000010000400019&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p> 	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;20&#93; Schroeder M., "Objective measure of certain speech signal degradations based on masking properties of the human auditory perception", Frontiers of Speech Communication Research, Academic Press, New York, 1979.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=4848424&pid=S1665-6423201000010000400020&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p> 	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;21&#93; Wang S., Sekey A. &amp; Gersho A., "An objective measure for predicting subjective quality of speech coders," IEEE Journal on Selected Areas in Comm., Vol. 10, No. 3, June 1992, pp. 819&#150;829.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=4848426&pid=S1665-6423201000010000400021&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>      ]]></body><back>
<ref-list>
<ref id="B1">
<label>1</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Barney]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
<name>
<surname><![CDATA[Hawork]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
<name>
<surname><![CDATA[Dunn]]></surname>
<given-names><![CDATA[F.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[An experimental transitorized artifcial larynx]]></article-title>
<source><![CDATA[Bell System Technical Journal]]></source>
<year>1959</year>
<volume>38</volume>
<page-range>1337-1356</page-range></nlm-citation>
</ref>
<ref id="B2">
<label>2</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Aguilar]]></surname>
<given-names><![CDATA[G.]]></given-names>
</name>
<name>
<surname><![CDATA[Nakano-Miyatake]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Perez-Meana]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Alaryngeal Speech Enhancement Using Pattern Recognition Techniques]]></article-title>
<source><![CDATA[IEICE Trans. Inf. & Syst.]]></source>
<year>2005</year>
<numero>7</numero>
<issue>7</issue>
<page-range>1618-1622</page-range></nlm-citation>
</ref>
<ref id="B3">
<label>3</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Espy-Wilson]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Chari]]></surname>
<given-names><![CDATA[V.]]></given-names>
</name>
<name>
<surname><![CDATA[Huang]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
</person-group>
<source><![CDATA[Enhancement of alaryngeal speech by adaptive filtering]]></source>
<year>2000</year>
<publisher-loc><![CDATA[Boston^eMA MA]]></publisher-loc>
<publisher-name><![CDATA[Boston University]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B4">
<label>4</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Becerril]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
<name>
<surname><![CDATA[Nakano-Miyatake]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Perez-Meana]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Development of an adaptive system for voice enhancement in persons with artificial larynx using DSP]]></article-title>
<source><![CDATA[Cientifica]]></source>
<year>Apri</year>
<month>l </month>
<day>20</day>
<volume>8</volume>
<numero>2</numero>
<issue>2</issue>
<page-range>12-20</page-range></nlm-citation>
</ref>
<ref id="B5">
<label>5</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Cole]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Sridharan]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Geva]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<source><![CDATA[Application of noise reduction techniques for alaryngeal speech enhancement, IEEE TECON Speech and Image Processing for Computing and Telecommunications]]></source>
<year>1997</year>
<page-range>491-494</page-range></nlm-citation>
</ref>
<ref id="B6">
<label>6</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Matsui]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
<name>
<surname><![CDATA[Hara]]></surname>
<given-names><![CDATA[N.]]></given-names>
</name>
</person-group>
<source><![CDATA[Enhancement of esophageal speech using format synthesis, IEEE International Conference on Acoustic, Speech and Signal Pprocessing]]></source>
<year>1999</year>
<volume>1</volume>
<page-range>81-84</page-range></nlm-citation>
</ref>
<ref id="B7">
<label>7</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Gorrits]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Valiere]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
</person-group>
<source><![CDATA[Low-band extension of telephone-band speech, IEEE International Conference on Acoustic, Speech and Signal Processing]]></source>
<year>2000</year>
<page-range>1851-1854</page-range></nlm-citation>
</ref>
<ref id="B8">
<label>8</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Bi]]></surname>
<given-names><![CDATA[N.]]></given-names>
</name>
<name>
<surname><![CDATA[Qi]]></surname>
<given-names><![CDATA[Y.]]></given-names>
</name>
</person-group>
<source><![CDATA[Speech conversion and its application to alaryngeal speech enhancement, Proc. of The International Conference on Signal Processing]]></source>
<year>1997</year>
<page-range>1586-1589</page-range></nlm-citation>
</ref>
<ref id="B9">
<label>9</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Bi]]></surname>
<given-names><![CDATA[N.]]></given-names>
</name>
<name>
<surname><![CDATA[Qi]]></surname>
<given-names><![CDATA[Y.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Application of speech conversion to alaryngeal speech enhancement]]></article-title>
<source><![CDATA[IEEE Trans. Speech and Audio Processing]]></source>
<year>Marc</year>
<month>h </month>
<day>19</day>
<volume>5</volume>
<numero>2</numero>
<issue>2</issue>
<page-range>97-105</page-range></nlm-citation>
</ref>
<ref id="B10">
<label>10</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Aguilar]]></surname>
<given-names><![CDATA[G.]]></given-names>
</name>
<name>
<surname><![CDATA[Perez-Meana]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
<name>
<surname><![CDATA[Nakano-Miyatake]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Becerril]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
</person-group>
<source><![CDATA[Speech enhancement of voice produced by an electronic larynx, IEEE Midwest Symposium on Circuit and Systems]]></source>
<year>Augu</year>
<month>st</month>
<day> 2</day>
<volume>III</volume>
<page-range>37-40</page-range></nlm-citation>
</ref>
<ref id="B11">
<label>11</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Rabiner]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
<name>
<surname><![CDATA[Gold]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
</person-group>
<source><![CDATA[Digital processing of speech signals]]></source>
<year>1975</year>
<publisher-loc><![CDATA[Englewood Cliffs^eNJ NJ]]></publisher-loc>
<publisher-name><![CDATA[Prentice Hall]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B12">
<label>12</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Rabiner]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
<name>
<surname><![CDATA[Juang]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
</person-group>
<source><![CDATA[Fundamentals of Speech Recognition]]></source>
<year>1993</year>
<publisher-loc><![CDATA[Piscataway ]]></publisher-loc>
<publisher-name><![CDATA[Prentice Hall]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B13">
<label>13</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Rabiner]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
<name>
<surname><![CDATA[Juang]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
<name>
<surname><![CDATA[Lee]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Lee]]></surname>
<given-names><![CDATA[C. H.]]></given-names>
</name>
<name>
<surname><![CDATA[Soong]]></surname>
<given-names><![CDATA[F. K.]]></given-names>
</name>
<name>
<surname><![CDATA[Paliwal]]></surname>
<given-names><![CDATA[K. K.]]></given-names>
</name>
</person-group>
<source><![CDATA[An Overview of Automatic Speech Recognition, in Automatic Speech and Speaker Recognition: Advanced Topics]]></source>
<year>1996</year>
<page-range>1-30</page-range><publisher-name><![CDATA[Kluwer Academic Publisher]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B14">
<label>14</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Junqua]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Halton]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
</person-group>
<source><![CDATA[Robustness in Automatic Speech Recognition]]></source>
<year>1996</year>
<publisher-loc><![CDATA[Norwell^eMA MA]]></publisher-loc>
<publisher-name><![CDATA[Kluwer Academic Publishers]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B15">
<label>15</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Suarez-Guerra]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Oropeza-Rodriguez]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Perez-Meana]]></surname>
<given-names><![CDATA[H]]></given-names>
</name>
</person-group>
<source><![CDATA[Introduction to Speech Recognition, in Advances in Audio and Speech Signal Processing; Technologies and Applications]]></source>
<year>2007</year>
<page-range>325-347</page-range><publisher-name><![CDATA[Idea Group Publishing]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B16">
<label>16</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Mantilla-Caeiros]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Nakano-Miyatake]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Perez-Meana]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
</person-group>
<source><![CDATA[A New Wavelet Function for Audio and Speech Processing, IEEE Midwest Symposium on Circuit and Systems]]></source>
<year>Augu</year>
<month>st</month>
<day> 2</day>
<page-range>101-104</page-range></nlm-citation>
</ref>
<ref id="B17">
<label>17</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Zhang]]></surname>
<given-names><![CDATA[X.]]></given-names>
</name>
<name>
<surname><![CDATA[Heinz]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Bruce]]></surname>
<given-names><![CDATA[I.]]></given-names>
</name>
<name>
<surname><![CDATA[Carney]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[A phenomenological model for the responses of auditory-nerve fibers: I. Nonlinear tuning with compression and suppression]]></article-title>
<source><![CDATA[Acoustical Society of America]]></source>
<year>2001</year>
<volume>109</volume>
<numero>2</numero>
<issue>2</issue>
<page-range>648-670</page-range></nlm-citation>
</ref>
<ref id="B18">
<label>18</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Mantilla-Caeiros]]></surname>
<given-names><![CDATA[A., Nakano]]></given-names>
</name>
<name>
<surname><![CDATA[MIyatake]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Perez-Meana]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
</person-group>
<source><![CDATA[Isolate speech recognition based on time-frequency analysis methods]]></source>
<year></year>
<page-range>297-304</page-range></nlm-citation>
</ref>
<ref id="B19">
<label>19</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Rao]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Bopardikar]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<source><![CDATA[Wavelets Transforms, Introduction to Theory and Applications]]></source>
<year>1998</year>
<publisher-loc><![CDATA[New York ]]></publisher-loc>
<publisher-name><![CDATA[Addison Wesley]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B20">
<label>20</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Schroeder]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<source><![CDATA[Objective measure of certain speech signal degradations based on masking properties of the human auditory perception]]></source>
<year>1979</year>
<publisher-loc><![CDATA[New York ]]></publisher-loc>
<publisher-name><![CDATA[Academic Press]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B21">
<label>21</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Wang]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Sekey]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Gersho]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[An objective measure for predicting subjective quality of speech coders]]></article-title>
<source><![CDATA[IEEE Journal on Selected Areas in Comm.]]></source>
<year>June</year>
<month> 1</month>
<day>99</day>
<volume>10</volume>
<numero>3</numero>
<issue>3</issue>
<page-range>819-829</page-range></nlm-citation>
</ref>
</ref-list>
</back>
</article>
