<?xml version="1.0" encoding="ISO-8859-1"?><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<front>
<journal-meta>
<journal-id>1405-5546</journal-id>
<journal-title><![CDATA[Computación y Sistemas]]></journal-title>
<abbrev-journal-title><![CDATA[Comp. y Sist.]]></abbrev-journal-title>
<issn>1405-5546</issn>
<publisher>
<publisher-name><![CDATA[Instituto Politécnico Nacional, Centro de Investigación en Computación]]></publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id>S1405-55462019000200461</article-id>
<article-id pub-id-type="doi">10.13053/cys-23-2-2977</article-id>
<title-group>
<article-title xml:lang="en"><![CDATA[Application of Different Statistical Tests for Validation of Synthesized Speech Parameterized by Cepstral Coefficients and LSP]]></article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Franco-Galván]]></surname>
<given-names><![CDATA[Carlos]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Herrera-Camacho]]></surname>
<given-names><![CDATA[Abel]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Escalante-Ramírez]]></surname>
<given-names><![CDATA[Boris]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
</contrib-group>
<aff id="Af1">
<institution><![CDATA[,Benemérita Universidad Autónoma de Puebla Facultad de Artes ]]></institution>
<addr-line><![CDATA[Puebla ]]></addr-line>
<country>Mexico</country>
</aff>
<aff id="Af2">
<institution><![CDATA[,Laboratorio de Tecnologías del Lenguaje  ]]></institution>
<addr-line><![CDATA[ ]]></addr-line>
<country>Mexico</country>
</aff>
<aff id="Af3">
<institution><![CDATA[,Universidad Nacional Autónoma de México Facultad de Ingeniería ]]></institution>
<addr-line><![CDATA[ ]]></addr-line>
<country>Mexico</country>
</aff>
<pub-date pub-type="pub">
<day>00</day>
<month>06</month>
<year>2019</year>
</pub-date>
<pub-date pub-type="epub">
<day>00</day>
<month>06</month>
<year>2019</year>
</pub-date>
<volume>23</volume>
<numero>2</numero>
<fpage>461</fpage>
<lpage>467</lpage>
<copyright-statement/>
<copyright-year/>
<self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_arttext&amp;pid=S1405-55462019000200461&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_abstract&amp;pid=S1405-55462019000200461&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_pdf&amp;pid=S1405-55462019000200461&amp;lng=en&amp;nrm=iso"></self-uri><abstract abstract-type="short" xml:lang="en"><p><![CDATA[Abstract: The following document tries out different statistical norms to validate the quality of synthesized voices applied to an HTS-based Spanish synthesizer, which uses LSP and Cepstral Coefficients parameterizations. Standard MOS tests were carried out. Nevertheless, other types of quality tests were performed to reinforce the MOS results. Such as: MUSHRA, ABX and CCR. The subjective test PESQ was also applied. To validate intelligibility a SUS test was used.]]></p></abstract>
<kwd-group>
<kwd lng="en"><![CDATA[Speech synthesis]]></kwd>
<kwd lng="en"><![CDATA[voice parameterization]]></kwd>
<kwd lng="en"><![CDATA[line spectral pair]]></kwd>
</kwd-group>
</article-meta>
</front><back>
<ref-list>
<ref id="B1">
<label>1</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Tokuda]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
<name>
<surname><![CDATA[Nankaku]]></surname>
<given-names><![CDATA[Y.]]></given-names>
</name>
<name>
<surname><![CDATA[Toda]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
<name>
<surname><![CDATA[Zen]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
<name>
<surname><![CDATA[Yamagishi]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Oura]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Speech Synthesis Based on Hidden Markov Models]]></article-title>
<source><![CDATA[Proc. IEEE]]></source>
<year>2013</year>
<volume>101</volume>
<numero>5</numero>
<issue>5</issue>
<page-range>1234-52</page-range></nlm-citation>
</ref>
<ref id="B2">
<label>2</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Herrera-Camacho]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Del Rio-Avila]]></surname>
<given-names><![CDATA[F.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Development of a Mexican Spanish Synthetic Voice Using Synthesizer Modules of Festival Speech and HTSStraight]]></article-title>
<source><![CDATA[Int. J. Comput. Electr. Eng.]]></source>
<year>2013</year>
<volume>5</volume>
<numero>1</numero>
<issue>1</issue>
<page-range>36-9</page-range></nlm-citation>
</ref>
<ref id="B3">
<label>3</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Franco]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Del Rio-Avila]]></surname>
<given-names><![CDATA[F.]]></given-names>
</name>
<name>
<surname><![CDATA[Herrera]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Speech Synthesis of Central Mexico Spanish using Hidden Markov Models]]></article-title>
<source><![CDATA[ATINER Conference Paper Series]]></source>
<year>2016</year>
<page-range>1-12</page-range></nlm-citation>
</ref>
<ref id="B4">
<label>4</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Nakatani]]></surname>
<given-names><![CDATA[N.]]></given-names>
</name>
<name>
<surname><![CDATA[Yamamoto]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
<name>
<surname><![CDATA[Matsumoto]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Mel-LSP Parameterization for HMM-based Speech Synthesis]]></article-title>
<source><![CDATA[Eurasip Proc. (SPECOM'06)]]></source>
<year>2006</year>
<page-range>261-2264</page-range></nlm-citation>
</ref>
<ref id="B5">
<label>5</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Franco]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Herrera]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Escalante]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Speech Synthesis in Mexican Spanish using LSP as voice parameterization]]></article-title>
<source><![CDATA[IIISCi. ORG]]></source>
<year>2017</year>
<volume>15</volume>
<numero>4</numero>
<issue>4</issue>
<page-range>72-5</page-range></nlm-citation>
</ref>
<ref id="B6">
<label>6</label><nlm-citation citation-type="">
<collab>TU-T</collab>
<source><![CDATA[Recommendation ITU-T P.800.1 : Mean opinion score (MOS) terminology]]></source>
<year>2016</year>
</nlm-citation>
</ref>
<ref id="B7">
<label>7</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[King]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Karaiskos]]></surname>
<given-names><![CDATA[V.]]></given-names>
</name>
</person-group>
<source><![CDATA[The Blizzard Challenge 2016]]></source>
<year>2016</year>
<publisher-name><![CDATA[Blizzard Challenge workshop]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B8">
<label>8</label><nlm-citation citation-type="">
<collab>Itu-BS.1534</collab>
<source><![CDATA[Method for the subjective assessment of intermediate quality level of audio systems Policy on Intellectual Property Right (IPR) Series of ITU-R Recommendations]]></source>
<year>2015</year>
<page-range>1-34</page-range></nlm-citation>
</ref>
<ref id="B9">
<label>9</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Munson]]></surname>
<given-names><![CDATA[W.A.]]></given-names>
</name>
<name>
<surname><![CDATA[Gardner]]></surname>
<given-names><![CDATA[M.B.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Standardizing Auditory Tests]]></article-title>
<source><![CDATA[J. Acoust. Soc. Am.]]></source>
<year>1950</year>
<volume>22</volume>
<numero>5</numero>
<issue>5</issue>
<page-range>675</page-range></nlm-citation>
</ref>
<ref id="B10">
<label>10</label><nlm-citation citation-type="">
<collab>ITU-T</collab>
<source><![CDATA[T-REC-P.800-1996]]></source>
<year>1996</year>
<volume>800</volume>
</nlm-citation>
</ref>
<ref id="B11">
<label>11</label><nlm-citation citation-type="">
<collab>ITU-T</collab>
<source><![CDATA[ITU-T Recommendation P.862 -PESQ measure]]></source>
<year>2001</year>
</nlm-citation>
</ref>
<ref id="B12">
<label>12</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Beerends]]></surname>
<given-names><![CDATA[J.G.]]></given-names>
</name>
<name>
<surname><![CDATA[Hekstra]]></surname>
<given-names><![CDATA[A.P.]]></given-names>
</name>
<name>
<surname><![CDATA[Rix]]></surname>
<given-names><![CDATA[A.W.]]></given-names>
</name>
<name>
<surname><![CDATA[Hollier]]></surname>
<given-names><![CDATA[M.P.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Perceptual Evaluation of Speech Quality (PESQ) The New ITU Standard for End-to-End Speech Quality Assessment Part II: Psychoacoustic Model]]></article-title>
<source><![CDATA[J. Audio Eng. Soc]]></source>
<year>2002</year>
<volume>50</volume>
<numero>10</numero>
<issue>10</issue>
<page-range>765-78</page-range></nlm-citation>
</ref>
<ref id="B13">
<label>13</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Cernak]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Rusko]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[An evaluation of synthetic speech using the PESQ measure]]></article-title>
<source><![CDATA[Proc. of European Congress on Acoustics]]></source>
<year>2005</year>
<page-range>2725-8</page-range></nlm-citation>
</ref>
<ref id="B14">
<label>14</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Benoit]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Grice]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Hazan]]></surname>
<given-names><![CDATA[V.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[The SUS test: A method for the assessment of text-to-speech synthesis intelligibility using Semantically Unpredictable Sentences]]></article-title>
<source><![CDATA[Speech Commun.]]></source>
<year>1996</year>
<volume>18</volume>
<numero>4</numero>
<issue>4</issue>
<page-range>381-92</page-range></nlm-citation>
</ref>
</ref-list>
</back>
</article>
