<?xml version="1.0" encoding="ISO-8859-1"?><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<front>
<journal-meta>
<journal-id>1405-5546</journal-id>
<journal-title><![CDATA[Computación y Sistemas]]></journal-title>
<abbrev-journal-title><![CDATA[Comp. y Sist.]]></abbrev-journal-title>
<issn>1405-5546</issn>
<publisher>
<publisher-name><![CDATA[Instituto Politécnico Nacional, Centro de Investigación en Computación]]></publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id>S1405-55462022000401421</article-id>
<article-id pub-id-type="doi">10.13053/cys-26-4-4271</article-id>
<title-group>
<article-title xml:lang="en"><![CDATA[Application of Auditory Filter-Banks in Polyphonic Music Transcription]]></article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Velázquez López]]></surname>
<given-names><![CDATA[Omar]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Oropeza Rodríguez]]></surname>
<given-names><![CDATA[José Luis]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Suárez Guerra]]></surname>
<given-names><![CDATA[Sergio]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
</contrib-group>
<aff id="Af1">
<institution><![CDATA[,Instituto Politécnico Nacional Centro de Investigación en Computación Laboratorio de Procesamiento Digital de Señales]]></institution>
<addr-line><![CDATA[ ]]></addr-line>
<country>Mexico</country>
</aff>
<pub-date pub-type="pub">
<day>00</day>
<month>12</month>
<year>2022</year>
</pub-date>
<pub-date pub-type="epub">
<day>00</day>
<month>12</month>
<year>2022</year>
</pub-date>
<volume>26</volume>
<numero>4</numero>
<fpage>1421</fpage>
<lpage>1428</lpage>
<copyright-statement/>
<copyright-year/>
<self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_arttext&amp;pid=S1405-55462022000401421&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_abstract&amp;pid=S1405-55462022000401421&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_pdf&amp;pid=S1405-55462022000401421&amp;lng=en&amp;nrm=iso"></self-uri><abstract abstract-type="short" xml:lang="en"><p><![CDATA[Abstract: In this paper we present a frame-level transcription system for polyphonic piano music by using nonnegative matrix factorization (NMF) technique based on Fourier spectrogram as the representation of the musical signal and enhanced by application of an auditory filter-bank based on a new cochlear frequency-position equation, which was developed solving a biomechanical cochlea model without the need of physiological or psychoacoustic experiments. It is important to mention that in our days in music transcription task, a set of auditory bank filters have been used and this paper is focused precisely in this search field. Evaluation using a set of polyphonic piano pieces is performed against the system itself when it does not use filtered spectrograms and also against another system in the state-of-the-art, in both cases it is showed that the proposed method in this paper achieves an increment in precision measure.]]></p></abstract>
<kwd-group>
<kwd lng="en"><![CDATA[Automatic music transcription]]></kwd>
<kwd lng="en"><![CDATA[auditory filter-bank]]></kwd>
<kwd lng="en"><![CDATA[nonnegative matrix factorization]]></kwd>
</kwd-group>
</article-meta>
</front><back>
<ref-list>
<ref id="B1">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Benetos]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
<name>
<surname><![CDATA[Dixon]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Duan]]></surname>
<given-names><![CDATA[Z.]]></given-names>
</name>
<name>
<surname><![CDATA[Ewert]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Automatic Music Transcription: An Overview]]></article-title>
<source><![CDATA[IEEE Signal Processing Magazine]]></source>
<year>2019</year>
<volume>36</volume>
<numero>1</numero>
<issue>1</issue>
<page-range>20-30</page-range></nlm-citation>
</ref>
<ref id="B2">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Lee]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Seung]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Learning the parts of objects by non-negative matrix factorization]]></article-title>
<source><![CDATA[Nature]]></source>
<year>1999</year>
<volume>401</volume>
<page-range>788-91</page-range></nlm-citation>
</ref>
<ref id="B3">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Smaragdis]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Brown]]></surname>
<given-names><![CDATA[J. C.]]></given-names>
</name>
</person-group>
<source><![CDATA[Non-negative matrix factorization for polyphonic music transcription]]></source>
<year>2003</year>
<conf-name><![CDATA[ IEEE Workshop on Applications of Signal Processing to Audio and Acoustics]]></conf-name>
<conf-loc> </conf-loc>
<page-range>177-80</page-range></nlm-citation>
</ref>
<ref id="B4">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Bertin]]></surname>
<given-names><![CDATA[N.]]></given-names>
</name>
<name>
<surname><![CDATA[Badeau]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Richard]]></surname>
<given-names><![CDATA[G.]]></given-names>
</name>
</person-group>
<source><![CDATA[Blind signal decompositions for automatic transcription of polyphonic music: NMF and K-SVD on the Benchmark]]></source>
<year>2007</year>
<conf-name><![CDATA[ IEEE International Conference on Acoustics, Speech and Signal Processing ICASSP'07]]></conf-name>
<conf-loc> </conf-loc>
<page-range>I-65&#8211;I-68</page-range></nlm-citation>
</ref>
<ref id="B5">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Oropeza]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Guerra]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Cochlear mechanical models used in automatic speech recognition tasks]]></article-title>
<source><![CDATA[Computación y Sistemas]]></source>
<year>2019</year>
<volume>23</volume>
<numero>3</numero>
<issue>3</issue>
<page-range>1099-114</page-range></nlm-citation>
</ref>
<ref id="B6">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Greenwood]]></surname>
<given-names><![CDATA[D. D.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[A cochlear frequency-position function for several species-29 years later]]></article-title>
<source><![CDATA[The Journal of the Acoustical Society of America]]></source>
<year>1990</year>
<volume>87</volume>
<numero>6</numero>
<issue>6</issue>
<page-range>2592-605</page-range></nlm-citation>
</ref>
<ref id="B7">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Jiménez]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<source><![CDATA[Distance-frequency relation in a two dimensional cochlear model by mechanical resonance]]></source>
<year>2018</year>
<conf-name><![CDATA[ International Conference on Electronics, Communications and Computers]]></conf-name>
<conf-loc> </conf-loc>
<page-range>106-9</page-range></nlm-citation>
</ref>
<ref id="B8">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Neely]]></surname>
<given-names><![CDATA[S. T.]]></given-names>
</name>
<name>
<surname><![CDATA[Kim]]></surname>
<given-names><![CDATA[D. O.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[A model for active elements in cochlear biomechanics]]></article-title>
<source><![CDATA[The Journal of the Acoustical Society of America]]></source>
<year>1986</year>
<volume>79</volume>
<numero>5</numero>
<issue>5</issue>
<page-range>1472-80</page-range></nlm-citation>
</ref>
<ref id="B9">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Emery]]></surname>
<given-names><![CDATA[M. K.]]></given-names>
</name>
<name>
<surname><![CDATA[Elliott]]></surname>
<given-names><![CDATA[S. J.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Statistics of instabilities in a state space model of the human cochlea]]></article-title>
<source><![CDATA[The Journal of the Acoustical Society of America]]></source>
<year>2008</year>
<volume>124</volume>
<numero>2</numero>
<issue>2</issue>
<page-range>1068-79</page-range></nlm-citation>
</ref>
<ref id="B10">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Santos-Perdigão]]></surname>
<given-names><![CDATA[F.]]></given-names>
</name>
<name>
<surname><![CDATA[Vieira de Sá]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
</person-group>
<source><![CDATA[Modelo computacional da cóclea humana]]></source>
<year>1998</year>
<conf-name><![CDATA[ Acústica'98 Congreso Ibérico de Acústica]]></conf-name>
<conf-loc>Lisbon </conf-loc>
</nlm-citation>
</ref>
<ref id="B11">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Slaney]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Seltzer]]></surname>
<given-names><![CDATA[M. L.]]></given-names>
</name>
</person-group>
<source><![CDATA[The influence of pitch and noise on the discriminability of filterbank features]]></source>
<year>2014</year>
<conf-name><![CDATA[ INTERSPEECH´14]]></conf-name>
<conf-loc> </conf-loc>
<page-range>2263-7</page-range></nlm-citation>
</ref>
<ref id="B12">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Malcolm]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
</person-group>
<source><![CDATA[Auditory Toolbox]]></source>
<year>1998</year>
<page-range>1-52</page-range><publisher-name><![CDATA[Interval Research Corporation Technical]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B13">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[López-Serrano]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Dittmar]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Özer]]></surname>
<given-names><![CDATA[Y.]]></given-names>
</name>
<name>
<surname><![CDATA[Müller]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<source><![CDATA[NMF Toolbox: Music Processing Applications of Nonnegative Matrix Factorization]]></source>
<year>2019</year>
<conf-name><![CDATA[ International Conference on Digital Audio Effects DAFx´19]]></conf-name>
<conf-loc> </conf-loc>
<page-range>2-6</page-range></nlm-citation>
</ref>
<ref id="B14">
<nlm-citation citation-type="">
<collab>MuseScore</collab>
<source><![CDATA[]]></source>
<year></year>
</nlm-citation>
</ref>
<ref id="B15">
<nlm-citation citation-type="">
<collab>Midi Sheet Music</collab>
<source><![CDATA[]]></source>
<year></year>
</nlm-citation>
</ref>
<ref id="B16">
<nlm-citation citation-type="">
<collab>MIREX</collab>
<source><![CDATA[Music Information Retrieval Evaluation eXchange (MIREX)]]></source>
<year>2021</year>
</nlm-citation>
</ref>
<ref id="B17">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Marolt]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[A connectionist approach to automatic transcription of polyphonic piano music]]></article-title>
<source><![CDATA[IEEE Trans. Multimedia]]></source>
<year>2004</year>
<volume>6</volume>
<numero>3</numero>
<issue>3</issue>
<page-range>439-49</page-range></nlm-citation>
</ref>
<ref id="B18">
<nlm-citation citation-type="">
<collab>SONIC System</collab>
<source><![CDATA[]]></source>
<year>2022</year>
</nlm-citation>
</ref>
</ref-list>
</back>
</article>
