<?xml version="1.0" encoding="ISO-8859-1"?><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<front>
<journal-meta>
<journal-id>1405-5546</journal-id>
<journal-title><![CDATA[Computación y Sistemas]]></journal-title>
<abbrev-journal-title><![CDATA[Comp. y Sist.]]></abbrev-journal-title>
<issn>1405-5546</issn>
<publisher>
<publisher-name><![CDATA[Instituto Politécnico Nacional, Centro de Investigación en Computación]]></publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id>S1405-55462018000401367</article-id>
<article-id pub-id-type="doi">10.13053/cys-22-4-3073</article-id>
<title-group>
<article-title xml:lang="en"><![CDATA[Towards Product Attributes Extraction in Indonesian e-Commerce Platform]]></article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Rif&#8217;at]]></surname>
<given-names><![CDATA[Muhammad]]></given-names>
</name>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Mahendra]]></surname>
<given-names><![CDATA[Rahmad]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Budi]]></surname>
<given-names><![CDATA[Indra]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Wibowo]]></surname>
<given-names><![CDATA[Haryo Akbarianto]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
</contrib-group>
<aff id="Af1">
<institution><![CDATA[,Universitas Indonesia Faculty of Computer Science ]]></institution>
<addr-line><![CDATA[Depok ]]></addr-line>
<country>Indonesia</country>
</aff>
<pub-date pub-type="pub">
<day>00</day>
<month>12</month>
<year>2018</year>
</pub-date>
<pub-date pub-type="epub">
<day>00</day>
<month>12</month>
<year>2018</year>
</pub-date>
<volume>22</volume>
<numero>4</numero>
<fpage>1367</fpage>
<lpage>1375</lpage>
<copyright-statement/>
<copyright-year/>
<self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_arttext&amp;pid=S1405-55462018000401367&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_abstract&amp;pid=S1405-55462018000401367&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_pdf&amp;pid=S1405-55462018000401367&amp;lng=en&amp;nrm=iso"></self-uri><abstract abstract-type="short" xml:lang="en"><p><![CDATA[Abstract: Product attribute extraction is an important task in e-commerce domain. Extracting pairs of attribute label and value from free-text product descriptions can be useful for many tasks, such as product matching, product categorization, faceted product search, and product recommendation. In this paper, we present a study of attribute extraction from Indonesian e-commerce product titles. We annotate 1,721 product titles with 16 attribute labels. We apply supervised learning technique using CRF algorithm. We propose combination of lexical, word embedding, and dictionary features to learn the attribute using joint extraction model. Our model achieves F1-measure 47.30% and 68.49% respectively for full match and partial match evaluation. Based on the experiment, we find that doing attributes extraction on more various number and diverse attributes simultaneously does not necessarily give worse result compared to extraction on less number of attributes.]]></p></abstract>
<kwd-group>
<kwd lng="en"><![CDATA[Attributes extraction]]></kwd>
<kwd lng="en"><![CDATA[e-commerce]]></kwd>
<kwd lng="en"><![CDATA[product title]]></kwd>
<kwd lng="en"><![CDATA[named entity recognition]]></kwd>
<kwd lng="en"><![CDATA[Indonesian language]]></kwd>
</kwd-group>
</article-meta>
</front><back>
<ref-list>
<ref id="B1">
<label>1</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Ghani]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Probst]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
<name>
<surname><![CDATA[Liu]]></surname>
<given-names><![CDATA[Y.]]></given-names>
</name>
<name>
<surname><![CDATA[Krema]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Fano]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Text mining for product attribute extraction]]></article-title>
<source><![CDATA[SIGKDD Explor. Newsl.]]></source>
<year>2006</year>
<volume>8</volume>
<numero>1</numero>
<issue>1</issue>
<page-range>41-8</page-range></nlm-citation>
</ref>
<ref id="B2">
<label>2</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Joshi]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Hart]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
<name>
<surname><![CDATA[Vogel]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Ruvini]]></surname>
<given-names><![CDATA[J.-D.]]></given-names>
</name>
</person-group>
<source><![CDATA[Distributed word representations improve ner for e-commerce]]></source>
<year>2015</year>
<conf-name><![CDATA[ VS@ HLT-NAACL]]></conf-name>
<conf-loc> </conf-loc>
<page-range>160-7</page-range></nlm-citation>
</ref>
<ref id="B3">
<label>3</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Kannan]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Givoni]]></surname>
<given-names><![CDATA[I. E.]]></given-names>
</name>
<name>
<surname><![CDATA[Agrawal]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Fuxman]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<source><![CDATA[Matching unstructured product offers to structured product specifications]]></source>
<year>2011</year>
<conf-name><![CDATA[ 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD &#8217;11]]></conf-name>
<conf-loc>New York, NY, USA </conf-loc>
<page-range>404-12</page-range></nlm-citation>
</ref>
<ref id="B4">
<label>4</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Köpcke]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
<name>
<surname><![CDATA[Thor]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Thomas]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Rahm]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
</person-group>
<source><![CDATA[Tailoring entity resolution for matching product offers]]></source>
<year>2012</year>
<conf-name><![CDATA[ 15th International Conference on Extending Database Technology]]></conf-name>
<conf-loc> </conf-loc>
<page-range>545-50</page-range></nlm-citation>
</ref>
<ref id="B5">
<label>5</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Kozareva]]></surname>
<given-names><![CDATA[Z.]]></given-names>
</name>
<name>
<surname><![CDATA[Li]]></surname>
<given-names><![CDATA[Q.]]></given-names>
</name>
<name>
<surname><![CDATA[Zhai]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
<name>
<surname><![CDATA[Guo]]></surname>
<given-names><![CDATA[W.]]></given-names>
</name>
</person-group>
<source><![CDATA[Recognizing salient entities in shopping queries]]></source>
<year>2016</year>
<volume>2</volume>
<conf-name><![CDATA[ 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016]]></conf-name>
<conf-date>August 7-12, 2016</conf-date>
<conf-loc>Berlin, Germany </conf-loc>
</nlm-citation>
</ref>
<ref id="B6">
<label>6</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Lafferty]]></surname>
<given-names><![CDATA[J. D.]]></given-names>
</name>
<name>
<surname><![CDATA[McCallum]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Pereira]]></surname>
<given-names><![CDATA[F. C.]]></given-names>
</name>
</person-group>
<source><![CDATA[Conditional random fields: Probabilistic models for segmenting and labeling sequence data]]></source>
<year>2001</year>
<conf-name><![CDATA[ Eighteenth International Conference on Machine Learning, ICML &#8217;01]]></conf-name>
<conf-loc>San Francisco, CA, USA </conf-loc>
<page-range>282-9</page-range></nlm-citation>
</ref>
<ref id="B7">
<label>7</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Mauge]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
<name>
<surname><![CDATA[Rohanimanesh]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
<name>
<surname><![CDATA[Ruvini]]></surname>
<given-names><![CDATA[J.-D.]]></given-names>
</name>
</person-group>
<source><![CDATA[Structuring e-commerce inventory]]></source>
<year>2012</year>
<conf-name><![CDATA[ 50th Annual Meeting of the Association for Computational Linguistics: Long Papers-Volume 1]]></conf-name>
<conf-loc> </conf-loc>
<page-range>805-14</page-range></nlm-citation>
</ref>
<ref id="B8">
<label>8</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Melli]]></surname>
<given-names><![CDATA[G]]></given-names>
</name>
</person-group>
<source><![CDATA[Shallow semantic parsing of product offering titles (for better automatic hyperlink insertion)]]></source>
<year>2014</year>
<conf-name><![CDATA[ 20th ACM SIGKDD international conference on Knowledge discovery and data mining]]></conf-name>
<conf-loc> </conf-loc>
<page-range>1670-8</page-range></nlm-citation>
</ref>
<ref id="B9">
<label>9</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Mikolov]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
<name>
<surname><![CDATA[Sutskever]]></surname>
<given-names><![CDATA[I.]]></given-names>
</name>
<name>
<surname><![CDATA[Chen]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
<name>
<surname><![CDATA[Corrado]]></surname>
<given-names><![CDATA[G. S.]]></given-names>
</name>
<name>
<surname><![CDATA[Dean]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Distributed representati-ons of words and phrases and their compositionality]]></article-title>
<source><![CDATA[Advances in neural information processing systems]]></source>
<year>2013</year>
<page-range>3111-9</page-range></nlm-citation>
</ref>
<ref id="B10">
<label>10</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[More]]></surname>
<given-names><![CDATA[A]]></given-names>
</name>
</person-group>
<source><![CDATA[Attribute extraction from product tit-les in ecommerce]]></source>
<year>2016</year>
</nlm-citation>
</ref>
<ref id="B11">
<label>11</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Petrovski]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Bryl]]></surname>
<given-names><![CDATA[V.]]></given-names>
</name>
<name>
<surname><![CDATA[Bizer]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
</person-group>
<source><![CDATA[Learning regular expressions for the extraction of product attributes from e-commerce microdata]]></source>
<year>2014</year>
<conf-name><![CDATA[ Second International Conference on Lin-ked Data for Information Extraction-Volume 1267]]></conf-name>
<conf-loc> </conf-loc>
<page-range>45-54</page-range></nlm-citation>
</ref>
<ref id="B12">
<label>12</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Petrovski]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Primpeli]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Meusel]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Bizer]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[The wdc gold standards for product feature extraction and product matching]]></article-title>
<source><![CDATA[International Conference on Electronic Commerce and Web Technologies]]></source>
<year>2016</year>
<page-range>73-86</page-range><publisher-name><![CDATA[Springer]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B13">
<label>13</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Putthividhya]]></surname>
<given-names><![CDATA[D. P.]]></given-names>
</name>
<name>
<surname><![CDATA[Hu]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
</person-group>
<source><![CDATA[Bootstrapped named entity recognition for product attribute extraction]]></source>
<year>2011</year>
<conf-name><![CDATA[ Conference on Empirical Methods in Natural Language Processing]]></conf-name>
<conf-loc> </conf-loc>
<page-range>1557-67</page-range></nlm-citation>
</ref>
<ref id="B14">
<label>14</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Radhakrishnan]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Gupta]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Varma]]></surname>
<given-names><![CDATA[V.]]></given-names>
</name>
</person-group>
<source><![CDATA[Modeling the evolution of product entities]]></source>
<year>2014</year>
<conf-name><![CDATA[ 37th International ACM SIGIR Conference on Research; Development in Information Retrieval, SIGIR &#8217;14]]></conf-name>
<conf-loc>New York, NY, USA </conf-loc>
<page-range>923-6</page-range></nlm-citation>
</ref>
<ref id="B15">
<label>15</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Shinzato]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
<name>
<surname><![CDATA[Sekine]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
</person-group>
<source><![CDATA[Unsupervised extraction of attributes and their values from product description]]></source>
<year>2013</year>
<conf-name><![CDATA[ Sixth International Joint Conference on Natural Language Processing, IJCNLP 2013]]></conf-name>
<conf-date>October 14-18, 2013</conf-date>
<conf-loc>Nagoya, Japan </conf-loc>
<page-range>1339-47</page-range></nlm-citation>
</ref>
<ref id="B16">
<label>16</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Xu]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
<name>
<surname><![CDATA[Zhang]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Pan]]></surname>
<given-names><![CDATA[Z.]]></given-names>
</name>
<name>
<surname><![CDATA[Yang]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Content-based recommendation in e-commerce]]></article-title>
<source><![CDATA[Computational Science and Its Applications - ICCSA 2005]]></source>
<year>2005</year>
<page-range>946-55</page-range><publisher-loc><![CDATA[Berlin, Heidelberg ]]></publisher-loc>
<publisher-name><![CDATA[Springer Berlin Heidelberg]]></publisher-name>
</nlm-citation>
</ref>
</ref-list>
</back>
</article>
