<?xml version="1.0" encoding="ISO-8859-1"?><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<front>
<journal-meta>
<journal-id>1870-9044</journal-id>
<journal-title><![CDATA[Polibits]]></journal-title>
<abbrev-journal-title><![CDATA[Polibits]]></abbrev-journal-title>
<issn>1870-9044</issn>
<publisher>
<publisher-name><![CDATA[Instituto Politécnico Nacional, Centro de Innovación y Desarrollo Tecnológico en Cómputo]]></publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id>S1870-90442009000200006</article-id>
<title-group>
<article-title xml:lang="en"><![CDATA[Improving Named Entity Extraction Accuracy using Unlabeled Data and Several Extractors]]></article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Iwakura]]></surname>
<given-names><![CDATA[Tomoya]]></given-names>
</name>
<xref ref-type="aff" rid="A01"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Okamoto]]></surname>
<given-names><![CDATA[Seishi]]></given-names>
</name>
<xref ref-type="aff" rid="A01"/>
</contrib>
</contrib-group>
<aff id="A01">
<institution><![CDATA[,Fujitsu Laboratories Ltd.  ]]></institution>
<addr-line><![CDATA[ ]]></addr-line>
<country>Japan</country>
</aff>
<pub-date pub-type="pub">
<day>00</day>
<month>12</month>
<year>2009</year>
</pub-date>
<pub-date pub-type="epub">
<day>00</day>
<month>12</month>
<year>2009</year>
</pub-date>
<numero>40</numero>
<fpage>29</fpage>
<lpage>38</lpage>
<copyright-statement/>
<copyright-year/>
<self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_arttext&amp;pid=S1870-90442009000200006&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_abstract&amp;pid=S1870-90442009000200006&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_pdf&amp;pid=S1870-90442009000200006&amp;lng=en&amp;nrm=iso"></self-uri><abstract abstract-type="short" xml:lang="en"><p><![CDATA[This paper proposes feature augmentation methods using unlabeled data and several Named Entity (NE) extractors. We collect NE-related information of each word (which we call NE-related labels) from unlabeled data by using NE extractors. NE-related labels which we collect include candidate NE class labels of each word and NE class labels of co-occurring words. To accurately collect the NE-related labels from unlabeled data, we consider methods to collect NE-related labels by using outputs of several NE extractors. We use NE-related labels as additional features for creating new NE extractors. We apply our NE extraction methods using the NE-related labels to IREX Japanese NE extraction task. The experimental results show better accuracy than the previous results obtained with NE extractors using handcrafted resources.]]></p></abstract>
<kwd-group>
<kwd lng="en"><![CDATA[Named entity recognition]]></kwd>
<kwd lng="en"><![CDATA[unlabeled data]]></kwd>
<kwd lng="en"><![CDATA[combination of extractors]]></kwd>
</kwd-group>
</article-meta>
</front><body><![CDATA[ <p align="justify"><font face="verdana" size="4">Special section: Information Retrieval and Natural Language Processing</font></p>     <p align="justify"><font face="verdana" size="4">&nbsp;</font></p>     <p align="center"><font face="verdana" size="4"><b>Improving Named Entity Extraction Accuracy using Unlabeled Data and Several Extractors</b></font></p>     <p align="center"><font face="verdana" size="2">&nbsp;</font></p>     <p align="center"><font face="verdana" size="2"><b>Tomoya Iwakura and Seishi Okamoto</b></font></p>     <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>     <p align="justify"><font face="verdana" size="2"><i>Fujitsu Laboratories Ltd., 1&#150;1, Kamikodanaka 4&#150;chome, Nakahara&#150;ku, Kawasaki 211&#150;8588, Japan.</i> (<a href="mailto:iwakura.tomoya@jp.fujitsu.com">iwakura.tomoya@jp.fujitsu.com</a>, <a href="mailto:seishi@jp.fujitsu.com">seishi@jp.fujitsu.com</a>)</font></p>     <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>     <p align="justify"><font face="verdana" size="2">Manuscript received November 4, 2008.     <br> Manuscript accepted for publication August 25, 2009.</font></p>     ]]></body>
<body><![CDATA[<p align="justify"><font face="verdana" size="2">&nbsp;</font></p>     <p align="justify"><font face="verdana" size="2"><b>Abstract</b></font></p>     <p align="justify"><font face="verdana" size="2">This paper proposes feature augmentation methods using unlabeled data and several Named Entity (NE) extractors. We collect NE&#150;related information of each word (which we call NE&#150;related labels) from unlabeled data by using NE extractors. NE&#150;related labels which we collect include candidate NE class labels of each word and NE class labels of co&#150;occurring words. To accurately collect the NE&#150;related labels from unlabeled data, we consider methods to collect NE&#150;related labels by using outputs of several NE extractors. We use NE&#150;related labels as additional features for creating new NE extractors. We apply our NE extraction methods using the NE&#150;related labels to IREX Japanese NE extraction task. The experimental results show better accuracy than the previous results obtained with NE extractors using handcrafted resources.</font></p>     <p align="justify"><font face="verdana" size="2"><b>Key words: </b>Named entity recognition, unlabeled data, combination of extractors.</font></p>     <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>     <p align="justify"><font face="verdana" size="2"><a href="/pdf/poli/n40/n40a6.pdf" target="_blank">DESCARGAR ART&Iacute;CULO EN FORMATO PDF</a></font></p>     <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>     <p align="justify"><font face="verdana" size="2"><b>REFERENCES</b></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;1&#93; Y. Takemoto, T. Fukushima, and H. Yamada, "A Japanese named entity extraction system based on building a large&#150;scale and high quality dictionary and pattern&#150;matching rules (in Japanese)," in <i>IPSJ Journal, 42(6), </i>2001, pp. 1580&#150;1591.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6042871&pid=S1870-9044200900020000600001&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </font></p>     ]]></body>
<body><![CDATA[<!-- ref --><p align="justify"><font face="verdana" size="2">&#91;2&#93; M.     Collins    and    Y.     Singer, "Unsupervised    models for named   entity   classification,"   in   <i>Proc.   of the   Joint SIGDAT Conference    on    Empirical    Methods    in    Natural Language Processing and Very Large Corpora,   </i>1999.   &#91;Online&#93;. Available: <a href="https://citeseer.ist.psu.edu/myciteseer/login" target="_blank">citeseer.ist.psu.edu/collins99unsupervised.html</a> </font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6042873&pid=S1870-9044200900020000600002&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">&#91;3&#93; K. Uchimoto, Q. Ma, M. Murata, H. Ozaku, M. Utiyama, and H. Isahara, "Named entity extraction based on a maximum entropy model and transformati on rules." in <i>Proc. of the ACL 2000, </i>2000, pp. 326&#150;335.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6042874&pid=S1870-9044200900020000600003&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;4&#93; H. Yamada, T. Kudoh, and Y. Matsumoto, "Japanese named entity extraction using Support Vector Machine (in Japanese)," in <i>IPSJ Journal, 43(1), </i>2002, pp. 44&#150;53.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6042876&pid=S1870-9044200900020000600004&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;5&#93; X. Carreras, L. M&agrave;rques, and L. Padr&oacute;, "Named entity extraction using adaboost," in <i>Proc. of CoNLL&#150;2002. </i>Taipei, Taiwan, 2002, pp. 167&#150;170.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6042878&pid=S1870-9044200900020000600005&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;6&#93; H. Isozaki and H. Kazawa, "Speeding up named entity recognition based on Support Vector Machines (in Japanese)," in <i>IPSJ SIG notes NL&#150;149&#150;1, </i>2002, pp. 1&#150;8.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6042880&pid=S1870-9044200900020000600006&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;7&#93; R. Florian, A. Ittycheriah, H. Jing, and T. Zhang, "Named entity recognition through classifier combination," in <i>Proc. of CoNLL&#150;2003, </i>2003, pp. 168&#150;171.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6042882&pid=S1870-9044200900020000600007&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;8&#93; M. Asahara and Y. Matsumoto, "Japanese named entity extraction with redundant morphological analysis," in <i>Proc. of HLT&#150;NAACL 2003, </i>2003, pp. 8&#150;15.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6042884&pid=S1870-9044200900020000600008&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;9&#93; K. Nakano and Y. Hirai, "Japanese named entity extraction with bunsetsu features (in Japanese)," in <i>IPSJ Journal, 45(3), </i>2004, pp. 934&#150;941.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6042886&pid=S1870-9044200900020000600009&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;10&#93; S. Miller, J. Guinness, and A. Zamanian, "Name tagging with word clusters and discriminative training." in <i>HLT&#150;NAACL, </i>2004, pp. 337-342.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6042888&pid=S1870-9044200900020000600010&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;11&#93; D. Freitag, "Trained named entity recognition using distributional clusters," in <i>Proc. of EMNLP 2004.    </i>Association for Computational Linguistics, July 2004, pp. 262&#150;269.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6042890&pid=S1870-9044200900020000600011&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;12&#93; R. Ando and T. Zhang, "A high&#150;performance semi&#150;supervised learning method for text chunking," in <i>Proc. of the 43rd Annual Meeting of the Association for Computational Linguistics. </i>Ann Arbor, Michigan: Association for Computational Linguistics, June 2005, pp. 1&#150;9. &#91;Online&#93;. Available: <a href="http://www.aclweb.org/anthology/P/P05/P05-1001" target="_blank">http://www.aclweb.org/anthology/P/P05/P05&#150;1001</a> </font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6042892&pid=S1870-9044200900020000600012&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">&#91;13&#93; D.  Yarowsky, "Unsupervised word sense disambiguation rivaling supervised methods," in <i>Proc. of ACL&#150;1995, </i>1995, pp. 189&#150;196.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6042893&pid=S1870-9044200900020000600013&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;14&#93; E. Riloff and R. Jones, "Learning dictionaries for information extraction by multi&#150;level bootstrapping," in <i>AAAI/IAAI,  </i>1999, pp. 474&#150;479. &#91;Online&#93;. Available: <a href="https://citeseer.ist.psu.edu/myciteseer/login" target="_blank">citeseer.ist.psu.edu/article/riloff99learning.html</a> </font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6042895&pid=S1870-9044200900020000600014&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">&#91;15&#93; A. Blum and T. Mitchell, "Combining labeled and unlabeled data with co&#150;training," in <i>Proc. of the 11th COLT, </i>1998, pp. 92&#150;100.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6042896&pid=S1870-9044200900020000600015&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;16&#93; R. K. Ando, "Semantic lexicon construction: Learning from unlabeled data via spectral analysis," in <i>Proc. of CoNLL&#150;2004.    </i>Boston, MA, USA, 2004, pp. 9&#150;16.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6042898&pid=S1870-9044200900020000600016&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;17&#93; C. IREX, <i>Proc. of the IREX workshop, </i>1999.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6042900&pid=S1870-9044200900020000600017&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;18&#93; L. Ramshaw and M. Marcus, "Text chunking using transformation&#150;based learning," in <i>Proc. of the Third Workshop on Very Large Corpora. </i>Association for Computational Linguistics, 1995, pp. 82&#150;94. &#91;Online&#93;. Available: <a href="https://citeseer.ist.psu.edu/myciteseer/login" target="_blank">citeseer.ist.psu.edu/article/ramshaw95text.html</a> </font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6042902&pid=S1870-9044200900020000600018&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">&#91;19&#93; E. Tjong Kim Sang and J. Veenstra, "Representing text chunks." in <i>Proc. of EACL '99, </i>Bergen, Norway, 1999. &#91;Online&#93;. Available: <a href="http://www.cnts.ua.ac.be/Publications/1999/TV99" target="_blank">http://www.cnts.ua.ac.be/Publications/1999/TV99</a> </font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6042903&pid=S1870-9044200900020000600019&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">&#91;20&#93; T. Kudo and Y. Matsumoto, "Chunking with Support Vector Machines," in <i>Proc. of NAACL 2001, </i>2001.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6042904&pid=S1870-9044200900020000600020&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;21&#93; &#150;&#150;&#150;&#150;&#150;&#150;&#150;&#150;&#150;&#150;, "Fast methods for kernel&#150;based text analysis," in <i>Proc. of ACL&#150;</i><i>2003, </i>2003, pp. 24&#150;31.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6042906&pid=S1870-9044200900020000600021&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;22&#93; V. Vapnik, <i>Statistical Learning Theory.   </i>John Wiley &amp; Sons, 1998.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6042908&pid=S1870-9044200900020000600022&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;23&#93; J. C. Platt, <i>Probabilities for SV machines, </i>A. J. Smola, P. L. Bartlett, B. Sch&uml;olkopf, and D. Schuurmans, Eds.   MIT Press, 2000.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6042910&pid=S1870-9044200900020000600023&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;24&#93; T. Utsuro, M. Sassano, and K. Uchimoto, "Combining outputs of multiple Japanese named entity chunkers by stacking," in <i>Proc. of </i><i>EMNLP 2002, </i>2002, pp. 281&#150;288.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6042912&pid=S1870-9044200900020000600024&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;25&#93; R. Sasano and S. Kurohashi, "Japanese named entity recognition using structural natural language processing," in <i>Proc. of IJCNLP'08, </i>2008, pp. 607&#150;612.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6042914&pid=S1870-9044200900020000600025&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;26&#93; J. Kazama and K. Torisawa, "Inducing gazetteers for named entity recognition by large&#150;scale clustering of dependency relations," in <i>Proc. </i><i>of ACL&#150;08: HLT, </i>2008, pp. 407&#150;415.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6042916&pid=S1870-9044200900020000600026&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </font></p>     <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;27&#93; S. Ikehara, M. Miyazaki, S. Shirai, A. Yokoo, H. Nakaiwa, K. Ogura, Y. Ooyama, and Y. Hayashi, <i>Goi&#150;Taikei &#150;A Japanese Lexicon CDROM. </i>Iwanami Shoten, 1999.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6042918&pid=S1870-9044200900020000600027&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>      ]]></body><back>
<ref-list>
<ref id="B1">
<label>1</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Takemoto]]></surname>
<given-names><![CDATA[Y.]]></given-names>
</name>
<name>
<surname><![CDATA[Fukushima]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
<name>
<surname><![CDATA[Yamada]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[A Japanese named entity extraction system based on building a large-scale and high quality dictionary and pattern-matching rules (in Japanese)]]></article-title>
<source><![CDATA[IPSJ Journal]]></source>
<year>2001</year>
<volume>42</volume>
<numero>6</numero>
<issue>6</issue>
<page-range>1580-1591</page-range></nlm-citation>
</ref>
<ref id="B2">
<label>2</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Collins]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Singer]]></surname>
<given-names><![CDATA[Y.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Unsupervised models for named entity classification]]></article-title>
<source><![CDATA[Proc. of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora]]></source>
<year>1999</year>
</nlm-citation>
</ref>
<ref id="B3">
<label>3</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Uchimoto]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
<name>
<surname><![CDATA[Ma]]></surname>
<given-names><![CDATA[Q.]]></given-names>
</name>
<name>
<surname><![CDATA[Murata]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Ozaku]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
<name>
<surname><![CDATA[Utiyama]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Isahara]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Named entity extraction based on a maximum entropy model and transformati on rules]]></article-title>
<source><![CDATA[Proc. of the ACL 2000]]></source>
<year>2000</year>
<page-range>326-335</page-range></nlm-citation>
</ref>
<ref id="B4">
<label>4</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Yamada]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
<name>
<surname><![CDATA[Kudoh]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
<name>
<surname><![CDATA[Matsumoto]]></surname>
<given-names><![CDATA[Y.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Japanese named entity extraction using Support Vector Machine (in Japanese)]]></article-title>
<source><![CDATA[IPSJ Journal]]></source>
<year>2002</year>
<volume>43</volume>
<numero>1</numero>
<issue>1</issue>
<page-range>44-53</page-range></nlm-citation>
</ref>
<ref id="B5">
<label>5</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Carreras]]></surname>
<given-names><![CDATA[X.]]></given-names>
</name>
<name>
<surname><![CDATA[Màrques]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
<name>
<surname><![CDATA[Padró]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Named entity extraction using adaboost]]></article-title>
<source><![CDATA[Proc. of CoNLL-2002]]></source>
<year>2002</year>
<page-range>167-170</page-range><publisher-loc><![CDATA[Taipei ]]></publisher-loc>
</nlm-citation>
</ref>
<ref id="B6">
<label>6</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Isozaki]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
<name>
<surname><![CDATA[Kazawa]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Speeding up named entity recognition based on Support Vector Machines (in Japanese)]]></article-title>
<source><![CDATA[IPSJ SIG notes]]></source>
<year>2002</year>
<page-range>1-8</page-range></nlm-citation>
</ref>
<ref id="B7">
<label>7</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Florian]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Ittycheriah]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Jing]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
<name>
<surname><![CDATA[Zhang]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Named entity recognition through classifier combination]]></article-title>
<source><![CDATA[Proc. of CoNLL-2003]]></source>
<year>2003</year>
<page-range>168-171</page-range></nlm-citation>
</ref>
<ref id="B8">
<label>8</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Asahara]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Matsumoto]]></surname>
<given-names><![CDATA[Y.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Japanese named entity extraction with redundant morphological analysis]]></article-title>
<source><![CDATA[Proc. of HLT-NAACL 2003]]></source>
<year>2003</year>
<page-range>8-15</page-range></nlm-citation>
</ref>
<ref id="B9">
<label>9</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Nakano]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
<name>
<surname><![CDATA[Hirai]]></surname>
<given-names><![CDATA[Y.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Japanese named entity extraction with bunsetsu features (in Japanese)]]></article-title>
<source><![CDATA[IPSJ Journal]]></source>
<year>2004</year>
<volume>45</volume>
<numero>3</numero>
<issue>3</issue>
<page-range>934-941</page-range></nlm-citation>
</ref>
<ref id="B10">
<label>10</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Miller]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Guinness]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Zamanian]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Name tagging with word clusters and discriminative training]]></article-title>
<source><![CDATA[HLT-NAACL]]></source>
<year>2004</year>
<page-range>337-342</page-range></nlm-citation>
</ref>
<ref id="B11">
<label>11</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Freitag]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Trained named entity recognition using distributional clusters]]></article-title>
<source><![CDATA[Proc. of EMNLP 2004]]></source>
<year>July</year>
<month> 2</month>
<day>00</day>
<page-range>262-269</page-range><publisher-name><![CDATA[Association for Computational Linguistics]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B12">
<label>12</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Ando]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Zhang]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[A high-performance semi-supervised learning method for text chunking]]></article-title>
<source><![CDATA[Proc. of the 43rd Annual Meeting of the Association for Computational Linguistics]]></source>
<year>June</year>
<month> 2</month>
<day>00</day>
<page-range>1-9</page-range><publisher-loc><![CDATA[Ann Arbor^eMichigan Michigan]]></publisher-loc>
<publisher-name><![CDATA[Association for Computational Linguistics]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B13">
<label>13</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Yarowsky]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Unsupervised word sense disambiguation rivaling supervised methods]]></article-title>
<source><![CDATA[Proc. of ACL-1995]]></source>
<year>1995</year>
<page-range>189-196</page-range></nlm-citation>
</ref>
<ref id="B14">
<label>14</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Riloff]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
<name>
<surname><![CDATA[Jones]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Learning dictionaries for information extraction by multi-level bootstrapping]]></article-title>
<source><![CDATA[AAAI/IAAI]]></source>
<year>1999</year>
<page-range>474-479</page-range></nlm-citation>
</ref>
<ref id="B15">
<label>15</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Blum]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Mitchell]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Combining labeled and unlabeled data with co-training]]></article-title>
<source><![CDATA[Proc. of the 11th COLT]]></source>
<year>1998</year>
<page-range>92-100</page-range></nlm-citation>
</ref>
<ref id="B16">
<label>16</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Ando]]></surname>
<given-names><![CDATA[R. K.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Semantic lexicon construction: Learning from unlabeled data via spectral analysis]]></article-title>
<source><![CDATA[Proc. of CoNLL-2004]]></source>
<year>2004</year>
<page-range>9-16</page-range><publisher-loc><![CDATA[Boston^eMA MA]]></publisher-loc>
</nlm-citation>
</ref>
<ref id="B17">
<label>17</label><nlm-citation citation-type="">
<collab>C. IREX</collab>
<source><![CDATA[Proc. of the IREX workshop]]></source>
<year>1999</year>
</nlm-citation>
</ref>
<ref id="B18">
<label>18</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Ramshaw]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
<name>
<surname><![CDATA[Marcus]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Text chunking using transformation-based learning]]></article-title>
<source><![CDATA[Proc. of the Third Workshop on Very Large Corpora]]></source>
<year>1995</year>
<page-range>82-94</page-range><publisher-name><![CDATA[Association for Computational Linguistics]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B19">
<label>19</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Tjong Kim Sang]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
<name>
<surname><![CDATA[Veenstra]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Representing text chunks]]></article-title>
<source><![CDATA[Proc. of EACL '99]]></source>
<year>1999</year>
<publisher-loc><![CDATA[Bergen ]]></publisher-loc>
</nlm-citation>
</ref>
<ref id="B20">
<label>20</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Kudo]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
<name>
<surname><![CDATA[Matsumoto]]></surname>
<given-names><![CDATA[Y.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Chunking with Support Vector Machines]]></article-title>
<source><![CDATA[Proc. of NAACL 2001]]></source>
<year>2001</year>
</nlm-citation>
</ref>
<ref id="B21">
<label>21</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Kudo]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Fast methods for kernel-based text analysis]]></article-title>
<source><![CDATA[Proc. of ACL-2003]]></source>
<year>2003</year>
<page-range>24-31</page-range></nlm-citation>
</ref>
<ref id="B22">
<label>22</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Vapnik]]></surname>
<given-names><![CDATA[V.]]></given-names>
</name>
</person-group>
<source><![CDATA[Statistical Learning Theory]]></source>
<year>1998</year>
<publisher-name><![CDATA[John Wiley & Sons]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B23">
<label>23</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Platt]]></surname>
<given-names><![CDATA[J. C.]]></given-names>
</name>
<name>
<surname><![CDATA[Smola]]></surname>
<given-names><![CDATA[A. J.]]></given-names>
</name>
<name>
<surname><![CDATA[Bartlett]]></surname>
<given-names><![CDATA[P. L.]]></given-names>
</name>
<name>
<surname><![CDATA[Sch¨olkopf]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
<name>
<surname><![CDATA[Schuurmans]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
</person-group>
<source><![CDATA[Probabilities for SV machines]]></source>
<year>2000</year>
<publisher-name><![CDATA[MIT Press]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B24">
<label>24</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Utsuro]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
<name>
<surname><![CDATA[Sassano]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Uchimoto]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Combining outputs of multiple Japanese named entity chunkers by stacking]]></article-title>
<source><![CDATA[Proc. of EMNLP 2002]]></source>
<year>2002</year>
<page-range>281-288</page-range></nlm-citation>
</ref>
<ref id="B25">
<label>25</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Sasano]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Kurohashi]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Japanese named entity recognition using structural natural language processing]]></article-title>
<source><![CDATA[Proc. of IJCNLP'08]]></source>
<year>2008</year>
<page-range>607-612</page-range></nlm-citation>
</ref>
<ref id="B26">
<label>26</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Kazama]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Torisawa]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Inducing gazetteers for named entity recognition by large-scale clustering of dependency relations]]></article-title>
<source><![CDATA[Proc. of ACL-08: HLT]]></source>
<year>2008</year>
<page-range>407-415</page-range></nlm-citation>
</ref>
<ref id="B27">
<label>27</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Ikehara]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Miyazaki]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Shirai]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Yokoo]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Nakaiwa]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
<name>
<surname><![CDATA[Ogura]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
<name>
<surname><![CDATA[Ooyama]]></surname>
<given-names><![CDATA[Y.]]></given-names>
</name>
<name>
<surname><![CDATA[Hayashi]]></surname>
<given-names><![CDATA[Y.]]></given-names>
</name>
</person-group>
<source><![CDATA[Goi-Taikei -A Japanese Lexicon CDROM]]></source>
<year>1999</year>
<publisher-name><![CDATA[Iwanami Shoten]]></publisher-name>
</nlm-citation>
</ref>
</ref-list>
</back>
</article>
