<?xml version="1.0" encoding="ISO-8859-1"?><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<front>
<journal-meta>
<journal-id>1405-5546</journal-id>
<journal-title><![CDATA[Computación y Sistemas]]></journal-title>
<abbrev-journal-title><![CDATA[Comp. y Sist.]]></abbrev-journal-title>
<issn>1405-5546</issn>
<publisher>
<publisher-name><![CDATA[Instituto Politécnico Nacional, Centro de Investigación en Computación]]></publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id>S1405-55462014000300013</article-id>
<article-id pub-id-type="doi">10.13053/CyS-18-3-2034</article-id>
<title-group>
<article-title xml:lang="en"><![CDATA[Entity Extraction in Biochemical Text using Multiobjective Optimization]]></article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Sikdar]]></surname>
<given-names><![CDATA[Utpal Kumar]]></given-names>
</name>
<xref ref-type="aff" rid="A01"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Ekbal]]></surname>
<given-names><![CDATA[Asif]]></given-names>
</name>
<xref ref-type="aff" rid="A01"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Saha]]></surname>
<given-names><![CDATA[Sriparna]]></given-names>
</name>
<xref ref-type="aff" rid="A01"/>
</contrib>
</contrib-group>
<aff id="A01">
<institution><![CDATA[,Indian Institute of Technology Patna Department of Computer Science and Engineering ]]></institution>
<addr-line><![CDATA[Patna Bihar]]></addr-line>
<country>India</country>
</aff>
<pub-date pub-type="pub">
<day>00</day>
<month>09</month>
<year>2014</year>
</pub-date>
<pub-date pub-type="epub">
<day>00</day>
<month>09</month>
<year>2014</year>
</pub-date>
<volume>18</volume>
<numero>3</numero>
<fpage>591</fpage>
<lpage>602</lpage>
<copyright-statement/>
<copyright-year/>
<self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_arttext&amp;pid=S1405-55462014000300013&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_abstract&amp;pid=S1405-55462014000300013&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_pdf&amp;pid=S1405-55462014000300013&amp;lng=en&amp;nrm=iso"></self-uri><abstract abstract-type="short" xml:lang="en"><p><![CDATA[In this paper we propose a multiobjective modified differential evolution based feature selection and classifier ensemble approach for biochemical entity extraction. The algorithm performs in two layers. The first layer concerns with determining an appropriate set of features for the task within the framework of a supervised statistical classifier, namely, Conditional Random Field (CRF). This produces a set of solutions, a subset of which is used to construct an ensemble in the second layer. The proposed approach is evaluated for entity extraction in chemical texts, which involves identification of IUPAC and IUPAC-like names and classification of them into some predefined categories. Experiments that were carried out on a benchmark dataset show the recall, precision and F-measure values of 86.15%, 91.29% and 88.64%, respectively.]]></p></abstract>
<kwd-group>
<kwd lng="en"><![CDATA[Multiobjective modified differential evolution (MODE)]]></kwd>
<kwd lng="en"><![CDATA[feature selection]]></kwd>
<kwd lng="en"><![CDATA[ensemble learning]]></kwd>
<kwd lng="en"><![CDATA[conditional random field (CRF)]]></kwd>
<kwd lng="en"><![CDATA[named entity (NE)]]></kwd>
</kwd-group>
</article-meta>
</front><body><![CDATA[  	    <p align="justify"><font face="verdana" size="4">Art&iacute;culos regulares</font></p>  	    <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>  	    <p align="center"><font face="verdana" size="4"><b>Entity Extraction in Biochemical Text using Multiobjective Optimization</b></font></p>  	    <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>  	    <p align="center"><font face="verdana" size="2"><b>Utpal Kumar Sikdar, Asif Ekbal, and Sriparna Saha</b></font></p>  	    <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>  	    <p align="justify"><font face="verdana" size="2"><i>Department of Computer Science and Engineering, Indian Institute of Technology, Patna, India.</i> <a href="mailto:utpal.sikdar@iitp.ac.in">utpal.sikdar@iitp.ac.in</a>, <a href="mailto:asif@iitp.ac.in">asif@iitp.ac.in</a>, <a href="mailto:sriparna@iitp.ac.in">sriparna@iitp.ac.in</a>.</font></p>  	    <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>  	    <p align="justify"><font face="verdana" size="2">Article received on 18/01/2014.    ]]></body>
<body><![CDATA[<br> 	Accepted on 01/02/2014.</font></p>  	    <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>  	    <p align="justify"><font face="verdana" size="2"><b>Abstract</b></font></p>  	    <p align="justify"><font face="verdana" size="2">In this paper we propose a multiobjective modified differential evolution based feature selection and classifier ensemble approach for biochemical entity extraction. The algorithm performs in two layers. The first layer concerns with determining an appropriate set of features for the task within the framework of a supervised statistical classifier, namely, Conditional Random Field (CRF). This produces a set of solutions, a subset of which is used to construct an ensemble in the second layer. The proposed approach is evaluated for entity extraction in chemical texts, which involves identification of IUPAC and IUPAC&#45;like names and classification of them into some predefined categories. Experiments that were carried out on a benchmark dataset show the recall, precision and F&#45;measure values of 86.15%, 91.29% and 88.64%, respectively.</font></p>  	    <p align="justify"><font face="verdana" size="2"><b>Keywords:</b> Multiobjective modified differential evolution (MODE), feature selection, ensemble learning, conditional random field (CRF), named entity (NE).</font></p>  	    <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>  	    <p align="justify"><font face="verdana" size="2"><a href="/pdf/cys/v18n3/v18n3a13.pdf" target="_blank">DESCARGAR ART&Iacute;CULO EN FORMATO PDF</a></font></p>  	    <p align="justify"><font face="verdana" size="2">&nbsp;</font></p>  	    <p align="justify"><font face="verdana" size="2"><b>References</b></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>1. Ekbal, A. &amp; Saha, S. (2010).</b> Classifier ensemble selection using genetic algorithm for named entity recognition. <i>Research on Language and Computation,</i> 8, 73&#45;99.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2067870&pid=S1405-5546201400030001300001&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>2. Ekbal, A. &amp; Saha, S. (2010).</b> Weighted vote based classifier ensemble selection using genetic algorithm for named entity recognition. In <i>Proceedings of the Natural language processing and information systems,</i> NLDB'10, pp. 256&#45;267.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2067872&pid=S1405-5546201400030001300002&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>3. Ekbal, A. &amp; Saha, S. (2011).</b> Weighted vote&#45;based classifier ensemble for named entity recognition: A genetic algorithm&#45;based approach. <i>ACM Trans. Asian Lang. Inf. Process.,</i> 10(2).    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2067874&pid=S1405-5546201400030001300003&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>4. Ekbal, A. &amp; Saha, S. (2012).</b> Multiobjective optimization for classifier ensemble and feature selection: an application to named entity recognition. <i>IJDAR,</i> 15(2), 143&#45;166.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2067876&pid=S1405-5546201400030001300004&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <p align="justify"><font face="verdana" size="2"><b>5. Lafferty, J. D., McCallum, A., &amp; Pereira, F. C. N. (2001).</b> Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data. In <i>ICML,</i> pp. 282&#45;289.</font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>6. Liu, H. &amp; Motoda, H. (1998).</b> <i>Feature Selection for Knowledge Discovery and Data Mining.</i> Kluwer Academic Publishers, Norwell, MA, USA.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2067879&pid=S1405-5546201400030001300005&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    ]]></body>
<body><![CDATA[<!-- ref --><p align="justify"><font face="verdana" size="2"><b>7. Liu, H. &amp; Yu, L. (2005).</b> Toward integrating feature selection algorithms for classification and clustering. <i>IEEE Trans. on Knowl. and Data Eng.,</i> 17(4), 491-502. doi: <a href="http://dx.doi.org/10.1109/TKDE.2005.66" target="_blank">http://dx.doi.org/10.1109/TKDE.2005.66</a>.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2067881&pid=S1405-5546201400030001300006&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>  	    <p align="justify"><font face="verdana" size="2"><b>8. Sikdar, U. K., Ekbal, A., &amp; Saha, S. (2012).</b> Differential evolution based feature selection and classifier ensemble for named entity recognition. In <i>COLING,</i> pp. 2475&#45;2490.</font></p>  	    <p align="justify"><font face="verdana" size="2"><b>9. Sikdar, U. K., Ekbal, A., &amp; Saha, S. (2014).</b> Modified differential evolution for biochemical name recognizer. In <i>CICLing,</i> pp. 225&#45;236.</font></p>  	    <!-- ref --><p align="justify"><font face="verdana" size="2"><b>10. Storn, R. &amp; Price, K. (1997).</b> Differential evolution &#151; a simple and efficient heuristic for global optimization over continuous spaces. <i>J. of Global Optimization,</i> 11(4), 341&#45;359. doi: 10.1023/A:1008202821328.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=2067885&pid=S1405-5546201400030001300007&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>      ]]></body><back>
<ref-list>
<ref id="B1">
<label>1</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Ekbal]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Saha]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Classifier ensemble selection using genetic algorithm for named entity recognition]]></article-title>
<source><![CDATA[Research on Language and Computation]]></source>
<year>2010</year>
<volume>8</volume>
<page-range>73-99</page-range></nlm-citation>
</ref>
<ref id="B2">
<label>2</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Ekbal]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Saha]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Weighted vote based classifier ensemble selection using genetic algorithm for named entity recognition]]></article-title>
<source><![CDATA[Proceedings of the Natural language processing and information systems, NLDB'10]]></source>
<year>2010</year>
<page-range>256-267</page-range></nlm-citation>
</ref>
<ref id="B3">
<label>3</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Ekbal]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Saha]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Weighted vote-based classifier ensemble for named entity recognition: A genetic algorithm-based approach]]></article-title>
<source><![CDATA[ACM Trans. Asian Lang. Inf. Process]]></source>
<year>2011</year>
<volume>10</volume>
<numero>2</numero>
<issue>2</issue>
</nlm-citation>
</ref>
<ref id="B4">
<label>4</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Ekbal]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Saha]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Multiobjective optimization for classifier ensemble and feature selection: an application to named entity recognition]]></article-title>
<source><![CDATA[IJDAR]]></source>
<year>2012</year>
<volume>15</volume>
<numero>2</numero>
<issue>2</issue>
<page-range>143-166</page-range></nlm-citation>
</ref>
<ref id="B5">
<label>6</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Liu]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
<name>
<surname><![CDATA[Motoda]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
</person-group>
<source><![CDATA[Feature Selection for Knowledge Discovery and Data Mining]]></source>
<year>1998</year>
<publisher-loc><![CDATA[Norwell ]]></publisher-loc>
<publisher-name><![CDATA[Kluwer Academic Publishers]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B6">
<label>7</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Liu]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
<name>
<surname><![CDATA[Yu]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Toward integrating feature selection algorithms for classification and clustering]]></article-title>
<source><![CDATA[IEEE Trans. on Knowl. and Data Eng.]]></source>
<year>2005</year>
<volume>17</volume>
<numero>4</numero>
<issue>4</issue>
<page-range>491-502</page-range></nlm-citation>
</ref>
<ref id="B7">
<label>10</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Storn]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Price]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Differential evolution - a simple and efficient heuristic for global optimization over continuous spaces]]></article-title>
<source><![CDATA[J. of Global Optimization]]></source>
<year>1997</year>
<volume>11</volume>
<numero>4</numero>
<issue>4</issue>
<page-range>341-359</page-range></nlm-citation>
</ref>
</ref-list>
</back>
</article>
