<?xml version="1.0" encoding="ISO-8859-1"?><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<front>
<journal-meta>
<journal-id>1870-9044</journal-id>
<journal-title><![CDATA[Polibits]]></journal-title>
<abbrev-journal-title><![CDATA[Polibits]]></abbrev-journal-title>
<issn>1870-9044</issn>
<publisher>
<publisher-name><![CDATA[Instituto Politécnico Nacional, Centro de Innovación y Desarrollo Tecnológico en Cómputo]]></publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id>S1870-90442014000100005</article-id>
<title-group>
<article-title xml:lang="en"><![CDATA[Combining Active and Ensemble Learning for Efficient Classification of Web Documents]]></article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Schnitzer]]></surname>
<given-names><![CDATA[Steffen]]></given-names>
</name>
<xref ref-type="aff" rid="A01"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Schmidt]]></surname>
<given-names><![CDATA[Sebastian]]></given-names>
</name>
<xref ref-type="aff" rid="A01"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Rensing]]></surname>
<given-names><![CDATA[Christoph]]></given-names>
</name>
<xref ref-type="aff" rid="A01"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Harriehausen-Miihlbauer]]></surname>
<given-names><![CDATA[Bettina]]></given-names>
</name>
<xref ref-type="aff" rid="A02"/>
</contrib>
</contrib-group>
<aff id="A01">
<institution><![CDATA[,Technische Universitat Darmstadt Multimedia Communications Lab ]]></institution>
<addr-line><![CDATA[ ]]></addr-line>
<country>Germany</country>
</aff>
<aff id="A02">
<institution><![CDATA[,University of Applied Sciences  ]]></institution>
<addr-line><![CDATA[Darmstadt ]]></addr-line>
<country>Germany</country>
</aff>
<pub-date pub-type="pub">
<day>00</day>
<month>06</month>
<year>2014</year>
</pub-date>
<pub-date pub-type="epub">
<day>00</day>
<month>06</month>
<year>2014</year>
</pub-date>
<numero>49</numero>
<fpage>39</fpage>
<lpage>46</lpage>
<copyright-statement/>
<copyright-year/>
<self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_arttext&amp;pid=S1870-90442014000100005&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_abstract&amp;pid=S1870-90442014000100005&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_pdf&amp;pid=S1870-90442014000100005&amp;lng=en&amp;nrm=iso"></self-uri><abstract abstract-type="short" xml:lang="en"><p><![CDATA[Classification of text remains a challenge. Most machine learning based approaches require many manually annotated training instances for a reasonable accuracy. In this article we present an approach that minimizes the human annotation effort by interactively incorporating human annotators into the training process via active learning of an ensemble learner. By passing only ambiguous instances to the human annotators the effort is reduced while maintaining a very good accuracy. Since the feedback is only used to train an additional classifier and not for re-training the whole ensemble, the computational complexity is kept relatively low.]]></p></abstract>
<kwd-group>
<kwd lng="en"><![CDATA[Text classification]]></kwd>
<kwd lng="en"><![CDATA[active learning]]></kwd>
<kwd lng="en"><![CDATA[user feedback]]></kwd>
<kwd lng="en"><![CDATA[ensemble learning]]></kwd>
</kwd-group>
</article-meta>
</front><body><![CDATA[ 
	    <p align="center"><font face="verdana" size="4"><b>Combining Active and Ensemble Learning for Efficient Classification of Web Documents</b></font></p>
	    <p align="center">&nbsp;</p>
	    <p align="center"><font face="verdana" size="2"><b>Steffen Schnitzer<sup>1</sup>, Sebastian Schmidt<sup>1</sup>, Christoph Rensing<sup>1</sup>, and Bettina Harriehausen&#45;Miihlbauer<sup>2</sup></b></font></p>

	    <p align="justify">&nbsp;</p>
	    <p align="justify"><font face="verdana" size="2"><sup><i>1 </i></sup><i>Multimedia Communications Lab, Technische Universitat Darmstadt, Germany</i> (e&#45;mail: <a href="mailto:Steffen.Schnitzer@kom.tu&#45;darmstadt.de">Steffen.Schnitzer@kom.tu&#45;darmstadt.de</a>, <a href="mailto:Sebastian.Schmidt@kom.tu&#45;darmstadt.de">Sebastian.Schmidt@kom.tu&#45;darmstadt.de</a>, <a href="mailto:Christoph.Rensing@kom.tu&#45;darmstadt.de">Christoph.Rensing@kom.tu&#45;darmstadt.de</a>).</font></p>
        <p align="justify"><font face="verdana" size="2"><sup><i>2 </i></sup><i>University of Applied Sciences, Darmstadt, Germany</i> (e&#45;mail: <a href="mailto:Bettina.Harriehausen@h&#45;da.de">Bettina.Harriehausen@h&#45;da.de</a>). </font>	</p>
        <p align="justify">&nbsp;</p>
	    <p align="justify"><font face="verdana" size="2">Manuscript received on December 17, 2013    <br>
    Accepted for publication on February 6, 2014.</font></p>
	    ]]></body>
<body><![CDATA[<p align="justify">&nbsp;</p>
	    <p align="justify"><font face="verdana" size="2"><b>Abstract</b></font></p>
	    <p align="justify"><font face="verdana" size="2">Classification of text remains a challenge. Most machine learning based approaches require many manually annotated training instances for a reasonable accuracy. In this article we present an approach that minimizes the human annotation effort by interactively incorporating human annotators into the training process via active learning of an ensemble learner. By passing only ambiguous instances to the human annotators the effort is reduced while maintaining a very good accuracy. Since the feedback is only used to train an additional classifier and not for re&#45;training the whole ensemble, the computational complexity is kept relatively low.</font></p>

	    <p align="justify"><font face="verdana" size="2"><b>Key words: </b>Text classification, active learning, user feedback, ensemble learning.</font></p>
	    <p align="justify">&nbsp;</p>
	    <p align="justify"><font face="verdana" size="2"><a href="/pdf/poli/n49/n49a5.pdf" target="_blank">DESCARGAR ART&Iacute;CULO EN FORMATO PDF</a></font></p>
	    <p align="justify">&nbsp;</p>
    <p align="justify"><font face="verdana" size="2"><b>Acknowledgements</b></font></p>

	    <p align="justify"><font face="verdana" size="2">The work presented in this paper was partly funded by the German Federal Ministry of Education and Research (BMBF) under grant no. 01IS12054 and partially funded in the framework of Hessen Modell Projekte, financed with funds of LOEWE&#45;State Offensive for the Development of Scientific and Economic Excellence (HA project no. 292/11&#45;37). The responsibility for the contents of this publication lies with the authors. We thank kimeta GmbH for the essential help assisting with building the evaluation corpus.</font></p>
	    <p align="justify">&nbsp;</p>
	    ]]></body>
<body><![CDATA[<p align="justify"><font size="2" face="verdana"><b>References</b></font></p>
	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;1&#93; Netcraft, "November 2013 web server survey," <a href="http://news.netcraft.com/archives/2013/11/01/november&#45;2013&#45;web&#45;server&#45;survey.html" target="_blank">http://news.netcraft.com/archives/2013/11/01/november-2013-web-server-survey.html</a>, year 2013, &#91;Online; accessed 18&#45;November&#45;2013&#93;    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6068621&pid=S1870-9044201400010000500001&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref -->.</font></p>

	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;2&#93; C. D. Manning, P. Raghavan, and H. Schutze, <i>Introduction to information retrieval.</i> Cambridge University Press Cambridge, 2008, vol. 1.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6068623&pid=S1870-9044201400010000500002&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>

	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;3&#93; G. Salton and C. Buckley, "Term weighting approaches in automatic text retrieval," <i>Information Processing Management,</i> vol. 24, no. 5, pp. 513&#45;523, 1988.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6068625&pid=S1870-9044201400010000500003&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>

	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;4&#93; T. Joachims, "A statistical learning learning model of text classification for support vector machines," in <i>Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval,</i> 2001, pp. 128&#45;136. &#91;Online&#93;. Available: <a href="http://dl.acm.org/citation.cfm?id=383974" target="_blank">http://dl.acm.org/citation.cfm?id=383974</a></font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6068627&pid=S1870-9044201400010000500004&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">&#91;5&#93; N. Tripathi, M. Oakes, and S. Wermter, "A fast subspace text categorization method using parallel classifiers," in <i>Computational Linguistics and Intelligent Text Processing.</i> Springer, 2012, pp. 132&#45;143. &#91;Online&#93;. Available: <a href="http://link.springer.com/chapter/10.1007/978&#45;3&#45;642&#45;28601&#45;8_12" target="_blank">http://link.springer.com/chapter/10.1007/978-3-642-28601-8_12</a></font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6068628&pid=S1870-9044201400010000500005&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">&#91;6&#93; F. Fukumoto, Y. Suzuki, and S. Matsuyoshi, "Text classification from positive and unlabeled data using misclassified data correction," in <i>Proceedings of the the 51st Annual Meeting of the Association for Computational Linguistics (ACL 2013),</i> 2013, pp. 474&#151;478.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6068629&pid=S1870-9044201400010000500006&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>

	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;7&#93; I. H. Witten and E. Frank, <i>Data Mining: Practical machine learning </i><i>tools and techniques.</i> Morgan Kaufmann, 2011.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6068631&pid=S1870-9044201400010000500007&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>
	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;8&#93; C. C. Aggarwal, <i>Mining text data.</i> Springer, 2012.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6068633&pid=S1870-9044201400010000500008&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --> </font></p>
    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;9&#93; B. Settles, M. Craven, and L. Friedland, "Active learning with real annotation costs," in <i>Proceedings of the NIPS Workshop </i><i>on Cost&#45;Sensitive Learning,</i> 2008, pp. 1&#45;10. &#91;Online&#93;. Available: <a href="http://dl.acm.org/citation.cfm?id=1557119" target="_blank">http://dl.acm.org/citation.cfm?id=1557119</a></font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6068635&pid=S1870-9044201400010000500009&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">&#91;10&#93; Y. Fu, X. Zhu, and B. Li, "A survey on instance selection for active learning," <i>Knowledge and Information Systems,</i> vol. 35, no. 2, pp. 249&#45;283, May 2013. &#91;Online&#93;. Available: <a href="http://link.springer.com/article/10.1007/s10115&#45;012&#45;0507&#45;8" target="_blank">http://link.springer.com/article/10.1007/s10115-012-0507-8</a></font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6068636&pid=S1870-9044201400010000500010&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">&#91;11&#93; B. Yang, J.&#45;T. Sun, T. Wang, and Z. Chen, "Effective multi&#45;label active learning for text classification," in <i>Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining,</i> ser. KDD '09. New York, NY, USA: ACM, 2009, pp. 917&#45;926. &#91;Online&#93;. Available: <a href="http://doi.acm.org/10.1145/1557019.1557119" target="_blank">http://doi.acm.org/10.1145/1557019.1557119</a></font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6068637&pid=S1870-9044201400010000500011&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">&#91;12&#93; B. Settles, "Active learning literature survey," <i>University of Wisconsin on Active Learning, Madison,</i> 2010.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6068638&pid=S1870-9044201400010000500012&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>

	    ]]></body>
<body><![CDATA[<!-- ref --><p align="justify"><font face="verdana" size="2">&#91;13&#93; J. Zhu and M. Ma, "Uncertainty&#45;based active learning with instability estimation for text classification," <i>ACM Trans. Speech Lang. Process.,</i> vol. 8, no. 4, pp. 5:1&#45;5:21, Feb. 2012. &#91;Online&#93;. Available: <a href="http://doi.acm.org/10.1145/2093153.2093154" target="_blank">http://doi.acm.org/10.1145/2093153.2093154</a></font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6068640&pid=S1870-9044201400010000500013&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><!-- ref --><p align="justify"><font face="verdana" size="2">&#91;14&#93; X. Li and C. G. Snoek, "Classifying tag relevance with relevant positive and negative examples," in <i>Proceedings of the 21st ACM International Conference on Multimedia,</i> ser. MM '13. New York, NY, USA: ACM, 2013, pp. 485&#45;488. &#91;Online&#93;. Available: <a href="http://doi.acm.org/10.1145/2502081.2502129" target="_blank">http://doi.acm.org/10.1145/2502081.2502129</a><a href="http://doi.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6068641&pid=S1870-9044201400010000500014&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref -->acm.org/10.1145/2502081.2502129"></a></font></p>

	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;15&#93; S. Schnitzer, "Effective classification of ambiguous web documents incorporating human feedback efficiently," Master's thesis, University of Applied Sciences Darmstadt, Faculty of Computer Science, Darmstadt, Germany, 2013.    &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6068643&pid=S1870-9044201400010000500015&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --></font></p>

	    <!-- ref --><p align="justify"><font face="verdana" size="2">&#91;16&#93; J. Platt, "Fast training of support vector machines using sequential minimal optimization," in <i>Advances in Kernel Methods &#45; Support Vector Learning,</i> B. Schoelkopf, C. Burges, and A. Smola, Eds. MIT Press, 1998. &#91;Online&#93;. Available: <a href="http://dl.acm.org/citation.cfm?id=299105" target="_blank">http://dl.acm.org/citation.cfm?id=299105</a></font>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;[&#160;<a href="javascript:void(0);" onclick="javascript: window.open('/scielo.php?script=sci_nlinks&ref=6068645&pid=S1870-9044201400010000500016&lng=','','width=640,height=500,resizable=yes,scrollbars=1,menubar=yes,');">Links</a>&#160;]<!-- end-ref --><p align="justify">&nbsp;</p>
	    <p align="justify"><font face="verdana" size="2"><b>Note</b></font></p>
	    <p align="justify"><font face="verdana" size="2">The first two authors contributed equally to this work.     <br>
    <sup>1</sup> <a href="http://www.google.com" target="_blank">http://www.google.com</a><sup>    ]]></body>
<body><![CDATA[<br>
2 </sup><a href="http://www.bing.com" target="_blank">http://www.bing.com</a></font></p>
     ]]></body><back>
<ref-list>
<ref id="B1">
<label>1</label><nlm-citation citation-type="">
<collab>Netcraft</collab>
<source><![CDATA[November 2013 web server survey]]></source>
<year>2013</year>
</nlm-citation>
</ref>
<ref id="B2">
<label>2</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Manning]]></surname>
<given-names><![CDATA[C. D.]]></given-names>
</name>
<name>
<surname><![CDATA[Raghavan]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Schutze]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
</person-group>
<source><![CDATA[Introduction to information retrieval]]></source>
<year>2008</year>
<volume>1</volume>
<publisher-name><![CDATA[Cambridge University Press Cambridge]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B3">
<label>3</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Salton]]></surname>
<given-names><![CDATA[G.]]></given-names>
</name>
<name>
<surname><![CDATA[Buckley]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Term weighting approaches in automatic text retrieval]]></article-title>
<source><![CDATA[Information Processing Management]]></source>
<year>1988</year>
<volume>24</volume>
<numero>5</numero>
<issue>5</issue>
<page-range>513-523</page-range></nlm-citation>
</ref>
<ref id="B4">
<label>4</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Joachims]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[A statistical learning learning model of text classification for support vector machines]]></article-title>
<source><![CDATA[Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval]]></source>
<year>2001</year>
<page-range>128-136</page-range></nlm-citation>
</ref>
<ref id="B5">
<label>5</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Tripathi]]></surname>
<given-names><![CDATA[N.]]></given-names>
</name>
<name>
<surname><![CDATA[Oakes]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Wermter]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[A fast subspace text categorization method using parallel classifiers]]></article-title>
<source><![CDATA[Computational Linguistics and Intelligent Text Processing]]></source>
<year>2012</year>
<page-range>132-143</page-range><publisher-name><![CDATA[Springer]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B6">
<label>6</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Fukumoto]]></surname>
<given-names><![CDATA[F.]]></given-names>
</name>
<name>
<surname><![CDATA[Suzuki]]></surname>
<given-names><![CDATA[Y.]]></given-names>
</name>
<name>
<surname><![CDATA[Matsuyoshi]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Text classification from positive and unlabeled data using misclassified data correction]]></article-title>
<source><![CDATA[Proceedings of the the 51st Annual Meeting of the Association for Computational Linguistics (ACL 2013)]]></source>
<year>2013</year>
<page-range>474-478</page-range></nlm-citation>
</ref>
<ref id="B7">
<label>7</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Witten]]></surname>
<given-names><![CDATA[I. H.]]></given-names>
</name>
<name>
<surname><![CDATA[Frank]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
</person-group>
<source><![CDATA[Data Mining: Practical machine learning tools and techniques]]></source>
<year>2011</year>
<publisher-name><![CDATA[Morgan Kaufmann]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B8">
<label>8</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Aggarwal]]></surname>
<given-names><![CDATA[C. C.]]></given-names>
</name>
</person-group>
<source><![CDATA[Mining text data]]></source>
<year>2012</year>
<publisher-name><![CDATA[Springer]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B9">
<label>9</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Settles]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
<name>
<surname><![CDATA[Craven]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Friedland]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Active learning with real annotation costs]]></article-title>
<source><![CDATA[Proceedings of the NIPS Workshop on Cost-Sensitive Learning]]></source>
<year>2008</year>
<page-range>1-10</page-range></nlm-citation>
</ref>
<ref id="B10">
<label>10</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Fu]]></surname>
<given-names><![CDATA[Y.]]></given-names>
</name>
<name>
<surname><![CDATA[Zhu]]></surname>
<given-names><![CDATA[X.]]></given-names>
</name>
<name>
<surname><![CDATA[Li]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[A survey on instance selection for active learning]]></article-title>
<source><![CDATA[Knowledge and Information Systems]]></source>
<year>May </year>
<month>20</month>
<day>13</day>
<volume>35</volume>
<numero>2</numero>
<issue>2</issue>
<page-range>249-283</page-range></nlm-citation>
</ref>
<ref id="B11">
<label>11</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Yang]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
<name>
<surname><![CDATA[Sun]]></surname>
<given-names><![CDATA[J.-T.]]></given-names>
</name>
<name>
<surname><![CDATA[Wang]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
<name>
<surname><![CDATA[Chen]]></surname>
<given-names><![CDATA[Z.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Effective multi-label active learning for text classification]]></article-title>
<source><![CDATA[Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining]]></source>
<year>2009</year>
<page-range>917-926</page-range><publisher-loc><![CDATA[New York^eNY NY]]></publisher-loc>
<publisher-name><![CDATA[ACM]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B12">
<label>12</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Settles]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Active learning literature survey]]></article-title>
<source><![CDATA[University of Wisconsin on Active Learning, Madison]]></source>
<year>2010</year>
</nlm-citation>
</ref>
<ref id="B13">
<label>13</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Zhu]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Ma]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Uncertainty-based active learning with instability estimation for text classification]]></article-title>
<source><![CDATA[ACM Trans. Speech Lang. Process.]]></source>
<year>Feb.</year>
<month> 2</month>
<day>01</day>
<volume>8</volume>
<numero>4</numero>
<issue>4</issue>
<page-range>5</page-range></nlm-citation>
</ref>
<ref id="B14">
<label>14</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Li]]></surname>
<given-names><![CDATA[X.]]></given-names>
</name>
<name>
<surname><![CDATA[Snoek]]></surname>
<given-names><![CDATA[C. G.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Classifying tag relevance with relevant positive and negative examples]]></article-title>
<source><![CDATA[Proceedings of the 21st ACM International Conference on Multimedia]]></source>
<year>2013</year>
<page-range>485-488</page-range><publisher-loc><![CDATA[New York^eNY NY]]></publisher-loc>
<publisher-name><![CDATA[ACM]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B15">
<label>15</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Schnitzer]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
</person-group>
<source><![CDATA[Effective classification of ambiguous web documents incorporating human feedback efficiently]]></source>
<year></year>
</nlm-citation>
</ref>
<ref id="B16">
<label>16</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Platt]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
</person-group>
<article-title xml:lang="en"><![CDATA[Fast training of support vector machines using sequential minimal optimization]]></article-title>
<person-group person-group-type="editor">
<name>
<surname><![CDATA[Schoelkopf]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
<name>
<surname><![CDATA[Burges]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Smola]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<source><![CDATA[Advances in Kernel Methods - Support Vector Learning]]></source>
<year>1998</year>
<publisher-name><![CDATA[MIT Press]]></publisher-name>
</nlm-citation>
</ref>
</ref-list>
</back>
</article>
