<?xml version="1.0" encoding="ISO-8859-1"?><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<front>
<journal-meta>
<journal-id>1405-5546</journal-id>
<journal-title><![CDATA[Computación y Sistemas]]></journal-title>
<abbrev-journal-title><![CDATA[Comp. y Sist.]]></abbrev-journal-title>
<issn>1405-5546</issn>
<publisher>
<publisher-name><![CDATA[Instituto Politécnico Nacional, Centro de Investigación en Computación]]></publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id>S1405-55462016000300505</article-id>
<article-id pub-id-type="doi">10.13053/cys-20-3-2464</article-id>
<title-group>
<article-title xml:lang="en"><![CDATA[A Framework that Uses the Web for Named Entity Class Identification: Case Study for Indian Classical Music Forums]]></article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Ross]]></surname>
<given-names><![CDATA[Joe Cheri]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Joshi]]></surname>
<given-names><![CDATA[Aditya]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Bhattacharyya]]></surname>
<given-names><![CDATA[Pushpak]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
</contrib-group>
<aff id="Af1">
<institution><![CDATA[,Indian Institute of Technology Bombay Dept. of Computer Science & Engg. ]]></institution>
<addr-line><![CDATA[Mumbai ]]></addr-line>
<country>India</country>
</aff>
<pub-date pub-type="pub">
<day>00</day>
<month>09</month>
<year>2016</year>
</pub-date>
<pub-date pub-type="epub">
<day>00</day>
<month>09</month>
<year>2016</year>
</pub-date>
<volume>20</volume>
<numero>3</numero>
<fpage>505</fpage>
<lpage>513</lpage>
<copyright-statement/>
<copyright-year/>
<self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_arttext&amp;pid=S1405-55462016000300505&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_abstract&amp;pid=S1405-55462016000300505&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_pdf&amp;pid=S1405-55462016000300505&amp;lng=en&amp;nrm=iso"></self-uri><abstract abstract-type="short" xml:lang="en"><p><![CDATA[Abstract Identification of named entity(NE) class (semantic class) is crucial for NLP problems like coreference resolution where semantic compatibility between the entity mentions is imperative to coreference decision. Short and noisy text containing the entity makes it challenging to extract the NE class of the entity through the context. We introduce a framework for named entity class identification for a given entity, using the web when the entity boundaries are known. The proposed framework will be beneficial for specialized domains where data and class label challenges exist. We demonstrate the benefit of our framework through a case study of Indian classical music forums. Apart from person and location included in standard semantic classes, here we also consider raga1, song, instrument and music concept. Our baseline approach follows a heuristic based method making use of Freebase, a structured web repository. The search engine based approaches acquire context from the web for an entity and perform named entity class identification. This approach shows improvement compared to baseline performance and it is further improved with the hierarchical classification introduced. In summary, our framework is a first-of-its-kind validation of viability of the web for NE class identification.]]></p></abstract>
<kwd-group>
<kwd lng="en"><![CDATA[Named Entity Recognition]]></kwd>
<kwd lng="en"><![CDATA[Named Entity Class Identification]]></kwd>
<kwd lng="en"><![CDATA[Music Data]]></kwd>
</kwd-group>
</article-meta>
</front><back>
<ref-list>
<ref id="B1">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Bhagyalekshmy]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
</person-group>
<source><![CDATA[Ragas in Carnatic music]]></source>
<year>1990</year>
<publisher-name><![CDATA[South Asia Books]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B2">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[Latent dirichlet allocation]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Blei]]></surname>
<given-names><![CDATA[D. M.]]></given-names>
</name>
<name>
<surname><![CDATA[Ng]]></surname>
<given-names><![CDATA[A. Y.]]></given-names>
</name>
<name>
<surname><![CDATA[Jordan]]></surname>
<given-names><![CDATA[M. I.]]></given-names>
</name>
</person-group>
<source><![CDATA[Journal of machine Learning research]]></source>
<year>2003</year>
<volume>3</volume>
<page-range>993-1022</page-range></nlm-citation>
</ref>
<ref id="B3">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Bollacker]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
<name>
<surname><![CDATA[Evans]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Paritosh]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Sturge]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
<name>
<surname><![CDATA[Taylor]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
</person-group>
<source><![CDATA[Freebase: a collaboratively created graph database for structuring human knowledge]]></source>
<year>2008</year>
<conf-name><![CDATA[ ACM SIGMOD international conference on Management of data]]></conf-name>
<conf-date>2008</conf-date>
<conf-loc> </conf-loc>
<page-range>1247-50</page-range></nlm-citation>
</ref>
<ref id="B4">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[Unsupervised named-entity extraction from the web: An experimental study]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Etzioni]]></surname>
<given-names><![CDATA[O.]]></given-names>
</name>
<name>
<surname><![CDATA[Cafarella]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Downey]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Popescu]]></surname>
<given-names><![CDATA[A.-M.]]></given-names>
</name>
<name>
<surname><![CDATA[Shaked]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
<name>
<surname><![CDATA[Soderland]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Weld]]></surname>
<given-names><![CDATA[D. S.]]></given-names>
</name>
<name>
<surname><![CDATA[Yates]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<source><![CDATA[Artificial intelligence]]></source>
<year>2005</year>
<volume>165</volume>
<numero>1</numero>
<issue>1</issue>
<page-range>91-134</page-range></nlm-citation>
</ref>
<ref id="B5">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Isozaki]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
<name>
<surname><![CDATA[Kazawa]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
</person-group>
<source><![CDATA[Efficient support vector classifiers for named entity recognition]]></source>
<year>2002</year>
<conf-name><![CDATA[ 19th international conference on Computational linguistics]]></conf-name>
<conf-loc> </conf-loc>
<page-range>1-7</page-range></nlm-citation>
</ref>
<ref id="B6">
<nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Karaa]]></surname>
<given-names><![CDATA[W. B. A.]]></given-names>
</name>
</person-group>
<source><![CDATA[Named entity recognition using web document corpus]]></source>
<year>2011</year>
</nlm-citation>
</ref>
<ref id="B7">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Kazama]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Makino]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
<name>
<surname><![CDATA[Ohta]]></surname>
<given-names><![CDATA[Y.]]></given-names>
</name>
<name>
<surname><![CDATA[Tsujii]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
</person-group>
<source><![CDATA[Tuning support vector machines for biomedical named entity recognition]]></source>
<year>2002</year>
<conf-name><![CDATA[ ACL-02 workshop on Natural language processing in the biomedical domain]]></conf-name>
<conf-loc> </conf-loc>
<page-range>1-8</page-range></nlm-citation>
</ref>
<ref id="B8">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[McCallum]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Li]]></surname>
<given-names><![CDATA[W.]]></given-names>
</name>
</person-group>
<source><![CDATA[Early results for named entity recognition with conditional random fields, feature induction and web-enhanced lexicons]]></source>
<year>2003</year>
<conf-name><![CDATA[ seventh conference on Natural language learning at HLT-NAACL 2003]]></conf-name>
<conf-loc> </conf-loc>
<page-range>188-91</page-range></nlm-citation>
</ref>
<ref id="B9">
<nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Nadeau]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Turney]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Matwin]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
</person-group>
<source><![CDATA[Unsupervised named-entity recognition: Generating gazetteers and resolving ambiguity]]></source>
<year>2006</year>
</nlm-citation>
</ref>
<ref id="B10">
<nlm-citation citation-type="">
<collab>rasikas</collab>
<source><![CDATA[Rasikas.org]]></source>
<year>2005</year>
</nlm-citation>
</ref>
<ref id="B11">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Ratinov]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
<name>
<surname><![CDATA[Roth]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
</person-group>
<source><![CDATA[Design challenges and misconceptions in named entity recognition]]></source>
<year>2009</year>
<conf-name><![CDATA[ Thirteenth Conference on Computational Natural Language Learning]]></conf-name>
<conf-loc> </conf-loc>
<page-range>147-55</page-range></nlm-citation>
</ref>
<ref id="B12">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Sekine]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Eriguchi]]></surname>
<given-names><![CDATA[Y.]]></given-names>
</name>
</person-group>
<source><![CDATA[Japanese named entity extraction evaluation: analysis of results]]></source>
<year>2000</year>
<conf-name><![CDATA[ 18th conference on Computational linguistics]]></conf-name>
<conf-loc> </conf-loc>
<page-range>1106-10</page-range></nlm-citation>
</ref>
<ref id="B13">
<nlm-citation citation-type="">
<collab>statisticbrain</collab>
<source><![CDATA[statisticbrain.com]]></source>
<year>2016</year>
</nlm-citation>
</ref>
<ref id="B14">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Whitelaw]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Kehlenbeck]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Petrovic]]></surname>
<given-names><![CDATA[N.]]></given-names>
</name>
<name>
<surname><![CDATA[Ungar]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
</person-group>
<source><![CDATA[Web-scale named entity recognition]]></source>
<year>2008</year>
<conf-name><![CDATA[ 17th ACM conference on Information and knowledge management]]></conf-name>
<conf-loc> </conf-loc>
<page-range>123-32</page-range></nlm-citation>
</ref>
<ref id="B15">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Winkler]]></surname>
<given-names><![CDATA[W. E.]]></given-names>
</name>
</person-group>
<source><![CDATA[The state of record linkage and current research problems]]></source>
<year>1999</year>
<publisher-name><![CDATA[Statistical Research Division, US Census Bureau]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B16">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Zhou]]></surname>
<given-names><![CDATA[G.]]></given-names>
</name>
<name>
<surname><![CDATA[Su]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
</person-group>
<source><![CDATA[Named entity recognition using an hmm-based chunk tagger]]></source>
<year>2002</year>
<conf-name><![CDATA[ 40th Annual Meeting on Association for Computhe 40th Annual Meeting on Association for Computational Linguistics]]></conf-name>
<conf-loc> </conf-loc>
<page-range>473-80</page-range></nlm-citation>
</ref>
</ref-list>
</back>
</article>
