<?xml version="1.0" encoding="ISO-8859-1"?><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<front>
<journal-meta>
<journal-id>1405-5546</journal-id>
<journal-title><![CDATA[Computación y Sistemas]]></journal-title>
<abbrev-journal-title><![CDATA[Comp. y Sist.]]></abbrev-journal-title>
<issn>1405-5546</issn>
<publisher>
<publisher-name><![CDATA[Instituto Politécnico Nacional, Centro de Investigación en Computación]]></publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id>S1405-55462018000401329</article-id>
<article-id pub-id-type="doi">10.13053/cys-22-4-3065</article-id>
<title-group>
<article-title xml:lang="en"><![CDATA[Construction of Paraphrase Graphs as a Means of News Clusters Extraction]]></article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Yagunova]]></surname>
<given-names><![CDATA[Elena]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Pronoza]]></surname>
<given-names><![CDATA[Ekaterina]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Kochetkova]]></surname>
<given-names><![CDATA[Nataliya]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
</contrib-group>
<aff id="Af1">
<institution><![CDATA[,St.-Petersburg State University Department of Informational Systems in Arts and Humanities ]]></institution>
<addr-line><![CDATA[St.-Petersburg ]]></addr-line>
<country>Russian Federation</country>
</aff>
<aff id="Af2">
<institution><![CDATA[,National Research University Higher School of Economics School of Computer Engineering ]]></institution>
<addr-line><![CDATA[St.-Petersburg ]]></addr-line>
<country>Russian Federation</country>
</aff>
<pub-date pub-type="pub">
<day>00</day>
<month>12</month>
<year>2018</year>
</pub-date>
<pub-date pub-type="epub">
<day>00</day>
<month>12</month>
<year>2018</year>
</pub-date>
<volume>22</volume>
<numero>4</numero>
<fpage>1329</fpage>
<lpage>1336</lpage>
<copyright-statement/>
<copyright-year/>
<self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_arttext&amp;pid=S1405-55462018000401329&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_abstract&amp;pid=S1405-55462018000401329&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_pdf&amp;pid=S1405-55462018000401329&amp;lng=en&amp;nrm=iso"></self-uri><abstract abstract-type="short" xml:lang="en"><p><![CDATA[Abstract: In this paper, we construct paraphrase graphs for news text collections (clusters). Our aims are, first, to prove that paraphrase graph construction method can be used for news clusters identification and, second, to analyze and compare stylistically different news collections. Our news collections include dynamic, static and combined (dynamic and static) texts. Their respective paraphrase graphs reflect their main characteristics. We also automatically extract the most informationally important linked fragments of news texts, and these fragments characterize news texts as either informative, conveying some information, or publicistic ones, trying to affect the readers emotionally.]]></p></abstract>
<kwd-group>
<kwd lng="en"><![CDATA[News cluster]]></kwd>
<kwd lng="en"><![CDATA[paraphrase graph]]></kwd>
<kwd lng="en"><![CDATA[paraphrase extraction]]></kwd>
<kwd lng="en"><![CDATA[linked text segments]]></kwd>
<kwd lng="en"><![CDATA[text analysis]]></kwd>
</kwd-group>
</article-meta>
</front><back>
<ref-list>
<ref id="B1">
<label>1</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Azzopardi]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Staff]]></surname>
<given-names><![CDATA[Ch.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Incremental Clustering of News Reports]]></article-title>
<source><![CDATA[Algorithms]]></source>
<year>2012</year>
<volume>5</volume>
<numero>3</numero>
<issue>3</issue>
<page-range>364-78</page-range></nlm-citation>
</ref>
<ref id="B2">
<label>2</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Bora]]></surname>
<given-names><![CDATA[N. N.]]></given-names>
</name>
<name>
<surname><![CDATA[Mishra]]></surname>
<given-names><![CDATA[B. S. P.]]></given-names>
</name>
<name>
<surname><![CDATA[Dehuri]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Heuristic Frequent Term-Based Clustering of News Headlines]]></article-title>
<source><![CDATA[Procedia Technology]]></source>
<year>2012</year>
<volume>6</volume>
<page-range>436-43</page-range></nlm-citation>
</ref>
<ref id="B3">
<label>3</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Daudaravi&#269;ius]]></surname>
<given-names><![CDATA[V.]]></given-names>
</name>
<name>
<surname><![CDATA[Marcinkevi&#269;ien&#279;]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Gravity Counts for the Boundaries of Collocations]]></article-title>
<source><![CDATA[International Journal of Corpus Linguistics]]></source>
<year>2004</year>
<volume>9</volume>
<numero>2</numero>
<issue>2</issue>
<page-range>321-48</page-range></nlm-citation>
</ref>
<ref id="B4">
<label>4</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Pronoza]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
<name>
<surname><![CDATA[Yagunova]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
<name>
<surname><![CDATA[Kochetkova]]></surname>
<given-names><![CDATA[N.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Sentence Paraphrase Graphs: Classification Based on Predictive Models or Annotators&#8217; Decisions?]]></article-title>
<person-group person-group-type="editor">
<name>
<surname><![CDATA[Sidorov]]></surname>
<given-names><![CDATA[G.]]></given-names>
</name>
<name>
<surname><![CDATA[Herrera-Alcántara]]></surname>
<given-names><![CDATA[O.]]></given-names>
</name>
</person-group>
<source><![CDATA[Advances in Computational Intelligence]]></source>
<year>2016</year>
<numero>10061</numero>
<issue>10061</issue>
<page-range>41-52</page-range><publisher-name><![CDATA[Lecture Notes in Computer Science]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B5">
<label>5</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Pronoza]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
<name>
<surname><![CDATA[Yagunova]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
</person-group>
<source><![CDATA[Comparison of sentence similarity measures for Russian paraphrase identification]]></source>
<year>2015</year>
<conf-name><![CDATA[ Artificial Intelligence and Natural Language and Information Extraction, Social Media and Web Search FRUCT Conference (AINL-ISMW FRUCT)]]></conf-name>
<conf-loc> </conf-loc>
<page-range>74-82</page-range></nlm-citation>
</ref>
<ref id="B6">
<label>6</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Pronoza]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
<name>
<surname><![CDATA[Yagunova]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
<name>
<surname><![CDATA[Pronoza]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Construction of a Russian Paraphrase Corpus: Unsupervised Paraphrase Extraction]]></article-title>
<source><![CDATA[Communications in Computer and Information Science]]></source>
<year>2015</year>
<page-range>146-57</page-range></nlm-citation>
</ref>
<ref id="B7">
<label>7</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Thirunarayan]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
<name>
<surname><![CDATA[Immaneni]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
<name>
<surname><![CDATA[Shaik]]></surname>
<given-names><![CDATA[M. V.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Selecting Labels for News Document Clusters]]></article-title>
<source><![CDATA[Lecture Notes in Computer Science]]></source>
<year>2007</year>
<page-range>119-30</page-range></nlm-citation>
</ref>
<ref id="B8">
<label>8</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Yagunova]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
<name>
<surname><![CDATA[Pivovarova]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[The nature of collocations in the Russian language. The experience of automatic extraction and classification of the material of news texts]]></article-title>
<source><![CDATA[Automatic Documentation and Mathematical Linguistics]]></source>
<year>2010</year>
<volume>44</volume>
<numero>3</numero>
<issue>3</issue>
<page-range>164-75</page-range></nlm-citation>
</ref>
<ref id="B9">
<label>9</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Viveros-Jiménez]]></surname>
<given-names><![CDATA[F.]]></given-names>
</name>
<name>
<surname><![CDATA[Sánchez-Perez]]></surname>
<given-names><![CDATA[M.A.]]></given-names>
</name>
<name>
<surname><![CDATA[Gómez-Adorno]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
<name>
<surname><![CDATA[Posadas-Durán]]></surname>
<given-names><![CDATA[J.P.]]></given-names>
</name>
<name>
<surname><![CDATA[Sidorov]]></surname>
<given-names><![CDATA[G.]]></given-names>
</name>
<name>
<surname><![CDATA[Gelbukh]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Improving the Boilerpipe Algorithm for Boilerplate Removal in News Articles Using HTML Tree Structure]]></article-title>
<source><![CDATA[Computación y Sistemas]]></source>
<year>2018</year>
<volume>22</volume>
<numero>2</numero>
<issue>2</issue>
<page-range>483-9</page-range></nlm-citation>
</ref>
<ref id="B10">
<label>10</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Yagunova]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
<name>
<surname><![CDATA[Pivovarova]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
<name>
<surname><![CDATA[Volskaya]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[News Text Segmentation in Human Perception]]></article-title>
<person-group person-group-type="editor">
<name>
<surname><![CDATA[Sharp]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
<name>
<surname><![CDATA[Delmonte]]></surname>
<given-names><![CDATA[R. de Gruyter]]></given-names>
</name>
</person-group>
<source><![CDATA[Proceedings of Natural Language Processing and Cognitive Science]]></source>
<year>2015</year>
<page-range>63-74</page-range></nlm-citation>
</ref>
<ref id="B11">
<label>11</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Antonov]]></surname>
<given-names><![CDATA[A. V.]]></given-names>
</name>
<name>
<surname><![CDATA[Yagunova]]></surname>
<given-names><![CDATA[E. V.]]></given-names>
</name>
</person-group>
<source><![CDATA[Procedure of working with text information collections via information portraits analysis]]></source>
<year>2010</year>
<conf-name><![CDATA[ RCDL'10]]></conf-name>
<conf-loc> </conf-loc>
<page-range>79-84</page-range></nlm-citation>
</ref>
</ref-list>
</back>
</article>
