<?xml version="1.0" encoding="ISO-8859-1"?><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<front>
<journal-meta>
<journal-id>1405-5546</journal-id>
<journal-title><![CDATA[Computación y Sistemas]]></journal-title>
<abbrev-journal-title><![CDATA[Comp. y Sist.]]></abbrev-journal-title>
<issn>1405-5546</issn>
<publisher>
<publisher-name><![CDATA[Instituto Politécnico Nacional, Centro de Investigación en Computación]]></publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id>S1405-55462023000100127</article-id>
<article-id pub-id-type="doi">10.13053/cys-27-1-4528</article-id>
<title-group>
<article-title xml:lang="en"><![CDATA[Lexical Complexity Evaluation based on Context for Russian Language]]></article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Abramov]]></surname>
<given-names><![CDATA[Aleksei V.]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Ivanov]]></surname>
<given-names><![CDATA[Vladimir V.]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Solovyev]]></surname>
<given-names><![CDATA[Valery D.]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
</contrib-group>
<aff id="Af1">
<institution><![CDATA[,Kazan Federal University Instiute of Computational Mathematics and Information Technologies ]]></institution>
<addr-line><![CDATA[ ]]></addr-line>
<country>Russian Federation</country>
</aff>
<aff id="Af2">
<institution><![CDATA[,Innopolis University Institute of Software Development and Software Engineering ]]></institution>
<addr-line><![CDATA[ ]]></addr-line>
<country>Russian Federation</country>
</aff>
<pub-date pub-type="pub">
<day>00</day>
<month>03</month>
<year>2023</year>
</pub-date>
<pub-date pub-type="epub">
<day>00</day>
<month>03</month>
<year>2023</year>
</pub-date>
<volume>27</volume>
<numero>1</numero>
<fpage>127</fpage>
<lpage>139</lpage>
<copyright-statement/>
<copyright-year/>
<self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_arttext&amp;pid=S1405-55462023000100127&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_abstract&amp;pid=S1405-55462023000100127&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_pdf&amp;pid=S1405-55462023000100127&amp;lng=en&amp;nrm=iso"></self-uri><abstract abstract-type="short" xml:lang="en"><p><![CDATA[Abstract: The task of identifying complex words within a context usually referred to as Complex Word Identification (CWI) or Lexical Complexity Prediction (LCP), is a vital component in Lexical Simplification pipelines. Correctness of complexity estimation depends on presented features, i.e. hand-crafted features, word embeddings, and presence of surrounding context, as well as on exploited rules or models, i.e. manually designed filtering, classic machine learning models, recurrent neural networks, and Transformer-based models. To our knowledge, the majority of existing works in CWI and LCP areas are devoted to investigating properties of English words and texts, accompanied by studies of German, Spanish, French and Hindu languages with little to no attention to Russian. In this paper, we present a study on lexical complexity estimation for the Russian language, by investigating the following topics: how well do morphological, semantic, and syntactic properties of a word represent its complexity; does a surrounding context significantly affect the accuracy of complexity estimation. We provide a brief description of the dataset of lexical complexity in context based on the Russian Synodal Bible and expand it by presenting a dataset of morphological, semantic, and syntactic features for annotated words. Additionally, we present linear regression and RuBERT models as baselines for lexical complexity estimation respectively.]]></p></abstract>
<kwd-group>
<kwd lng="en"><![CDATA[Lexical complexity]]></kwd>
<kwd lng="en"><![CDATA[Russian language]]></kwd>
<kwd lng="en"><![CDATA[Bible]]></kwd>
<kwd lng="en"><![CDATA[corpus]]></kwd>
<kwd lng="en"><![CDATA[Wiktionary]]></kwd>
</kwd-group>
</article-meta>
</front><back>
<ref-list>
<ref id="B1">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Abramov]]></surname>
<given-names><![CDATA[A. V.]]></given-names>
</name>
<name>
<surname><![CDATA[Ivanov]]></surname>
<given-names><![CDATA[V. V.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Collection and evaluation of lexical complexity data for Russian language using crowdsourcing]]></article-title>
<source><![CDATA[Russian Journal of Linguistics]]></source>
<year>2022</year>
<volume>26</volume>
<numero>2</numero>
<issue>2</issue>
<page-range>409-25</page-range></nlm-citation>
</ref>
<ref id="B2">
<nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Kuratov]]></surname>
<given-names><![CDATA[Y.]]></given-names>
</name>
<name>
<surname><![CDATA[Arkhipov]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<source><![CDATA[Adaptation of deep bidirectional multilingual transformers for Russian language]]></source>
<year>2019</year>
</nlm-citation>
</ref>
<ref id="B3">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Dale]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[The Dale-Chall formula for predicting readability]]></article-title>
<source><![CDATA[Educational Research Bulletin]]></source>
<year>1948</year>
<volume>27</volume>
<page-range>11-20</page-range></nlm-citation>
</ref>
<ref id="B4">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Chall]]></surname>
<given-names><![CDATA[J. S.]]></given-names>
</name>
<name>
<surname><![CDATA[Dale]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
</person-group>
<source><![CDATA[Readability revisited: The new Dale-Chall readability formula]]></source>
<year>1995</year>
<publisher-name><![CDATA[Brookli Books]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B5">
<nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Devlin]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
</person-group>
<source><![CDATA[The use of a psycholinguistic database in the simplification of text for aphasic readers]]></source>
<year>1998</year>
</nlm-citation>
</ref>
<ref id="B6">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Carroll]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Minnen]]></surname>
<given-names><![CDATA[G.]]></given-names>
</name>
<name>
<surname><![CDATA[Canning]]></surname>
<given-names><![CDATA[Y.]]></given-names>
</name>
<name>
<surname><![CDATA[Devlin]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Tait]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
</person-group>
<source><![CDATA[Practical simplification of English newspaper text to assist aphasic readers]]></source>
<year>1998</year>
<conf-name><![CDATA[ AAAI-98 Workshop on Integrating Artificial Intelligence and Assistive Technology]]></conf-name>
<conf-loc> </conf-loc>
<page-range>7-10</page-range></nlm-citation>
</ref>
<ref id="B7">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Specia]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
<name>
<surname><![CDATA[Jauhar]]></surname>
<given-names><![CDATA[S. K.]]></given-names>
</name>
<name>
<surname><![CDATA[Mihalcea]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
</person-group>
<source><![CDATA[SemEval-2012 task 1: English lexical simplification]]></source>
<year>2012</year>
<conf-name><![CDATA[ First Joint Conference on Lexical and Computational Semantics (SEM)]]></conf-name>
<conf-loc> </conf-loc>
<page-range>347-55</page-range></nlm-citation>
</ref>
<ref id="B8">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Amoia]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Romanelli]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<source><![CDATA[Sb: mmsystem-using decompositional semantics for lexical simplification]]></source>
<year>2012</year>
<conf-name><![CDATA[ First Joint Conference on Lexical and Computational Semantics (SEM)]]></conf-name>
<conf-loc> </conf-loc>
<page-range>482-6</page-range></nlm-citation>
</ref>
<ref id="B9">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Ligozat]]></surname>
<given-names><![CDATA[A. L.]]></given-names>
</name>
<name>
<surname><![CDATA[Grouin]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[García-Fernández]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Bernhard]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
</person-group>
<source><![CDATA[Annlor: A naïve notation-system for lexical outputs ranking]]></source>
<year>2012</year>
<conf-name><![CDATA[ First Joint Conference on Lexical and Computational Semantics (SEM)]]></conf-name>
<conf-loc> </conf-loc>
<page-range>487-92</page-range></nlm-citation>
</ref>
<ref id="B10">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Sinha]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
</person-group>
<source><![CDATA[Unt-simprank: Systems for lexical simplification ranking]]></source>
<year>2012</year>
<conf-name><![CDATA[ First Joint Conference on Lexical and Computational Semantics (SEM)]]></conf-name>
<conf-loc> </conf-loc>
<page-range>493-6</page-range></nlm-citation>
</ref>
<ref id="B11">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Jauhar]]></surname>
<given-names><![CDATA[S. K.]]></given-names>
</name>
<name>
<surname><![CDATA[Specia]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
</person-group>
<source><![CDATA[Uow-shef: Simplex&#8211;lexical simplicity ranking based on contextual and psycholinguistic features]]></source>
<year>2012</year>
<conf-name><![CDATA[ First Joint Conference on Lexical and Computational Semantics (SEM)]]></conf-name>
<conf-loc> </conf-loc>
<page-range>477-81</page-range></nlm-citation>
</ref>
<ref id="B12">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Gooding]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Kochmar]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
<name>
<surname><![CDATA[Blackwell]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Sarkar]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<source><![CDATA[Comparative judgments are more consistent than binary classification for labelling word complexity]]></source>
<year>2019</year>
<publisher-name><![CDATA[Association for Computational Linguistics]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B13">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Paetzold]]></surname>
<given-names><![CDATA[G.]]></given-names>
</name>
<name>
<surname><![CDATA[Specia]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
</person-group>
<source><![CDATA[Lexical simplification with neural ranking]]></source>
<year>2017</year>
<volume>2</volume>
<conf-name><![CDATA[ 15th Conference of the European Chapter of the Association for Computational Linguistics]]></conf-name>
<conf-loc> </conf-loc>
<page-range>34-40</page-range></nlm-citation>
</ref>
<ref id="B14">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Xu]]></surname>
<given-names><![CDATA[W.]]></given-names>
</name>
<name>
<surname><![CDATA[Callison-Burch]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Napoles]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Problems in current text simplification research: New data can help]]></article-title>
<source><![CDATA[Transactions of the Association for Computational Linguistics]]></source>
<year>2015</year>
<volume>3</volume>
<page-range>283-97</page-range></nlm-citation>
</ref>
<ref id="B15">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Paetzold]]></surname>
<given-names><![CDATA[G.]]></given-names>
</name>
<name>
<surname><![CDATA[Specia]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
</person-group>
<source><![CDATA[Semeval 2016 task 11: Complex word identification]]></source>
<year>2016</year>
<conf-name><![CDATA[ 10th International Workshop on Semantic Evaluation (SemEval´16)]]></conf-name>
<conf-loc> </conf-loc>
<page-range>560-9</page-range></nlm-citation>
</ref>
<ref id="B16">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Shardlow]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<source><![CDATA[A comparison of techniques to automatically identify complex words]]></source>
<year>2013</year>
<conf-name><![CDATA[ 51st annual meeting of the association for computational linguistics proceedings of the student research workshop]]></conf-name>
<conf-loc> </conf-loc>
<page-range>103-9</page-range></nlm-citation>
</ref>
<ref id="B17">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Shardlow]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<source><![CDATA[The cw corpus: A new resource for evaluating the identification of complex words]]></source>
<year>2013</year>
<conf-name><![CDATA[ Second Workshop on Predicting and Improving Text Readability for Target Reader Populations]]></conf-name>
<conf-loc> </conf-loc>
<page-range>69-77</page-range></nlm-citation>
</ref>
<ref id="B18">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Choubey]]></surname>
<given-names><![CDATA[P. K.]]></given-names>
</name>
<name>
<surname><![CDATA[Pateria]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
</person-group>
<source><![CDATA[Garuda &amp; Bhasha at SemEval-2016 task 11: Complex word identification using aggregated learning models]]></source>
<year>2016</year>
<conf-name><![CDATA[ 10th International Workshop on Semantic Evaluation, SemEval´16]]></conf-name>
<conf-loc> </conf-loc>
<page-range>1006-10</page-range></nlm-citation>
</ref>
<ref id="B19">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Zampieri]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Tan]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
<name>
<surname><![CDATA[van Genabith]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
</person-group>
<source><![CDATA[Macsaar at SemEval-2016 task 11: Zipfian and character features for complex word identification]]></source>
<year>2016</year>
<conf-name><![CDATA[ 10th International Workshop on Semantic Evaluation SemEval´16]]></conf-name>
<conf-loc> </conf-loc>
<page-range>1001-5</page-range></nlm-citation>
</ref>
<ref id="B20">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Kuru]]></surname>
<given-names><![CDATA[O.]]></given-names>
</name>
</person-group>
<source><![CDATA[Ai-ku at semeval-2016 task 11: Word embeddings and substring features for complex word identification]]></source>
<year>2016</year>
<conf-name><![CDATA[ 10th International Workshop on Semantic Evaluation SemEval´16]]></conf-name>
<conf-loc> </conf-loc>
<page-range>1042-6</page-range></nlm-citation>
</ref>
<ref id="B21">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Quijada]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Medero]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
</person-group>
<source><![CDATA[Hmc at SemEval-2016 task 11: Identifying complex words using depth-limited decision trees]]></source>
<year>2016</year>
<conf-name><![CDATA[ 10th International Workshop on Semantic Evaluation, SemEval´16]]></conf-name>
<conf-loc> </conf-loc>
<page-range>1034-7</page-range></nlm-citation>
</ref>
<ref id="B22">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Malmasi]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Dras]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Zampieri]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<source><![CDATA[Ltg at SemEval-2016 task 11: Complex word identification with classifier ensembles]]></source>
<year>2016</year>
<conf-name><![CDATA[ 10th International Workshop on Semantic Evaluation, SemEval´16]]></conf-name>
<conf-loc> </conf-loc>
<page-range>996-1000</page-range></nlm-citation>
</ref>
<ref id="B23">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Malmasi]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Zampieri]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<source><![CDATA[Maza at SemEval-2016 task 11: Detecting lexical complexity using a decision stump meta-classifier]]></source>
<year>2016</year>
<conf-name><![CDATA[ 10th International Workshop on Semantic Evaluation, SemEval´16]]></conf-name>
<conf-loc> </conf-loc>
<page-range>991-5</page-range></nlm-citation>
</ref>
<ref id="B24">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Brooke]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Uitdenbogerd]]></surname>
<given-names><![CDATA[A. L.]]></given-names>
</name>
<name>
<surname><![CDATA[Baldwin]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
</person-group>
<source><![CDATA[Melbourne at SemEval 2016 task 11: Classifying type-level word complexity using random forests with corpus and word list features]]></source>
<year>2016</year>
<conf-name><![CDATA[ 10th International Workshop on Semantic Evaluation, SemEval´16]]></conf-name>
<conf-loc> </conf-loc>
<page-range>975-81</page-range></nlm-citation>
</ref>
<ref id="B25">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Nat]]></surname>
<given-names><![CDATA[G.]]></given-names>
</name>
</person-group>
<source><![CDATA[Sensible at SemEval-2016 task 11: Neural nonsense mangled in ensemble mess]]></source>
<year>2016</year>
<conf-name><![CDATA[ 10th International Workshop on Semantic Evaluation, SemEval´16]]></conf-name>
<conf-loc> </conf-loc>
<page-range>963-8</page-range></nlm-citation>
</ref>
<ref id="B26">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Paetzold]]></surname>
<given-names><![CDATA[G.]]></given-names>
</name>
<name>
<surname><![CDATA[Specia]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
</person-group>
<source><![CDATA[Sv000gg at Semeval-2016 task 11: Heavy gauge complex word identification with system voting]]></source>
<year>2016</year>
<conf-name><![CDATA[ 10th International Workshop on Semantic Evaluation, SemEval´16]]></conf-name>
<conf-loc> </conf-loc>
<page-range>969-74</page-range></nlm-citation>
</ref>
<ref id="B27">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Ronzano]]></surname>
<given-names><![CDATA[F.]]></given-names>
</name>
<name>
<surname><![CDATA[Anke]]></surname>
<given-names><![CDATA[L. E.]]></given-names>
</name>
<name>
<surname><![CDATA[Saggion]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
</person-group>
<source><![CDATA[Taln at Semeval-2016 task 11: Modelling complex words by contextual, lexical and semantic features]]></source>
<year>2016</year>
<conf-name><![CDATA[ 10th International Workshop on Semantic Evaluation, SemEval´16]]></conf-name>
<conf-loc> </conf-loc>
<page-range>1011-6</page-range></nlm-citation>
</ref>
<ref id="B28">
<nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Yimam]]></surname>
<given-names><![CDATA[S. M.]]></given-names>
</name>
<name>
<surname><![CDATA[Biemann]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Malmasi]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Paetzold]]></surname>
<given-names><![CDATA[G. H.]]></given-names>
</name>
<name>
<surname><![CDATA[Specia]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
<name>
<surname><![CDATA[&#352;tajner]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Zampieri]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<source><![CDATA[A report on the complex word identification shared task 2018]]></source>
<year>2018</year>
</nlm-citation>
</ref>
<ref id="B29">
<nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Butnaru]]></surname>
<given-names><![CDATA[A. M.]]></given-names>
</name>
<name>
<surname><![CDATA[Ionescu]]></surname>
<given-names><![CDATA[R. T.]]></given-names>
</name>
</person-group>
<source><![CDATA[UnibucKernel: A kernel-based learning method for complex word identification]]></source>
<year>2018</year>
</nlm-citation>
</ref>
<ref id="B30">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[AbuRa'ed]]></surname>
<given-names><![CDATA[A. G. T.]]></given-names>
</name>
<name>
<surname><![CDATA[Saggion]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
</person-group>
<source><![CDATA[LaSTUS/TALN at complex word identification (CWI) 2018 shared task]]></source>
<year>2018</year>
<conf-name><![CDATA[ Thirteenth Workshop on Innovative Use of NLP for Building Educational Applications]]></conf-name>
<conf-loc> </conf-loc>
<page-range>159-65</page-range></nlm-citation>
</ref>
<ref id="B31">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Alfter]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Pilán]]></surname>
<given-names><![CDATA[I.]]></given-names>
</name>
</person-group>
<source><![CDATA[SB@ GU at the complex word identification 2018 shared task]]></source>
<year>2018</year>
<conf-name><![CDATA[ Thirteenth Workshop on Innovative Use of NLP for Building Educational Applications]]></conf-name>
<conf-loc> </conf-loc>
<page-range>315-21</page-range></nlm-citation>
</ref>
<ref id="B32">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Hartmann]]></surname>
<given-names><![CDATA[N.]]></given-names>
</name>
<name>
<surname><![CDATA[Dos Santos]]></surname>
<given-names><![CDATA[L. B.]]></given-names>
</name>
</person-group>
<source><![CDATA[NILC at CWI 2018: Exploring feature engineering and feature learning]]></source>
<year>2018</year>
<conf-name><![CDATA[ Thirteenth Workshop on Innovative Use of NLP for Building Educational Applications]]></conf-name>
<conf-loc> </conf-loc>
<page-range>335-40</page-range></nlm-citation>
</ref>
<ref id="B33">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Kajiwara]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
<name>
<surname><![CDATA[Komachi]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<source><![CDATA[Complex word identification based on frequency in a learner corpus]]></source>
<year>2018</year>
<conf-name><![CDATA[ Thirteenth Workshop on Innovative Use of NLP for Building Educational Applications]]></conf-name>
<conf-loc> </conf-loc>
<page-range>195-9</page-range></nlm-citation>
</ref>
<ref id="B34">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Bingel]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Schluter]]></surname>
<given-names><![CDATA[N.]]></given-names>
</name>
<name>
<surname><![CDATA[Alonso]]></surname>
<given-names><![CDATA[H. M.]]></given-names>
</name>
</person-group>
<source><![CDATA[CoastalCPH at SemEval-2016 Task 11: The importance of designing your neural networks right]]></source>
<year>2016</year>
<conf-name><![CDATA[ 10th International Workshop on Semantic Evaluation, SemEval´16]]></conf-name>
<conf-loc> </conf-loc>
<page-range>1028-33</page-range></nlm-citation>
</ref>
<ref id="B35">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Aroyehun]]></surname>
<given-names><![CDATA[S. T.]]></given-names>
</name>
<name>
<surname><![CDATA[Ángel]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Álvarez]]></surname>
<given-names><![CDATA[D. A. P.]]></given-names>
</name>
<name>
<surname><![CDATA[Gelbukh]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<source><![CDATA[Complex word identification: convolutional neural network vs. feature engineering]]></source>
<year>2018</year>
<conf-name><![CDATA[ Thirteenth Workshop on Innovative Use of NLP for Building Educational Applications]]></conf-name>
<conf-loc> </conf-loc>
<page-range>322-7</page-range></nlm-citation>
</ref>
<ref id="B36">
<nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Maddela]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Xu]]></surname>
<given-names><![CDATA[W.]]></given-names>
</name>
</person-group>
<source><![CDATA[A word-complexity lexicon and a neural readability ranking model for lexical simplification]]></source>
<year>2018</year>
</nlm-citation>
</ref>
<ref id="B37">
<nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Shardlow]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Evans]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Paetzold]]></surname>
<given-names><![CDATA[G. H.]]></given-names>
</name>
<name>
<surname><![CDATA[Zampieri]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<source><![CDATA[Semeval-2021 task 1: Lexical complexity prediction]]></source>
<year>2021</year>
</nlm-citation>
</ref>
<ref id="B38">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Christodouloupoulos]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Steedman]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[A massively parallel corpus: the bible in 100 languages]]></article-title>
<source><![CDATA[Language resources and evaluation]]></source>
<year>2015</year>
<volume>49</volume>
<numero>2</numero>
<issue>2</issue>
<page-range>375-95</page-range></nlm-citation>
</ref>
<ref id="B39">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Bada]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Eckert]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Evans]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[García]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
<name>
<surname><![CDATA[Shipley]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
<name>
<surname><![CDATA[Sitnikov]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Hunter]]></surname>
<given-names><![CDATA[L. E.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Concept annotation in the CRAFT corpus]]></article-title>
<source><![CDATA[BMC bioinformatics]]></source>
<year>2012</year>
<volume>13</volume>
<numero>1</numero>
<issue>1</issue>
<page-range>1-20</page-range></nlm-citation>
</ref>
<ref id="B40">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Koehn]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
</person-group>
<source><![CDATA[Europarl: A parallel corpus for statistical machine translation]]></source>
<year>2005</year>
<conf-name><![CDATA[ machine translation summit x]]></conf-name>
<conf-loc> </conf-loc>
<page-range>79-86</page-range></nlm-citation>
</ref>
<ref id="B41">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Likert]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[A technique for the measurement of attitudes]]></article-title>
<source><![CDATA[Archives of psychology]]></source>
<year>1932</year>
</nlm-citation>
</ref>
<ref id="B42">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Pan]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Song]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
<name>
<surname><![CDATA[Wang]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Luo]]></surname>
<given-names><![CDATA[Z.]]></given-names>
</name>
</person-group>
<source><![CDATA[DeepBlueAI at SemEval-2021 Task 1: Lexical complexity prediction with a deep ensemble approach]]></source>
<year>2021</year>
<conf-name><![CDATA[ 15th International Workshop on Semantic Evaluation, SemEval´21]]></conf-name>
<conf-loc> </conf-loc>
<page-range>578-84</page-range></nlm-citation>
</ref>
<ref id="B43">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Yaseen]]></surname>
<given-names><![CDATA[T. B.]]></given-names>
</name>
<name>
<surname><![CDATA[Ismail]]></surname>
<given-names><![CDATA[Q.]]></given-names>
</name>
<name>
<surname><![CDATA[Al-Omari]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Al-Sobh]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
<name>
<surname><![CDATA[Abdullah]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<source><![CDATA[JUST-BLUE at SemEval-2021 Task 1: Predicting lexical complexity using BERT and RoBERTa Pre-Trained language models]]></source>
<year>2021</year>
<conf-name><![CDATA[ 15th International Workshop on Semantic Evaluation, SemEval´21]]></conf-name>
<conf-loc> </conf-loc>
<page-range>661-6</page-range></nlm-citation>
</ref>
<ref id="B44">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Rao]]></surname>
<given-names><![CDATA[G.]]></given-names>
</name>
<name>
<surname><![CDATA[Li]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Hou]]></surname>
<given-names><![CDATA[X.]]></given-names>
</name>
<name>
<surname><![CDATA[Jiang]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
<name>
<surname><![CDATA[Mo]]></surname>
<given-names><![CDATA[Y.]]></given-names>
</name>
<name>
<surname><![CDATA[Shen]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
</person-group>
<source><![CDATA[RG PA at SemEval-2021 Task 1: A contextual attention-based model with RoBERTa for lexical complexity prediction]]></source>
<year>2021</year>
<conf-name><![CDATA[ 15th International Workshop on Semantic Evaluation, SemEval´21]]></conf-name>
<conf-loc> </conf-loc>
<page-range>623-6</page-range></nlm-citation>
</ref>
<ref id="B45">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Rotaru]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<source><![CDATA[ANDI at SemEval-2021 Task 1: Predicting complexity in context using distributional models, behavioural norms, and lexical resources]]></source>
<year>2021</year>
<conf-name><![CDATA[ 15th International Workshop on Semantic Evaluation, SemEval´21]]></conf-name>
<conf-loc> </conf-loc>
<page-range>655-60</page-range></nlm-citation>
</ref>
<ref id="B46">
<nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Taya]]></surname>
<given-names><![CDATA[Y.]]></given-names>
</name>
<name>
<surname><![CDATA[Pereira]]></surname>
<given-names><![CDATA[L. K.]]></given-names>
</name>
<name>
<surname><![CDATA[Cheng]]></surname>
<given-names><![CDATA[F.]]></given-names>
</name>
<name>
<surname><![CDATA[Kobayashi]]></surname>
<given-names><![CDATA[I.]]></given-names>
</name>
</person-group>
<source><![CDATA[OCHADAI-KYOTO at SemEval-2021 Task 1: Enhancing model generalization and robustness for lexical complexity prediction]]></source>
<year>2021</year>
</nlm-citation>
</ref>
<ref id="B47">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Mosquera]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<source><![CDATA[Alejandro Mosquera at SemEval-2021 Task 1: Exploring sentence and word features for lexical complexity prediction]]></source>
<year>2021</year>
<conf-name><![CDATA[ 15th International Workshop on Semantic Evaluation, SemEval´21]]></conf-name>
<conf-loc> </conf-loc>
<page-range>554-9</page-range></nlm-citation>
</ref>
<ref id="B48">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Billami]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[François]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
<name>
<surname><![CDATA[Gala]]></surname>
<given-names><![CDATA[N.]]></given-names>
</name>
</person-group>
<source><![CDATA[ReSyf: A French lexicon with ranked synonyms]]></source>
<year>2018</year>
<conf-name><![CDATA[ 27th International Conference on Computational Linguistics, COLING´18]]></conf-name>
<conf-loc> </conf-loc>
<page-range>2570-81</page-range></nlm-citation>
</ref>
<ref id="B49">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Ortiz-Zambranoa]]></surname>
<given-names><![CDATA[J. A.]]></given-names>
</name>
<name>
<surname><![CDATA[Montejo-Ráezb]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<source><![CDATA[Overview of alexs 2020: First workshop on lexical analysis at sepln]]></source>
<year>2020</year>
<conf-name><![CDATA[ Iberian Languages Evaluation Forum, IberLEF´20]]></conf-name>
<conf-loc> </conf-loc>
</nlm-citation>
</ref>
<ref id="B50">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Lee]]></surname>
<given-names><![CDATA[J. S.]]></given-names>
</name>
<name>
<surname><![CDATA[Yeung]]></surname>
<given-names><![CDATA[C. Y.]]></given-names>
</name>
</person-group>
<source><![CDATA[Personalizing lexical simplification]]></source>
<year>2018</year>
<conf-name><![CDATA[ 27th International Conference on Computational Linguistics]]></conf-name>
<conf-loc> </conf-loc>
<page-range>224-32</page-range></nlm-citation>
</ref>
<ref id="B51">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Nishihara]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Kajiwara]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
</person-group>
<source><![CDATA[Word Complexity Estimation for Japanese Lexical Simplification]]></source>
<year>2020</year>
<conf-name><![CDATA[ 12th Language resources and evaluation Conference]]></conf-name>
<conf-loc> </conf-loc>
<page-range>3114-20</page-range></nlm-citation>
</ref>
<ref id="B52">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Maekawa]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
<name>
<surname><![CDATA[Yamazaki]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Maruyama]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
<name>
<surname><![CDATA[Yamaguchi]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Ogura]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
<name>
<surname><![CDATA[Kashino]]></surname>
<given-names><![CDATA[W.]]></given-names>
</name>
<name>
<surname><![CDATA[Den]]></surname>
<given-names><![CDATA[Y.]]></given-names>
</name>
</person-group>
<source><![CDATA[Design, compilation, and preliminary analyses of balanced corpus of contemporary written Japanese]]></source>
<year>2010</year>
<conf-name><![CDATA[ Seventh International Conference on Language resources and evaluation, LREC'10]]></conf-name>
<conf-loc> </conf-loc>
</nlm-citation>
</ref>
<ref id="B53">
<nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Smolenska]]></surname>
<given-names><![CDATA[G.]]></given-names>
</name>
</person-group>
<source><![CDATA[Complex Word Identification for Swedish]]></source>
<year>2018</year>
</nlm-citation>
</ref>
<ref id="B54">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Venugopal]]></surname>
<given-names><![CDATA[G.]]></given-names>
</name>
<name>
<surname><![CDATA[Pramod]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Shekhar]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
</person-group>
<source><![CDATA[CWID-hi: A dataset for complex word identification in Hindi text]]></source>
<year>2022</year>
<conf-name><![CDATA[ Thirteenth Language resources and evaluation Conference]]></conf-name>
<conf-loc> </conf-loc>
<page-range>5627-36</page-range></nlm-citation>
</ref>
<ref id="B55">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Vaswani]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Shazeer]]></surname>
<given-names><![CDATA[N.]]></given-names>
</name>
<name>
<surname><![CDATA[Parmar]]></surname>
<given-names><![CDATA[N.]]></given-names>
</name>
<name>
<surname><![CDATA[Uszkoreit]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Jones]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
<name>
<surname><![CDATA[Gómez]]></surname>
<given-names><![CDATA[A. N.]]></given-names>
</name>
<name>
<surname><![CDATA[Polosukhin]]></surname>
<given-names><![CDATA[I.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Attention is all you need]]></article-title>
<source><![CDATA[Advances in neural information processing systems]]></source>
<year>2017</year>
</nlm-citation>
</ref>
<ref id="B56">
<nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Devlin]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Chang]]></surname>
<given-names><![CDATA[M. W.]]></given-names>
</name>
<name>
<surname><![CDATA[Lee]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
<name>
<surname><![CDATA[Toutanova]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
</person-group>
<source><![CDATA[Bert: Pre-training of deep bidirectional transformers for language understanding]]></source>
<year>2018</year>
</nlm-citation>
</ref>
<ref id="B57">
<nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Liu]]></surname>
<given-names><![CDATA[Y.]]></given-names>
</name>
<name>
<surname><![CDATA[Ott]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Goyal]]></surname>
<given-names><![CDATA[N.]]></given-names>
</name>
<name>
<surname><![CDATA[Du]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Joshi]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Chen]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Stoyanov]]></surname>
<given-names><![CDATA[V.]]></given-names>
</name>
</person-group>
<source><![CDATA[Roberta: A robustly optimized bert pretraining approach]]></source>
<year>2019</year>
</nlm-citation>
</ref>
<ref id="B58">
<nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Lan]]></surname>
<given-names><![CDATA[Z.]]></given-names>
</name>
<name>
<surname><![CDATA[Chen]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Goodman]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Gimpel]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
<name>
<surname><![CDATA[Sharma]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Soricut]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
</person-group>
<source><![CDATA[Albert: A lite bert for self-supervised learning of language representations]]></source>
<year>2019</year>
</nlm-citation>
</ref>
<ref id="B59">
<nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Sun]]></surname>
<given-names><![CDATA[Y.]]></given-names>
</name>
<name>
<surname><![CDATA[Wang]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Li]]></surname>
<given-names><![CDATA[Y.]]></given-names>
</name>
<name>
<surname><![CDATA[Feng]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Chen]]></surname>
<given-names><![CDATA[X.]]></given-names>
</name>
<name>
<surname><![CDATA[Zhang]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
<name>
<surname><![CDATA[Wu]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
</person-group>
<source><![CDATA[Ernie: Enhanced representation through knowledge integration]]></source>
<year>2019</year>
</nlm-citation>
</ref>
<ref id="B60">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Loukachevitch]]></surname>
<given-names><![CDATA[N.]]></given-names>
</name>
<name>
<surname><![CDATA[Lashevich]]></surname>
<given-names><![CDATA[G.]]></given-names>
</name>
</person-group>
<source><![CDATA[Multiword expressions in Russian thesauri RuThes and RuWordnet]]></source>
<year>2016</year>
<conf-name><![CDATA[ of IEEE Artificial Intelligence and Natural Language Conference, AINL]]></conf-name>
<conf-loc> </conf-loc>
<page-range>1-6</page-range></nlm-citation>
</ref>
<ref id="B61">
<nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Mikolov]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
<name>
<surname><![CDATA[Chen]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
<name>
<surname><![CDATA[Corrado]]></surname>
<given-names><![CDATA[G.]]></given-names>
</name>
<name>
<surname><![CDATA[Dean]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
</person-group>
<source><![CDATA[Efficient estimation of word representations in vector space]]></source>
<year>2013</year>
</nlm-citation>
</ref>
<ref id="B62">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Pennington]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Socher]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Manning]]></surname>
<given-names><![CDATA[C. D.]]></given-names>
</name>
</person-group>
<source><![CDATA[Glove: Global vectors for word representation]]></source>
<year>2014</year>
<conf-name><![CDATA[ Conference on Empirical Methods in Natural Language Processing EMNLP]]></conf-name>
<conf-loc> </conf-loc>
<page-range>1532-43</page-range></nlm-citation>
</ref>
<ref id="B63">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Bojanowski]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Grave]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
<name>
<surname><![CDATA[Joulin]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Mikolov]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Enriching word vectors with subword information]]></article-title>
<source><![CDATA[Transactions of the Association for Computational Linguistics]]></source>
<year>2017</year>
<volume>5</volume>
<page-range>135-46</page-range></nlm-citation>
</ref>
<ref id="B64">
<nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[He]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Liu]]></surname>
<given-names><![CDATA[X.]]></given-names>
</name>
<name>
<surname><![CDATA[Gao]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Chen]]></surname>
<given-names><![CDATA[W.]]></given-names>
</name>
</person-group>
<source><![CDATA[Deberta: Decoding-enhanced bert with disentangled attention]]></source>
<year>2020</year>
</nlm-citation>
</ref>
<ref id="B65">
<nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Clark]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
<name>
<surname><![CDATA[Luong]]></surname>
<given-names><![CDATA[M. T.]]></given-names>
</name>
<name>
<surname><![CDATA[Le]]></surname>
<given-names><![CDATA[Q. V.]]></given-names>
</name>
<name>
<surname><![CDATA[Manning]]></surname>
<given-names><![CDATA[C. D.]]></given-names>
</name>
</person-group>
<source><![CDATA[Electra: Pre-training text encoders as discriminators rather than generators]]></source>
<year>2020</year>
</nlm-citation>
</ref>
</ref-list>
</back>
</article>
