<?xml version="1.0" encoding="ISO-8859-1"?><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<front>
<journal-meta>
<journal-id>1405-5546</journal-id>
<journal-title><![CDATA[Computación y Sistemas]]></journal-title>
<abbrev-journal-title><![CDATA[Comp. y Sist.]]></abbrev-journal-title>
<issn>1405-5546</issn>
<publisher>
<publisher-name><![CDATA[Instituto Politécnico Nacional, Centro de Investigación en Computación]]></publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id>S1405-55462020000200597</article-id>
<article-id pub-id-type="doi">10.13053/cys-24-2-3393</article-id>
<title-group>
<article-title xml:lang="en"><![CDATA[Analysis of Automatic Annotations of Real Video Surveillance Images]]></article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Guevara Flores]]></surname>
<given-names><![CDATA[Diana Karina]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Pérez Téllez]]></surname>
<given-names><![CDATA[Fernando]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Pinto Avendano]]></surname>
<given-names><![CDATA[David Eduardo]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
</contrib-group>
<aff id="Af1">
<institution><![CDATA[,Benemérita Universidad Autónoma de Puebla Department of Computing ]]></institution>
<addr-line><![CDATA[ ]]></addr-line>
<country>Mexico</country>
</aff>
<aff id="Af2">
<institution><![CDATA[,Technological University Dublin Department of Computing ]]></institution>
<addr-line><![CDATA[ ]]></addr-line>
<country>Ireland</country>
</aff>
<pub-date pub-type="pub">
<day>00</day>
<month>06</month>
<year>2020</year>
</pub-date>
<pub-date pub-type="epub">
<day>00</day>
<month>06</month>
<year>2020</year>
</pub-date>
<volume>24</volume>
<numero>2</numero>
<fpage>597</fpage>
<lpage>606</lpage>
<copyright-statement/>
<copyright-year/>
<self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_arttext&amp;pid=S1405-55462020000200597&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_abstract&amp;pid=S1405-55462020000200597&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_pdf&amp;pid=S1405-55462020000200597&amp;lng=en&amp;nrm=iso"></self-uri><abstract abstract-type="short" xml:lang="en"><p><![CDATA[Abstract: The results of the analysis of the automatic annotations of real video surveillance sequences are presented. The annotations of the frames of surveillance sequences of the parking lot of a university campus are generated. The purpose of the analysis is to evaluate the quality of the descriptions and analyze the correspondence between the semantic content of the images and the corresponding annotation. To perform the tests, a fixed camera was placed in the campus parking lot and video sequences of about 20 minutes were obtained, later each frame was annotated individually and a text repository with all the annotations was formed. It was observed that it is possible to take advantage of the properties of the video to evaluate the performance of the annotator and the example of the crossing of a pedestrian is presented as an example for its analysis.]]></p></abstract>
<kwd-group>
<kwd lng="en"><![CDATA[Automatic annotation]]></kwd>
<kwd lng="en"><![CDATA[semantic analysis]]></kwd>
<kwd lng="en"><![CDATA[surveillance images]]></kwd>
</kwd-group>
</article-meta>
</front><back>
<ref-list>
<ref id="B1">
<label>1</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Banerjee]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Lavie]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<source><![CDATA[METEOR: An automatic metric for MT evaluation with improved correlation with human judgments]]></source>
<year>2005</year>
<conf-name><![CDATA[ acl workshop on intrinsic and extrinsic evaluation measures for machine translation and/or summarization]]></conf-name>
<conf-loc> </conf-loc>
<page-range>65-72</page-range></nlm-citation>
</ref>
<ref id="B2">
<label>2</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Bengoechea Isasa]]></surname>
<given-names><![CDATA[J. I.]]></given-names>
</name>
</person-group>
<source><![CDATA[Let me see: diseño de un generador automático de descripciones de imágenes]]></source>
<year></year>
</nlm-citation>
</ref>
<ref id="B3">
<label>3</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Bernardi]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Cakici]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Elliott]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Erdem]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Erdem]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
<name>
<surname><![CDATA[Ikizler-Cinbis]]></surname>
<given-names><![CDATA[N.]]></given-names>
</name>
<name>
<surname><![CDATA[Keller]]></surname>
<given-names><![CDATA[F.]]></given-names>
</name>
<name>
<surname><![CDATA[Muscat]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Plank]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Automatic description generation from images: A survey of models, datasets, and evaluation measures]]></article-title>
<source><![CDATA[Journal of Artificial Intelligence Research]]></source>
<year>2016</year>
<volume>55</volume>
<page-range>409-42</page-range></nlm-citation>
</ref>
<ref id="B4">
<label>4</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Chen]]></surname>
<given-names><![CDATA[X.]]></given-names>
</name>
<name>
<surname><![CDATA[Lawrence Zitnick]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
</person-group>
<source><![CDATA[Mind&#8217;s eye: A recurrent visual representation for image caption generation]]></source>
<year>2015</year>
<conf-name><![CDATA[ Proceedings of the IEEE conference on computer vision and pattern recognition]]></conf-name>
<conf-loc> </conf-loc>
<page-range>2422-31</page-range></nlm-citation>
</ref>
<ref id="B5">
<label>5</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Cheng]]></surname>
<given-names><![CDATA[Q.]]></given-names>
</name>
<name>
<surname><![CDATA[Zhang]]></surname>
<given-names><![CDATA[Q.]]></given-names>
</name>
<name>
<surname><![CDATA[Fu]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Tu]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Li]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[A survey and analysis on automatic image annotation]]></article-title>
<source><![CDATA[Pattern Recognition]]></source>
<year>2018</year>
<volume>79</volume>
<page-range>242-59</page-range></nlm-citation>
</ref>
<ref id="B6">
<label>6</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Everingham]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Van Gool]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
<name>
<surname><![CDATA[Williams]]></surname>
<given-names><![CDATA[C. K.]]></given-names>
</name>
<name>
<surname><![CDATA[Winn]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Zisserman]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[The Pascal visual object classes (voc) challenge]]></article-title>
<source><![CDATA[International journal of computer vision]]></source>
<year>2010</year>
<volume>88</volume>
<numero>2</numero>
<issue>2</issue>
<page-range>303-38</page-range></nlm-citation>
</ref>
<ref id="B7">
<label>7</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Ghoshal]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Ircing]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Khudanpur]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
</person-group>
<source><![CDATA[Hidden Markov models for automatic annotation and content-based retrieval of images and video]]></source>
<year>2005</year>
<conf-name><![CDATA[ 28th annual international ACM SIGIR conference on Research and development in information retrieval]]></conf-name>
<conf-loc> </conf-loc>
<page-range>544-51</page-range></nlm-citation>
</ref>
<ref id="B8">
<label>8</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Hossain]]></surname>
<given-names><![CDATA[M. Z.]]></given-names>
</name>
<name>
<surname><![CDATA[Sohel]]></surname>
<given-names><![CDATA[F.]]></given-names>
</name>
<name>
<surname><![CDATA[Shiratuddin]]></surname>
<given-names><![CDATA[M. F.]]></given-names>
</name>
<name>
<surname><![CDATA[Laga]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[A comprehensive survey of deep learning for image captioning]]></article-title>
<source><![CDATA[ACM Computing Surveys (CSUR)]]></source>
<year>2019</year>
<volume>51</volume>
<numero>6</numero>
<issue>6</issue>
<page-range>1-36</page-range></nlm-citation>
</ref>
<ref id="B9">
<label>9</label><nlm-citation citation-type="book">
<collab>International Standard</collab>
<source><![CDATA[Compact descriptors for video analysis]]></source>
<year>2017</year>
<publisher-name><![CDATA[Organization for Standardization and International Electrotechnical Commission]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B10">
<label>10</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Lavie]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Agarwal]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<source><![CDATA[METEOR: An automatic metric for MT evaluation with high levels of correlation with human judgments]]></source>
<year>2007</year>
<conf-name><![CDATA[ second workshop on statistical machine translation]]></conf-name>
<conf-loc> </conf-loc>
<page-range>228-31</page-range></nlm-citation>
</ref>
<ref id="B11">
<label>11</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Lin]]></surname>
<given-names><![CDATA[C.-Y.]]></given-names>
</name>
</person-group>
<source><![CDATA[ROUGE: a package for automatic evaluation of summaries]]></source>
<year>2004</year>
<conf-name><![CDATA[ Workshop on Text Summarization Branches Out, Post-Conference Workshop of ACL 2004]]></conf-name>
<conf-loc>Barcelona, Spain </conf-loc>
</nlm-citation>
</ref>
<ref id="B12">
<label>12</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Lin]]></surname>
<given-names><![CDATA[T.-Y.]]></given-names>
</name>
<name>
<surname><![CDATA[Maire]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Belongie]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Hays]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Perona]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Ramanan]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Dollár]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Zitnick]]></surname>
<given-names><![CDATA[C. L.]]></given-names>
</name>
</person-group>
<source><![CDATA[Microsoft coco: Common objects in context]]></source>
<year>2014</year>
<conf-name><![CDATA[ European conference on computer vision]]></conf-name>
<conf-loc> </conf-loc>
<page-range>740-55</page-range></nlm-citation>
</ref>
<ref id="B13">
<label>13</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Murthy]]></surname>
<given-names><![CDATA[V. N.]]></given-names>
</name>
<name>
<surname><![CDATA[Maji]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Manmatha]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
</person-group>
<source><![CDATA[Automatic image annotation using deep learning representations]]></source>
<year>2015</year>
<conf-name><![CDATA[ 5th ACM on International Conference on Multimedia Retrieval]]></conf-name>
<conf-loc> </conf-loc>
<page-range>603-6</page-range></nlm-citation>
</ref>
<ref id="B14">
<label>14</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Papineni]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
<name>
<surname><![CDATA[Roukos]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Ward]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
<name>
<surname><![CDATA[Zhu]]></surname>
<given-names><![CDATA[W.-J.]]></given-names>
</name>
</person-group>
<source><![CDATA[BLEU: a method for automatic evaluation of machine translation]]></source>
<year>2002</year>
<conf-name><![CDATA[ 40th annual meeting on association for computational linguistics]]></conf-name>
<conf-loc> </conf-loc>
<page-range>311-8</page-range></nlm-citation>
</ref>
<ref id="B15">
<label>15</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Rashtchian]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Young]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Hodosh]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Hockenmaier]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
</person-group>
<source><![CDATA[Collecting image annotations using Amazon&#8217;s Mechanical Turk]]></source>
<year>2010</year>
<conf-name><![CDATA[ NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon&#8217;s Mechanical Turk]]></conf-name>
<conf-loc> </conf-loc>
<page-range>139-47</page-range></nlm-citation>
</ref>
<ref id="B16">
<label>16</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Tran]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
<name>
<surname><![CDATA[He]]></surname>
<given-names><![CDATA[X.]]></given-names>
</name>
<name>
<surname><![CDATA[Zhang]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
<name>
<surname><![CDATA[Sun]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Carapcea]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Thrasher]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Buehler]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Sienkiewicz]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
</person-group>
<source><![CDATA[Rich image captioning in the wild]]></source>
<year>2016</year>
<conf-name><![CDATA[ Proceedings of the IEEE conference on computer vision and pattern recognition workshops]]></conf-name>
<conf-loc> </conf-loc>
<page-range>49-56</page-range></nlm-citation>
</ref>
<ref id="B17">
<label>17</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Uricchio]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
<name>
<surname><![CDATA[Ballan]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
<name>
<surname><![CDATA[Seidenari]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
<name>
<surname><![CDATA[Del Bimbo]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Automatic image annotation via label transfer in the semantic space]]></article-title>
<source><![CDATA[Pattern Recognition]]></source>
<year>2017</year>
<volume>71</volume>
<page-range>144-57</page-range></nlm-citation>
</ref>
<ref id="B18">
<label>18</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Vedantam]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Lawrence Zitnick]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Parikh]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
</person-group>
<source><![CDATA[Cider: Consensus-based image description evaluation]]></source>
<year>2015</year>
<conf-name><![CDATA[ Proceedings of the IEEE conference on computer vision and pattern recognition]]></conf-name>
<conf-loc> </conf-loc>
<page-range>4566-75</page-range></nlm-citation>
</ref>
<ref id="B19">
<label>19</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Vinyals]]></surname>
<given-names><![CDATA[O.]]></given-names>
</name>
<name>
<surname><![CDATA[Toshev]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Bengio]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
<name>
<surname><![CDATA[Erhan]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
</person-group>
<source><![CDATA[Show and tell: A neural image caption generator]]></source>
<year>2015</year>
<conf-name><![CDATA[ Proceedings of the IEEE conference on computer vision and pattern recognition]]></conf-name>
<conf-loc> </conf-loc>
<page-range>3156-64</page-range></nlm-citation>
</ref>
<ref id="B20">
<label>20</label><nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Zhang]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Zhang]]></surname>
<given-names><![CDATA[Z.]]></given-names>
</name>
<name>
<surname><![CDATA[Li]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Ma]]></surname>
<given-names><![CDATA[W.-Y.]]></given-names>
</name>
<name>
<surname><![CDATA[Zhang]]></surname>
<given-names><![CDATA[H.-J.]]></given-names>
</name>
</person-group>
<source><![CDATA[A probabilistic semantic model for image annotation and multimodal image retrieval]]></source>
<year>2005</year>
<volume>1</volume>
<conf-name><![CDATA[ Tenth IEEE International Conference on Computer Vision (ICCV&#8217;05) Volume 1]]></conf-name>
<conf-loc> </conf-loc>
<page-range>846-51</page-range><publisher-name><![CDATA[IEEE]]></publisher-name>
</nlm-citation>
</ref>
</ref-list>
</back>
</article>
