<?xml version="1.0" encoding="ISO-8859-1"?><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<front>
<journal-meta>
<journal-id>1870-9044</journal-id>
<journal-title><![CDATA[Polibits]]></journal-title>
<abbrev-journal-title><![CDATA[Polibits]]></abbrev-journal-title>
<issn>1870-9044</issn>
<publisher>
<publisher-name><![CDATA[Instituto Politécnico Nacional, Centro de Innovación y Desarrollo Tecnológico en Cómputo]]></publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id>S1870-90442016000200025</article-id>
<article-id pub-id-type="doi">10.17562/PB-54-4</article-id>
<title-group>
<article-title xml:lang="en"><![CDATA[Business Process Models Clustering Based on Multimodal Search, K-means, and Cumulative and No-Continuous N-Grams]]></article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Ordoñez]]></surname>
<given-names><![CDATA[Hugo]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Merchán]]></surname>
<given-names><![CDATA[Luis]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Ordoñez]]></surname>
<given-names><![CDATA[Armando]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Cobos]]></surname>
<given-names><![CDATA[Carlos]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
</contrib-group>
<aff id="Af1">
<institution><![CDATA[,Universidad de San Buenaventura Facultad de Ingeniería ]]></institution>
<addr-line><![CDATA[Cali ]]></addr-line>
<country>Colombia</country>
</aff>
<aff id="Af2">
<institution><![CDATA[,Fundación Universitaria de Popayán, the group Intelligent Management Systems Facultad de Ingeniería ]]></institution>
<addr-line><![CDATA[ Popayán]]></addr-line>
<country>Colombia</country>
</aff>
<aff id="Af3">
<institution><![CDATA[,Universidad del Cauca Facultad de Ingeniería Electrónica y Telecomunicaciones Departamento de Sistemas]]></institution>
<addr-line><![CDATA[ ]]></addr-line>
<country>Colombia</country>
</aff>
<pub-date pub-type="pub">
<day>00</day>
<month>12</month>
<year>2016</year>
</pub-date>
<pub-date pub-type="epub">
<day>00</day>
<month>12</month>
<year>2016</year>
</pub-date>
<numero>54</numero>
<fpage>25</fpage>
<lpage>31</lpage>
<copyright-statement/>
<copyright-year/>
<self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_arttext&amp;pid=S1870-90442016000200025&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_abstract&amp;pid=S1870-90442016000200025&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_pdf&amp;pid=S1870-90442016000200025&amp;lng=en&amp;nrm=iso"></self-uri><abstract abstract-type="short" xml:lang="en"><p><![CDATA[Abstract: Due to the large volume of process repositories, finding a particular process may become a difficult task. This paper presents a method for indexing, search, and grouping business processes models. The method considers linguistic and behavior information for modeling the business process. Behavior information is described using cumulative and no-continuous n-grams. Grouping method is based on k-means algorithm and suffix arrays to define labels for each group. The clustering approach incorporates mechanisms for avoiding overlapping and improve the homogeneity of the created groups using the K-means algorithm. Obtained results outperform the precision, recall and F-measure of previous approaches.]]></p></abstract>
<kwd-group>
<kwd lng="en"><![CDATA[Clustering]]></kwd>
<kwd lng="en"><![CDATA[business process models]]></kwd>
<kwd lng="en"><![CDATA[multimodal search]]></kwd>
<kwd lng="en"><![CDATA[cumulative and no-continuous n-grams.]]></kwd>
</kwd-group>
</article-meta>
</front><back>
<ref-list>
<ref id="B1">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[Detecting approximate clones in business process model repositories]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Rosa]]></surname>
<given-names><![CDATA[M. La]]></given-names>
</name>
</person-group>
<source><![CDATA[Information Systems]]></source>
<year>2015</year>
<volume>49</volume>
<page-range>102-25</page-range></nlm-citation>
</ref>
<ref id="B2">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[Multimodal representation, indexing, automated annotation and retrieval of image collections via non-negative matrix factorization]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Caicedo]]></surname>
<given-names><![CDATA[J. C.]]></given-names>
</name>
<name>
<surname><![CDATA[Abdallah]]></surname>
<given-names><![CDATA[J. Ben]]></given-names>
</name>
<name>
<surname><![CDATA[González]]></surname>
<given-names><![CDATA[F. A.]]></given-names>
</name>
</person-group>
<source><![CDATA[Neurocomputing]]></source>
<year>2012</year>
<volume>76</volume>
<page-range>50-60</page-range></nlm-citation>
</ref>
<ref id="B3">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[N-gramas sintacticos no-continuos]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Sidorov]]></surname>
<given-names><![CDATA[G.]]></given-names>
</name>
</person-group>
<source><![CDATA[Polibits]]></source>
<year>2013</year>
<numero>48</numero>
<issue>48</issue>
<page-range>69-78</page-range></nlm-citation>
</ref>
<ref id="B4">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[Business Processes Retrieval Based on Multimodal Search and Lingo Clustering Algorithm]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Ordoñez]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
<name>
<surname><![CDATA[Corrales]]></surname>
<given-names><![CDATA[J. C.]]></given-names>
</name>
<name>
<surname><![CDATA[Cobos]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
</person-group>
<source><![CDATA[IEEE Latin America Transactions]]></source>
<year>2015</year>
<volume>13</volume>
<numero>3</numero>
<issue>3</issue>
<page-range>769-76</page-range></nlm-citation>
</ref>
<ref id="B5">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Sidorov]]></surname>
<given-names><![CDATA[G.]]></given-names>
</name>
</person-group>
<source><![CDATA[Construcción no lineal de n-gramas en la lingüística computacional]]></source>
<year>2013</year>
<publisher-loc><![CDATA[Mexico DF ]]></publisher-loc>
<publisher-name><![CDATA[Sociedad Mexicana de Inteligencia Artificial]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B6">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[Syntactic N-grams as machine learning features for natural language processing]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Sidorov]]></surname>
<given-names><![CDATA[G.]]></given-names>
</name>
<name>
<surname><![CDATA[Velasquez]]></surname>
<given-names><![CDATA[F.]]></given-names>
</name>
<name>
<surname><![CDATA[Stamatatos]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
<name>
<surname><![CDATA[Gelbukh]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Chanona-Hernández]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
</person-group>
<source><![CDATA[Expert Systems with Applications]]></source>
<year>2014</year>
<volume>41</volume>
<numero>3</numero>
<issue>3</issue>
<page-range>853-60</page-range></nlm-citation>
</ref>
<ref id="B7">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Ordoñez]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
<name>
<surname><![CDATA[Corrales]]></surname>
<given-names><![CDATA[J. C.]]></given-names>
</name>
<name>
<surname><![CDATA[Cobos]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Wives]]></surname>
<given-names><![CDATA[L. K.]]></given-names>
</name>
<name>
<surname><![CDATA[Thom]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
</person-group>
<source><![CDATA[Collaborative Evaluation to Build Closed Repositories on Business Process Models]]></source>
<year>2014</year>
<volume>3</volume>
<conf-name><![CDATA[ 16tInternational Conference on EnterpriseInformation Systems]]></conf-name>
<conf-loc> </conf-loc>
<page-range>311-8</page-range><publisher-loc><![CDATA[SciTePress ]]></publisher-loc>
</nlm-citation>
</ref>
<ref id="B8">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[Recommendation-based editor for business process modeling]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Koschmider]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Hornung]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
<name>
<surname><![CDATA[Oberweis]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<source><![CDATA[Data Knowl. Eng.]]></source>
<year>2011</year>
<volume>70</volume>
<numero>6</numero>
<issue>6</issue>
<page-range>483-503</page-range></nlm-citation>
</ref>
<ref id="B9">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Rosso-Pelayo]]></surname>
<given-names><![CDATA[D. A.]]></given-names>
</name>
<name>
<surname><![CDATA[Trejo-Ramirez]]></surname>
<given-names><![CDATA[R. A.]]></given-names>
</name>
<name>
<surname><![CDATA[Gonzalez-Mendoza]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Hernandez-Gress]]></surname>
<given-names><![CDATA[N.]]></given-names>
</name>
</person-group>
<source><![CDATA[Business Process Mining and Rules Detection for Unstructured Information]]></source>
<year>2010</year>
<conf-name><![CDATA[ NinthMex. Int. Conf. Artif. Intell]]></conf-name>
<conf-loc> </conf-loc>
<page-range>81-5</page-range></nlm-citation>
</ref>
<ref id="B10">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Turner]]></surname>
<given-names><![CDATA[C. J.]]></given-names>
</name>
<name>
<surname><![CDATA[Tiwari]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Mehnen]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
</person-group>
<source><![CDATA[A genetic programming approach to business process mining]]></source>
<year>2008</year>
<conf-name><![CDATA[ 10Annu. Conf. Genet. Evol. Comput.]]></conf-name>
<conf-date>2008</conf-date>
<conf-loc> </conf-loc>
</nlm-citation>
</ref>
<ref id="B11">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[Clustering of Process Schemas by Graph Mining Techniques]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Diamantini]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Potena]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Storti]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
</person-group>
<source><![CDATA[Methodology]]></source>
<year>2011</year>
<volume>4</volume>
</nlm-citation>
</ref>
<ref id="B12">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[Visualization and Clustering of Business Process Collections Based on Process Metric Values]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Melcher]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Seese]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Aifb]]></surname>
<given-names><![CDATA[I.]]></given-names>
</name>
</person-group>
<source><![CDATA[Measurement]]></source>
<year>2008</year>
<volume>8</volume>
</nlm-citation>
</ref>
<ref id="B13">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[A Niching Memetic Algorithm for Simultaneous Clustering and Feature Selection]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Sheng]]></surname>
<given-names><![CDATA[W.]]></given-names>
</name>
<name>
<surname><![CDATA[Liu]]></surname>
<given-names><![CDATA[X.]]></given-names>
</name>
<name>
<surname><![CDATA[Fairhurst]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<source><![CDATA[IEEE Transactions on Knowledge and Data Engineering]]></source>
<year>2008</year>
<volume>20</volume>
<numero>7</numero>
<issue>7</issue>
<page-range>868-79</page-range></nlm-citation>
</ref>
<ref id="B14">
<nlm-citation citation-type="book">
<article-title xml:lang=""><![CDATA[Applied Sequence Clustering Techniques for Process Mining]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Ferreira]]></surname>
<given-names><![CDATA[D. R.]]></given-names>
</name>
</person-group>
<source><![CDATA[Handbook of Research on Business Process Modeling]]></source>
<year>2009</year>
<page-range>481-502</page-range><publisher-name><![CDATA[IGI Global]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B15">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[Approaching Process Mining with Sequence Clustering: Experiments and Findings]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Ferreira]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Malheiros]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
<name>
<surname><![CDATA[Ferreira]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
</person-group>
<source><![CDATA[Engineering]]></source>
<year>2008</year>
<volume>7</volume>
<numero>1</numero>
<issue>1</issue>
<page-range>1-15</page-range></nlm-citation>
</ref>
<ref id="B16">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[Dynamic reconfiguration of composite convergent services supported by multimodal search]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Ordonez]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Ordonez]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
<name>
<surname><![CDATA[Figueroa]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Cobos]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Corrales]]></surname>
<given-names><![CDATA[J. C.]]></given-names>
</name>
</person-group>
<source><![CDATA[Lecture Notes in Business Information Processing]]></source>
<year>2015</year>
<volume>208</volume>
<page-range>127-39</page-range></nlm-citation>
</ref>
<ref id="B17">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[Improving Business Process Retrieval Using Categorization and Multimodal Search]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Figueroa]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Ordoñez]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
<name>
<surname><![CDATA[Corrales]]></surname>
<given-names><![CDATA[J.-C.]]></given-names>
</name>
<name>
<surname><![CDATA[Cobos]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Wives]]></surname>
<given-names><![CDATA[L. K.]]></given-names>
</name>
<name>
<surname><![CDATA[Herrera-Viedma]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
</person-group>
<source><![CDATA[Knowledge-Based Syst.]]></source>
<year>2016</year>
<volume>110</volume>
<page-range>1-17</page-range></nlm-citation>
</ref>
<ref id="B18">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Manning]]></surname>
<given-names><![CDATA[Christopher D.]]></given-names>
</name>
<name>
<surname><![CDATA[Raghavan]]></surname>
<given-names><![CDATA[Prabhakar]]></given-names>
</name>
<name>
<surname><![CDATA[Schütze]]></surname>
</name>
</person-group>
<source><![CDATA[Introduction to Information Retrieval]]></source>
<year>2008</year>
<publisher-name><![CDATA[Cambridge University Press]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B19">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[Fast VQ codebook search algorithm for grayscale image coding]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Hu]]></surname>
<given-names><![CDATA[Y.-C.]]></given-names>
</name>
<name>
<surname><![CDATA[Su]]></surname>
<given-names><![CDATA[B.-H.]]></given-names>
</name>
<name>
<surname><![CDATA[Tsou]]></surname>
<given-names><![CDATA[C.-C.]]></given-names>
</name>
</person-group>
<source><![CDATA[Image Vis. Comput.]]></source>
<year>2008</year>
<volume>26</volume>
<numero>5</numero>
<issue>5</issue>
<page-range>657-66</page-range></nlm-citation>
</ref>
<ref id="B20">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[Intelligent Kernel K-Means for Clustering Gene Expression]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Handhayani]]></surname>
<given-names><![CDATA[T.]]></given-names>
</name>
<name>
<surname><![CDATA[Hiryanto]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
</person-group>
<source><![CDATA[Procedia Comput. Sci.]]></source>
<year>2015</year>
<volume>59</volume>
<page-range>171-7</page-range></nlm-citation>
</ref>
<ref id="B21">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[Business Processes Retrieval based on Multimodal Search and Lingo Clustering Algorithm]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Ordoñez]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
<name>
<surname><![CDATA[Corrales]]></surname>
<given-names><![CDATA[J. C.]]></given-names>
</name>
<name>
<surname><![CDATA[Cobos]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
</person-group>
<source><![CDATA[IEEE Lat. Am. Trans.]]></source>
<year>2015</year>
<volume>13</volume>
<numero>9</numero>
<issue>9</issue>
<page-range>40-8</page-range></nlm-citation>
</ref>
<ref id="B22">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[Hierarchical clustering of business process models]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Jung]]></surname>
<given-names><![CDATA[L. L. Jae-Yoon]]></given-names>
</name>
<name>
<surname><![CDATA[Bae]]></surname>
<given-names><![CDATA[Joonsoo]]></given-names>
</name>
</person-group>
<source><![CDATA[Eng. Inf. Syst. Control]]></source>
<year>2009</year>
<volume>5</volume>
<numero>12</numero>
<issue>12</issue>
<page-range>613-6</page-range></nlm-citation>
</ref>
<ref id="B23">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[WB-index: A sum-of-squares based index for cluster validity]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Zhao]]></surname>
<given-names><![CDATA[Q.]]></given-names>
</name>
<name>
<surname><![CDATA[Fränti]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
</person-group>
<source><![CDATA[Data Knowl. Eng.]]></source>
<year>2014</year>
<volume>92</volume>
<page-range>77-89</page-range></nlm-citation>
</ref>
<ref id="B24">
<nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Ordonez]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
<name>
<surname><![CDATA[Corrales]]></surname>
<given-names><![CDATA[J. C.]]></given-names>
</name>
<name>
<surname><![CDATA[Cobos]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Wives]]></surname>
<given-names><![CDATA[L. K.]]></given-names>
</name>
</person-group>
<source><![CDATA[Collaborative grouping of business process models]]></source>
<year>2014</year>
<page-range>1-2</page-range></nlm-citation>
</ref>
</ref-list>
</back>
</article>
