<?xml version="1.0" encoding="ISO-8859-1"?><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<front>
<journal-meta>
<journal-id>1870-9044</journal-id>
<journal-title><![CDATA[Polibits]]></journal-title>
<abbrev-journal-title><![CDATA[Polibits]]></abbrev-journal-title>
<issn>1870-9044</issn>
<publisher>
<publisher-name><![CDATA[Instituto Politécnico Nacional, Centro de Innovación y Desarrollo Tecnológico en Cómputo]]></publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id>S1870-90442016000100031</article-id>
<article-id pub-id-type="doi">10.17562/PB-53-3</article-id>
<title-group>
<article-title xml:lang="en"><![CDATA[Data Reduction and Regression Using Principal Component Analysis in Qualitative Spatial Reasoning and Health Informatics]]></article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Sabharwal]]></surname>
<given-names><![CDATA[Chaman Lal]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Anjum]]></surname>
<given-names><![CDATA[Bushra]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
</contrib-group>
<aff id="Af1">
<institution><![CDATA[,Missouri University of Science and Technology  ]]></institution>
<addr-line><![CDATA[Rolla MO]]></addr-line>
<country>USA</country>
</aff>
<aff id="Af2">
<institution><![CDATA[,Amazon Inc.  ]]></institution>
<addr-line><![CDATA[San Luis Obispo, CA]]></addr-line>
<country>USA</country>
</aff>
<pub-date pub-type="pub">
<day>00</day>
<month>06</month>
<year>2016</year>
</pub-date>
<pub-date pub-type="epub">
<day>00</day>
<month>06</month>
<year>2016</year>
</pub-date>
<numero>53</numero>
<fpage>31</fpage>
<lpage>42</lpage>
<copyright-statement/>
<copyright-year/>
<self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_arttext&amp;pid=S1870-90442016000100031&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_abstract&amp;pid=S1870-90442016000100031&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_pdf&amp;pid=S1870-90442016000100031&amp;lng=en&amp;nrm=iso"></self-uri><abstract abstract-type="short" xml:lang="en"><p><![CDATA[Abstract The central idea of principal component analysis (PCA) is to reduce the dimensionality of a dataset consisting of a large number of interrelated variables, while retaining as much as possible of the variation present in the dataset. In this paper, we use PCA based algorithms in two diverse genres, qualitative spatial reasoning (QSR) to achieve lossless data reduction and health informatics to achieve data reduction along with improved regression analysis respectively. In an adaptive hybrid approach, we have employed PCA to traditional regression algorithms to improve their performance and representation. This yields prediction models that have both a better fit and reduced number of attributes than those produced by using standard logistic regression alone. We present examples using both synthetic data and real health datasets from UCI Repository.]]></p></abstract>
<kwd-group>
<kwd lng="en"><![CDATA[Principal component analysis]]></kwd>
<kwd lng="en"><![CDATA[regression analysis]]></kwd>
<kwd lng="en"><![CDATA[healthcare analytics]]></kwd>
<kwd lng="en"><![CDATA[big data analytics]]></kwd>
<kwd lng="en"><![CDATA[region connection calculus]]></kwd>
</kwd-group>
</article-meta>
</front><back>
<ref-list>
<ref id="B1">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[Point-Set topological Relations]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Egenhofer]]></surname>
<given-names><![CDATA[M. J.]]></given-names>
</name>
<name>
<surname><![CDATA[Franzosa]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
</person-group>
<source><![CDATA[International Journal of Geographical Information Systems]]></source>
<year>1991</year>
<volume>5</volume>
<numero>2</numero>
<issue>2</issue>
<page-range>161-74</page-range></nlm-citation>
</ref>
<ref id="B2">
<nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Randell]]></surname>
<given-names><![CDATA[D.A.]]></given-names>
</name>
<name>
<surname><![CDATA[Cui]]></surname>
<given-names><![CDATA[Z.]]></given-names>
</name>
<name>
<surname><![CDATA[Cohn]]></surname>
<given-names><![CDATA[A.G.]]></given-names>
</name>
</person-group>
<source><![CDATA[A Spatial Logic Based on Regions and Connection]]></source>
<year>1992</year>
<page-range>165-76</page-range></nlm-citation>
</ref>
<ref id="B3">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Sabharwal]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
<name>
<surname><![CDATA[Eloe]]></surname>
<given-names><![CDATA[Nathan]]></given-names>
</name>
</person-group>
<source><![CDATA[A More Expressive 3D Region Connection Calculus]]></source>
<year>2011</year>
<conf-name><![CDATA[ 2011 International Workshop on Visual Languages and Computing DMS'11]]></conf-name>
<conf-date>Aug. 18-20, 2011</conf-date>
<conf-loc>Florence, Italy </conf-loc>
<page-range>307-11</page-range></nlm-citation>
</ref>
<ref id="B4">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[The Promise of Big Data]]></article-title>
<source><![CDATA[Harward School of Public Health Magazine]]></source>
<year>2012</year>
<page-range>15-43</page-range></nlm-citation>
</ref>
<ref id="B5">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[Healthcare Industry Sees Big Data As More Than a Bandage]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Bernard]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<source><![CDATA[CIO]]></source>
<year></year>
</nlm-citation>
</ref>
<ref id="B6">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Bishop]]></surname>
<given-names><![CDATA[C. M.]]></given-names>
</name>
</person-group>
<source><![CDATA[Pattern Recognition and Machine Learning]]></source>
<year>2006</year>
<publisher-name><![CDATA[Springer]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B7">
<nlm-citation citation-type="book">
<source><![CDATA[The Global Use of Medicines: Outlook Through 2016]]></source>
<year>2012</year>
<page-range>1-36</page-range><publisher-name><![CDATA[IMS Institute for Healthcare Informatics]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B8">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[Contextual Health vs The Elephant in the Hospital]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Israel]]></surname>
<given-names><![CDATA[S.]]></given-names>
</name>
</person-group>
<source><![CDATA[Forbes]]></source>
<year>2013</year>
<page-range>1-10</page-range></nlm-citation>
</ref>
<ref id="B9">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[How the CDC Is Using Big Data to Save You from the Flu]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Bort]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
</person-group>
<source><![CDATA[Business Insider]]></source>
<year></year>
</nlm-citation>
</ref>
<ref id="B10">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[Want to know if you will develop high blood pressure next year? With big data magic you can]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Parmar]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<source><![CDATA[MedCity News]]></source>
<year>2012</year>
</nlm-citation>
</ref>
<ref id="B11">
<nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Groves]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
<name>
<surname><![CDATA[Kayyali]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
<name>
<surname><![CDATA[Knott]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Kuiken]]></surname>
<given-names><![CDATA[S. V.]]></given-names>
</name>
</person-group>
<source><![CDATA[The 'Big Data' Revolution in Healthcare]]></source>
<year>2013</year>
<page-range>1-20</page-range><publisher-name><![CDATA[Center of US Health System Reform Business Technology Office]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B12">
<nlm-citation citation-type="">
<article-title xml:lang=""><![CDATA[Predicting Adverse Drug Events from Personal Health Messages]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Chee]]></surname>
<given-names><![CDATA[B.W.]]></given-names>
</name>
<name>
<surname><![CDATA[Berlin]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Schatz]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
</person-group>
<source><![CDATA[Annual Symposium Proceedings]]></source>
<year>2011</year>
<page-range>217-26</page-range></nlm-citation>
</ref>
<ref id="B13">
<nlm-citation citation-type="book">
<article-title xml:lang=""><![CDATA[Azdrugminer: An Information Extraction System for Mining Patient Reported Adverse Drug Events in Online Patient Forums]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Liu]]></surname>
<given-names><![CDATA[X.]]></given-names>
</name>
<name>
<surname><![CDATA[Chen]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
</person-group>
<source><![CDATA[Smart Health]]></source>
<year>2013</year>
<page-range>134-50</page-range><publisher-name><![CDATA[Springer]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B14">
<nlm-citation citation-type="confpro">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Yang]]></surname>
<given-names><![CDATA[C. C.]]></given-names>
</name>
<name>
<surname><![CDATA[Jiang]]></surname>
<given-names><![CDATA[L.]]></given-names>
</name>
<name>
<surname><![CDATA[Yang]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
<name>
<surname><![CDATA[Zhang]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<source><![CDATA[Social Media Mining for Drug Safety Signal Detection]]></source>
<year>Octo</year>
<month>be</month>
<day>r </day>
<conf-name><![CDATA[ ACMSHB'12]]></conf-name>
<conf-date>October 29, 2012</conf-date>
<conf-loc>Maui, Hawaii, USA </conf-loc>
</nlm-citation>
</ref>
<ref id="B15">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[Using Health-Consumer-Contributed Data to Detect Adverse Drug Reactions by Association Mining with Temporal Analysis]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Yang]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
<name>
<surname><![CDATA[Yang]]></surname>
<given-names><![CDATA[C. C.]]></given-names>
</name>
</person-group>
<source><![CDATA[ACMTrans. Intell. Syst. Technol.]]></source>
<year>2015</year>
<volume>6</volume>
<numero>4</numero>
<issue>4</issue>
</nlm-citation>
</ref>
<ref id="B16">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[Analysis of a complex of statistical variables into principal components]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Hotelling]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
</person-group>
<source><![CDATA[Journal of Educational Psychology]]></source>
<year>1993</year>
<volume>24</volume>
<page-range>417-41</page-range></nlm-citation>
</ref>
<ref id="B17">
<nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Jim]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
</person-group>
<source><![CDATA[Linear Algebra]]></source>
<year>2014</year>
</nlm-citation>
</ref>
<ref id="B18">
<nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Shlens]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
</person-group>
<source><![CDATA[A Tutorial on Principal Component Analysis]]></source>
<year>2014</year>
<numero>arXiv:1404.1100 [cs.LG]</numero>
<issue>arXiv:1404.1100 [cs.LG]</issue>
<page-range>1-15</page-range></nlm-citation>
</ref>
<ref id="B19">
<nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Baker]]></surname>
<given-names><![CDATA[K.]]></given-names>
</name>
</person-group>
<source><![CDATA[Singular Value Decomposition Tutorial]]></source>
<year>2013</year>
</nlm-citation>
</ref>
<ref id="B20">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[Principal Component Analysis of Particle Motion]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Chen]]></surname>
<given-names><![CDATA[H. Y.]]></given-names>
</name>
<name>
<surname><![CDATA[Liegeois]]></surname>
<given-names><![CDATA[R.]]></given-names>
</name>
<name>
<surname><![CDATA[Bruyn]]></surname>
<given-names><![CDATA[J. R. de]]></given-names>
</name>
<name>
<surname><![CDATA[Soddu]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<source><![CDATA[Phys. Rev. E]]></source>
<year>2015</year>
<volume>91</volume>
<numero>042308</numero>
<issue>042308</issue>
</nlm-citation>
</ref>
<ref id="B21">
<nlm-citation citation-type="">
<article-title xml:lang=""><![CDATA[Discovering Consumer Health Expressionsfrom Consumer-Contributed Content]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Jiang]]></surname>
<given-names><![CDATA[Ling]]></given-names>
</name>
<name>
<surname><![CDATA[Yang]]></surname>
<given-names><![CDATA[Christopher C.]]></given-names>
</name>
<name>
<surname><![CDATA[Li]]></surname>
<given-names><![CDATA[Jiexun]]></given-names>
</name>
</person-group>
<source><![CDATA[SBP 2013]]></source>
<year>2013</year>
<volume>LNCS 7812</volume>
<page-range>164-74</page-range></nlm-citation>
</ref>
<ref id="B22">
<nlm-citation citation-type="">
<collab>UCI Machine Learning Repository</collab>
<source><![CDATA[Liver Patient Dataset]]></source>
<year></year>
</nlm-citation>
</ref>
<ref id="B23">
<nlm-citation citation-type="journal">
<article-title xml:lang=""><![CDATA[APACHE II Predicts Long-term Survival in COPD Patients Admitted to a General Medical Ward]]></article-title>
<person-group person-group-type="author">
<name>
<surname><![CDATA[Goel]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Pinckney]]></surname>
<given-names><![CDATA[R. G]]></given-names>
</name>
<name>
<surname><![CDATA[Littenberg]]></surname>
<given-names><![CDATA[B.]]></given-names>
</name>
</person-group>
<source><![CDATA[J Gen Intern Med]]></source>
<year>2003</year>
<volume>18</volume>
<numero>10</numero>
<issue>10</issue>
<page-range>824-30</page-range></nlm-citation>
</ref>
<ref id="B24">
<nlm-citation citation-type="">
<source><![CDATA[Wikipedia]]></source>
<year></year>
</nlm-citation>
</ref>
<ref id="B25">
<nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Leskovec]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
<name>
<surname><![CDATA[Rajaraman]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
<name>
<surname><![CDATA[Ullman]]></surname>
<given-names><![CDATA[J. D.]]></given-names>
</name>
</person-group>
<source><![CDATA[Datamining of Massive Datasets]]></source>
<year>2014</year>
</nlm-citation>
</ref>
</ref-list>
</back>
</article>
