<?xml version="1.0" encoding="ISO-8859-1"?><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<front>
<journal-meta>
<journal-id>2007-5057</journal-id>
<journal-title><![CDATA[Investigación en educación médica]]></journal-title>
<abbrev-journal-title><![CDATA[Investigación educ. médica]]></abbrev-journal-title>
<issn>2007-5057</issn>
<publisher>
<publisher-name><![CDATA[Universidad Nacional Autónoma de México, Facultad de Medicina]]></publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id>S2007-50572020000200100</article-id>
<article-id pub-id-type="doi">10.22201/facmed.20075057e.2020.34.221</article-id>
<title-group>
<article-title xml:lang="es"><![CDATA[Amenazas a la validez en evaluación: implicaciones en educación médica]]></article-title>
<article-title xml:lang="en"><![CDATA[Threats to validity in assessment: implications in medical education]]></article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Carrillo Avalos]]></surname>
<given-names><![CDATA[Blanca Ariadna]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Sánchez Mendiola]]></surname>
<given-names><![CDATA[Melchor]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Leenen]]></surname>
<given-names><![CDATA[Iwin]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
</contrib-group>
<aff id="Af1">
<institution><![CDATA[,Universidad Autónoma de San Luis Potosí Facultad de Medicina Departamento de Ciencias Morfológicas]]></institution>
<addr-line><![CDATA[ S. L. P.]]></addr-line>
<country>Mexico</country>
</aff>
<aff id="Af2">
<institution><![CDATA[,Universidad Nacional Autónoma de México Facultad de Medicina División de Estudios de Posgrado]]></institution>
<addr-line><![CDATA[ Cd. Mx.]]></addr-line>
<country>Mexico</country>
</aff>
<aff id="Af3">
<institution><![CDATA[,Universidad Nacional Autónoma de México Facultad de Psicología División de Estudios de Posgrado]]></institution>
<addr-line><![CDATA[ Cd. Mx.]]></addr-line>
<country>Mexico</country>
</aff>
<pub-date pub-type="pub">
<day>00</day>
<month>06</month>
<year>2020</year>
</pub-date>
<pub-date pub-type="epub">
<day>00</day>
<month>06</month>
<year>2020</year>
</pub-date>
<volume>9</volume>
<numero>34</numero>
<fpage>100</fpage>
<lpage>107</lpage>
<copyright-statement/>
<copyright-year/>
<self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_arttext&amp;pid=S2007-50572020000200100&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_abstract&amp;pid=S2007-50572020000200100&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://www.scielo.org.mx/scielo.php?script=sci_pdf&amp;pid=S2007-50572020000200100&amp;lng=en&amp;nrm=iso"></self-uri><abstract abstract-type="short" xml:lang="es"><p><![CDATA[Resumen Las amenazas a la validez en evaluación educativa son elementos que interfieren con la interpretación propuesta de los resultados de una prueba, pueden ocurrir tanto en exámenes escritos como en pruebas de desempeño y evaluación de competencias clínicas. Estas amenazas se suelen agrupar en dos clases principales: subrepresentación del constructo y varianza irrelevante al constructo. La primera se refiere a que en la prueba no haya suficientes ítems, casos u observaciones para generalizar apropiadamente al dominio completo que se pretende evaluar. La segunda tiene que ver con la presencia de sesgos que interfieren de manera sistemática con la interpretación de los resultados de una prueba, como pueden ser la calidad de los ítems y errores sistemáticos de los evaluadores, entre otros factores que pueden influir sobre la puntuación obtenida. En este artículo se describen las características de las amenazas principales, su importancia y algunas recomendaciones para evitarlas al elaborar y aplicar instrumentos de evaluación en ciencias de la salud. La comprensión de estas amenazas es útil para desarrollar pruebas cuyos resultados tengan niveles aceptables de validez que nos permitan conocer mejor el desempeño de los estudiantes.]]></p></abstract>
<abstract abstract-type="short" xml:lang="en"><p><![CDATA[Abstract Validity threats in educational assessment are elements that interfere with the proposed interpretation of a test score. They can occur in written tests as well as in performance and clinical competency assessments. They are usually grouped in two major categories: construct underrepresentation and construct-irrelevant variance. The former refers to tests with insufficient items, cases, or observations to make a proper generalization towards the full to-be-assessed domain. The latter is related to the presence of biases that can interfere systematically with the interpretation of a test score, such as item quality and raters&#8217; systematic errors, among other factors that may have an effect on the obtained score. In this paper we describe the characteristics of some of these threats, their importance, and some recommendations to avoid them during the development of assessment instruments in health sciences education. The insights offered can be useful to devise tests and assessment instruments that allow us to draw more valid inferences about students&#8217; knowledge and abilities.]]></p></abstract>
<kwd-group>
<kwd lng="es"><![CDATA[amenazas a la validez]]></kwd>
<kwd lng="es"><![CDATA[evaluación del aprendizaje]]></kwd>
<kwd lng="es"><![CDATA[educación médica]]></kwd>
<kwd lng="es"><![CDATA[México]]></kwd>
<kwd lng="en"><![CDATA[validity]]></kwd>
<kwd lng="en"><![CDATA[validity threats]]></kwd>
<kwd lng="en"><![CDATA[learning assessment]]></kwd>
<kwd lng="en"><![CDATA[medical education]]></kwd>
<kwd lng="en"><![CDATA[Mexico]]></kwd>
</kwd-group>
</article-meta>
</front><back>
<ref-list>
<ref id="B1">
<label>1</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Cronbach]]></surname>
<given-names><![CDATA[LJ]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Five perspectives on validity argument]]></article-title>
<person-group person-group-type="editor">
<name>
<surname><![CDATA[Wainer]]></surname>
<given-names><![CDATA[H,]]></given-names>
</name>
<name>
<surname><![CDATA[Braun]]></surname>
<given-names><![CDATA[HI,]]></given-names>
</name>
</person-group>
<source><![CDATA[Test validity]]></source>
<year>1988</year>
<page-range>3-17</page-range><publisher-loc><![CDATA[New York ]]></publisher-loc>
<publisher-name><![CDATA[Routledge]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B2">
<label>2</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Downing]]></surname>
<given-names><![CDATA[SM]]></given-names>
</name>
<name>
<surname><![CDATA[Haladyna]]></surname>
<given-names><![CDATA[TM]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Validity threats Overcoming interference with proposed interpretations of assessment data]]></article-title>
<source><![CDATA[Med Educ]]></source>
<year>2004</year>
<volume>38</volume>
<numero>3</numero>
<issue>3</issue>
<page-range>327-33</page-range></nlm-citation>
</ref>
<ref id="B3">
<label>3</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Downing]]></surname>
<given-names><![CDATA[SM]]></given-names>
</name>
<name>
<surname><![CDATA[Yudkowski]]></surname>
<given-names><![CDATA[R]]></given-names>
</name>
</person-group>
<source><![CDATA[Assessment in health professions education]]></source>
<year>2009</year>
<publisher-loc><![CDATA[New York and London ]]></publisher-loc>
<publisher-name><![CDATA[Routledge]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B4">
<label>4</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Carrillo]]></surname>
<given-names><![CDATA[BA]]></given-names>
</name>
<name>
<surname><![CDATA[Sánchez]]></surname>
<given-names><![CDATA[M]]></given-names>
</name>
<name>
<surname><![CDATA[Leenen]]></surname>
<given-names><![CDATA[I]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[El concepto moderno de validez y su uso en educación médica]]></article-title>
<source><![CDATA[Inv Ed Med]]></source>
<year>2020</year>
<volume>9</volume>
<numero>33</numero>
<issue>33</issue>
<page-range>98-106</page-range></nlm-citation>
</ref>
<ref id="B5">
<label>5</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Norman]]></surname>
<given-names><![CDATA[G,]]></given-names>
</name>
<name>
<surname><![CDATA[van der Vleuten]]></surname>
<given-names><![CDATA[C,]]></given-names>
</name>
<name>
<surname><![CDATA[Newble]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
<name>
<surname><![CDATA[Norman]]></surname>
<given-names><![CDATA[G,]]></given-names>
</name>
<name>
<surname><![CDATA[van der Vleuten]]></surname>
<given-names><![CDATA[C,]]></given-names>
</name>
<name>
<surname><![CDATA[Newble]]></surname>
<given-names><![CDATA[D,]]></given-names>
</name>
</person-group>
<source><![CDATA[International Handbook of Research in Medical Education]]></source>
<year>2002</year>
<page-range>1106</page-range><publisher-name><![CDATA[Springer]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B6">
<label>6</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Jozefowicz]]></surname>
<given-names><![CDATA[RF]]></given-names>
</name>
<name>
<surname><![CDATA[Koeppen]]></surname>
<given-names><![CDATA[BM]]></given-names>
</name>
<name>
<surname><![CDATA[Case]]></surname>
<given-names><![CDATA[S]]></given-names>
</name>
<name>
<surname><![CDATA[Galbraith]]></surname>
<given-names><![CDATA[R]]></given-names>
</name>
<name>
<surname><![CDATA[Swanson]]></surname>
<given-names><![CDATA[D]]></given-names>
</name>
<name>
<surname><![CDATA[Glew]]></surname>
<given-names><![CDATA[RH]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[The quality of in-house medical school examinations]]></article-title>
<source><![CDATA[Acad Med]]></source>
<year>2002</year>
<volume>77</volume>
<numero>2</numero>
<issue>2</issue>
<page-range>156-61</page-range></nlm-citation>
</ref>
<ref id="B7">
<label>7</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Ware]]></surname>
<given-names><![CDATA[J]]></given-names>
</name>
<name>
<surname><![CDATA[Vik]]></surname>
<given-names><![CDATA[T]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Quality assurance of item writing During the introduction of multiple choice questions in medicine for high stakes examinations]]></article-title>
<source><![CDATA[Med Teach]]></source>
<year>2009</year>
<volume>31</volume>
<numero>3</numero>
<issue>3</issue>
<page-range>238-43</page-range></nlm-citation>
</ref>
<ref id="B8">
<label>8</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Tarrant]]></surname>
<given-names><![CDATA[M]]></given-names>
</name>
<name>
<surname><![CDATA[Knierim]]></surname>
<given-names><![CDATA[A]]></given-names>
</name>
<name>
<surname><![CDATA[Hayes]]></surname>
<given-names><![CDATA[SK]]></given-names>
</name>
<name>
<surname><![CDATA[Ware]]></surname>
<given-names><![CDATA[J]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[The frequency of item writing flaws in multiple-choice questions used in high stakes nursing assessments]]></article-title>
<source><![CDATA[Nurse Educ Today]]></source>
<year>2006</year>
<volume>26</volume>
<numero>8</numero>
<issue>8</issue>
<page-range>662-71</page-range></nlm-citation>
</ref>
<ref id="B9">
<label>9</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Downing]]></surname>
<given-names><![CDATA[SM]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Threats to the validity of locally developed multiple-choice tests in medical education Construct-irrelevant variance and construct underrepresentation]]></article-title>
<source><![CDATA[Adv Heal Sci Educ]]></source>
<year>2002</year>
<volume>7</volume>
<numero>3</numero>
<issue>3</issue>
<page-range>235-41</page-range></nlm-citation>
</ref>
<ref id="B10">
<label>10</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Crooks]]></surname>
<given-names><![CDATA[TJ]]></given-names>
</name>
<name>
<surname><![CDATA[Kane]]></surname>
<given-names><![CDATA[MT]]></given-names>
</name>
<name>
<surname><![CDATA[Cohen]]></surname>
<given-names><![CDATA[AS]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Threats to the valid use of assessments]]></article-title>
<source><![CDATA[Assess Educ Princ Policy Pract]]></source>
<year>1996</year>
<volume>3</volume>
<numero>3</numero>
<issue>3</issue>
<page-range>265-85</page-range></nlm-citation>
</ref>
<ref id="B11">
<label>11</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Messick]]></surname>
<given-names><![CDATA[S]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Validity]]></article-title>
<person-group person-group-type="editor">
<name>
<surname><![CDATA[Linn]]></surname>
<given-names><![CDATA[RL]]></given-names>
</name>
</person-group>
<source><![CDATA[Educational Measurement]]></source>
<year>1989</year>
<page-range>13-103</page-range><publisher-loc><![CDATA[New York ]]></publisher-loc>
<publisher-name><![CDATA[Macmillan]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B12">
<label>12</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Schuwirth]]></surname>
<given-names><![CDATA[LWT]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[General overview of the theories used in assessment AMEE Guide No. 57]]></article-title>
<collab>Van Der Vleuten CPM</collab>
<source><![CDATA[Med Teach]]></source>
<year>2011</year>
<volume>33</volume>
<numero>10</numero>
<issue>10</issue>
<page-range>783-97</page-range></nlm-citation>
</ref>
<ref id="B13">
<label>13</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[De Champlain]]></surname>
<given-names><![CDATA[AF]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[A primer on classical test theory and item response theory for assessments in medical education]]></article-title>
<source><![CDATA[Med Educ]]></source>
<year>2010</year>
<volume>44</volume>
<numero>1</numero>
<issue>1</issue>
<page-range>109-17</page-range></nlm-citation>
</ref>
<ref id="B14">
<label>14</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Haladyna]]></surname>
<given-names><![CDATA[TM,]]></given-names>
</name>
<name>
<surname><![CDATA[Downing]]></surname>
<given-names><![CDATA[SM.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Construct-Irrelevant Variance in High-Stakes Testing]]></article-title>
<source><![CDATA[Educ Meas Issues Pract]]></source>
<year>2004</year>
<volume>23</volume>
<numero>1</numero>
<issue>1</issue>
<page-range>17-27</page-range></nlm-citation>
</ref>
<ref id="B15">
<label>15</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Leenen]]></surname>
<given-names><![CDATA[I]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Virtudes y limitaciones de la teoría de respuesta al ítem para la evaluación educativa en las ciencias médicas]]></article-title>
<source><![CDATA[Inv Ed Med]]></source>
<year>2014</year>
<volume>3</volume>
<numero>9</numero>
<issue>9</issue>
<page-range>40-55</page-range></nlm-citation>
</ref>
<ref id="B16">
<label>16</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Downing]]></surname>
<given-names><![CDATA[SM]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Reliability on the reproducibility of assessment data]]></article-title>
<source><![CDATA[Med Educ]]></source>
<year>2004</year>
<volume>38</volume>
<page-range>1006-12</page-range></nlm-citation>
</ref>
<ref id="B17">
<label>17</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Miller]]></surname>
<given-names><![CDATA[GE]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[The assessment of clinical skills/competence/performance]]></article-title>
<source><![CDATA[Acad Med]]></source>
<year>1990</year>
<volume>65</volume>
<numero>9</numero>
<issue>9</issue>
<page-range>S63-7</page-range></nlm-citation>
</ref>
<ref id="B18">
<label>18</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Hawkins]]></surname>
<given-names><![CDATA[RE]]></given-names>
</name>
<name>
<surname><![CDATA[Margolis]]></surname>
<given-names><![CDATA[MJ]]></given-names>
</name>
<name>
<surname><![CDATA[Durning]]></surname>
<given-names><![CDATA[SJ]]></given-names>
</name>
<name>
<surname><![CDATA[Norcini]]></surname>
<given-names><![CDATA[JJ]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Constructing a validity argument for the mini-clinical evaluation exercise A review of the research]]></article-title>
<source><![CDATA[Acad Med]]></source>
<year>2010</year>
<volume>85</volume>
<numero>9</numero>
<issue>9</issue>
<page-range>1453-61</page-range></nlm-citation>
</ref>
<ref id="B19">
<label>19</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Moore]]></surname>
<given-names><![CDATA[K,]]></given-names>
</name>
<name>
<surname><![CDATA[Dailey]]></surname>
<given-names><![CDATA[A,]]></given-names>
</name>
<name>
<surname><![CDATA[Agur]]></surname>
<given-names><![CDATA[A.]]></given-names>
</name>
</person-group>
<source><![CDATA[Anatomía con orientación clínica]]></source>
<year>2013</year>
<edition>7a</edition>
<publisher-loc><![CDATA[Philadelphia ]]></publisher-loc>
<publisher-name><![CDATA[Wolters Kluwer Health, S.A.]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B20">
<label>20</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Paniagua]]></surname>
<given-names><![CDATA[MA]]></given-names>
</name>
<name>
<surname><![CDATA[Swygert]]></surname>
<given-names><![CDATA[KA]]></given-names>
</name>
</person-group>
<collab>National Board of Medical Examiners</collab>
<source><![CDATA[Cómo elaborar preguntas para evaluaciones escritas en el área de ciencias básicas y clínicas]]></source>
<year>2016</year>
<edition>4</edition>
<publisher-loc><![CDATA[PA ]]></publisher-loc>
<publisher-name><![CDATA[National Board of Medical Examiners]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B21">
<label>21</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Moreno]]></surname>
<given-names><![CDATA[R]]></given-names>
</name>
<name>
<surname><![CDATA[Martínez]]></surname>
<given-names><![CDATA[RJ]]></given-names>
</name>
<name>
<surname><![CDATA[Muñiz]]></surname>
<given-names><![CDATA[J.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Directrices para la construcción de ítems de elección múltiple]]></article-title>
<source><![CDATA[Psicothema]]></source>
<year>2004</year>
<volume>16</volume>
<numero>3</numero>
<issue>3</issue>
<page-range>490-7</page-range></nlm-citation>
</ref>
<ref id="B22">
<label>22</label><nlm-citation citation-type="book">
<collab>American Educational Research Association.American Psychological Association.National Council on Measurement in Education</collab>
<source><![CDATA[STANDARDS for Educational and Psychological Testing]]></source>
<year>2014</year>
<edition>6</edition>
<publisher-loc><![CDATA[American Educational Research Association ]]></publisher-loc>
<publisher-name><![CDATA[American Psychological Association &amp; National Council on Measurement in Education]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B23">
<label>23</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Williams]]></surname>
<given-names><![CDATA[BW]]></given-names>
</name>
<name>
<surname><![CDATA[Byrne]]></surname>
<given-names><![CDATA[PD]]></given-names>
</name>
<name>
<surname><![CDATA[Welindt]]></surname>
<given-names><![CDATA[D]]></given-names>
</name>
<name>
<surname><![CDATA[Williams M]]></surname>
<given-names><![CDATA[V]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Miller's pyramid and core competency assessment A study in relationship construct validity]]></article-title>
<source><![CDATA[J Contin Educ Health Prof]]></source>
<year>2016</year>
<volume>36</volume>
<numero>4</numero>
<issue>4</issue>
<page-range>295-9</page-range></nlm-citation>
</ref>
<ref id="B24">
<label>24</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Pangaro]]></surname>
<given-names><![CDATA[L]]></given-names>
</name>
<name>
<surname><![CDATA[Ten Cate]]></surname>
<given-names><![CDATA[O]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Frameworks for learner assessment in medicine AMEE Guide No. 78]]></article-title>
<source><![CDATA[Med Teach]]></source>
<year>2013</year>
<volume>35</volume>
<page-range>e1197-210</page-range></nlm-citation>
</ref>
<ref id="B25">
<label>25</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Hadie]]></surname>
<given-names><![CDATA[SNH]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[The Application of Learning Taxonomy in Anatomy Assessment in Medical School]]></article-title>
<source><![CDATA[Educ Med J]]></source>
<year>2018</year>
<volume>10</volume>
<numero>1</numero>
<issue>1</issue>
<page-range>13-23</page-range></nlm-citation>
</ref>
<ref id="B26">
<label>26</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Haladyna]]></surname>
<given-names><![CDATA[TM]]></given-names>
</name>
<name>
<surname><![CDATA[Downing]]></surname>
<given-names><![CDATA[SM]]></given-names>
</name>
<name>
<surname><![CDATA[Rodriguez]]></surname>
<given-names><![CDATA[MC]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[A Review of Multiple-Choice Item-Writing Guidelines for Classroom Assessment]]></article-title>
<source><![CDATA[Appl Meas Educ]]></source>
<year>2002</year>
<volume>15</volume>
<numero>3</numero>
<issue>3</issue>
<page-range>309-34</page-range></nlm-citation>
</ref>
<ref id="B27">
<label>27</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Downing]]></surname>
<given-names><![CDATA[SM.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Construct-irrelevant variance and flawed test questions: Do multiple-choice item-writing principles make any difference?]]></article-title>
<source><![CDATA[Acad Med.]]></source>
<year>2002</year>
<volume>77</volume>
<numero>^s10</numero>
<issue>^s10</issue>
<supplement>10</supplement>
<page-range>103-4</page-range></nlm-citation>
</ref>
<ref id="B28">
<label>28</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Downing]]></surname>
<given-names><![CDATA[SM]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[The effects of violating standard item writing principles on tests and students The consequences of using flawed test items on achievement examinations in medical education]]></article-title>
<source><![CDATA[Adv Heal Sci Educ]]></source>
<year>2005</year>
<volume>10</volume>
<numero>2</numero>
<issue>2</issue>
<page-range>133-43</page-range></nlm-citation>
</ref>
<ref id="B29">
<label>29</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Abad]]></surname>
<given-names><![CDATA[FJ]]></given-names>
</name>
<name>
<surname><![CDATA[Olea]]></surname>
<given-names><![CDATA[J]]></given-names>
</name>
<name>
<surname><![CDATA[Ponsoda]]></surname>
<given-names><![CDATA[V]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Analysis of the optimum number alternatives from the Item Response Theory]]></article-title>
<source><![CDATA[Psicothema]]></source>
<year>2001</year>
<volume>13</volume>
<numero>1</numero>
<issue>1</issue>
<page-range>152-8</page-range></nlm-citation>
</ref>
<ref id="B30">
<label>30</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Rodriguez]]></surname>
<given-names><![CDATA[MC]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Three options are optimal for multiple-choice items A meta-analysis of 80 years of research]]></article-title>
<source><![CDATA[Educ Meas Issues Pract]]></source>
<year>2005</year>
<volume>24</volume>
<numero>2</numero>
<issue>2</issue>
<page-range>3-13</page-range></nlm-citation>
</ref>
<ref id="B31">
<label>31</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Haladyna]]></surname>
<given-names><![CDATA[TM,]]></given-names>
</name>
<name>
<surname><![CDATA[Rodriguez]]></surname>
<given-names><![CDATA[MC,]]></given-names>
</name>
<name>
<surname><![CDATA[Stevens]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Are Multiple-choice Items Too Fat?]]></article-title>
<source><![CDATA[Appl Meas Educ]]></source>
<year>2019</year>
<volume>32</volume>
<numero>4</numero>
<issue>4</issue>
<page-range>350-64</page-range></nlm-citation>
</ref>
<ref id="B32">
<label>32</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Hicks]]></surname>
<given-names><![CDATA[NA]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Guidelines for identifying and revising culturally biased multiple-choice nursing examination items]]></article-title>
<source><![CDATA[Nurse Educ]]></source>
<year>2011</year>
<volume>36</volume>
<numero>6</numero>
<issue>6</issue>
<page-range>266-70</page-range></nlm-citation>
</ref>
<ref id="B33">
<label>33</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Chiavaroli]]></surname>
<given-names><![CDATA[N]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Negatively-worded multiple choice questions An avoidable threat to validity]]></article-title>
<source><![CDATA[Pract Assessment, Res Eval]]></source>
<year>2017</year>
<volume>22</volume>
<numero>3</numero>
<issue>3</issue>
<page-range>1-14</page-range></nlm-citation>
</ref>
<ref id="B34">
<label>34</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Gómez-Benito]]></surname>
<given-names><![CDATA[J]]></given-names>
</name>
<name>
<surname><![CDATA[Sireci]]></surname>
<given-names><![CDATA[S]]></given-names>
</name>
<name>
<surname><![CDATA[Padilla]]></surname>
<given-names><![CDATA[JL]]></given-names>
</name>
<name>
<surname><![CDATA[Dolores Hidalgo]]></surname>
<given-names><![CDATA[M]]></given-names>
</name>
<name>
<surname><![CDATA[Benítez]]></surname>
<given-names><![CDATA[I]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Differential item functioning Beyond validity evidence based on internal structure]]></article-title>
<source><![CDATA[Psicothema]]></source>
<year>2018</year>
<volume>30</volume>
<numero>1</numero>
<issue>1</issue>
<page-range>104-9</page-range></nlm-citation>
</ref>
<ref id="B35">
<label>35</label><nlm-citation citation-type="">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Young]]></surname>
<given-names><![CDATA[JW.]]></given-names>
</name>
</person-group>
<source><![CDATA[Ensuring valid content tests for English Language Learners. Educational Testing Service]]></source>
<year>2008</year>
</nlm-citation>
</ref>
<ref id="B36">
<label>36</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Wong]]></surname>
<given-names><![CDATA[S,]]></given-names>
</name>
<name>
<surname><![CDATA[Yang]]></surname>
<given-names><![CDATA[L,]]></given-names>
</name>
<name>
<surname><![CDATA[Riecke]]></surname>
<given-names><![CDATA[B,]]></given-names>
</name>
<name>
<surname><![CDATA[Cramer]]></surname>
<given-names><![CDATA[E,]]></given-names>
</name>
<name>
<surname><![CDATA[Neustaedter]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Assessing the usability of smartwatches for academic cheating during exams]]></article-title>
<source><![CDATA[Proceedings of the 19th International Conference on Human-Computer Interaction with Mobile Devices and Services, MobileHCI 2017]]></source>
<year>2017</year>
<publisher-name><![CDATA[Association for Computing Machinery]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B37">
<label>37</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Bond]]></surname>
<given-names><![CDATA[L]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Teaching to the Test Coaching or Corruption]]></article-title>
<source><![CDATA[New Educ]]></source>
<year>2008</year>
<volume>4</volume>
<numero>3</numero>
<issue>3</issue>
<page-range>216-23</page-range></nlm-citation>
</ref>
<ref id="B38">
<label>38</label><nlm-citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Lane]]></surname>
<given-names><![CDATA[S]]></given-names>
</name>
<name>
<surname><![CDATA[Raymond]]></surname>
<given-names><![CDATA[M]]></given-names>
</name>
<name>
<surname><![CDATA[Haladyna]]></surname>
<given-names><![CDATA[T]]></given-names>
</name>
<name>
<surname><![CDATA[Lane]]></surname>
<given-names><![CDATA[S]]></given-names>
</name>
<name>
<surname><![CDATA[Raymond]]></surname>
<given-names><![CDATA[M]]></given-names>
</name>
<name>
<surname><![CDATA[Haladyna]]></surname>
<given-names><![CDATA[T]]></given-names>
</name>
</person-group>
<source><![CDATA[Handbook of Test Development]]></source>
<year>2016</year>
<edition>2</edition>
<publisher-loc><![CDATA[New York ]]></publisher-loc>
<publisher-name><![CDATA[Routledge]]></publisher-name>
</nlm-citation>
</ref>
<ref id="B39">
<label>39</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Jurado]]></surname>
<given-names><![CDATA[A]]></given-names>
</name>
<name>
<surname><![CDATA[Leenen]]></surname>
<given-names><![CDATA[I]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Reflexiones sobre adivinar en preguntas de opción múltiple y cómo afecta el resultado del examen]]></article-title>
<source><![CDATA[Inv Ed Med]]></source>
<year>2016</year>
<volume>5</volume>
<numero>17</numero>
<issue>17</issue>
<page-range>55-63</page-range></nlm-citation>
</ref>
</ref-list>
</back>
</article>
