INTRODUCTION
Development is defined as the dynamic, continuous acquisition of functions and independent domains that interrelate with each other and the environment in a non-linear way to achieve a balance throughout life. Studies have documented the influence of risk and protective factors that influence the course of early childhood development, which should be taken into account in primary health care (Walker et al., 2011). During this first phase of care, monitoring includes activities related to the promotion of normal development and the detection of delays or problems in the early years of life, which implies the continuous participation mainly of health professionals, parents and teachers (Hutchson & Nicoll, 1988).
Due to the complexity of the interaction between development areas and its path, it has been proposed that a delay in one area correlates with delay in others, which are common underlying manifestations of developmental deficits. Regarding this point, the complex, multifaceted relationship between language, cognition, and motor skills has been described, as a result of which intervention programs must take into account and promote these areas (Wang, Lekhal, Aarø, & Schjølberg, 2014; Piek, Dawson, Smith, & Gasson, 2008; Houwen, Visser, van der Putten, & Vlaskamp, 2016). A delay in acquiring functions of some or all areas of development is associated with the combination of several risk factors encompassing economic, social, environmental, and biological aspects, which can affect the level of a person’s functioning and may or may not cause disability (Narbona & Schlumberger, 2008). Thus, disability is defined as an activity limitation and a participation restriction, caused by a disorder that permanently affects the individual (World Health Organization, 2001).
The health care and wellbeing of children at a very early age are subjects of global interest and have been addressed in various fields of knowledge, such as medical, psychological, social, and economic spheres, in which research has been conducted on care programs and the impact they have had not only in the short term, but also at later stages of human development (Heckman, Pinto, & Savelyev, 2013; Heckman, 2011). In 2007, Chile became a pioneer in Latin America to implement a system for developmental care and monitoring from zero to four years, called Chile Grows with You Comprehensive Child Protection System (CHCC). One of its main objectives was to reduce the prevalence of developmental delay risk through specific actions for children and mothers in vulnerable conditions. This program has been evaluated at various levels through indicators to identify areas of opportunity and improve early childhood care by restructuring child health actions and policies in this country (Atalah et al., 2014), which has prompted other countries in the region to develop similar programs (Bedregal, 2014).
In Mexico, interest for early childhood and concern for its care is based on the Official Norm for Child Health Care (Secretaría de Salud, 1999), which not only establishes disease eradication and control, but also stipulates that all health institutions must periodically monitor the healthy growth and development of children under five through clinical observation. Various organizations and collegiate bodies have established guidelines and policies for child development. The American Academy of Pediatrics suggests continuous surveillance and monitoring of development, taking into account both biological and social risk factors (Bright Futures Steering Committee & Medical Home Initiatives for Children with Special Needs Project Advisory Committee, 2006). For this reason, it has been recommended to use simple, accessible methodologies, with valid, reliable instruments through which specific indicators are obtained on the level achieved in areas of development and risks are detected in the general population, together with developmental disorders.
The literature describes screening tests that require specialized, trained human resources for their application, as well as specific materials and clinical spaces (Figueiras, de Souza, Ríos, & Benguigui, 2006). In Mexico, validated, reliable screening instruments are available to assess early childhood development, such as the Neurobehavioral Assessment of the Development of Infants (VANEDELA, with a sensitivity .79 - .85 and specificity of .71 - .84) (Ruiz, Vázquez, & Sánchez, 2013) and the Child Development Evaluation (EDI, with a sensitivity of .81 and specificity of .61) (Rizzoli-Córdoba et al., 2013). At the same time, other instruments have been developed that do not require prior training for health professionals as parents and/or caregivers are the ones who answer the questionnaires, which involves them by taking into account their observations and concerns about their child’s behavior (Glascoe, 2000), since the information explored everyday activities. This is an advantage since it translates into a reduction of both financial and human costs to achieve one of the objectives of early childhood development monitoring programs. The Ages and Stages Questionnaire, known by its acronym ASQ (Ages and Stages Questionnaires) (Squires, Bricker, & Potter, 1997), was designed in the late 1970´s in USA as a development screening instrument from first month of birth to 66 months. It has been modified over time and is now in its third edition. The ASQ-3 is the version that has been most widely studied and has been translated into various languages and is used in several countries (Charafeddine et al., 2013; Filgueiras, Pires, Maissonette, & Landeira-Fernandez, 2013; Vameghi et al., 2013; Velikonja et al., 2016). Various studies have reported the stability, consistency, and validity of the psychometric characteristics (Dionne, McKinnon, Squires, & Clifford, 2014; Kerstjens et al., 2009; San Antonio, Fenick, Shabanova, Leventhal, & Weitzman, 2014). Regarding the specificity and sensitivity of ASQ-3, it has been reported that it is an instrument capable of detecting developmental delays at early ages. Some authors report that the ASQ-3 is better at identifying children at high risk (92.3%) than those at low risk (60%) (Ga & Kwon, 2011; Juneja, Mohanty, Jain, & Ramji, 2012). The ASQ-3 has proved its effectiveness detecting delays in children with vulnerable conditions. One study reported that the systematic application of the ASQ-3 in US children in vulnerable conditions facilitated the detection of delays in all domains of development, particularly in areas that are less explored during routine medical examinations, such as problem-solving, the socio-individual area, and fine motor skills (Jee et al., 2010). The ASQ-3 application in Spanish in Argentinian children showed sensitivity indices of 88% and specificity indices of 94%, with positive predictive values of 88% and negative predictive values of 96%. This suggests that the Spanish version is valid for detection of developmental delays or inconsistencies (Romero et al., 2018).
The objective of the first phase of this study was to determine the internal consistency and inter-rater reliability of the Spanish version of the ASQ-3 in Mexican preschoolers. The aim is to continue studying the psychometric properties of this instrument, because it has been reported in the literature as a useful tool for monitoring early childhood development, which would provide opportunities for health and education systems in Mexico.
Method
Participants
Sample consisted of parents and/or guardians of children aged between 33 and 60 months who attended various Child Development Daycare Centers (CENDIS) in the public and private sector of Mexico City for four consecutive years, beginning with the oldest children in 2011 and ending with the youngest preschoolers in 2015. The sample size was set on the basis of criteria for studying the psychometric properties of instruments (Anastasi, 1981; Pick & López, 1994), which stipulate that at least five subjects per item are required for statistical analysis.
Procedure
After authorization had been obtained from the directors of the CENDIS, parents/guardians and teachers were invited to a briefing session at which the importance of evaluating and monitoring areas of development through reliable, valid screening instruments for Mexican children was explained; then they were asked to participate. Parents/guardians who agreed to participate signed a consent letter and complete the sociodemographic data card and the ASQ-3 according to child’s age. Children with identified medical neurological, and psychiatric conditions in sociodemographic data sheet, as well as those whose information was not answered in full were excluded. Inter-rater reliability was carried out with the teachers of each age group and 30 questionnaires were randomly selected. A brief written report of the ASQ-3 results was delivered to each of the participating parents (Figure 1).
Instruments
Sociodemographic data sheet: An ad hoc questionnaire was designed on the sociodemographic aspects of the child and his/her parents; as well as the child’s prenatal and postnatal health.
Ages and Stages Questionnaire (ASQ-3): This is a screening instrument that identifies developmental delays in children aged 1 to 66 months. It consists of 21 questionnaires for the age ranges of 2, 4, 6, 8, 9, 10, 12, 14, 16, 18, 20, 22, 24, 27, 30, 33, 36, 42, 48, 54, and 60 months. Each questionnaire contains 30 questions that evaluate five areas of development: problem solving; communication; fine motor development; gross motor development; and socio-individual skills. Three response options are provided: 1. the behavior occurs; 2. sometimes occurs, and 3. has not yet occurred. It has a compilation sheet, in which the scores of the five areas are recorded, and three levels of the total score are represented in graphs: 1. The target level that refers to a score above expectations means that the child’s development matches the time of the evaluation; 2. The gray level shows a score barely above expectations and corresponds to scores with -1 standard deviation of the mean (-1 SD) indicates that the child is still in process in one area and it is suggested that additional activities should be proposed to encourage him or her and monitor the process; and 3. The black level, whose score is below expectations, relates to scores with -2 SD with respect to the average, which is an indicator that an additional, more specific evaluation should be carried out. Another section of the ASQ-3, comprising general observations by the parents, refers to family history, concerns, and recent health problems of the child, which allows for additional complementary information. Lastly, there is a section for the follow-up of ASQ-3, in which the suggestion to undertake additional activities and specific medical evaluations is recorded. Time required for answering and grading the questionnaire is approximately 10 to 15 minutes. The third edition, published in 2009, has a Spanish version designed for a basic reading comprehension level. As mentioned earlier, this instrument has acceptable psychometric levels, with a sensitivity of 86.1% and specificity of 85.6% (Pomés, Squires, & Yovanoff, 2016; Squires & Bricker, 2009).
Ethical considerations
Project was registered and approved by the ethics and research commission of a hospital within federal health system. The objective of the study, confidential handling of data, the benefit they would obtain from a personalized written report of the results of the ASQ-3, and recommendations on parenting strategies were explained to parents/guardians. The principle of autonomy was preserved since the parents who agreed to participate voluntarily signed an informed consent form.
Statistical analysis
Sociodemographic variables were analyzed with summary measures, and scores of each of the ASQ-3 scales were analyzed using central tendency measures. Cronbach’s alpha coefficient was used to measure consistency and Pearson correlation coefficients were used to measure the inter-rater correlation level of total and each one of the questionnaire areas considering significant correlations with the p < .05* y p < .01** values. Confirmatory factorial analysis and the Varimax method were used to identify independence between the factors and evaluate validity of the latter. A discriminant analysis was carried out using the variables of age and area of development in order to determine the level of risk in each area by means of total scores for each of them. They were represented on a table of frequencies and percentages according to age range, areas, and three levels of development (achieved, in process, and requiring specialized evaluation), in order to observe population distribution. Mean and standard deviation of each area and age group were obtained, and children with scores of less than one and two standard deviations were grouped into probable cases. Version 19 of the SPSS program for Windows was used.
Results
The sample comprised parents and guardians (n = 1291), out of which 194 parents declined to participate and 45 questionnaires were eliminated because they were not completely filled out, yielding a total of n = 1052 questionnaires, divided into six age groups (33, 36, 42, 48, 54, and 60 months of age) of the ASQ-3. Of the total sample, 48% (n = 505) were boys and 52% girls (n = 547). The parents’ educational attainment was extremely heterogeneous, a high percentage only completed middle school (Table 1).
Note: Ranges of each age group: 33 months (31 months 16 days to 34 months 15 days); 36 months (34 months 16 days to 38 months 30 days); 42 months (39 months to 44 months, 30 days); 48 months (45 months to 50 months); 54 months (51 months to 56 months, 30 days); 60 months (57 months to 66 months).
The reliability levels of the questionnaires for each age group were acceptable, ranging from α = .77 to α = .88. A confirmatory factorial analysis (CFA) was performed to evaluate construct validity, from which five factors were obtained through the Varimax method and Kaiser normalization. The percentages of explained variance ranged from 39.65% to 44.85%, which coincide with theoretical approach of the instrument. In order to under- take a discriminant analysis, the Wilks lambda statistic was obtained, which makes it possible to test the null hypothesis that the centroids of the groups are the same. The statistic obtained in all the tests was lower than the significance level (.05), from which it was concluded that the variables were adequate and significant for discriminating between age groups.
In the inter-rater reliability analysis, the areas having statistical significance are shown below:
the area of communication for 33 months (p < .05), 54 months (p < .05), and 60 months (p < .01);
the gross motor area for 36 months (p < .01) and 60 months (p < .01);
the fine motor area for 33 months (p < .01), 36 months (p < .01), 48 months (p < .01), 54 months (p < .01), and 60 months (p < .05);
the problem solving area for 33 months (p < .05) and 54 months (p < .05);
the socio-individual area for 36 months (p < .01), 48 months (p < .05), and 60 months (p < .05).
Correlation values obtained were .426 < - < .85, indicating moderate to good association levels, with the exception of the group aged 42 months, in which there were no significant correlations in the areas evaluated (Table 2).
Characterization of the levels obtained by ASQ-3 area and age group
In the characterization of levels obtained by each area of the ASQ-3 and age group, it was observed that over 70% of children were within the expectations for their age range (white area), except for the 48 and 54-month age groups, which were below 70% in the areas of fine motor skills and problem solving. Moreover, in the 48-month age group socio-individual area, less than 60% of the children were found in the white area. In the group of possible cases, we perceived that areas requiring more stimulation and specialized evaluation are the socio-individual area, problem solving, and fine motor skills, mainly in groups aged 42, 54, and 48 months. Figure 1 shows that approximately 30% of children evaluated failed to achieve expected level of development for their age in any area at the age of 42 months (Table 3 and Figure 2).
Discussion and conclusion
The results obtained suggest that theASQ-3 is a stable, consistent screening instrument for preschoolers at CENDIS in Mexico City. Levels of reliability for each age group ranged from α = .77 to α = .88, from which it may be inferred that the ASQ-3 possesses a level of accuracy acceptable (Kerlinger, Lee, Pineda, & Mora, 2002). The variation between reliability values reported in this study and those from the article by Hornman, Kerstjens, Winter, Bos, and Reijneveld (2013) can be explained by age variable. However, other authors, such as Vameghi et al. (2013) and Charafeddine et al. (2013), report this variability is due to translations and cultural adaptations of the instrument, as well as higher scores in each of the ASQ-3 areas in comparison with the original version in a US population (Vameghi et al., 2013; Velikonja et al., 2016).
Regarding inter-rater reliability (parents vs. teachers), moderate correlation levels were observed in this study. The areas with greatest correlation occurred at the ages of 54 and 60 months in areas of communication, fine motor and socio-individual skills, as well as gross motor skills for the age of 60 months and problem solving for age of 54 months. In the 42-month age group, no significant correlations were found between parents and teachers in areas of the instrument,which can be explained by findings of the study by Dionne et al. (2014). This study reports high rates of disagreement in areas of communication and problem solving, since it attributes these findings to discrepancy between parent’s expectations of their children development and those from the school system in kindergartens, and the difficulty parents may experience in classifying the frequency of a behavior in the way it is assessed by the ASQ-3 (always or sometimes) (Kerstjens et al., 2009).
According to Squires and Bricker (2009), Filgueiras et al. (2013), and Pomés et al. (2016), the ASQ-3 is easily applied and graded, making possible to distinguish between slight or serious delay problems in areas of development it evaluates.
Likewise, the ASQ-3 is a tool that can be used to periodically monitor early childhood development in pediatric medical care as indicated by the Secretaría de Salud (1999) for timely identification of developmental delays.
Is important to point out that in a developmental evaluation, when an area has not achieved the expected level according to child’s age, this represents a clinical risk and is an indicator that requires attention and stimulation. In this study, we found that areas of fine motor skills, socio-individual, and problem solving at the ages of 42, 48, and 54 months were located in the gray area (in process) or black area (below expectations), indicating need for a specialized clinical evaluation. In this regard, Wang et al. (2014) mentioned that finding two areas that require stimulation or specialized assessment are common underlying manifestations of neurodevelopmental deficits and should not be considered symptoms of separate disorders. Likewise, Houwen et al. (2016) reported that fine and gross motor development are strongly associated with cognition and therefore with language, which underlines the importance of early interventions that promote and stimulate all the areas involved.
Thus, we concluded that ASQ-3 scores are consistent in the age groups evaluated, which indicates the need to continue studying the internal and external validity of the instrument in order to be used in primary health care for development monitoring and timely detection of delays in Mexican children. One limitation of the study is that used the population of CENDIS in various parts of Mexico City and failed to consider other variables been associated with delays in early development, ranging from parent’s level of education and their socioeconomic status, including the mother’s health status during pregnancy, childbirth, postpartum, and the birth of the child ‒such as her weight and height‒. Future directions could meet the need to replicate this study in different populations, checking their psychometric properties and the association between health determinants and developmental delays. It is also essential to specify sensitivity and specificity indices in preschool children.