versión impresa ISSN 2007-2902
Rev. mex. cienc. geol vol.25 no.1 México ene. 2008
Critical values for 33 discordancy test variants for outliers in normal samples up to sizes 1000, and applications in quality control in Earth Sciences
Valores críticos de 33 variantes de pruebas de discordancia para los datos desviados en muestras normales con tamaños de hasta 1000 y aplicaciones en control de calidad en las Ciencias de la Tierra
Surendra P. Verma1,*, Alfredo QuirozRuiz1, and Lorena DíazGonzález2
1 Centro de Investigación en Energía, Universidad Nacional Autónoma de México, Priv. Xochicalco s/n, Col. Centro, Apartado Postal 34, Temixco 62580, Mexico.* email@example.com
2 Posgrado en Ingeniería (Energía), sede Centro de Investigación en Energía, Universidad Nacional Autónoma de México, Priv. Xochicalco s/n, Col. Centro, Apartado Postal 34, Temixco 62580, Mexico.
Manuscript received: June 22, 2007
Corrected manuscript received: August 10, 2007
Manuscript accepted: August 12, 2007
In two earlier papers (Verma and QuirozRuiz, 2006, Rev. Mex. dene. Geol, 23,133161, 302319) precise critical values for normal univariate samples of sizes n up to 100 have been reported. However, for greater n, critical values are available only for a few tests: Nl for n up to 147, N4k2 for nuptol 49, N6, N14 and N15 (for the latter three tests, critical values were reported for only n= 200, 500, and 1000). This clearly demonstrates the need for proposing new critical values for n >100 through an adequate statistical methodology. Therefore, modifications of our earlier simulation procedure as well as new, precise, and accurate critical values or percentage points (with four to eight decimal places; average standard error of the mean 0.000000030.0039) of 15 discordancy tests with 33 test variants, and each with seven significance levels a = 0.30, 0.20, 0.10, 0.05, 0.02, 0.01, and 0.005, for normal samples of sizes n up to 1000, viz., nmin(1)100(5)200(10)500(20)1000, are reported. For the first time in the literature, the standard error of the mean is also reported explicitly and individually for each critical value. Similarly, a new methodology involving artificial neural network (ANN) was used, for the first time in published literature, to obtain interpolation equations for all 33 discordancy test variants and for each of the seven significance levels. Each equation was fitted using 76 simulated data for n from 100 to 1000 for a given test and significance level. Extremely small sums of squared residuals (5.5 × 10 8 8.4 ×105; generally <105) in the ANN equations fitted for n=100 to 1,000 were obtained. As a result, the applicability of these discordancy tests is now extended up to 1000 observations of a particular parameter in a statistical sample. The new most precise and accurate critical values will result in more reliable applications of these discordancy tests than have been possible so far in various scientific and engineering fields, particularly for quality control in Earth Sciences. The multipletest method with new critical values was shown to perform better than both the boxandwhisker plot and the "two standard deviation " methods used by some researchers, and is therefore the recommended procedure for handling experimental data.
Key words: outlier methods, normal sample, two standard deviation method, 2s method, reference materials, Monte Carlo simulation, critical values, Dixon tests, skewness, kurtosis, artificial neural network, ANN, statistics, petroleum hydrocarbon, Nd isotopes, BCR1.
En dos trabajos anteriores (Verma and QuirozRuiz, 2006, Rev. Mex. dene. Geol, 23, 133161, 302319) se han reportado valores críticos precisos para pruebas de discordancia en muestras normales univariadas n hasta 100. Sin embargo, para n >100, se dispone solamente de valores críticos para las pruebas: NI para n hasta 147, N4k2 para n hasta 149, N6, NI4 y NI5 (para las últimas tres pruebas, valores críticos han sido reportados solamente para n=200, 500 y 1000). Esto demuestra claramente la necesidad de proponer nuevos valores críticos para n >100 mediante una metodología estadística apropiada. Por lo tanto, se reportan las modificaciones del procedimiento de la simulación así como valores críticos o puntos porcentuales nuevos y más precisos y exactos (con cuatro hasta ocho puntos decimales; el error estándar de la media 0.00000003 0.0039) para 15 pruebas de discordancia con 33 variantes, y cada una con siete niveles de significancia a = 0.30, 0.20, 0.10, 0.05, 0.02, 0.01 y 0.005, para muestras normales con tamaño n hasta 1000, viz., nmin (1)100(5)200(10)500(20)1000. Por primera vez en la literatura, se reporta el error estándar de la media explícitamente y en forma individual para cada valor crítico. De igual manera, una nueva metodología que consiste en la aplicación de redes neuronales artificiales (ANN, por sus siglas en inglés) fue usada, por primera vez en la literatura publicada, para obtener ecuaciones de interpolación para las 33 variantes de las pruebas de discordancia y para cada uno de los siete niveles de significancia. Cada ecuaciónfue ajustada con los 76 datos de las simulaciones para n desde 100 hasta 1,000 correspondientes a cada prueba y cada nivel de significancia. Sumas de cuadrados de los residuales extremadamente pequeñas (5.5 × 10 8 8.4 ×105; generalmente <105) fueron obtenidas en el ajuste de las ecuaciones por ANN para n =100 a 1,000. Como consecuencia, la aplicabilidad de las pruebas de discordancia ha sido extendida hasta 1,000 observaciones de un determinado parámetro en una muestra estadística. Los valores críticos nuevos y mucho más precisos y exactos resultarán en aplicaciones más confiables de las pruebas de discordancia que han sido posibles hasta ahora en una variedad de campos de las ciencias e ingenierías, particularmente para el control de calidad en Ciencias de la Tierra. El método de pruebas múltiples con nuevos valores críticos proporcionó mejores resultados que los métodos de la gráfica de "boxy whisker" y de "dos desviaciones estándar" usados por algunos investigadores y, por lo tanto, el presente método estadístico es el más recomendado para el manejo de datos experimentales.
Palabras clave: métodos de valores desviados, muestra normal, prueba de dos desviaciones estándar, 2s, materiales de referencia, simulación Monte Carlo, valores críticos, pruebas de Dixon, sesgo, curtosis, redes neuronales artificiales, RNA, estadística, hidrocarburos de petróleo, isótopos de Nd, BCR1.
Two recent papers (Verma and QuirozRuiz, 2006a, 2006b) have reported a highly precise and accurate Monte Carlo type simulation procedure for N(0,1) random normal variates and presented new, precise, and accurate critical values for seven significance levels a = 0.30,0.20, 0.10, 0.05, 0.02,0.01, and 0.005, andfor sample sizes n up to 100 for 15 discordancy tests with 33 variants. Table 1 summarizes these tests. However, for greater n, only a few critical values are available in the literature (Barnett and Lewis, 1994; Verma, 2005). These values are for tests: Nl (n up to 147); N4k2 in up to 149); N6, N14 and N15 (for the latter three tests, critical values with only two decimal places were reported for only n =200, 500, and 1000).
Reference materials (RMs) are routinely used for quality control in Earth Sciences (e.g., Verma, 1997,1998, 2005; VelascoTapia et al., 2001; M.P. Verma, 2004; Lozano and Bernal, 2005; Guevara et al, 2005; Sang et al, 2006; Santoyo et al, 2006; Papadakis et al, 2007). In other fields of science and engineering also, quality control through RMs has become mandatory, for example, in biology and medicine (Okamoto et al, 1996; Dybczynski et al, 1998; Patriarca et al, 2005); environmental sciences (Gill et al, 2004; Graybeal et al, 2004; Farre et al, 2006); and food research (In't Veld, 1998; Langton et al., 2002; Gabrovská et al, 2006).
When a large number of laboratories around the world participate in a cooperative study of a RM, the number of individual data (n) for a given chemical element in that RM can exceed 100. In these cases, at present the multipletest method initially proposed by Verma (1997) and practiced by Verma (1998,2005) and Verma and QuirozRuiz (2006a, 2006b), among others, is not likely to be appropriately applicable due to the unavailability of precise critical values for n >100 for most discordancy tests (Table 1). This clearly demonstrates the need for proposing new critical values for n >100 through an adequate statistical methodology. Requirements of critical values for large n (>100) also exist in an altogether different field of molecular and cellular proteomics (Xia et al., 2006; Murray Hackett, written communication, June 2007).
For the present work, we have included most discordancy tests for normal univariate samples (15 tests with 33 test variants; see Table 1) for simulating new, precise, and accurate critical values for the same seven significance levels (a = 0.30 to 0.005) and for n up to 1000, viz., nmin (1)100(5)200(10)500(20)1000 (where nmin is the minimum number of data that could be tested by a given statistical test; see Table 1), using a simulation procedure slightly modified after Verma and QuirozRuiz (2006a, 2006b). Further, a novel approach is followed, for the first time in the literature, for presenting these new critical values along with the respective standard errors and for interpolating the simulated critical values using artificial neural network (ANN). These results are useful in all fields of science and engineering, especially in quality control in Earth Sciences. We present a few examples of the application of all normal univariate tests (Table 1) for which we have reported new, most precise critical values in this paper.
We will not repeat the explanation of discordancy tests; the reader is referred to Barnett and Lewis (1994), Verma (2005), or the recent papers by Verma and QuirozRuiz (2006a, 2006b). The 15 tests with their 33 variants for which critical values were simulated are listed in Table 1.
SIMULATION PROCEDURE FOR MOST PRECISE AND ACCURATE CRITICAL VALUES
Our highly precise and accurate Monte Carlo type simulation procedure has already been described in detail (Verma and QuirozRuiz, 2006a, 2006b) and, therefore, will not be repeated here. However, some required changes will be mentioned.
In our present work, the simulations were of sizes 500,000 for tests N3N5 and N7N13; 1,000,000 for N14; and 2,000,000 for Nl, N2, N6, and N15. They were repeated ten times (each using a different set of 500,000,000 to 2,000,000,000 random normal variates). Different simulation sizes (500,000 to 2,000,000) were appropriate to optimize the simulation time required for the use of personal computers and to obtain, at the same time, "acceptable" simulation errors for all tests. For tests N2, N5N8, N14 and N15, the final mean critical value or percentage point and its standard error for each n and α were estimated from ten repetitions. However, for tests such as Nl (Table 1) two independent test statistics (one for an upper and the other for a lower outlier) were simulated and thus 20 independent results could be obtained from the same simulation scheme as reported earlier (Verma and QuirozRuiz, 2006a, 2006b). Besides test Nl, because of the existence of the upper and lower versions of the statistic (Table 1), 20 results of critical values and their error estimates were also obtained for tests N3k=2,3,4, N4k=l,2,3,4, and N9N13. For all these tests, therefore, and calculations were based on 20 independent results.
RESULTS OF NEW CRITICAL VALUES
Both and data for 33 discordancy test variants (Table 1), for n from nmin (3,4, 5,6,7, 8, or 9, depending on the type of statistic to be calculated) up to 1000, viz., nmin(1)100(5)200(10)500(20)1000, and >α = 0.30, 0.20, 0.10, 0.05, 0.02, 0.01, and 0.005 (corresponding to confidence level of 70% to 99.5%, or equivalently significance level of 30% to 0.5%), are summarized in Tables A1A40 (40 tables in the electronic supplement; 20 oddnumbered tables for and 20 evennumbered tables for . Thus, our data presentation approach is novel because, for the first time in the literature, the precision estimates are explicitly tabulated for each critical value. For example, in Table Al the rounded values are presented individually for each n and α, whereas in Table A2 the rounded values are similarly listed for test Nl; the rounding procedure follows the guidelines suggested by Verma (2005). Similarly, and values are presented consecutively for the remaining tests N2 to N15 in Tables A3A40.
For all cases, our present values are more reliable (error is given by a small number on the third up to the eighth decimal place) than the earlier literature values (compiled by Barnett and Lewis, 1994; Verma, 2005), including those reported by Verma and QuirozRuiz (2006a, 2006b). In fact, the errors of these literature critical values, except those by Verma and QuirozRuiz (2006a, 2006b), are not precisely known. A synthesis of standard errors of the mean for all tests is presented in Table A41 of the electronic supplement. The errors of the present critical values for n up to 1,000 (Table A41) range as follows: 0.000000090.0007 for test N1 (see also Table A1); 0.000000030.0009 for test N2 (Table A3), 0.000050.0019 for N3k=2 (Table A5); 0.000090.0020 for N3k=3 (Table A7); 0.000100.0021 for N3k=4 (Table A9); 0.000000230.00040 for N4k=1 (Table A11); 0.000000070.00025 for N4k=2 (Table A13); 0.00000170.00021 for N4k=3 (Table A15); 0.00000210.00018 for N4k=4 (Table A17); 0.000000050.00035 for N5k=2 (Table A19); 0.000000050.0012 for N6k=2 (Table A21);0.0000160.0005 for N7 (Table A23); 0.0000150.0006 for N8 (Table A25); 0.0000150.00028 for N9 (Table A27); 0.0000160.00032 for N10 (Table A29); 0.0000150.00028 for N11k=2 (Table A31); 0.0000080.00025 for N12k=2 (Table A33); 0.0000110.00024 for N13k=2 (Table A35); 0.0000230.0012 for N14 (Table A37); and 0.0000150.0039 for N15 (Table A39).
The much greater precision (and accuracy) of critical values simulated by Verma and QuirozRuiz (2006a, 2006b) as compared to the literature values was already documented. Here, we compare the mean values of the standard errors for n up to 100 for all tests obtained in the present work with those obtained by Verma and QuirozRuiz (2006a, 2006b) and show that the most precise critical values than ever attempted in the literature are now being reported (Table A41; see also Figure 1). And this improvement is due to the fact that much larger simulation sizes of 500,000 to 2,000,000 are used in the present work, which have resulted in smaller standard errors than was earlier possible from sizes of 100,000 to 500,000. Further, when in the present work the sample sizes were exactly the same as those in Verma and QuirozRuiz (2006b), for example, 500,000 for tests N3k=2, 3, and 4 (see footnote of Table A41 for a correction and explanation), the errors were exactly the same (see diamond symbols that plot right at the diagonal line in Figure 1). For these cases, not only the errors were exactly the same, but also the critical values were identical in both simulations (this work and Verma and QuirozRuiz, 2006b). This is an interesting observation and testifies the high reproducibility of our simulation procedure because the earlier simulations (Verma and QuirozRuiz, 2006b) were programmed in C whereas the present simulations were programmed in a different language (Java) and were run on a different and faster personal computer equipped with a different processor than our earlier work.
As our earlier tables for sample sizes up to 100 (Verma and QuirozRuiz, 2006a, 2006b), these new critical value data, along with their individual uncertainty estimates, are available in other formats such as txt, Excel, or Statistica. on request from the authors (S.R Verma firstname.lastname@example.org, A. QuirozRuiz email@example.com, or L. DíazGonzález firstname.lastname@example.org). Similarly, the interpolation equations (see below) can also be obtained in a doc file with plain text format.
RESULTS OF INTERPOLATIONS OF CRITICAL VALUES USING ARTIFICIAL NEURAL NETWORK (ANN)
A new methodology was developed that involved the use of ANN for obtaining the bestfitted interpolation equations. This was actually required because, for 100 < n <1000, critical values were not simulated for all n (see Table 1 for tests and Tables A1A40 for information on simulated n). No attempt was made in the present work to fit equations to critical values for n <100 mainly because precise and accurate critical values for all n <100 have already been simulated (Verma and QuirozRuiz, 2006a, 2006b; this work). Therefore, interpolation equations were actually not required for small n. Prior to our work, different kinds of interpolation or fitting (Bugner and Rutledge, 1990; Rorabacher, 1991; Verma et al, 1998) to low precision critical values available in the literature (see Barnett and Lewis, 1994; Verma, 2005; Verma and QuirozRuiz, 2006a, 2006b) were used for this purpose.
This is the first time in published literature that ANN was used for fitting highly sophisticated equations to the most precise and accurate simulated critical value data for n between 100 and 1000, with extremely small sums of squares of residuals and thence for predicting interpolated critical values with the smallest error. Details on the ANN can be found in Hassoun (1995) or Haykin (1999).
The fitted equations for test N1 using all critical values listed for n from 100 to 1000 (Table A2) for each a (from 0.30 to 0.005) are presented in Table 2. The values of the sum of squared residuals of each equation Σ(SIMANN)2 for the 76 simulated critical values, corresponding to n = 100(5)200(10)500(20)1000 and α = 0.30, 0.20, 0.10, 0.05, 0.02, 0.01, and 0.005, used for equation fitting are also included in Table 2. For the remaining tests (N2 to N15), the equations are summarized in Tables A42A60 of the electronic supplement.
The fitting quality parameter Σ (SIMANN)2 for the interpolation equations for n=100(5)200(10)500(20)1000 (Tables 2 and A42A60) range as follows: ~2.4 × 10-7 ~ 4.1 × 10-6 for test N1 (see Table 2); ~4.1 × 10-7 7.8 × 106 for test N2 (Table A42), ~1.8 × 106 3.0 × 105 for N3k=2 (Table A43); ~4.1 × 106 7.6 × 10 5 for N3k=3 (Table A44); ~7.7 × 106 8.4 × 105 for N3k=4 (Table A45); ~1.9 × 10 8 4.8 × 10 7 for N4k= 1 (Table A46); ~1.0 × 108 8.2 × 107 for N4k=2 (Table A47); 9.3 × 108 8.7 × 107 for N4k=3 (Table A48); ~1.9 × 107 8.8 × 10 7 for N4k=4 (Table A49); ~7.5 × 108 5.3 × 107 for N5k=2 (Table A50); ~6.7 × 10 7 2.2 × 10 5 for N6k=2 (Table A51); ~4.2 × 10 71.3 × 10 6 for N7 (Table A52); 2.7 × 107 9.7 × 10 7 for N8 (Table A53); 5.1 × 107 2.0 × 105 for N9 (Table A54); ~5.7 × 10 7 9.6 × 10 6 for N10 (Table A55); ~1.4 × 10 78.0 × 10 7 for N11k=2 (Table A56); ~1.9 × 107 7.2 × 106 for N12k=2 (Table A57); ~2.8 × 10 62.9 × 105 for N13k=2 (Table A58); ~5.7 × 107 3.0 × 105 for N14 (Table A59); and ~1.8 × 10 7 2.8 × 10 5 for N15 (Table A60). Thus, the fitting quality parameter Σ (SIMANN)2 for the interpolation equations was generally <105 (andalways <104).
These equations can be used to compute precisely the interpolated critical values for all n between 100 and 1000, for which such values are not listed in Tables A2A40 (see evennumbered tables). Thus, precise critical values can be made available for all n between nmin and 1000, viz., nmin (1)1000 (see Table 1 for more information on all tests and their nmin values). Figure 2 shows, as an example, the simulated critical values for test Nl (CVTN1) for all values of a (0.30 to 0.005) as a function of n (from 100 to 1000). The respective interpolation equations are also plotted using dotted or dashed curves. Note these equations very closely match the simulated critical values to such an extent that the curves cannot be properly observed in Figure 2.
APPLICATIONS IN SCIENCE AND ENGINEERING
The tests (Table 1) after extending their applicability to samples of sizes up to 1,000, can be applied to all examples earlier summarized by Verma and QuirozRuiz (2006a, 2006b). These include all the following fields (but are not limited to them): Agricultural and Soil Sciences; Aquatic Environmental Research; Astronomy; Biology; Biomedicine and Biotechnology; Chemistry; Electronics; Ecology; Geochronology; Geodesy; Geochemistry; Isotope Geology; Medical Science and Technology; Meteorology; Paleontology; Petroleum Hydrocarbons and Organic Compounds in Sediment Samples; Quality Assurance and Assessment Programs in Biology and Biomedicine, in Cement Industry, in Food Science and Technology, in Environmental and Pollution Research, in Nuclear Science, in Rock Chemistry, in Soil Science, and in Water Research; Structural Geology; Water Resources; and Zoology. Further, our new critical values for n up to 1,000 will be equally useful for applying these discordancy tests to identify outliers in linear regressions, such as those recently employed by Verma et al. (2006).
APPLICATIONS IN QUALITY CONTROL IN EARTH SCIENCES
As stated earlier in the Introduction section, the most important requirement for simulating critical values for n >100 was for processing interlaboratory data for RMs. This is confirmed from the information synthesized in Table 3. Thus, the data for all chemical elements, without exception, in all RMs (Table 3) can now be processed for possible outliers and thence for correctly computing central tendency (or location) and dispersion (or scale) parameters (see Verma, 2005, for more details) using outlierbased statistical methods.
We now present examples or case histories to illustrate the use of all discordancy tests for which new critical values have been obtained in this work. For these applications, we chose the strict confidence level of 99% (i.e., we used the 99% CL, or 1% SL, or 0.01 a column; see the respective critical values in Tables A2 to A40 evennumbered tables in the electronic, supplementary data file).
Example 1. Comparison of multipletest method with boxandwhisker plot method: Chlorinated pesticides and petroleum hydrocarbons in a sediment reference sample
Verma and QuirozRuiz (2006b) used interlaboratory data for one sediment RM (IAEA417; IAEAInternational Atomic Energy Agency) to show that, for detecting outliers, the multipletest method of Verma (1997) performed better than the boxandwhisker plot method used by Villeneuve et al. (2002). Here, we use, as the first example, a different sediment RM (IAEA408) from an interlaboratory study by Villeneuve et al. (1999) to highlight the use of the multipletest method and compare its performance with the boxandwhisker plot method used by the original authors. The individual data for nine selected compounds (six chlorinated pesticides and three petroleum hydrocarbons) were compiled in Table A61 (see electronic supplement) from the original report by Villeneuve et al. (1999). The multipletest method (Table 1) consisted in applying all nine singleoutlier (with 13 test variants) and seven multipleoutlier tests (with 20 test variants) at the strict confidence level of 99% to a given set of data although, if desired, a lesser number of tests or a less strict confidence level, such as 95%, could instead be used. The final concentration data along with the basic statistical information are summarized in Table 4. A greater number of discordant outliers were detected in the data for most compounds listed in Table A61, by the multipletest method than the boxandwhisker plot method used by the original authors (Villeneuve et al., 1999). Consequently, smaller standard deviations (and somewhat different mean values, although probably not statistically different) were obtained from the multipletest method as compared to the boxandwhisker plot method for all cases except one compound (PCB101) for which none of the two methods detected any outliers. Note also that, because of the presence of outliers, the initial mean and standard deviation data strongly differ from the final statistical parameters. Finally, had we applied the multipletest method at a less strict confidence level of 95% (which will probably be consistent with the boxandwhisker plot method although details on the respective confidence level were not provided by Villeneuve et al., 1999), a greater number of outliers would have been identified in most cases than those obtained at 99% confidence level, with the consequent reduction of the standard deviation values obtained by our method and possible changes in the mean values (Table 4).
Application of significance tests, such as Ftest and Studentt test (see Verma, 2005 for more details on significance tests), to evaluate the performance of the multipletest method versus the boxandwhisker plot method is not advisable using the present data, because rather small number of degrees of freedom (733; the final number of data remaining after outlier elimination vary from 8 to 31 for nf and from 13 to 34 for nl in Table 4) are involved. We consider these numbers too small for quality control purposes. Therefore, an objective statistical comparison of the multipletest method and the boxandwhisker plot method should be done using larger datasets.
Nevertheless, as in Verma and QuirozRuiz (2006b), but using different datasets, we now conclude that the multipletest method exemplified in this work can be advantageously used in future to arrive at the final statistical parameters in such interlaboratory studies.
Example 2. Comparison of multipletest method with "two standard deviation" (2s) method: Two chemical elements and one isotopic ratio in a geochemical reference material (BCR1) from U.S.G.S.
We present the example of just two elements (petrogenetically important trace elements Sm and Nd) and one widely used radiogenic isotopic ratio (143Nd/144Nd) in a rock RM (Columbia River basalt) BCR1 from the United States Geological Survey (U.S.G.S.), U.S.A. This RM was extensively used four decades ago because it was recommended as the RM for all studies of lunar rocks, i.e., researchers reporting data on lunar rocks had to evaluate their data quality by reporting the analysis of this particular RM. Since then, this RM has been widely used in geochemical laboratories, including isotope laboratories; most Nd isotope studies, even today, report 143Nd/144Nd in this RM. However, because B CR1 is no longer available for distribution by the U.S.G.S., it is now replaced by BCR2 (a sample collected at the same site as BCR1).
The individual data for BCR1 for Sm, Nd, and 143Nd/144Nd were compiled from Gladney et al. (1990) and are presented in supplementary Tables A62, A63, and A64, respectively. No attempt was made to complement them with more recent data on this widely used RM because our main aim was to compare the performance of the multipletest method with the "two standard deviation" (2s) method used by Gladney et al. (1990) to process their compiled data. Although Gladney et al. (1990) is a relatively old compilation reference, this is the latest one available on this particular RM (BCR1) in published literature.
The results of the application of the multipletest method, along with those from the 2s method, are also summarized in Table 4. For Sm, the same number (10) of outliers were detected by both methods (multipletest versus 2s) whereas for Nd the multipletest method detected more outliers (22) than the 2s method (11). However, note that the multipletest method was applied at the strict confidence level of 99% (and not at the less strict level of 95%, which would, in theory, correspond to the 2s method, and is likely to detect more outliers than the 99% level).
Nevertheless, the statistically correct procedure for applying discordancy tests to such analytical datasets (e.g., Sm and Nd data obtained by different types of analytical methods) would be to: (i) construct statistical samples for each methodgroup and process them separately for outliers using the multipletest method; (ii) apply ANOVA ("ANalysis Of VAriance") test to decipher any statistically significant differences among different methodgroups, and isolate a particular group if significantly different from the remaining ones at a predetermined confidence level; (iii) combine the data from different methodgroups showing no significant differences and process them again for possible outliers; and (iv) calculate the final statistical parameters from the normally distributed final data. We suggest that ANOVA be applied at the strict confidence level of 99% (see Verma, 2005, for the respective tabulated critical values). The results from such a statistically correct procedure are also presented for Sm and Nd in rows marked by an asterisk (*) in Table 4. Gladney et al. (1990) did not present such method groupbased results for their 2s method although they did so for individual techniques. A clear advantage of the multipletest method as compared to the 2s method is observed for detecting outlying observations for both Sm and Nd concentration data if we compare our results (see rows marked by an asterisk in Table 4) with the results of "allgroups" presented by Gladney et al. (1990).
We applied the Ftest and Studentt test to the two sets of Sm and Nd concentration data in order to evaluate if there were significant differences between the final results obtained by the multipletest method and the 2s method (Table 4). Significant differences (at 95% confidence level for both Sm and Nd, and even at 99% confidence level for Nd) in standard deviation or variance were observed between these two sets of data when statistically correct procedure involving method groups was used for the multipletest method (see rows marked by an asterisk in Table 4). Variance estimates of Sm and Nd data processed by the multipletest method were significantly lower than those obtained by the 2s method. Such results will have important implications for instrumental calibrations (using weighted regression techniques such as those used by Guevara et al., 2005) and data quality assessments (using significance tests). More details can be found in Verma (2005). The comparison of the Nd isotope data is discussed below.
First of all, it is very important to note that for 143Nd/ 144Nd from California Institute of Technology (CalTec)type laboratories that normalize, during data acquisition, the Nd isotopic composition to 148Nd/144Nd = 0.243082 (see DePaolo and Wasserburg, 1976, for details on CalTectype laboratories), the data have to be converted according to the following Equation (1), in order to make them consistent with the numerous laboratories around the world that are Lamonttype and use for normalization 146Nd/144Nd = 0.7219 (see O'Nions et al., 1977, for more details on Lamonttype laboratories).
Almost concurrently with the CalTec and Lamont laboratories, Richard et al. (1976) from the University of Paris also discovered the utility of the SmNd isotope systematics in Earth Sciences, but they used 143Nd/146Nd instead of 143Nd/144Nd to show the usefulness of their work.
The existence of two types of laboratories was probably one of the main reasons, among others, why the CalTec researchers introduced the εNd notation (for the definition of εNd see DePaolo and Wasserburg, 1976,1977). Although the actual values of 143Nd/144Nd from these two different types of laboratories are drastically different (see the nine lower values in the compilation by Gladney et al., 1990, that are totally distinct from the rest of the compiled data; these values are identified by an asterisk and listed in Table A64 after their conversion according to Equation (1) and therefore, do not significantly differ from the rest of the data), the use of εNd makes the Nd isotope data from these two types of laboratories directly comparable and fully consistent. It has so happened that the Lamonttype laboratories have become much more numerous than the CalTectype laboratories (for example, see the compilation by Gladney et al, 1990, in which only nine values, out of a total of 102, were from the CalTectype laboratories; see also Table A64 in which CalTectype data are identified by an asterisk).
This essential conversion for handling 143Nd/144Nd from these two types of laboratories was not recognized by Gladney et al. (1990) as a requirement to statistically process Nd isotopic data, which resulted in erroneous processing of the isotope data (Table 4). Thus, their 2s method should have certainly, but erroneously, rejected the CalTectype data (nine data; see Table 4) as outliers although this was not clearly specified by these authors. The multipletest method, on the other hand, showed that no outlier is present in this dataset at the strict confidence level of 99%.
Verma and QuirozRuiz (2006b) extensively commented on the shortcomings of the 2s method and used a rock RM peridotite JP1 from Japan to show that just one multipleoutlier test N3 in its three variants (k=2,3,4) applied at 95% confidence level, performed better than the 2s method for detecting outliers. Here, using different datasets (Sm, Nd and 143Nd/144Nd in BCR1), we conclude that the multipletest method exemplified in this work can be advantageously used in future to arrive at the final statistical parameters in such interlaboratory studies and the probably statistically erroneous 2s method should be abandoned. Such outlier methods based on "fixed" multiples of standard deviation have also been recently criticized by Hayes et al (2007).
Example 3. Other examples of outlier tests applicable in Earth Sciences
We now briefly comment on the need of using the above multipletest method in numerous geoscience studies. Verma and QuirozRuiz (2006b) already applied the multipletest method to oxygen isotope data for the Los Azufres hydrothermal system reported by TorresAlvarado (2002). They also pointed out other studies where the multipletest method would be useful.
Similarly, this multipletest method with the new critical values can be successfully applied to process and better interpret: (i) effective weight, variation index and other groundwater data discussed by ElNaqa et al. (2006); (ii) univariate and bivariate data of ammonites documented by LópezPalomino et al. (2006), the latter (bivariate) data using studentized residuals from the regression; (iii) bivariate data for naturally fractured reservoirs from Mexico presented by MirandaMartinez et al. (2006); (iv) data acquisition stage of mass spectrometric instrumentation used for 40Ar39Ar (MolinaGarza and OrtegaRivera, 2006) or KAr geochronology (Solé et al., 2007), including RbSr, PbPb, SmNd, and ReOs geochronology or isotope geology (see for example, Wang et al, 1998; DoughertyPage and Bartlett, 1999, who used just one Dixon test); (v) geochemical data on granitic xenoliths and rocks presented and compiled by CoronaChavez et al. (2006); (vi) microprobe data recently reported by Colombo et al (2007); (vii) chemical data of inactive tailings from the Santa Barbara mineral zone, Chihuahua, documented by GutiérrezRuiz et al. (2007); (viii) chemical data for recent and historic tailings of a PbZnAg skarn deposit analyzed by MéndezOrtiz et al (2007); (ix) geochemical and stable isotope data for sedimentary rocks obtained by Nagarajan et al. (2007, in press); (x) cation and anion composition data for underground water reported by RamosLeal et al. (2007); (xi) geotechnical variables for oil prospect exploration decision making discussed by Salleh et al. (2007); (xii) major and trace element data for metabasic volcanic rocks presented by Shekhawat et al. (2007) and for topazbearing rhyolites documented by RodríguezRíos et al. (2007); and (xiii) mineral composition data for metagabbroic rocks reported by CruzGámez et al (2007).
We note that the discordant outliers, if present, are of much value to further understand the geological processes provided they are not caused solely by analytical uncertainty. The remaining normally distributed data (after eliminating outlying observations as judged by the applied discordancy tests) can then be used for correctly calculating the central tendency or location (mean) and dispersion or scale (standard deviation) parameters (see Verma, 2005, for more details).
As an example of the application of multipletest method, we can mention that Colombo et al. (2007) applied five singleoutlier tests (Nl, N2, N7, N8 and N9, with their seven test variants) to ascertain the absence of outliers in their geochemical data before calculating the mean and standard deviation values. We suggest that for small datasets such as those mentioned in this section, the multipletest method could consist of applying consecutively all nine singleoutlier tests (Nl, N2, N4k=l, N7N10, N14, and N15), with their 13 test variants, to detect possible outlier(s). The multipleoutlier tests (k=2 to k=4 types), with 20 test variants, could additionally be used for larger datasets such as those obtained in interlaboratory studies of RMs.
In summary, therefore, we emphasize that the multipletest method proposed by Verma (1997) and exemplified in our paper is a recommended procedure to process experimental data under the assumption that the data are drawn from a normal distribution and departure from this assumption due to any contamination or presence of discordant outliers can be properly handled by tests Nl to N15 (all 15 tests with their 33 variants, or only those selected for this purpose).
APPLICABILITY AND PERFORMANCE OF DISCORDANCY TESTS
There has been considerable confusion regarding the applicability of discordarcy tests. Miller and Miller (2000, p. 5457), among others, have expressed the view that Dixon tests are applicable to only small data sets, without actually providing any reasons for this limitation. Surprisingly, these authors intuitively refer to Dixon tests as a Dixon test (Dixon's Q test). Dixon's Q test (N7 by Dixon, 1951; N8 by King, 1953; see Table 1) is said to be "valid" for small n = 3 to 7 (Miller and Miller, 2000, p. 54). Unfortunately, this kind of view has plagued the literature in chemistry. Dixon (1951) presented approximate critical values independently for all tests and for n up to 30. Then, why should Dixon tests be limited to n = 3 to 7? It is true that Dixon tests are especially vulnerable to possible masking effects (Dixon, 1950; Barnett and Lewis, 1994), but this refers to their power and not their applicability. In fact, the power of tests as inferred by Dixon (1950) could have been seriously affected by the approximate nature of the critical values by Dixon (1951) see the large differences between these values and those simulated by Verma and QuirozRuiz (2006a), or those presented in the present work. This performance evaluation (Dixon, 1950) should also be considered rather incomplete from the modern statistical point of view presented by Barnett and Lewis (1994) and Hayes and Kinsella (2003). Therefore, the performance of Dixon and other tests should await further detailed work, which is already in progress by Verma and collaborators.
Nevertheless, statisticians specializing in outlier theory (e.g., Barnett and Lewis, 1994, in their authoritative book, p. 218234) have pointed out no such "applicability" limitations of singleoutlier tests, including Dixon tests, the only limitation being the availability of critical values (see e.g., Verma, 1997; Velasco et al, 2000; Verma and QuirozRuiz, 2006a, 2006b) and the efficiency of discordancy tests. Barnett and Lewis (1994, p. 126) have, in fact, suggested that singleoutlier tests should be classified as consecutive tests, and therefore, these tests can be used for identifying multiple (i.e., more than one) outliers (see p. 125127 in Barnett and Lewis, 1994). The null and alternate hypotheses can thus be repeatedly postulated to test a given dataset for several outliers using singleoutlier tests in a consecutive way. Of course, multipleoutlier tests or block procedures (see Barnett and Lewis, 1994) could be more suited for large datasets, but more work is needed to quantitatively evaluate the performance of all single and multipleoutlier tests. For this reason we believe that new, precise and accurate critical values should be generated not only for small n but also for very large n, as has been accomplished in the present work. We plan to address these questions of utmost importance in our future work; the first paper in this direction is already in preparation by S.R Verma, L. DíazGonzález, and R. GonzálezRamírez.
The performance of a discordancy test is an important "quality" factor that should be properly evaluated (Barnett and Lewis, 1994; Velasco and Verma, 1998; Velasco et al., 2000; Hayes and Kinsella, 2003; Efstathiou, 2006). During the last decade, Barnett and Lewis (1994, p. 131) pointed out that much remains to be done in the area of performance of the various available procedures in the presence of outliers. Even today, this statement seems to be true. Thus, appropriate statistical procedures for handling and interpretation of experimental data should be the focus of future work. Masking and swamping effects are also important factors that require special attention and evaluation (Barnett and Lewis, 1994). Finally, because the performance of discordancy tests is not properly known at present although some empirical results were reported by Velasco and Verma (1998) and Velasco et al. (2000). Verma and collaborators (Verma, 1997,1998,2005; Verma et al, 1998; Guevara et al., 2001; VelascoTapia et al., 2001; Verma and QuirozRuiz, 2006a, 2006b; this work) have proposed and practiced the use of the multipletest method and not just some selected test(s). The computation of new, precise and accurate critical values as carried out in the present work should facilitate in future to empirically better evaluate the performance of discordancy tests than attempted by Velasco et al. (2000).
We have used our established and welltested Monte Carlotype simulation procedure for generating new, precise and accurate critical values for 15 discordancy tests with 3 3 test variants for sample sizes up to 1000. For the first time in the published literature, these critical values , along with their individual uncertainty estimates as well as new interpolation equations obtained by ANN, are also tabulated. These new critical values will be very useful in many diverse fields of science and engineering, including in quality control in Earth Sciences. Specific examples are presented to highlight the use of these new critical values for quality control. The multipletest method outlined in the present work seems to perform better than both the boxandwhisker plot and the "two standard deviation" (2s) methods used for processing interlaboratory data on RMs for quality control purposes. Much work is still needed to evaluate the performance of discordancy tests. The new critical values for all samples sizes up to 1000 simulated and interpolated in this work should certainly facilitate the performance evaluation.
This third manuscript was prepared as a result of the invitation to the first author extended during 2006 by the editorinchief Susana AlanizÁlvarez. We are also grateful to Constantinos E. Efstathiou and an anonymous reviewer for a highly positive evaluation of our work. In spite of this appreciation, both of them pointed out several shortcomings in our earlier manuscript. Their critical comments helped us significantly improve our presentation.
APPENDIX A. SUPPLEMENTARY DATA
Tables A1A64 can be found at the journal web site <http://satori.geociencias.unam.mx/>, in the table of contents of this issue (electronic supplement 25101).
Barnett, V., Lewis, T, 1994, Outliers in Statistical Data. Third edition: Chichester, John Wiley, 584 p. [ Links ]
Bugner, E., Rutledge, D.N., 1990, Modelling of statistical tables for outlier tests: Chemometrics and Intelligent Laboratory Systems, 9(3), 257259. [ Links ]
Colombo, F., PannunzioMiner, E.V., Gay, H.D., Lira, R., Doráis, M.J., 2007, Barbosalita y lipscombita en Cerro Blanco, Córdoba (Argentina): descripción y génesis de fosfatos secundarios en pegmatitas con triplita y apatita: Revista Mexicana de Ciencias Geológicas, 24 (1), 120130. [ Links ]
CoronaChávez, R, ReyesSalas, M., GarduñoMonroy, V.H., IsradeAlcántara, I., LozanoSanta Cruz, R., MortonBermea, O., HernándezÁlvarez, E., 2006, Asimilación de xenolitos graníticos en el campo volcánico MichoacánGuanajuato: el caso de Arócutin Michoacán, México: Revista Mexicana de Ciencias Geológicas, 23(2), 233245. [ Links ]
CruzGámez, E.M., Maresch, W.V., CáceresGovea, D., Balcázar, N., 2007, Significado de las paragénesis de anfíboles en metagabros relacionados con secuencias de margen continental en el NW de Cuba: Revista Mexicana de Ciencias Geológicas, 24 (3), 318327. [ Links ]
DePaolo, D.J., Wasserburg, G.J., 1976, Nd isotopic variations and petrogenetic models: Geophysical Research Letters, 3(5), 249252. [ Links ]
DePaolo, D.J., Wasserburg, G.J., 1977, The source of island arcs as indicated by Nd and Sr isotopic studies: Geophysical Research Letters, 4(10), 465468. [ Links ]
Dixon, W.J., 1950, Analysis of extreme values: Annals of Mathematical Statistics, 21(4), 488506. [ Links ]
Dixon, W.J., 1951, Ratios involving extreme values: Annals of Mathematical Statistics, 22(1), 6878. [ Links ]
DoughertyPage, J.S., Bartlett, J.M., 1999, New analytical procedures to increase the resolution of zircon geochronology by the evaporation technique: Chemical Geology, 153(14), 227240. [ Links ]
Dybczynski, R., PolkowskaMotrenko, H., Samczynski, Z., Szopa, Z., 1998, Virginia tobacco leaves (CTAVTL2) new Polish CRM for inorganic trace analysis including microanalysis: Fresenius Journal of Analytical Chemistry, 360(34), 384387. [ Links ]
Efstathiou, C.E., 2006, Estimation of type I error probability from experimental Dixon's "Q" parameter on testing for outliers within small size data sets: Talanta, 69(5), 10681071. [ Links ]
ElNaqa, A., Hammouri, N., Kioso, M., 2006, GISbased evaluation of groundwater vulnerability in the Russeifa area, Jordan: Revista Mexicana de Ciencias Geológicas, 23(3), 277287. [ Links ]
Farre M., Martínez E., Hernando M.D., FernándezAlba A., Fritz J., Unruh E., Mihail O., Sakkas V., Morbey A., Albanis T, Brito F., Hansen P.D., Barcelo D., 2006, European ring exercise on water toxicity using different bioluminescence inhibition tests based on Vibrio fischeri, in support to the implementation of the water framework directive: Talanta, 69(2), 323333. [ Links ]
Gabrovská, D., Rysová J., Filová, V, Plicka, J., Cuhra, P., Kubík, M., Barsová, S., 2006, Gluten determination by gliadin enzymelinked immunosorbent assay kit: Interlaboratory study: Journal of AOAC International, 89(1), 154160. [ Links ]
Gill, U., Covaci, A., Ryan, J.J., Emond, A., 2004, Determination of persistent organohelogenated pollutants in human hair reference material (BCR 397): an interlaboratory study: Analytical and Bioanalytical Chemistry, 380(78), 924929. [ Links ]
Gladney, E.S., Jones, E.A., Nickell, E.J., Roelandts, I., 1990, 1988 compilation of elemental concentration data for USGS basalt BCR1: Geostandards Newsletter, 14(2), 209359. [ Links ]
Gladney, E.S., Jones, E.A., Nickell, E.J., Roelandts, I., 1991, 1988 compilation of elemental concentration data for USGS DTS1, Gl, PCC1, and Wl: Geostandards Newsletter, 15(2), 199396. [ Links ]
Gladney, E.S., Jones, E.A., Nickell, E.J., 1992, 1988 compilation of elemental concentration data for USGS AGV1, GSP1 and G2: Geostandards Newsletter, 16(2), 111300. [ Links ]
Govindaraju, K., Potts, P.J., Webb, PC, Watson, J.S., 1994, 1994 Report on Whin Sill dolerite WSE from England and Pitscurrie microgabbro PMS from Scotland: assessment by one hundred and four international laboratories: Geostandards Newsletter, 18(2), 211300. [ Links ]
Govindaraju, K., Potts, P.J., Webb, PC, Watson, J.S., 1995, Correction to "1994 Report on Whin Sill dolerite WSE from England and Pitscurrie microgabbro PMS from Scotland: assessment by one hundred and four international laboratories": Geostandards Newsletter, 19(1), 97. [ Links ]
Graybeal, D.Y., DeGaetano, A.T., Eggleston, K.L., 2004, Improved quality assurance for historical hourly temperature and humidity: development and application to environmental analysis: Journal of Applied Meteorology, 43(11), 17221735. [ Links ]
Grubbs, F.E., 1950, Sample criteria for testing outlying observations: Annals of Mathematical Statistics, 21(1), 2758. [ Links ]
Grubbs, F.E., 1969, Procedures for detecting outlying observations in samples: Technometrics, 11(1), 121. [ Links ]
Guevara, M., Verma, S.P, VelascoTapia, F., 2001, Evaluation of GSJ intrusive rocks JG1, JG2, JG3, JGla, and JGbl by an objective outlier rejection statistical procedure: Revista Mexicana de Ciencias Geológicas, 18(1), 7488. [ Links ]
Guevara, M., Verma, S.P, VelascoTapia, F., LozanoSanta Cruz, R., Girón, P, 2005, Comparison of linear regression models for quantitative geochemical analysis: Example of Xray fluorescence spectrometry: Geostandards and Geoanalytical Research, 29(3), 271284. [ Links ]
GutiérrezRuiz, M., Romero, F.M., GonzálezHernández, G., 2007, Suelos y sedimentos afectados por la dispersión de jales inactivos de sulfuros metálicos en la zona minera de Santa Bárbara, Chihuahua, México: Revista Mexicana de Ciencias Geológicas, 24(2), 170184. [ Links ]
Hassoun, M.H., 1995, Fundamentals of artificial neural networks: Massachusetts Institute of Technology, London, England, 511 p. [ Links ]
Hayes, K., Kinsella, A., 2003, Spurious and nonspurious power in performance criteria for tests of discordancy: The Statistician, 52(1), 6982. [ Links ]
Hayes, K., Kinsella, A., Coffey, N., 2007, Anote on the use of outlier criteria in Ontario laboratory quality control schemes: Clinical Biochemistry, 40 (34), 147152. [ Links ]
Haykin, S., 1999, Neural network, a comprehensive foundation: New York, McMillan, 842 p. [ Links ]
In't Veld, PH., 1998, The use of reference materials in quality assurance programmes in food microbiology laboratories: International Journal of Food Microbiology, 45(1), 3541. [ Links ]
King, E.P, 1953, On some procedures for the rejection of suspected data: Journal of American Statistical Association, 48(263), 531533. [ Links ]
Langton, S.D., Chevennement, R., Nagelkerke, N., Lombard, B., 2002, Analysing collaborative trials for qualitative microbiological methods: accordance and concordance: International Journal of Food Microbiology, 79(3), 175181. [ Links ]
LópezPalomino, R.I., VillaseñorMartínez, A.B., OlórizSáez, F., 2006, Primer registro del género Vinalesphinctes (Ammonitina) en el Oxfordiano de México: significación bioestratigráfica y consideraciones paleobiográficas en el Jurásico Superior de América: Revista Mexicana de Ciencias Geológicas, 23(2), 162183. [ Links ]
Lozano, R., Bernal, J.P, 2005, Assessment of eight new geochemical reference materials for XRF major and trace element analysis: Revista Mexicana de Ciencias Geológicas, 22(3), 329344. [ Links ]
MéndezOrtiz, B.A., CarrilloChávez. A., Monroy Fernández. M.G., 2007, Acid rock drainage and metal leaching on mine waste material (tailings) from a PbZnAg skarn deposit: Environmental assessment through static and kinetic laboratory tests: Revista Mexicana de Ciencias Geológicas, 24(2), 161169. [ Links ]
Miller, J.N., Miller, J. C, 2000, Statistics and Chemometrics for Analytical Chemistry: Essex, England, Prentice Hall, 271 p. [ Links ]
MirandaMartínez, M.E. Oleschko, K., Parrot, J.F., CastrejónVacio, F., Taud, H., BrambilaPaz, F., 2006, Porosidad de los yacimientos naturalmente fracturados: una clasificación fractal: Revista Mexicana de Ciencias Geológicas, 23(2), 199214. [ Links ]
MolinaGarza, R.S., OrtegaRivera, A., 2006, Chronostratigraphy and paleomagnetism of the Balsas Group in the TuzantlánCopalillo basin, northern Guerrero state, Mexico: Revista Mexicana de Ciencias Geológicas, 23(2), 215232. [ Links ]
Nagarajan, R., Madhavaraju, J., Nagendra, R., ArmstrongAltrin, J.S., Moutte, J., 2007, Geochemistry of Neoproterozoic shales of the Rabanpalli Formation, Bhima Basin, Northern Karnataka, southern India: implications for provenance and paleoredox conditions: Revista Mexicana de Ciencias Geológicas, 24(2), 150160. [ Links ]
Nagarajan, R., Sial, A.N., ArmstrongAltrin, J.S., Madhavaraju, J., Nagendra, R., in press, Carbon and oxygen isotope geochemistry of Neoproterozoic limestones of the Shahabad Formation, Bhima Basin, Karnataka, southern India: Revista Mexicana de Ciencias Geológicas. [ Links ]
Okamoto, K., Yoshinaga, J., Morita, M., 1996, Biological and environmental reference materials from the National Institute for Environmental Studies (Japan): Mikrochimica Acta, 123(1): 1521. [ Links ]
O'Nions, R.K., Hamilton, P.J., Evensen, N.M., 1977, Variations in 143Nd/ 144Nd and 87Sr/86Sr ratios in oceanic basalts: Earth and Planetary Science Letters, 34(1), 1322. [ Links ]
Papadakis, I., Van Nevel, L., Harper, C, Aregbe, Y., Taylor, P.D.P., 2007, IMEP12: trace elements in water; objective evaluation of the performance of the laboratories when measuring quality parameters prescribed in the European Directive 98/83/EC: Accreditation and Quality Assurance, 12(2), 105111. [ Links ]
Patriarca, M., Chiodo, E, Castelli, M., Corsetti, E, Menditto, A., 2005, Twenty years of the Me.Tos. project: an Italian national external quality assessment scheme for trace elements in biological fluids: Microchemical Journal, 79(12), 337340. [ Links ]
RamosLeal, J.A., Durazo, J., GonzálezMoran, T, JuárezSánchez, F., CortésSilva, A., Johannesson, K.H., 2007, Evidencias hidrogeoquímicas de mezclas de flujos regionales en el acuífero de la Muralla, Guanajuato: Revista Mexicana de Ciencias Geológicas, 24(3), 293305. [ Links ]
Richard, P, Shimizu,N., Allégre, C.J., 1976,143Nd/146Nd, a natural tracer: an application to oceanic basalts: Earth and Planetary Science Letters, 31, 269278. [ Links ]
RodríguezRíos, R., AguillónRobles, A., Leroy, J.L., 2007, Evolución petrológica y geoquímica de un complejo de domos topacíferos en el campo volcánico de San Luis Potosí (México): Revista Mexicana de Ciencias Geológicas, 24(3), 328343. [ Links ]
Rorabacher, D.B., 1991, Statistical treatmentfor rejection of deviant values: critical values of Dixon's "Q" parameter and related subrange ratios at the 95% confidence level: Analytical Chemistry, 63(2), 139146. [ Links ]
Salleh, S.H., Rosales, E., Flores de la Mota, I., 2007, Influence of different probability based models on oil prospect exploration decision making: A case from Southern Mexico: Revista Mexicana de Ciencias Geológicas, 24 (3), 306317. [ Links ]
Sang, H.Q., Wang, E, He, H.Y., Wang, Y.L., Yang, L.K., Zhu, R.X., 2006, Intercalibration of ZBH25 biotite reference material utilized for KAr and 40Ar39Ar age determination: Acta Petrológica Sinica, 22(12), 30593078. [ Links ]
Santoyo, E., Guevara, M., Verma, S.P, 2006, Determination of lanthanides in international geochemical reference materials by reversedphase high performance liquid chromatography: An application of error propagation theory to estimate total analysis uncertainties: Journal of Chromatography A, 1118(1), 7381. [ Links ]
Shekhawat, L.S., Pandit, M.K., Joshi D.W., 2007, Geology and geochemistry of low grade metabasic volcanic rocks from Salumber area in the Palaeoproterozoic Aravalli Supergroup, NW India: Journal of Earth System Science, 116(6), 511524. [ Links ]
Solé, J., Salinas, J.C., GonzálezTorres, E., Cendejas Cruz, J.E., 2007, Edades K/Ar de 54 rocas ígneas y metamórficas del occidente, centro y sur de México: Revista Mexicana de Ciencias Geológicas, 24(1), 104119. [ Links ]
TorresAlvarado, I.S., 2002, Chemical equilibrium in hydrothermal systems: the case of Los Azufres geothermal field, Mexico: International Geology Review, 44(7), 639652. [ Links ]
Velasco, F., Verma, S.P, 1998, Importance of skewness and kurtosis statistical tests for outlier detection and elimination in evaluation of Geochemical Reference Materials: Mathematical Geology, 30(1), 109128. [ Links ]
Velasco, F., Verma, S.P, Guevara, M., 2000, Comparison of the performance of fourteen statistical tests for detection of outlying values in geochemical reference material databases: Mathematical Geology, 32(4), 439464. [ Links ]
VelascoTapia, F., Guevara, M., Verma, S.P, 2001, Evaluation of concentration data in geochemical reference materials: Chemie der ErdeGeochemistry, 61(1), 6991. [ Links ]
Verma, M.P, 2004, Arevised analytical method for HCO3 and CO32 determinations in geothermal waters: an assessment of IAGC and IAEA interlaboratory comparisons: Geostandards and Geoanalytical Research, 28(3), 391409. [ Links ]
Verma, S.P, 1997, Sixteen statistical tests for outlier detection and rejection in evaluation of international geochemical reference materials: example of microgabbro PMS: Geostandards Newsletter: Journal of Geostandards and Geoanalysis, 21(1), 5975. [ Links ]
Verma, S.P, 1998, Improved concentration data in two international geochemical reference materials, USGS basalt BIR1 and GSJ peridotite JP1) by outlier rejection: Geofísica Internacional, 37(3), 215250. [ Links ]
Verma, S.P, 2005, Estadística Básica para el Manejo de Datos Experimentales: Aplicación en la Geoquímica (Geoquimiometría): México, D. F., Universidad Nacional Autónoma de México, 186 p. [ Links ]
Verma, S.P, QuirozRuiz, A., 2006a, Critical values for six Dixon tests for outliers in normal samples up to sizes 100, and applications in science and engineering: Revista Mexicana de Ciencias Geológicas, 23(2), 133161. [ Links ]
Verma, S.P, QuirozRuiz, A., 2006b, Critical values for 22 discordancy test variants for outliers in normal samples up to sizes 100, and applications in science and engineering: Revista Mexicana de Ciencias Geológicas, 23(3), 302319; with electronic tables available at <http://satori.geociencias.unam.mx/233.htm>. [ Links ]
Verma, S.P, OrduñaGalván, L.J., Guevara, M., 1998, SIPVADE: A new computer programme with seventeen statistical tests for outlier detection in evaluation of international geochemical reference materials and its application to Whin Sill dolerite WSE from England and Soil5 from Peru: Geostandards Newsletter: Journal of Geostandards and Geoanalysis, 22(2), 209234. [ Links ]
Verma, S.P, DíazGonzález, L., SánchezUpton, P., Santoyo, E., 2006, OYNYL: A new Computer Program for Ordinary, York, and New York leastSquares linear regressions: WSEAS Transactions on Environment and Development, 2(8), 9971002. [ Links ]
Villeneuve, J.P, de Mora, S.J., Cattini, C, Carvalho, F.P, 1999, Worldwide and regional intercomparison for the determination of organochlorine compounds, petroleum hydrocarbons, and sterols in sediment sample IAEA408: Vienna, Austria, International Atomic Energy Agency, Marine Environment Laboratory, 80 p. [ Links ]
Villeneuve, J.P, de Mora, S.J., Cattini, C, 2002, Worldwide and regional intercomparison for the determination of organochlorine compounds and petroleum hydrocarbons in sediment sample IAEA417: Vienna, Austria, International Atomic Energy Agency, Analytical Quality Control Services, 136 p. [ Links ]
Wang, X.D., Soderlund, U., Lindh, A., Johansson, L., 1998, UPb and SmNd dating of highpressure granulite and upper amphibolite facies rocks from SW Sweden: Precambrian Research, 92(4), 319339. [ Links ]
Xia, Q.W., Hendrickson, E.L., Zhang, Y, Wang, T.S., Taub, F., Moore, B.C., Porat, I., Whitman, W.B., Hackett, M., Leigh, J.A., 2006, Quantitative proteomics of the Archaeon Methanococcus maripaludis validated by microarray analysis and real time PCR: Molecular & Cellular Proteomics, 5 (5), 868881. [ Links ]