## Ingeniería, investigación y tecnología

*versión impresa* ISSN 1405-7743

### Ing. invest. y tecnol. vol.14 no.2 México abr./jun. 2013

**Estimation of Extreme Wind Speeds by Using Mixed Distributions**

**Estimación de velocidades de viento extremo utilizando distribuciones mezcladas**

**Escalante-Sandoval Carlos Agustín**

*División de Ingenierías Civil y Geomática, Facultad de Ingeniería, Universidad Nacional Autónoma de México.* E-mail: caes@unam.mx

Information on the article: received: November 2009,

Accepted: June 2012

**Abstract**

Structures are designed with the intention of safely withstanding ordinary and extreme wind loads over the entire intended economic lifetime. Due to the fact that extreme wind speeds are essentially random, appropriate statistical procedures needed to be developed in order design more accurately wind-sensitive structures. Five mixed extreme value distributions, with Gumbel, reverse Weibull and General Extreme Value components along with the Two Component Extreme Value distribution were used to model extreme wind speeds. The general procedure to estimate their parameters based on the maximum likelihood method is presented in the paper. A total of 45 sets, ranging from 9-year to 56-year, of largest annual wind speeds gathered from stations located in The Netherlands were fitted to mixed distributions. The best model was selected based on a goodness-of-fit test. The return levels were estimated and compared with those obtained by assuming the data arise from a single distribution. 87% of analyzed samples were better fitted with a mixed distribution. The best mixed models were the mixed reverse Weibull distribution and the mixture Gumbel-Reverse Weibull. Results suggest that it is very important to consider the mixed distributions as an additional mathematical tool when analyzing extreme wind speeds.

**Keywords:** wind speed frequency analysis, mixed extreme value distributions, maximum likelihood parameter estimation, goodness-of-fit.

**Resumen**

Las estructuras son diseñadas para resistir de forma segura las cargas de viento ordinarias o extremas en el periodo de su vida útil. Debido a que las velocidades de viento son esencialmente aleatorias se requiere de procedimientos estadísticos que estimen de manera más confiable la carga por viento, para la cual una estructura trabajará eficientemente. En este trabajo se presentan cinco distribuciones de probabilidad de valores extremos mixtas, cuyas componentes son las distribuciones Gumbel, Weibull, General de Valores Extremos y TCEV para modelar velocidades extremas de viento. Los parámetros de dichas distribuciones son obtenidos por la técnica de máxima verosimilitud. Para aplicar las distribuciones mezcladas propuestas se utilizaron los registros de velocidades de viento máximo anual de 45 estaciones localizadas en Holanda, cuyas longitudes varían de 9 a 56 años. El mejor modelo univariado o mezclado fue elegido a través de un criterio de bondad de ajuste. Un 87% de las muestras analizadas se ajustaron mejor a una distribución mezclada y las mejores combinaciones fueron las de Gumbel-Weibull y la Weibull-Weibull. Los resultados sugieren que es muy importante considerar a las distribuciones mezcladas como una herramienta adicional en el análisis de velocidades de vientos extremos.

**Descriptores:** análisis de frecuencias de velocidades de viento, distribuciones de valores extremos mixtas, estimación de parámetros por máxima verosimilitud, bondad de ajuste.

**Introduction**

Structures are designed with the intention of safely withstanding ordinary and extreme wind loads over the entire intended economic lifetime. The wind pressures on a structure are a function of the characteristics of the approaching wind, the geometry of the structure under consideration, and the geometry and proximity of the structures upwind. The pressures are not uniformly distributed over the surface of the structure and they can result in fatigue damage and in a probable dynamic excitation. Because of the many uncertainties involved, the maximum wind loads experienced by a structure during its lifetime, may vary widely from those assumed in design.

1) Stability against overturning, uplift and/or sliding of the structure as a whole.

2) Strength of the structural components of the building is required to be sufficient to withstand imposed loading without failure during the life of the structure.

3) Serviceability for example for buildings, where interstorey and overall deflections are expected to remain within acceptable limits.

The ultimate limit state wind speed is adopted by most international codes to satisfy stability and strength limit state requirements. In many codes such a speed has a return period of fifty years (*Û*_{50}).

The objective of wind speed frequency analysis is to obtain the most accurate estimates to any return period of occurrence through the use of probability distributions.

Much of the work in extreme value theory begins with the assumption that *X*_{1}, *X*_{2}, . . . , *X*_{n} are independent and identically distributed observations with some common, but unknown, distribution function *F*(*x*): The Fréchet distribution (with infinite upper tail), The Gumbel distribution (with infinite upper tail) and the reverse Weibull distribution, whose upper tail is finite (Castillo, 1988).

In the early 1970's two competing models of extreme wind speeds were widely used: the extreme value type II or Fréchet distribution and the extreme value distribution type I or Gumbel distribution. However, for long return periods the Fréchet distribution can lead to unrealistically high estimated speeds and inefficient for design purposes (Simiu *et al*., 1978).

In some works (Dukes and Palutikof, 1995; Simiu and Heckert, 1996; Heckert and Simiu, 1998, and Simiu *et al*., 2001) the Reverse Weibull distribution, based on epochal and peaks over threshold (POT) approaches, has been considered to be better in comparison to the Gumbel distribution for modeling extreme wind speeds.

In contrast, Galambos and Macri (1999) found that the assumption of bounded wind speeds and the subsequent implementation of the POT method for estimating the required parameters from wind speeds data lead to contradictions and that the Gumbel distribution is better to model extreme wind speeds. Perrin *et al*. (2006) also found that the Reverse Weibull distribution generates incorrect estimates of the tails of the distributions of wind speeds and of the distribution of annual maxima wind speed.

According to results obtained in those works, none of two extreme distributions (Gumbel or Reverse Weibull) can be considered better or totally adequate to model extreme wind speeds.

Simiu (2002) wrote "*It is likely that better probabilistic models of extreme wind speeds could be developed if statistics of thunderstorm and large-scale storm wind speeds could be developed separately and combined in mixed distributions*". So, efforts in this direction have already been reported (Holmes and Moriarty, 1999; Dougherty *et al*., 2003).

In order to continue with this topic, six mixed extreme value distributions are proposed to model annual maximum wind speed samples.

**Univariate extreme value distributions**

In general, extreme value distributions have been widely used for fitting the distribution of extreme wind speeds. The name extreme value is attached to these distributions because they can be obtained as limiting distributions (as n → ∞) of the greatest value among *n* independent random variables, each having the same continuous distribution.

The general solution of the functional equation that must satisfy the extreme values has been called General Extreme Value distribution, which directly represents the Types II, and III extreme value distributions. Type I distribution results as limiting condition of the General Extreme Value distribution. Each type is characterized by the value of the shape parameter β as: Gumbel distribution β = 0, Fréchet distribution β < 0 and Weibull distribution β > 0.

The probability density function (pdf) of the Gumbel distribution is

where *υ* and α are the location and scale parameters, and α >0.

The pdf of the standard Fréchet distribution is

where σ and λ are the scale and shape parameters, with σ >0 and λ >0.

The pdf of the Reverse Weibull distribution is

where φ and κ are the scale and shape parameters, with φ > 0 and κ > 0.

The pdf of the General Extreme Value distribution is

where ω, η, and β are the location, scale and shape parameters, and η >0.

**Mixed distributions**

Extreme wind speeds (EWS) have been analyzed through the use of univariate distributions. Several assumptions underlay the statistical estimate of the wind speed. The most important one that all extremes (up to return periods of 10^{4} yr) belong to the same population is hard to verify from the available short observational sets.

Van *et al*. (2004) noticed the existence of areas where the extreme value distribution of extratropical winds was double populated.

They demonstrated that the local wind can be caused by two meteorological systems "*1*" and "*2*" of different physical nature, each of them generating its own distribution *F*_{1}(*x*) and *F*_{2}(*x*). Then, the parent distribution *F*(*x*) is said to be mixed.

The use of a mixture of probability distributions functions for modeling samples of data coming from two populations have been proposed long time ago (Mood *et al*., 1974):

where *p* is a factor used to weight the relative contribution of each population (0 < *p* < 1).

*Mixed Gumbel Distribution* (MG)

If *F*_{1}(*x*) and *F*_{2}(*x*) of (5) are Gumbel distributions, the corresponding mixed pdf is (Raynal and Guevara, 1997):

where *υ*_{1}*, α*_{1} and *υ*_{2}*, α*_{2} are the location and scale parameters for the first and second population, respectively, and *p* is the association parameter (0 < *p* < 1).

*Mixed General Extreme Value Distribution* (MGEV)

If *F*_{1}(*x*) and *F*_{2}(*x*) of equation (5) are General Extreme Value distributions, the mixed pdf is (Raynal and Santillan, 1986):

where ω_{1}, η_{1}, β_{1} and ω_{2}, η_{2}, β_{2} are the location, scale and shape parameters for the first and second population, respectively, and *p* is the association parameter (0 < *p* < 1).

*Mixed Reverse Weibull Distribution* (MRW)

If *F*_{1}(*x*) and *F*_{1}(*x*) of equation (5) are Reverse Weibull distributions, the mixed pdf is (Escalante, 2006):

where φ_{1}, κ_{1} and φ_{2}, κ_{2} are the scale and shape parameters for the first and second population, respectively, and *p* is the association parameter (0 < *p* < 1).

*Mixed Gumbel-Reverse Weibull Distribution* (G-RW)

Assuming that first and second populations behave as Gumbel and Reverse Weibull distributions, respectively, the pdf of equation (5) yields to the five-parameter mixture model:

where *υ*_{1}*, α*_{1} are the location and scale parameters for the first population, φ_{2}, κ_{2} are the scale and shape parameters for the second population, and *p* is the association parameter (0 < *p* < 1).

*Mixed Gumbel-General Extreme Valued Distribution* (G-GEV)

Assuming that first and second populations behave as Gumbel and General Extreme Value distributions, respectively, the pdf of equation (5) yields to the six-parameter mixture model:

where *υ*_{1}*, α*_{1} are the location and scale parameters for the first population, and ω_{2}, η_{2}, β_{2} are the location, scale and shape parameters for the first and second population, and *p* is the association parameter (0 < *p* < 1).

*Two Component Extreme Value* (TCEV) *Distribution*

The cumulative density function is (Rossi *et al*., 1984):

The corresponding pdf is

**Estimation of parameters by maximum likelihood**

Since the parameters of the mixed distributions are unknown, they must be estimated from data. The method of maximum likelihood for estimation of the parameters of the mixed extreme value distribution was selected due to its wide applicability and the efficiency features associated with it, which are not easily found in other methods of parameter estimation.

The likelihood function of *n* random variables is defined to be the joint density of *n* random variables and it is a function of the parameters. If is a random sample of a univariate density function, the corresponding likelihood function is (Mood *et al*., 1974):

The logarithmic function will be used instead of the likelihood function because it is easier to handle. So, equation (13) is transformed:

where L is called the likelihood function; Ln is the natural logarithm; *θ* is the set of parameters to be estimated, and *f* (*x*, * θ*)is the univariate or mixed pdf.

For the case of the G-GEV distribution, equation (14) is

Due to the complexity of the mathematical expressions in (14) and the partial derivatives with respect to the parameters, the constrained multivariable Rosenbrock method (Kuester and Mize, 1973) was applied to obtain the estimators of the parameters by the direct maximization of (14).

Once obtained the parameters, the quantiles for different return periods can be estimated by solving equation (5). For the case of MG distribution:

where *Û _{T}* is the maximum extreme wind speed (in m/s) associated with T years of return period.

The best model can be selected based on the criterion of minimum standard error of fit (*SEF*), as defined by Kite (1988):

where g_{i}, *i* = 1, ... , *n* are the recorded events; h_{i}, *i* = 1, ... , *n* are the event magnitudes computed from the univariate or mixed distributions at probabilities obtained from the sorted ranks of g_{i}, *i* = 1, ... , *n*; *q* is the number of parameters estimated for the univariate or mixed distributions; *n* is the length of record, and *j* is the number of the analyzed station.

So, *q* = 2 for the Gumbel and Reverse Weibull distributions; *q* = 3 for the General Extreme Value Distribution; *q* = 4 for the TCEV distribution; *q* = 5 for the MG, MRW and G-RW distributions, and *q* = 6 for the MGEV distribution.

**Case study**

The mixed extreme value distributions were applied to model the annual maxima wind speed data gathered of the hourly potential winds computed at 45 stations located in The Netherlands (Figure 1). This country has a typical midLatitude oceanic climate with prevailing westerly winds. Winter storms are the result of differences in temperature between the polar air masses and the air in the middle latitudes in autumn and winter. These extratropical cyclones generally have less destructive power than tropical cyclones by they are able to provide damaging winds over wide coastal and inland areas.

Data are available from the Royal Netherlands Meteorological Institute (KNMI). Lengths of record vary from 9 to 56 years (Table 1).

As it is known, in any of the multivariable constrained non-linear optimization techniques, global optimality is never assured. Therefore, care must be taken in order to avoid a local optimum. It is suggested to start always with a set of initial parameters (Moments estimators). For example, in Schiphol station for the case of the G-RW distribution, sample is sorted in decreasing order of magnitude and divided into two parts. The first one contains a third of the sample (association parameter *p* = 0.33) with a mean equal to 23.85 m/s and standard deviation equal to 1.62 m/s. With these values and by using equations (18) and (19), the initial parameters for the Reverse Weibull distribution are computed ( = 18.546, = 24.546). For the rest of the sample with a mean equal to 19.41 m/s and standard deviation equal to 1.56 m/s, initial parameters for the Gumbel distribution are computed with equations (20) and (21), ( = 18.70, = 1.214).

The final maximum likelihood estimators by the direct maximization of equation (14) are:

In this station the best univariate fit was obtained using the General Extreme Value distribution with a *SEF* = 0.320 m/s and *Û*_{50} = 26.7 m/s.

In Figure 2, a graphical comparison between the empirical and fitted distributions (G-RW) is made.

The univariate and mixed return levels *U*(m/s) for different return periods T(years) along with the minimum value of the standard error of fit were obtained for each analyzed station. If only the univariate distributions had been considered in the wind speed frequency analysis 40% of the samples would have been better fitted with the Gumbel distribution, 56% with the General Extreme Value distribution, and 4% with the Reverse Weibull distribution.

It was possible to reduce the standard error of fit when mixed distributions were applied. 40% of samples were better fitted with the MRW distribution, and another 40% with the G-RW distribution. For instance, in station K13 with 22 years of record, the best univariate fit was obtained with the General Extreme Value distribution, *SEF* = 0.910 m/s and *Û*_{50} = 32.7 m/s, and the best mixed fit was obtained with the MRW distribution with a *SEF* = 0.443 m/s and the return level reduced to *Û*_{50} = 30.9 m/s, which also represents a significant difference for design purposes.

It was also seen that the reduction of the *SEF* was important in the cases when the analyzed sample has a short length of record. This fact represents a great advantage of the mixed distributions with reference to univariate distributions. The final values of the return levels are shown in table 2.

**Conclusions**

The general objective of this study is to show how the mixed distributions can be applied to model extreme wind speeds.

Five mixed extreme value distributions, with Gumbel, Reverse Weibull, and General Extreme Value components along with the Two Component Extreme Value distribution were used to model extreme wind speeds. The maximum likelihood estimators of the parameters were obtained numerically by using the multivariable constrained Rosenbrock optimization algorithm, which worked out very well in all cases.

Results have shown that there exists a reduction in the standard error of fit when estimating the parameters with mixed distributions instead of its univariate counterpart, and differences between univariate and mixed design events can be significant as return period increases. 87% of samples were better fitted with a mixed distribution. In 34 analyzed samples at least one of the components of the mixed distribution is the Reverse Weibull distribution. Besides, the final return levels were not observed like unrealistic design events even for long return periods.

Results suggest that it is very important to consider the mixed distributions as an additional mathematical tool when analyzing extreme wind speeds.

**References**

Castillo E. *Extreme Value Theory in Engineering*, Boston, USA, Academic Press, 1988. [ Links ]

Dougherty A.M., Corotis R.B., Segurson A. Design Wind Speed Prediction. *Journal of Structural Engineering,* volume 129 (issue 9), 2003: 1268-1274. [ Links ]

Dukes M., Palutikof J. Estimation of Extreme Wind Speeds with Very Long Return Period. *Journal of Applied Meteorology* volume 34, 1995: 1950-1961. [ Links ]

Escalante C. Application of Bivariate Extreme Value Distribution to Flood Frequency Analysis: A Case Study of Northwestern Mexico. *Natural Hazards* DOI: 10.1007/s11069-006-9044-7, 2006a, On line. [ Links ]

Galambos J., Macri N. Classical Extreme Value Model and Prediction of Extreme Events. *Journal of Structural Engineering,* volume 125 (issue 7), 1999: 792-794. [ Links ]

Heckert N.A., Simiu E. Estimates of Hurricane Wind Speeds by "Peaks Over Threshold" Method. *Journal of Structural Engineering,* volume 124 (issue 4), 1998: 445-449. [ Links ]

Dougherty A.M., Corotis R.B., Segurson A. Design Wind Speed Prediction. *Journal of Structural Engineering,* volume 129 (issue 9), 2003: 1268-1274. [ Links ]

Holmes J.D., Moriarty W.W. Application of the Generalized Pareto Distribution to Extreme Value Analysis in Wind Engineering. *Journal of Wind Engineering and Industrial Aerodynamics,* volume 83, 1999: 1-10. [ Links ]

Kite G.W. *Frequency and Risk Analyses in Hydrology*, USA, Water Resources Publication, 1988. [ Links ]

Kuester J.L., Mize J.H. *Optimization Techniques with FORTRAN*, USA, McGraw-Hill, 1973. [ Links ]

Mood A., Graybill F., Boes D. *Introduction to the Theory of Statistics*, 3rd. Ed., McGraw-Hill, USA, 1974. [ Links ]

Perrin O., Rootzén H., Taesler R. A Discussion of Statistical Methods Used to Estimate Extreme Wind Speeds. *Theoretical and Applied Climatology,* volume 85, 2006: 203-215. [ Links ]

Raynal J., Guevara J. Maximum Likelihood Estimators for the Two Populations Gumbel Distribution. *Hydrological Science and Technology,* volume 13 (issues 1-4), 1997: 47-56. [ Links ]

Raynal J., Santillan O. Maximum Likelihood Estimators of the Parameters of the Mixed GEV Distribution, on: IX Congreso Nacional de Hidráulica, AMH, Querétaro, Qro., Mex., 1986, pp. 79-90, (In Spanish). [ Links ]

Rossi F., Fiorentino M, Versace P. Two-Component Extreme Value Distribution for Flood Frequency Analysis. *Water Resources Research,* volume 20 (issue 7), 1984: 847-856. [ Links ]

Simiu E., Biétry J., Filliben J.J. Sampling Errors in Estimation of Extreme Wind Speeds. *Journal of Structural Division ASCE,* volume 104, 1978: 491-501. [ Links ]

Simiu E., Heckert N.A. Extreme Wind Distribution Tails: a "Peaks Over Threshold" Approach. *Journal of Structural Engineering,* volume 122 (issue 5), 1996: 539-547. [ Links ]

Simiu E., Heckert N.A., Fillibe, J., Johnson S. Extreme Wind Load Estimates on the Gumbel Distribution of Dynamic Pressures: An Assessment. *Structural Safety,* volume23, 2001: 221-229. [ Links ]

Simiu E. *Meteorological Extremes*, on: Encyclopedia of Environmetrics, UK, El-Shaarawi A.H., Piegorsch W.W. (eds), volume 3, John Wiley and Sons, 2002, pp. 1255-1259. [ Links ]

Van-Den-Brink H.W., Konnen G.P., Opsteegh J.D. Statistics of Extreme Synoptic-Scale Wind Speeds in Ensemble Simulations of Current and Future Climate. *J. Climate*, volume 17, 2004: 4564-4574. [ Links ]

**About the author**

*Carlos Agustín Escalante-Sandoval*. Civil engineer (BUAP, 1985), M.E. with major in water resources (UNAM, 1988), PhD with major in hydraulics (UNAM, 1991). He was head of Hydraulics Department up to 2007 and currently Head of Civil Engineering Graduated Department, both in the Faculty of Engineering at UNAM. He has been granted some academic and scientific prizes such as the Gabino Barreda Medal in 1991 by UNAM and the prize for Research "Enzo Levi" in 2000 by the Mexican Association of Hydraulics. He is member of the ASCE, AWRA, AGU, AMC, AI and the National System of Researches.