On the Use of Indices and Parameters in Forecasting Severe Storms - Electronic Journal of Severe Storms Meteorology
←
→
Page content transcription
If your browser does not render page correctly, please read the page content below
Doswell, C. A. III, and D. M. Schultz, 2006: On the use of indices and parameters in forecasting severe storms. Electronic J. Severe Storms Meteor., 1(3), 1–22. On the Use of Indices and Parameters in Forecasting Severe Storms CHARLES A. DOSWELL III Cooperative Institute for Mesoscale Meteorological Studies, University of Oklahoma, Norman, Oklahoma DAVID M. SCHULTZ Cooperative Institute for Mesoscale Meteorological Studies, University of Oklahoma, and NOAA/National Severe Storms Laboratory, Norman, Oklahoma (Submitted 11 May 2006; in final form 05 November 2006) ABSTRACT This paper describes our concept of the proper (and improper) use of diagnostic variables in severe-storm forecasting. A framework for classification of diagnostic variables is developed, indicating the limitations of such variables and their suitability for operational diagnosis and forecasting. The utility of diagnostic variables as forecast parameters is discussed, in terms of what we consider to be the relevant issues in de- signing new diagnostic variables used for making weather forecasts. Finally, criteria required to determine whether a new diagnostic variable represents an effective forecast parameter are proposed. We argue that many diagnostic variables in widespread use in forecasting severe convective storms have not met these criteria for demonstrated utility as forecast parameters. –––––––––––––––––––––––– 1. Introduction owing to various issues tied to their computation and representativeness. Although there is noth- Operational and research meteorologists often ing inherently wrong with diagnostic variables, refer to diagnostic variables, such as convective forecasters need to be aware of the limitations on available potential energy (CAPE) or the super- their use. For example, one of the most common cell composite parameter (SCP; Thompson et al. and well established among these is CAPE in 2003), as forecast parameters. We contend that various forms. Monteverdi et al. (2003) showed they are not necessarily forecast parameters; that for at least one type of severe weather fore- rather, they constitute a set of diagnostic vari- casting (tornadoes in California), CAPE proved ables. For most such variables, their forecast to be of little value in discriminating tornado value generally has not been established via rig- cases from nontornadic cases when tested as a orous verification. That operational forecasters forecast parameter. The forecasts based on have used many of them for decades is not suffi- CAPE were verified against observed events, cient, in our opinion, to establish their capability whereas another variable (0–1-km shear) as forecast parameters. appeared to be capable of making such dis- criminations. Furthermore, diagnostic variables can lead to faulty perceptions of the state of the atmosphere, Diagnostic variables have a long history in asso- __________________________ ciation with forecasting severe convection (Schaefer 1986; Johns and Doswell 1992). It is Corresponding author address: Dr. Charles A. Doswell III, The University of Oklahoma/CIMMS, 120 David L. Boren evident that the value of such variables is Blvd, Suite 2100, Norman, OK 73072-7304. strongly associated with their capacity to sum- E-mail: cdoswell@hoth.gcn.ou.edu marize in a single number (or field variable) 1
DOSWELL AND SCHULTZ 05 November 2006 some characteristic of the severe storm environ- cally as follows. Let Φ be an n-dimensional vec- ment. Rather than having to consider the full tor of atmospheric state variables complexity of four-dimensional atmospheric Φ = (φ1 ,φ 2 ,K ,φ n ). data, it is often regarded as a benefit for forecast- In general, this vector is a function of its location ers working with hard deadlines to be able to in space, denoted by the position vector X, and is distill that complexity into a single variable. known at some particular time, t = t 0 , such that Forecasters and researchers generally acknowl- Φ = Φ (X, t = t 0 ) ≡ Φ0 . edge that any single diagnostic variable consid- A basic principle of numerical weather predic- ered in isolation has little forecast value. Never- tion (NWP) is that the information about the theless, in our experience, we have seen in- state of the atmosphere at some initial time can stances where forecasters, often under forecast be used to make a forecast of the state of the deadline pressure, will make forecast decisions atmosphere at some future time in the following based heavily, if not primarily, on a single diag- way: nostic variable. It has been our observation that ∂Φ forecasters are most prone to rely heavily on a Φ (X,t = t 0 + δt ) = Φ (X, t = t 0 ) + δt . single diagnostic variable in the context of de- ∂t t=t 0 termining the general likelihood of severe Thus, the state of the atmosphere at some future weather. For example, when CAPE or strong time is the sum of its state at the initial time (the vertical wind shear is found to be absent at the diagnosis) and the product of the time step, δt, diagnosis time, the likelihood of severe convec- with the local time rate of change of Φ, where tive weather subsequently is sometimes dis- ∂Φ missed as unlikely. An important concern is that ∂t [ (X,t ) = f Φ (X,t ) .] most of the widely used diagnostic variables have not been validated as proper forecast pa- That is, the time trend of the atmospheric state rameters. (This process for validation is defined variables at any point in space at a particular in section 2). As diagnostic variables, they can time is a function of the current spatial distribu- be useful in assessing quantitatively the state of tion of those state variables. In NWP, this time the atmosphere at the time of their calculation, trend is expressed in the form of a set of govern- but their capability to inform forecasters about ing equations used for a particular NWP model. weather in the future can be quite limited, at Thus, although it is strictly true that a forecast best. can be calculated from the state variable vector Φ0 at the initial time, we exclude from consid- One purpose of this paper is to propose a classi- eration as diagnostic variables the explicit calcu- fication scheme for diagnostic variables in use lation of local time tendencies for atmospheric for severe weather forecasting, in order to under- state variables via this NWP-like process. This stand their characteristics and limitations. An- does not, as we will show, exclude as diagnostic other goal is to address the issue of what it takes variables the estimation of time tendencies by to validate the value of a variable as a proper other means. forecast parameter. We are not seeking to dis- courage the use of diagnostic variables, per se, in Establishing a quantitative understanding of the forecasting severe convective storms. Rather, state of the atmosphere at a given time using we seek to develop a perspective through which diagnostic variables is possible. Qualitative un- their value to forecasting can be maximized. derstanding, in turn, can be substantially en- hanced by this quantitative analysis. For exam- 2. Diagnostic variables ple, a forecaster can simply look at the plotted winds on a surface chart and recognize zones of What specifically do we mean by a diagnostic strong rotation (vorticity) and convergence, but variable? A diagnostic variable is some quan- without a quantitative evaluation of the vorticity tity, valid at a specific instant in time, which and convergence values, a forecaster’s sense of either is a basic observed variable (e.g., pressure, the strength of those variables is obviously sub- temperature, wind, humidity) or can be calcu- jective. A quantitative assessment is generally lated from those variables. The relationship be- preferred over subjective assessment in forecast- tween diagnosis and prognosis, as developed by ing; numerical values of variables often matter Doswell (1986), can be expressed mathemati- to the forecast decision.. 2
DOSWELL AND SCHULTZ 05 November 2006 We define a diagnostic variable being used as a properly, forecasters need to be aware of the forecast parameter as one that allows a fore- story behind each of them. In order to discuss caster to make an accurate weather forecast diagnostic variables and their limitations, we based on the current values of that variable. For propose a classification scheme as follows. a proper forecast parameter, there should be a time-lagged correlation between the parameter a. Simple observed variables and the weather being forecast. This is to be distinguished from using a diagnostic variable Simple observed variables are those measured by calculated from a forecast of the state variables meteorological instruments. Strictly speaking, (say, from NWP model gridded forecast fields) most modern meteorological instruments make valid at some future time to make a forecast of electronic measurements (e.g., resistance or ca- the weather for that valid time. How accurately pacitance) that are calibrated to provide readouts an NWP model forecasts that diagnostic variable of the meteorological variables (e.g., pressure, is a very different issue from how accurate a temperature, dewpoint, wind direction and forecast can be using only the current distribu- speed). tion of that variable. A diagnostic variable might be very good at discriminating between different b. Simple calculated variables weather events when known at the time of the event, but still be ineffective as a forecast pa- Calculated variables are not observed directly rameter because its values before the events oc- but are computed from the raw measurements cur make no directly useful statement about the using relatively simple conversion formulae. future weather. Calculated variables typically involve combina- tions of two or more observed variables, but they Obviously, diagnostic variables change with are not combined in an arbitrary fashion. Rather, time, and indeed, a description of the time ten- they are combined in ways having a physical dency for some diagnostic variable can be an basis. Such calculated variables are usually important diagnostic variable in its own right and sought because they have some valuable physical may even be valuable as a proper forecast pa- property, such as being conserved under certain rameter. The accuracy of forecasts made using a reasonable assumptions. For example, at the diagnostic variable as a forecast parameter will surface, the temperature and dewpoint tempera- tend to decay with time. Some forecast parame- ture are the common observed variables. How- ters might permit reasonably accurate forecasts ever, for reasons discussed in Sanders and Dos- only for a short period very close to the time of well (1995), mixing ratio and potential tempera- their diagnosis, whereas others might show high ture are conserved variables that incorporate the lagged correlation with the weather for a rela- pressure observations, as well as the temperature tively long time. and dewpoint. The formulae for calculating the mixing ratio and potential temperature are de- Therefore, what we are concerned with herein is termined by the laws governing the physical a forecast by a human forecaster that makes a properties of air parcels under the assumption of statement about the future weather (say, the oc- adiabatic flow. currence of severe storms), rather than a state- ment about the distribution of atmospheric state c. Derivatives or integrals (spatial or temporal) variables. For a variable to be useful in making of simple observed or calculated variables a weather forecast by a human, it must be shown rigorously that there is some measure of forecast Time and space derivatives and integrals of the skill when the current values of the variable are observed or calculated variables form the next used to make that weather forecast. This argu- class of diagnostic variables. In effect, these ment will be developed in detail in what follows. diagnostic quantities allow estimates of the terms in formulae that might arise in mathematical When calculating diagnostic variables, forecast- descriptions of atmospheric structure. An impor- ers unaware of the caveats associated with the tant caveat in the calculation of such diagnostic particular variable being analyzed can be misled. variables is the inevitable truncation error that What are the unique sensitivities in those calcu- arises from the limited temporal and spatial reso- lations? How much confidence can one put in lution of our meteorological information, the numbers and how they might change over whether it be observed or modeled. If we were time? If diagnostic variables are to be used to calculate, say, the Eulerian time rate of change 3
DOSWELL AND SCHULTZ 05 November 2006 of the 500-hPa height at some location where by a careful statistical verification study. Its soundings are taken, the true instantaneous value popularity is based almost entirely on anec- is not known, simply because soundings gener- dotal evidence and heuristic arguments. ally are taken at 12-h intervals. Of course, esti- 3. Term 1 on the rhs of the MFC equation mating that same variable at some point other associated with horizontal divergence is than where a sounding is launched 1 is problem- characteristically much larger than term 2, atic because of the small number of locations associated with moisture advection. The where soundings are made. The usual approach MFC field tends to look very much like that is to make estimates using available information; of the horizontal divergence field alone on this typically involves making assumptions about mesoscale and smaller scales. the behavior of the fields where we have no in- formation. Realizing the limitations of those 4. MFC is an inadequate tool as a forecast pa- calculations is critical. Just because the calcula- rameter because it combines two of the three tions were performed by computer is no guaran- ingredients required for deep, moist convec- tee that they are even remotely accurate. tion (e.g., Johns and Doswell 1992). The two components of the major term in this d. Combined variables combination, the mixing ratio and the diver- gence field, can evolve quasi-independently. Next is the class of combined diagnostic vari- What this variable shows is where those two ables. Many ways exist to take two or more di- main components overlap at the time of the agnostic variables and combine them in some analysis. Thus, MFC is first and foremost a way that might be more useful for some specific diagnostic variable. purpose than the raw observations or simple de- rivatives and integrals of those variables. Mois- CAPE is another example of a combined vari- ture flux convergence (MFC), discussed in detail able; its calculation involves a vertical integral of by Banacos and Schultz (2005), is an example of multiple state variables and incorporates a num- a combined variable. The formulation of MFC ber of assumptions (e.g., Doswell and Rasmus- can vary from one application to another, but it is sen 1994; Doswell and Markowski 2004). Nev- often formulated as the finite difference calcula- ertheless, that calculation can be represented by tion of the quantity: its most basic components: large CAPE is gen- MFC = −∇ h • (rVh ) = r ∇ • V + V • ∇ r erally observed where low-level moisture is 1 4h243h 1 h 24 4 h 3 found in the presence of conditionally unstable 1 2 lapse rates in the lower mid-troposphere. As with MFC, these constituent fields can evolve where r is the mixing ratio, ∇ h is the horizontal quasi-independently and can be superimposed by gradient operator, and Vh is the horizontal wind differential advection processes. Prior to that vector. This calculation involves a calculated superposition, the air streams carrying condition- variable (the mixing ratio) and a spatial deriva- ally unstable lapse rates and low-level moisture tive of an observed variable (the horizontal wind (typically at different levels in the atmosphere vector). MFC is often calculated using the rela- advecting variables in different directions and at tively dense surface observations, and, as used in different speeds) have not yet interacted and so forecasting severe storms, MFC is purported to little or no CAPE is found. show where ascent is occurring in the presence of surface moisture. Banacos and Schultz (2005) The presence of CAPE indicates when its con- emphasize four points about MFC relevant to stituents overlap, but an absence of CAPE prior this discussion. to that superpositioning cannot be used to infer 1. The scientific justification for using MFC as large CAPE will not be present in the future. a forecast tool for convective initiation is in- CAPE depicts where moisture and conditional adequate. instability are already superimposed, not where they will (or will not) be superimposed in the 2. The putative value of MFC as a forecast future. A comparable statement can be made variable has never been firmly established about MFC. This aspect of combined variables is an important element for understanding the 1 difference between a diagnostic variable and a In certain situations, the displacement of the sounding in- strument from its original launch location has to be accounted proper forecast parameter. for in the calculations. 4
DOSWELL AND SCHULTZ 05 November 2006 e. Indices 850–500-hPa lapse rate is very nearly dry adia- batic, the parcel is likely to arrive at 500 hPa Indices, the final class of diagnostic variables, colder than the observed 500-hPa temperature can be broken down into two distinct subclasses: because it will have followed a dry adiabat for indices based on physically based formulae and most of its ascent. Conversely, when the 850-hPa indices representing more or less arbitrary com- temperature–dewpoint spread is small, the parcel binations of diagnostic variables. This is a com- will ascend mostly along a moist adiabat and is plex topic and is the subject of a wholly separate likely to be warmer than the observed 500-hPa section (section 4, below). Before we discuss temperature, unless the lapse rate is less than indices, however, we consider several issues moist adiabatic. So far, so good, but consider the associated with diagnostic variables that affect example shown in Fig. 1a. If, as in this case, the their utility in diagnosing the current state of the low-level moisture decreases rapidly just below atmosphere. 850 hPa, the Showalter index may be calculated correctly, but its implications about the convective 3. Issues affecting the suitability of diagnostic variables All diagnostic variables are subject to error, but not all are equally error-prone. Errors can be de- composed into measurement errors, which have a number of sources that are generally instrument- dependent (Brock and Richardson 2001, their section 1.1.3), and sampling errors (associated with the finite number of observations). Meas- urement and sampling errors are associated with much of the volatility of diagnostic variables. Here we use volatility to mean that the variable can vary considerably over both time and space as a result of sensitivity to both measurement and sampling errors. Some diagnostic variables are inherently more volatile than others. Consider CAPE as an example of a combined variable, and compare it to the Showalter (1947) index. Both of these purport to be variables per- tinent to forecasting severe convective weather, and both are based on simple parcel theory. The Showalter index is determined by the tempera- ture difference between a hypothetical parcel lifted from 850 hPa to 500 hPa (a calculated di- agnostic variable) and that at 500 hPa (an ob- served diagnostic variable). The Showalter in- dex was originally proposed because it uses only three observed variables: 500-hPa and 850-hPa temperatures, and 850-hPa dewpoint tempera- ture. At the time, manual processing of sound- ings meant that the mandatory pressure-level data were available more quickly than the rest of the sounding. Calculating an index requiring only mandatory-level data was advantageous because it gave forecasters a quick look at the convective instability before the whole sounding came in later. Figure 1. Rawinsonde plots on a Skew-T–log p The temperature of a parcel lifted from 850 to 500 diagram for (a) Rapid City, SD on 09 June 1972, hPa depends strongly on the 850-hPa dewpoint and (b) Amarillo, TX on 04 March 2004. See depression. When that value is large, unless the text for discussion. 5
DOSWELL AND SCHULTZ 05 November 2006 instability of the atmosphere can be misleading. tions occurs in part because the Showalter index This example was taken on the morning of a involves a differential quantity, and derivatives devastating flash flood in the vicinity that eve- generally are much more sensitive to errors than ning—the calculated Showalter index nominally are the basic quantities being differenced. would indicate little or no chance for thunder- storms simply because of the low dewpoint tem- On the other hand, the CAPE calculation is a perature at 850 hPa. vertical integral that uses measurements at more than two levels. By virtue of being an integral quantity, rather than a derivative, calculation of CAPE is inherently less sensitive to small differ- ences that might arise from measurement errors. Unfortunately, that apparently useful property of integration renders its values non-unique; that is, you can get the same CAPE value from distinctly different vertical distributions of a parcel’s ther- mal buoyancy (Fig. 2 - see also Blanchard 1998). It is likely that one would interpret these sound- ings differently in terms of the weather forecasts, but considering only the CAPE values without seeing the soundings would permit no such dis- crimination. Moreover, as discussed in Brooks et al. (1994), finding a representative sounding can be some- thing of a challenge, owing to undersampled vari- ability in space and time. This undersampled variability also affects any interpretation of the Showalter index, of course. At least in the short term, little can be done about measurement errors and undersampling. However, when considering how to interpret diagnostic variables, it seems quite unlikely to be able to define a diagnostic variable that distills the relatively rich complexity of a complete sounding into a single number that isn’t vulnerable to being unrepresentative under some circumstances. In fact, we assert that no single number can replace the value of a fore- caster simply looking at the soundings, as well as looking at diverse diagnostic variable computa- tions based on those soundings. Any obvious errors in the sounding, or any characteristics that would render diagnostic variable computations misleading, will be apparent to someone experi- enced in sounding interpretation. Figure 2. Soundings with very nearly equal CAPE values, but showing distinctly different Further, we suggest that this principle applies to vertical distributions of CAPE—(a) for Bis- any diagnostic variable, not just those based on marck, ND at 00 UTC on 24 May 2000, and (b) soundings. The diagnostic variable calculations for Slidell, LA at 00 UTC on 31 October 2004. still add value beyond what our hypothetical forecaster can see simply by viewing the data on Indices can be affected dramatically when a display, but using only the diagnostic variables soundings rise through precipitation (as in Fig. as a substitute for considering all the information 1b), or from errors that just happen to fall at in the data is inherently risky in making weather mandatory pressure levels, thereby rendering forecasts. calculation of parameters such as the Showalter index unrepresentative or even meaningless. Volatility is associated with the specific way in This sensitivity to a small number of observa- which the observations are used to construct the 6
DOSWELL AND SCHULTZ 05 November 2006 fields of a diagnostic variable. Consider the hori- If divergence is calculated from a sparse network zontal divergence, expressed in conventional of points, such as rawinsonde sites (L~400 km), meteorological notation as: a rough order of magnitude for the divergence, ⎛ ∂u ∂v ⎞ according to the above scaling rule, is about (1– div h = ∇ h • Vh = ⎜ + ⎟ , 10 m s–1) ÷ 400 km = 2.5 x (10–6–10–5) s–1. In ⎝ ∂x ∂y ⎠ contrast, for the network of surface observations where (u,v) is the Cartesian coordinate (x,y) form (L~100 km), the simple scaling rule gives a value of the horizontal velocity vector. When ex- of 1 x (10–5–10–4) s–1, which is four times larger. pressed in terms of natural coordinates—along and normal to the horizontal wind vector— Panofsky (1964, p. 33 ff.) has shown that under “normal circumstances” (i.e., quasi-hydrostatic flow) the horizontal divergence calculation in- volves the difference between two relatively large numbers, so it can be quite sensitive to small changes in the winds. Note that this sensi- tivity is not present when calculating the vertical component of the vorticity, ζ, where ⎛ ∂v ∂u ⎞ ς = k• ∇× V = ⎜ −⎟, ⎝ ∂x ∂y ⎠ and where k is the unit vector in the vertical— the two terms in this calculation have no charac- teristic tendency to cancel one another. Further, a divergence (or vertical vorticity) cal- culation is dependent on the resolution of the data. To show this, assume a characteristic mag- nitude for the difference in wind velocity be- tween two points (δV, which is the same order of magnitude as the wind velocity itself, V), sepa- rated by a characteristic distance scale L, so that δV div h ~ , L which is a simple scaling law for the divergence (or vorticity) magnitude. The order of δV turns out to be not very scale dependent, typically of order 1–10 m s–1. This characteristic difference rarely reaches 100 m s–1 or becomes as small 0.1 Figure 3. Objective analyses of surface diver- m s–1. Thus, over a fairly wide range of scales, gence at (a) 1100 UTC and (b) 1200 UTC 14 the calculated divergence tends to depend most February 2000, illustrating the volatility of the strongly on the distance between sample points. 2 details. Solid contours are nonnegative diver- gence, starting at zero, and dashed contours are 2 negative values. Conventional wind barbs show Note that at synoptic scales (where L ~ 1000 km), because the wind is nearly geostrophic, the characteristic divergence the observations used in the calculations. magnitude is an order of magnitude less than this simple scaling law predicts because the geostrophic wind calculated on an f-plane in many meteorological coordinate systems is According to this simple scaling law, horizontal nondivergent (see Doswell 1988). This can be accounted for divergence is inversely proportional to the scale by changing the scaling law to of the data resolution over a wide range of V scales. divh ~ Ro L as shown by Haltiner and Williams (1980, p. 57), where Ro is Like the Showalter index, divergence calcula- the Rossby number and is of order 10–1 on synoptic scales, tions can be volatile. This volatility also follows but dynamics-based scale analysis is beyond the scope of this paper. Again, synoptic-scale vertical vorticity scaling is not from the simple scaling law above: if winds are subject to this consideration. only known to within 2 m s–1, then a divergence 7
DOSWELL AND SCHULTZ 05 November 2006 estimate based on a station separation of 100 km address topics in turbulence theory (Tennekes is known only to within 2 x 10–5 s–1. The fields and Lumley 1972, p. 98 ff.). tend to be noisy and behave somewhat erratic- ally over time as a result of undersampled vari- Many of these indices, including the SWEAT ability in the wind field and measurement er- index, CT, EHI, SCP, and STP, have combined rors, although the basic shape of the fields variables in ways that have no physical rationale. might be fairly consistent from one time to the In other words, the process of forming sums, next (Fig. 3). products, and ratios has not been done in accor- dance with a formula originating in the mathe- To some extent, this volatility can be reduced by matics describing a physical process. Rather, the heavy smoothing that limits the spatial and/or mathematical expression for the index is more or temporal scales of the features retained to those less arbitrary. Why a sum of two variables di- that can be depicted reliably in the analysis vided by a third? Why not one variable raised to (Doswell 1977). The consistency of the basic a power defined by a second variable multiplied shape of the fields, combined with the volatility by a third? of the details, is a direct indication of the sensi- tivity of the details in the field to small changes As we have mentioned for the case of simple in the wind. diagnostic variables calculated from observations (cf. section 2b), combining two or more vari- ables in a way that has a physical basis affects 4. Indices the interpretation and use of the resulting vari- able. If a variable is conserved during certain Having considered several of the issues that con- physical processes, for example, that is quite front users of diagnostic variables, we are now relevant to its application in diagnosis or fore- prepared to consider the topic of indices. Indices casting. have a long history in severe storms forecasting that perhaps began with the Showalter index At issue is whether or not the variable can be (SI). Their use has continued through a growing related to physical principles. Examples of diag- plethora of constructs, including, among many nostic variables based on physical principles others, the lifted index (LI), SWEAT index, bulk might be something like the static stability time Richardson number (BRN), energy–helicity in- tendency, potential vorticity, or energy dissipa- dex (EHI), Cross Totals (CT), SCP, significant tion rate. Combining two or more variables in an tornado parameter (STP), and enhanced stretch- arbitrary way leaves open many questions and ing potential (ESP; J. Davies 2005, personal makes it difficult to relate the variable to any communication). These and selected other indi- physical understanding of the process. The indi- ces are listed in Table 1. Indices also have been vidual diagnostic variables used to form an index applied to forecasts other than severe convective may have physical relevance to the problem at weather. One example is the Garcia (1994) hand, but when a specific formula combining method for forecasting snowfall. Wetzel and them is unphysical, this can be problematic. Martin (2001) and Schultz et al. (2002) discuss the scientific integrity of the Garcia (1994) Furthermore, the physical dimensions of these method and other approaches. indices may or may not make any physical sense. For example, the ratio of CAPE to shear (a ratio Some indices are associated with a physical ar- used in one form or another for several indices gument. The early stability indices (e.g., SI, LI) within Table 1) has dimensions of J kg–1 s, the were based on simple parcel theory, although we product of energy per unit mass and time, have suggested some issues with their use as whereas the product of CAPE and shear has di- diagnostic variables previously, to say nothing of mensions of J kg–1 s–1. The former has no obvi- their use as forecast variables. The dimen- ous physical interpretation, whereas the latter has sionless BRN is at least related to the true dimensions of energy per unit mass per unit Richardson number, but the actual physical sig- time, which at least can be related to terms in an nificance of any Richardson number to the phys- energy budget. Thus, the product of CAPE and ics of deep moist convection is unclear, as the shear might yield a more physically meaningful original intent of the Richardson number was to parameter than the quotient of CAPE and shear. 8
DOSWELL AND SCHULTZ 05 November 2006 Table 1. A selection of indices commonly used in the United States for severe storm forecasting. In the formulae, T denotes a temperature and D denotes a dewpoint temperature in ºC, with a subscript indicating at what mandatory pressure level (in hPa) this value is to be taken from; α denotes the specific volume and a subscript lp denotes a value associated with a lifted parcel; LFC stands for a lifted parcel’s level of free convection and EL stands for its equilibrium level. For the Lifted index, the lifted parcel is for a surface parcel with forecast properties at a representative time of day. For the SWEAT index, V denotes a wind speed (in knots), and ΔV denotes a wind direction difference (in degrees). For the Bulk Richardson num- ber, U denotes the density-weighted speed of the mean vector wind in the layer 0–6 km, and U 0 denotes ( ) the speed of the mean vector wind in the layer from the surface to 500 m—the quantity U − U 0 is some- times referred to as the “BRN shear”. For the storm-relative helicity, C denotes the storm motion vector. See Thompson et al. (2003) for an explanation of symbols used for the SCP and STP calculations. Index Name Formula Reference Showalter index SI = T500 – Tlp(850 hPa) Showalter (1947) Lifted index LI = Tlp(fcst surface) - T500 Galway (1956) K-index K = (T850 − T500 ) + D850 − (T700 − D700 ) George (1960, pp. 407–415) ∫ (α ) Convective EL Glickman available poten- CAPE = lp − α dp (2000, p. 176) LFC tial energy Vertical totals VT = T850 – T500 Miller (1972) Cross totals CT = D850 – T500 Miller (1972) Total totals TT = VT +CT Miller (1972) SWEAT index SWEAT = 20(TT–49ºC) + 12D850 + 2V850 + V500 + 125[sin(ΔV500- Miller (1972) 850) + 0.2] Bulk Richard- CAPE Weisman and BRN = son number (U − U 0 ) 1 2 Klemp (1982) 2 Storm-relative ⎡ ∂V ⎤ Davies-Jones SRH = − ∫ z k • ⎢(Vh − C) × h ⎥ dz z helicity o ⎣ ∂z ⎦ et al. (1990) Energy-helicity EHI = (CAPE )(SRH ) Hart and index 160,000 Korotky (1991) Supercell com- ⎛ MUCAPE ⎞⎛ SRH 0−3 km ⎞⎛ U − U 0 ⎞ Thompson et posite parameter SCP = ⎜ −1 ⎟⎜ 2 −2 ⎟⎜ −1 ⎟ al. (2003) ⎝1000 J kg ⎠⎝100 m s ⎠⎝ 40 ms ⎠ Significant tor- ⎛ MLCAPE ⎞⎛ SHR 0− 6 km ⎞⎛ SRH 0−1km ⎞⎛ (2000m − MLLCL )⎞ Thompson et nado parameter STP = ⎜ −1 ⎟⎜ −1 ⎟⎜ 2 −2 ⎟⎜ ⎟ al. (2003) ⎝ 1000 J kg ⎠⎝ 20 ms ⎠⎝ 100 m s ⎠⎝ 1500 m ⎠ Enhanced ⎛ ∂T ⎞⎛ MLCAPE 3 km ⎞ Davies (2005 – stretching po- ESP = ⎜⎜ − 7 o Ckm−1 ⎟⎟⎜ ⎟ personal com- ⎝ ∂z −1 tential 0−2 km ⎠⎝ 50 J kg ⎠ munication) 9
DOSWELL AND SCHULTZ 05 November 2006 a. Example of representative problems with forecasting tool: if the values of SRH and indices: EHI CAPE, fall within this presumably complex re- gion of 2D phase space, supercell tornadoes al- In order to demonstrate these issues using one of ways occur. Anywhere outside of this region, these indices, we use the EHI as a representative supercell tornadoes never occur. Use of the EHI example, although any of the above-listed indi- compresses the information within this hypo- ces would reveal similar problems. As described thetical 2D space into a single number, eliminat- in Rasmussen and Blanchard (1998, p. 1154), ing our ability to apply the information in this “This index [using the 0–1-km layer] is used phase space. operationally for supercell and tornado forecast- ing, with values larger than 1.0 indicating a po- Furthermore, EHI is scaled by 160,000 some- tential for supercells, and EHI > 2.0 indicating a what arbitrarily, so that whether the value of the large probability of supercells.” Our problems EHI is 1 or 100 is similarly arbitrary. EHI’s with EHI center around five issues: combination scaling constants are associated with a “stan- of ingredients, arbitrary construction, choice of dard” value for CAPE of 1000 J kg–1 and for scaling constant, ambiguous physical meaning, SRH of 160 m2 s–2—note that these units are and lack of proper validation. actually equivalent and so EHI has dimensions of (J kg–1 = m2 s–2)2, which has little obvious First, EHI is a combination of two separate vari- physical interpretation. For indices of this sort, ables that may not even be collocated during the the scaling constants are determined by what the event and can evolve separately, as discussed in developer felt to be typical values for the input section 3. For instance, although large CAPE variable. What is typical for some variable asso- and large SRH are both pertinent to supercell ciated with severe weather in one part of the forecasting, they need not be precisely collocated world may not be typical elsewhere. Hence, this in space—many severe storm forecasters believe can lead to incorrect interpretations of the index CAPE and SRH need simply be in proximity to when used outside of the region in which it was each other, perhaps with some overlap. There- developed (Tudurí and Ramis 1997). It is rea- fore, a variable based on the combination of sonable, we believe, to ask that a forecast pa- these two variables may not adequately reflect rameter’s utility and interpretation should not the true potential for storms. vary from one location to another. Further, EHI is the combination of two ingredi- Moreover, consider our hypothetical world ents in an unphysical, arbitrary fashion. Can it be again, in which the constituent parameters for shown that the formation of a supercell depends EHI can be used to construct a 2D phase space in some well-defined physical way on the prod- that includes some complex region wherein su- uct of CAPE and SRH? Although we cannot percell tornadoes always occur. It is conceivable exclude such a possibility in the future, as of this that some transformation of the EHI’s constituent writing, this product has not been shown to be variables (CAPE and SRH) would convert the physically pertinent, in the sense of appearing in complex region into a simple one—say, a circle. some physically-relevant mathematical formula. However, it seems highly unlikely that the exist- Schultz et al. (2002) made a similar argument ing scaling and unphysical construction of EHI about the lack of scientific justification in refer- could correspond to such a transformation. ence to the PVQ parameter (the product of PVes and the divergence of Q when both are negative) This leads to the next point: how does a fore- defined and proposed by Wetzel and Martin caster interpret EHI values? Put another way, (2001). does a situation where EHI=2 mean that super- cells are twice as likely as a situation where Consider the following argument. Imagine a EHI=1? If the CAPE doubles, doubling the EHI hypothetical world where the EHI’s two con- (assume SRH remains unchanged), does this stituent parameters were all that is needed to imply twice the chance of supercells? There forecast supercell tornadoes perfectly. Then might be some empirical way to determine the picture a two-dimensional (2D) phase space of significance of these values, but there is no SRH and CAPE in which some irregular region physical rationale for interpreting them. Is the within this phase space was associated with the relationship between EHI and severe weather occurrence of supercell tornadoes. By assump- linear or nonlinear? How does the lagged corre- tion, such a scenario would represent a perfect lation between EHI and the observed weather 10
DOSWELL AND SCHULTZ 05 November 2006 vary as a function of the lag time? Such ques- As already noted, many severe weather forecast- tions ought to be of concern to any severe ers already recognize the risks in relying on a weather forecaster, but are certainly not readily single forecast parameter. Certainly most would answerable by the method EHI was constructed. never consider using a single variable to deter- mine their forecast, but our experience suggests Finally, how has the forecast value of EHI been that some forecasters might be tempted to do so, validated? Rasmussen (2003) described his perhaps because the use of some diagnostic vari- evaluation of EHI for three classes of observed able (such as CAPE) is so widespread (e.g., the proximity soundings: supercells with significant “disengaged” forecasters studied by Pliske et al. tornadoes (those rated F2 and greater), supercells 2004). The pervasive use and ready availability without significant tornadoes (no tornadoes re- of diagnostic variables is a trap for the unwary. ported, or only F0 or F1 tornadoes), and nonsu- In situations where the time pressure becomes percell convection, defined in Rasmussen and intense, some might be inclined to do so in the Blanchard (1998). The weather events occurred interest of making a quick decision. Such a prac- anywhere from three hours before to six hours tice is antithetical to good forecasting, in general. after the nominal 00 UTC sounding time. Ras- mussen (2003) found, “only 25% of [proximity In a related line of reasoning, the press of time soundings from supercells without significant can encourage severe weather forecasters’ use of tornadoes] had EHI > 0.5, whereas nearly 2/3 of indices and other diagnostic variables to obtain a the [proximity soundings from supercells with quick look at the data for the purpose of identify- significant tornadoes] had values this large.” ing the hot spots upon to focus more attention on Given the number of soundings in each dataset in a diagnosis. By itself, this is a reasonable from Rasmussen and Blanchard (1998), this strategy. However, a serious forecaster is not means about 30 events occurred in each cate- likely to get much forecasting help from such a gory—a relatively small sample size. Taken to- cursory consideration of atmospheric structure. gether, these results indicate that when a supercell There is nothing inherently wrong with a quick occurs with an EHI > 0.5 there is about a 50% look, unless the forecaster limits the diagnosis to chance of it producing a significant tornado. This that. For all the reasons we have described, a figure amounts to a conditional probability, where conscientious weather forecaster always should the condition is the presence of a supercell. try to find the time to do a comprehensive analy- sis of the data. We believe that using ingredi- Rasmussen and Blanchard (1998) used a contin- ents-based forecasting methods (e.g., Doswell et gency table and scatter diagrams (see section 5) al. 1996) is a scientifically sound way to keep the to evaluate the diagnostic potential for EHI and diagnosis within practical time limitations in an several other candidate variables. But their study operational forecasting environment. evaluated indices from proximity soundings on the basis of observed events occurring in a time It also can be argued from a purely utilitarian per- period around the sounding’s nominal time. spective that if a forecast parameter works suc- Therefore, it is not truly an analysis of the fore- cessfully in forecasting, no matter how it was de- cast potential for the variables considered—it is rived, it seems unreasonable to ask forecasters to instead directed at a related problem: How well cease using it. We don’t disagree with this at all, do diagnosed values of the indices discriminate especially when proven forecast parameters based among the observed events? This sort of analy- on physical arguments are either unavailable or sis does not consider the topic of the lagged cor- demonstrably inferior to a nonphysically con- relation between the proposed forecast parameter structed variable. If it can be shown rigorously and the forecast severe weather events. that an arbitrarily constructed index does indeed have forecast utility (see the following section), b. Pros and cons regarding the use of indices we do not advocate ignoring its proven value to the challenge of weather forecasting—unless a We have shown, using the example of EHI, physically-based forecast parameter is known to what types of problems can arise with the use of be superior for forecasting. Generally, physical indices constructed in an arbitrary, unphysical reasoning is always preferable for the construction way. We recognize that there are advantages as of diagnostic variables and indices, owing to the well as drawbacks to their use in severe weather relative ease with which such forecast parameters forecasting. can be interpreted and applied globally. 11
DOSWELL AND SCHULTZ 05 November 2006 5. Evaluation of forecast utility for a candi- of the developmental dataset—in other words, a date prognostic variable wholly different set of cases than those used for development and testing of the threshold values Because many diagnostic variables with poten- for the variable. If the results using the verifica- tial forecast utility never have been tested rigor- tion dataset are comparable to those found from ously as forecast parameters in their own right, the developmental data, confidence in the use of we develop herein a general description of what the variable as a forecast variable is correspond- we believe are the requirements that a proper ingly high. If there is a statistically significant forecast parameter would have to meet. As al- difference between the results from the two data- ready discussed, diagnostic variables have their sets, then perhaps a larger sample is needed, but own specific purposes, but a diagnostic variable in any case, confidence in the forecast value of might also have the capability to make a rea- the variable (and its associated threshold value) sonably accurate and perhaps even skillful pre- is correspondingly low. diction of the weather at some future time. See Murphy (1993) for a discussion of the difference Two concerns often arise when a forecast pa- between accuracy and skill. rameter is proposed. First, many assessments of potential forecast parameter are done with a One way to conduct a rigorous assessment of a small number of cases, perhaps as few as one. variable as a forecast parameter is to use a devel- Knowledge of how to determine an appropriate opmental data set to form a classic 2 x 2 contin- sample size is outside the scope of this paper; see gency table, the standard verification table when the discussion of hypothesis testing by Wilks considering a dichotomous (yes/no) forecast for (2006, chapter 5). It is incumbent on the devel- some dichotomous event (Wilks 2006, p. 260 ff). oper of a forecast parameter to provide a rea- One example is given by Monteverdi et al. sonably thorough test using a robust sample of (2003) for tornadoes in California (see their Ta- enough cases. Second, many attempts to vali- ble 3 and Fig. 8). To create such a table for a date the utility of some variable as a forecast potential forecast parameter, begin with choosing parameter make the logical mistake of consider- a threshold value for the candidate variable— ing only values of the parameter when forecast forecast "yes" if the variable is at or above the events are known to have occurred. Diagnostic threshold, and "no" if the variable is below the variables are of little use in forecasting until they threshold. An assessment of the accuracy of the can be shown to discriminate successfully be- forecasts using the developmental dataset would tween events and nonevents. Correct predictions be done by filling in the contingency table using of nonevents are not inevitably easy (e.g., Dos- that threshold. Optimizing the choice for the well et al. 2002) although in forecasting severe threshold value of the variable using the so- storms (relatively rare events), many nonevents called Relative (or Receiver) Operating Charac- are obvious. teristic curves associated with signal detection theory is possible; see Wilks (2006, p. 294 ff.) Further, some diagnostic variables may have for more information. value as forecast parameters, but it needs to be shown just how far in advance of the event they The accuracy of the forecasts using that thresh- exhibit forecast accuracy and/or skill. That is, the old can be assessed using standard methods accuracy of any diagnostic variable used as a based on the contingency table. The skill of the forecast parameter is likely to increase as the forecasts is determined by comparing the accu- time lag between the diagnosis and the event racy of the proposed forecasts based on use of decreases, but this may not necessarily be a sim- that variable against the accuracy of some stan- ple relationship. Contingency tables and full dard forecasting method (e.g., climatology or assessments as described above would have to be persistence, or some other forecast scheme, such developed for a variety of diagnosis times rela- as Model Output Statistics). If the forecast tive to the beginning of the forecast events—say, scheme using the proposed diagnostic variable 12, 6, 3, and 1 h before the actual events begin. shows statistically significant skill in comparison Alternatively, an analysis of the time-lagged with some standard method, then it can be con- correlation between the forecast parameter and sidered a useful forecast parameter. the observed weather could be carried out at a variety of lag times. However it is done, the To do a thorough assessment, however, another accuracy of a candidate variable as a forecast dataset is needed that is completely independent parameter should be known as a function of time 12
DOSWELL AND SCHULTZ 05 November 2006 before the event. Quantitative knowledge of information at their disposal as effectively as forecast accuracy as a function of lead time is possible. obviously important when using a diagnostic variable as a forecast parameter. Acknowledgments. Funding was provided by NOAA/Office of Oceanic and Atmospheric Re- Another way to verify the potential of a forecast search under NOAA–University of Oklahoma variable would be to construct a multidimen- Cooperative Agreement NA17RJ1227, U.S. De- sional scatter diagram (say, for the case of two partment of Commerce. We would like to thank dimensions, CAPE and shear) in which both our reviewers for their careful and thoughtful events and nonevents are plotted with respect to reviews and helpful suggestions, which have observed values for the diagnostic variables. improved our presentation. Using this plot, a probability of occurrence of the weather event as a function of its location in the REFERENCES scatter diagram could developed, perhaps facili- tated by the use of kernel density estimation Banacos, P. C., and D. M. Schultz, 2005: The methods (e.g., Ramsay and Doswell 2005). Ex- use of moisture flux convergence in forecast- amples of such verification are found in Rasmus- ing convective initiation: Historical and op- sen (2003) and Brooks et al. (2003), although erational perspectives. Wea. Forecasting, 20, this method would require diagnostic variable 351–366. values prior to the event, rather than proximity Blanchard, D. O., 1998: Assessing the vertical data. distribution of convective available potential energy. Wea. Forecasting, 13, 870–877. Of course, the preceding does not present the Brock, F. V., and S. J. Richardson, 2001: Mete- only ways to assess the effectiveness of a pro- orological Measurement Systems. Oxford posed forecast variable. Many other methods University, 290 pp. could be used, but it does represent the level of Brooks, H. E, C. A. Doswell III, and J. Cooper, rigor we believe is necessary before asserting 1994: On the environments of tornadic and that a diagnostic variable has real value as a true non-tornadic mesocyclones. Wea. Forecast- forecast parameter. ing, 10, 606–618. ––––––––, J. W. Lee, and J. P. Craven, 2003: 6. Conclusions The spatial distribution of severe thunder- storm and tornado environments from global In our experience, many severe weather forecast- reanalysis data. Atmos. Res., 67–68, 73–94. ers and researchers are seeking a “magic bullet” Davies-Jones, R., D. Burgess, and M. Foster, when they offer yet another combined variable or 1990: Test of helicity as a tornado forecast index for consideration, whether or not they real- parameter. Preprints, 16th Conf. on Severe ize it. If some single variable or combination of Local Storms, Kananaskis Park, AB, Canada, variables made forecasting so simple, then the Amer. Meteor. Soc., 588–592. need for human forecasters effectively vanishes. Doswell, C. A. III, 1977: Obtaining meteoro- There may be other reasons for the demise of human forecasters, but distilling the complex logically significant surface divergence fields atmosphere with its nonlinear, possibly chaotic, through the filtering property of objective interactions into an all-encompassing variable analysis. Mon. Wea. Rev., 105, 885–892. seems improbable. Any forecaster seeking to ––––––––, 1986: Short range forecasting. find such a variable is not only unlikely to be Mesoscale Meteorology and Forecasting. P. successful, but, if success were achieved, the Ray, Ed., Amer. Meteor. Soc., 689–719. need for a human forecasting that event van- ––––––––, 1988: Comments on “An improved ishes! Should advances in the science of severe technique for computing the horizontal pres- convective storms ever produce such a forecast sure-gradient force at the earth's surface.” parameter, or should NWP models become near- Mon. Wea. Rev., 116, 1251–1254. perfect in terms of forecasting severe convection, ––––––––, 2004: Weather forecasting by hu- then the need for human forecasters will indeed mans—Heuristics and decision making. disappear (e.g., Doswell 2004), whatever our Wea. Forecasting, 19, 1115–1126. wishes might be. But it is our belief that this is ––––––––, and E. N. Rasmussen, 1994: The not very likely to happen soon. Even if such an effect of neglecting the virtual temperature unlikely development ever occurs, in the interim, correction on CAPE calculations. Wea. it remains incumbent on forecasters to use the Forecasting, 9, 625–629. 13
DOSWELL AND SCHULTZ 05 November 2006 ––––––––, and P. M. Markowski, 2004: Is Pliske, R. M., B. Crandall, and G. Klein, 2004: buoyancy a relative quantity? Mon. Wea. Competence in weather forecasting. Psycho- Rev., 132, 853–863. logical Investigations of Competence in Deci- ––––––––, H. E. Brooks, and R. A. Maddox, sion Making. K. Smith, J. Shanteau, and P. 1996: Flash flood forecasting: An ingredi- Johnson, Eds., Cambridge University Press, ents-based methodology. Wea. Forecasting, 40–68. 11, 560–581. Ramsay, H., and C. A. Doswell III, 2005: A ––––––––, D. V. Baker and C. A. Liles 2002: sensitivity study of hodograph-based methods Recognition of negative mesoscale factors for for estimating supercell motion. Wea. Fore- severe weather potential: A case study. Wea. casting, 20, 954–970. Forecasting, 17, 937–954. Rasmussen, E. N., 2003: Refined supercell and Galway, J. G., 1956: The lifted index as a pre- tornado forecast parameters. Wea. Forecast- dictor of latent instability. Bull. Amer. Me- ing, 18, 530–535. teor. Soc., 43, 528–529. ––––––––, and D. O. Blanchard, 1998: A base- Garcia, C., Jr., 1994: Forecasting snowfall using line climatology of sounding-derived super- mixing ratios on an isentropic surface—An cell and tornado forecast parameters. Wea. empirical study. NOAA Tech. Memo. NWS Forecasting, 13, 1148–1164. CR-105, PB 94-188760 NOAA/NWS, 31 pp. Sanders, F., and C. A. Doswell III, 1995: A case [Available from NOAA/National Weather for detailed surface analysis. Bull. Amer. Me- Service Central Region Headquarters, Kansas teor. Soc., 76, 505–521. City, MO 64106-2897.] Schaefer, J. T., 1986: Severe thunderstorm fore- George, J. J., 1960: Weather Forecasting for casting: A historical perspective. Wea. Aeronautics. Academic Press, 673 pp. Forecasting, 1, 164–189. Glickman, T. S., Ed., 2000: Glossary of Meteor- Schultz, D. M., J. V. Cortinas Jr., and C. A. ology. 2nd ed., Amer. Meteor. Soc., 855 pp. Doswell III, 2002: Comments on “An opera- Haltiner, G. J., and R. T. Williams, 1980: Nu- tional ingredients-based methodology for merical Prediction and Dynamic Meteorol- forecasting midlatitude winter season precipi- ogy. 2d ed., John Wiley & Sons, 477 pp. tation.” Wea. Forecasting, 17, 160–167. Hart, J. A., and W. Korotky, 1991: The SHARP Showalter, A. K., 1947: A stability index for workstation v1.50 users guide. National forecasting thunderstorms. Bull. Amer. Me- Weather Service, NOAA, U.S. Department of teor. Soc., 34, 250–252. Commerce, 30 pp. [Available from NWS Thompson, R. L., R. Edwards, J. A. Hart, K. L. Eastern Region Headquarters, 630 Johnson Elmore, and P. Markowski, 2003: Close Ave., Bohemia, NY 11716.] proximity soundings within supercell envi- Johns, R. H., and C. A. Doswell III, 1992: Se- ronments obtained from the Rapid Update vere local storms forecasting. Wea. Forecast- Cycle. Wea. Forecasting, 18, 1243–1261. ing, 7, 588–612. Tennekes, H., and J. L. Lumley, 1972: A First Miller, R. C., 1972: Notes on analysis and se- Course in Turbulence. MIT Press, 300 pp. vere storm forecasting procedures of the Air Tudurí, E., and C. Ramis, 1997: The environ- Force Global Weather Central. Tech. Report ments of significant convective events in the 200(R), Headquarters, Air Weather Service, western Mediterranean. Wea. Forecasting, Scott Air Force Base, IL 62225, 190 pp. 12, 294–306. Monteverdi, J. P., C. A. Doswell III, and G. S. Weisman, M. L., and J. B. Klemp, 1982: The Lipari, 2003: Shear parameter thresholds for dependence of numerically simulated convec- forecasting tornadic thunderstorms in north- tive storms on vertical wind shear and buoy- ern and central California. Wea. Forecasting, ancy. Mon. Wea. Rev., 110, 504–520. 18, 357–370. Wetzel, S. W., and J. E. Martin, 2001: An op- Murphy, A. H., 1993: What is a good forecast? erational ingredients-based methodology for An essay on the nature of goodness in forecasting midlatitude winter season precipi- weather forecasting. Wea. Forecasting, 8, tation. Wea. Forecasting, 16, 156–167. 281–293. ––––––––, and ––––––––, 2002: Reply. Wea. Panofsky, H., 1964: Introduction to Dynamic Forecasting, 17, 168–171. Meteorology. Pennsylvania State University, Wilks, 2006: Statistical Methods in the Atmos- 243 pp. pheric Sciences, 2d ed. Academic Press, 627 pp. 14
You can also read