|
In practise many cases are aggregated in order to evaluate the forecast behaviour of the ensemble. However, it is always useful to complement such assessments with case studies of individual events, like the one in this exercise, to get a more complete picture of IFS performance and identify weaker aspects that need further exploration. |
The exercises described below are available as a set of Metview macros with the accompanying data. This is available as a downloadable tarfile for use with Metview. It is also available as part of the OpenIFS/Metview virtual machine, which can be run on different operating systems.
For more details of the OpenIFS virtual machine and how to get the workshop files, please contact: openifs-support@ecmwf.int.
In 2012, at the time of this case study, ECMWF operational forecasts consisted of:
In 2016, the ECMWF operational forecasts has been upgraded compared to 2012 and consisted of:
Please follow this link to see more details on changes to the ECMWF IFS forecast system (http://www.ecmwf.int/en/forecasts/documentation-and-support/changes-ecmwf-model)
If using the OpenIFS/Metview virtual machine with these exercises the recommended memory is at least 6Gb, the minimum is 4Gb. If using 4Gb, do not use more than 2 parameters per plot.
These exercises use a relatively large domain with high resolution data. Some of the plotting options can therefore require significant amounts of memory. If the virtual machine freezes when running metview, please try increasing the memory assigned to the VM.
To begin:
metview & |
Please enter the folder 'openifs_2018' to begin working. |
To save images during these exercises for discussion later, you can either use:
"Export" button in Metview's display window under the 'File' menu to save to PNG image format. This will also allow animations to be saved into postscript.
or use the following command to take a 'snapshot' of the screen:
ksnapshot |
If you want to create animations, save the figures as postscript and then use the convert
command:
convert -delay 75 -rotate "90<" in.ps out.gif |
In this exercise, the development of Hurricane Nadine and the cut-off flow up to the 20th September 2012 is studied.
Begin by entering the folder labelled 'Analysis':
an_1x1.mv : this plots horizontal maps of parameters from the ECMWF analyses overlaid on one plot. an_2x2.mv : this plots horizontal maps of parameters from the ECMWF analyses four plots to a page (two by two). an_xs.mv : this plots vertical cross-sections of parameters from the ECMWF analyses. |
Right-click on the 'an_1x1.mv' icon and select the 'Visualise' menu item (see figure)
After a pause, this will generate a map showing mean-sea-level pressure (MSLP).
Drag and drop the mv_track.mv icon onto the map to add the track of Hurricane Nadine.
In the plot window, use the play button in the animation controls to animate the map and follow the development and track of Hurricane Nadine.
You can use the 'Speed' menu to change the animation speed (each frame is every 6 hours).
If the contour lines appear jagged, in the plot window, select the menu item 'Tools -> Antialias'. |
Please close any unused plot windows if using a virtual machine. This case study uses high resolution data over a relatively large domain. Multiple plot windows can therefore require significant amounts of computer memory which can be a problem for virtual machines with restricted memory. |
This task creates Figure 2. from Pantillon et al.
Right-click the mouse button on the 'an_1x1.mv' icon and select the 'Edit' menu item.
An edit window appears showing the Metview macro code used to generate the plot. During these exercises you can change the top lines of these macros to alter the choice of parameters and plot types.
# Available parameters: # mslp,t2,wind10,speed10,sst # t,z,pt,eqpt [850,700,500,200] # wind,speed,r[925,850,700,500,200] # w700, vo850, pv320K |
The surface fields (single level) are: mslp (mean-sea-level-pressure), t2 (2-metre temperature), wind10 (10-metre wind arrows), speed10 (wind-speed at 10m : sqrt(u^2+v^2)), sst (sea-surface temperature).
The upper level fields are: t (temperature), z (geopotential), pt (potential temperature), eqpt (equivalent potential temperature), wind (wind arrows), speed (wind-speed as contours), r (relative humidity).
These fields have a list of available pressure levels in square brackets.
To plot upper level fields, specify the pressure level after the name. e.g. z500 would plot geopotential at 500hPa.
Some extra fields are also provided: w700 (vertical velocity at 700hPa), vo850 (relative vorticity at 850hPa) and pv320K (potential vorticity at 320K).
Wind fields are normally plotted as coloured arrows. To plot them as wind barbs add the suffix '.flag'. e.g. "wind10.flag" will plot 10m wind as barbs.
With the edit window open, find the line that defines 'plot1':
Change this line to:
The '.s' means plot the 500hPa geopotential as a shaded plot instead of using contours (this style is not available for all fields). Click the play button and then animate the map that appears |
Change the value of
You might add the mslp or z500 fields to this plot e.g.
Note that the fields are plotted in the order specified in the list! |
Q. When does the cut-off low form (see z500)? |
Right-click on 'an_1x1.mv' icon and select 'Edit'.
In the edit window that appears
With With With |
Change, mapType=0
to mapType=1
then click the play button at the top of the window.
Repeat using mapType=2
to see the smaller region over France.
These different regions will be used in the following exercises.
Animate the storm on this smaller geographical map.
The 'an_2x2.mv' icon plots up to 4 separate figures on a single frame.
Right-click on the 'an_2x2.mv' icon and select the 'Edit' menu item.
#Define plot list (min 1- max 4) plot1=["mslp"] plot2=["wind10"] plot3=["speed500","z500"] plot4=["sst"] |
Click the play button at the top of the window to run this macro with the existing plots as shown above.
Each plot can be a single field or overlays of different fields.
Wind parameters can be shown either as arrows or as wind flags ('barbs') by adding '.flag' to the end of variable name e.g. "wind10.flag".
Animating. If only one field on the 2x2 plot animates, make sure the menu item 'Animation -> Animate all scenes' is selected. Plotting may be slow depending on the computer used. This reads a lot of data files. |
Q. What do you notice about the SST field? |
Open the folder 'satellite' (back in the top level folder 'openifs_2018') (scroll the window if it is not visible).
This folder contains satellite images (water vapour, infra-red, false colour) for 00Z on 20-09-2012 and animations of the infra-red and water vapour images. The animations go beyond the 20-09-2012.
Double click the images to display them.
Use the an_1x1.mv and/or the an_2x2.mv macros to compare the ECMWF analyses with the satellite images.
Should this task should move to the second look at analyses after 20/9? |
The last task in this exercise is to look at cross-sections through Hurricane Nadine and the cut-off low.
Right click on the icon 'an_xs.mv', select 'Edit' and push the play button.
The plot shows potential vorticity (PV), wind vectors and potential temperature roughly through the centre of the Hurricane and the cut-off low. The red line on the map of MSLP shows the location of the cross-section.
Q. Look at the PV field, how do the vertical structures of Nadine and the cut-off low differ? |
Cross-section data is only available every 24hrs.
This means the 'steps' value in the macros is only valid for the times: [2012-09-20 00:00], [2012-09-21 00:00], [2012-09-22 00:00], [2012-09-23 00:00], [2012-09-24 00:00], [2012-09-25 00:00]
To change the date/time of the plot, edit the macro and change the line:
steps=[2012-09-22 00:00] |
A reduced number of fields is available for cross-sections: temperature (t), potential temperature (pt), relative humidity (r), potential vorticity (pv), vertical velocity (w), wind-speed (speed; sqrt(u*u+v*v)) and wind vectors (wind3).
#Cross section line [ South, West, North, East ] line = [30,-29,45,-15] |
The cross-section location (red line) can be changed in this macro by defining the end points of the line as shown above.
Remember that if the forecast time is changed, the storm centres will move and the cross-section line will need to be repositioned to follow specific features. This is not computed automatically, but must be changed by altering the coordinates above.
Exercise 1 looked at the synoptic development up to the 20-Sept-2012. This exercise looks at the ECMWF HRES forecast from this date and how the IFS model developed the interaction between Hurricane Nadine and the cut-off low.
Enter the folder 'HRES_forecast'
in the 'openifs_2018'
folder to begin.
The ECMWF operational deterministic forecast is called HRES. At the time of this case study, the model ran with a spectral resolution of T1279, equivalent to 16km grid spacing.
Only a single forecast is run at this resolution as the computational resources required are demanding. The ensemble forecasts are run at a lower resolution.
Before looking at the ensemble forecasts, first understand the performance of the operational HRES forecast of the time.
Data is provided for a single 5 day forecast starting from 20th Sept 2012, as used in the paper by Pantillon et al. referred to at the start of this tutorial.
Data is provided at the same resolution as the operational model, in order to give the best representation of the Hurricane and cut-off low iterations. This may mean that some plotting will be slow.
A new parameter is total precipitation : tp.
The parameters available in the analyses are also available in the forecast data.
For this exercise, you will use the metview icons in the folder ' hres_1x1.mv & |
From 20th to 25th September, Nadine had lost hurricane intensity and was slowly moving over the North Atlantic. A short-wave trough separated into a cutoff and approached from the north. Nadine interacted with a cold front associated with the cutoff on 21st September.
Study the interaction of the systems and their subsequent development.
First plot the geopotential at 500hPa and MSLP (as in figure 1 in Pantillon et al.) using the hres_1x1.mv
macro script:
plot1=["z500.s","mslp"] |
Using either the hres_1x1.mv
or hres_2x2.mv
macro plot some of these other maps to study the synoptic development. Note for example that there should not be a strong horizontal temperature gradient around Nadine (why?) and the winds should be stronger for Nadine than the cutoff.
|
It would be instructive to view a cross-section through Nadine and the cutoff. Choose from the following (note the cross-section macro hres_xs.mv
uses slightly different names for the parameters)
|
You do not need to plot all of these maps. Focus on what interests you and prepare some plots for group discussion.
For help on how to save images, see the beginning of this tutorial.
Q. How strongly does Nadine appear to interact with the cutoff? |
Choose a hres macro to use, plot the total precipitation (parameter: tp), near surface wind field (parameter: wind10), relative humidity (parameter: r) (and any other parameters of interest).
The macros hres_1x1.mv, hres_2x2.mv and hres_xs.mv can be used to plot and animate fields or overlays of fields from the HRES forecast.
Q. What do you think about the quality of this forecast? And why? |
In this case study, there are two operational ensemble datasets.
An ensemble forecast consists of:
ens_oper: This dataset is the operational ensemble from 2012 and was used in the Pantillon et al. publication. A key feature of this ensemble is use of a climatological SST field (you should have seen this in the earlier tasks!).
ens_2016: This dataset is a reforecast of the 2012 event using the ECMWF operational ensemble of March 2016. Two key differences between the 2016 and 2012 operational ensembles are: higher horizontal resolution, and coupling of NEMO ocean model to provide SST from the start of the forecast.
The analysis was not rerun for 20-Sept-2012. This means the reforecast using the 2016 ensemble will be using the original 2012 analyses. Also only 10 ensemble data assimilation (EDA) members were used in 2012, whereas 25 are in use for 2016 operational ensembles, so each EDA member will be used multiple times for this reforecast. This will impact on the spread and clustering seen in the tasks in this exercise.
Visualising ensemble forecasts can be done in various ways. During this exercise and the following exercises a number of visualisation techniques will be used in order to understand the errors and uncertainties in the forecast,
Key parameters: MSLP and z500. We suggest concentrating on viewing these fields. If time, visualize other parameters (e.g. PV320K).
For these exercises please use the Metview icons in the row labelled 'ENS'. ens_rmse.mv : this is similar to the hres_rmse.mv in the previous exercise. It will plot the root-mean-square-error growth for the ensemble forecasts. ens_to_an.mv : this will plot (a) the mean of the ensemble forecast, (b) the ensemble spread, (c) the HRES deterministic forecast and (d) the analysis for the same date. ens_to_an_runs_spag.mv : this plots a 'spaghetti map' for a given parameter for the ensemble forecasts compared to the analysis. Another way of visualizing ensemble spread. stamp.mv : this plots all of the ensemble forecasts for a particular field and lead time. Each forecast is shown in a stamp sized map. Very useful for a quick visual inspection of each ensemble forecast. stamp_diff.mv : similar to stamp.mv except that for each forecast it plots a difference map from the analysis. Very useful for quick visual inspection of the forecast differences of each ensemble forecast.
Additional plots for further analysis: pf_to_cf_diff.mv : this useful macro allows two individual ensemble forecasts to be compared to the control forecast. As well as plotting the forecasts from the members, it also shows a difference map for each. ens_to_an_diff.mv : this will plot the difference between the ensemble control, ensemble mean or an individual ensemble member and the analysis for a given parameter. |
If working in groups, each group could follow the tasks below with a different ensemble forecast. e.g. one group uses the 'ens_oper', another group uses 'ens_2016' and so on.
Choose your ensemble dataset by setting the value of 'expId', either 'ens_oper' or 'ens_2016' for this exercise.
One of the OpenIFS ensembles could also be used but it's recommended one of the operational ensembles is studied first.
#The experiment. Possible values are: # ens_oper = operational ENS # ens_2016 = 2016 operational ENS expId="ens_oper" |
In these tasks, the performance of the ensemble forecast is studied.
Q. How does the ensemble mean MSLP and Z500 fields compare to the HRES forecast and analysis? |
This is similar to task 1 in exercise 2, except the RMSE curves for all the ensemble members from a particular forecast will be plotted.
Right-click the ens_rmse.mv icon, select 'Edit' and plot the curves for 'mslp' and 'z500'.
Change 'expID' for your choice of ensemble.
clustersId="off" |
Clustering will be used in later tasks.
Q. How do the HRES, ensemble control forecast and ensemble mean compare? |
There might be some evidence of clustering in the ensemble plumes.
There might be some individual forecasts that give a lower RMS error than the control forecast.
If time:
In the previous task, uncertainty in the forecast by starting from different initial conditions and the stochastic parameterizations can result in significant differences in the RMSE (for this particular case and geographical region).
This task will explore the difference in another way by looking at the 'ensemble spread'.
Use the ens_to_an.mv icon and plot the MSLP and z500. This will produce plots showing: the mean of all the ensemble forecasts, the spread of the ensemble forecasts, the operational HRES deterministic forecast and the analysis.
Change 'expId' if required.
Animate this plot to see how the spread grows.
This macro can also be used to look at clusters of ensemble members. It will be used later in the clustering tasks. For this task, make sure all the members of the ensemble are used.
#ENS members (use ["all"] or a list of members like [1,2,3] members=["all"] #[1,2,3,4,5] or ["all"] or ["cl.example.1"] |
Q. How does the mean of the ensemble forecasts compare to the HRES & analysis? |
A "spaghetti" plot is where a single contour of a parameter is plotted for all ensemble members. It is another way of visualizing the differences between the ensemble members and focussing on features.
Use the ens_to_an_runs_spag.mv icon. Plot and animate the MSLP and z500 fields using your suitable choice for the contour level. Find a value that highlights the low pressure centres. Note that not all members may reach the low pressure set by the contour.
The red contour line shows the control forecast of the ensemble.
Note that this macro may animate slowly because of the computations required.
Experiment with changing the contour value and (if time) plotting other fields.
So far we have been looking at reducing the information in some way to visualise the ensemble.
Stamp maps are used to visualise all the ensemble members as normal maps. These are small, stamp sized contour maps plotted for each ensemble member using a small set of contours.
There are two icons to use, stamp.mv and stamp_diff.mv.
Use stamp.mv to plot the MSLP and z500 fields in the ensemble.
The stamp map is slow to plot as it reads a lot of data. Rather than animate each forecast step, a particular date can be set by changing the 'steps' variable.
#Define forecast steps steps=[2012-09-24 00:00,"to",2012-09-24 00:00,"by",6] |
Make sure clustersId="off
" for this task.
Use stamp.mv and plot total precipitation ('tp') over France (mapType=2) for 00Z 24-09-2012 (compare with Figure 2 in Pantillon).
Note, stamp_diff.mv cannot be used for 'tp' as there is no precipitation data in the analyses.
Use the stamp_diff.mv plot to look at the differences between the ensemble members and the analysis. It can be easier to understand the difference in the ensembles by using difference stamp maps.
Q. Using the stamp and stamp difference maps, study the ensemble. Identify which ensembles produce "better" forecasts. |
After visualizing the stamp maps, it can be useful to animate a comparison of individual ensemble members to the analyses.
ens_to_an_diff.mv and pf_to_cf_diff.mv can be used to compare ensemble members.
To animate the difference in MSLP of an individual ensemble member 30 to the analysis, edit the lines:
To compare the control forecast:
|
This will show the forecasts from the ensemble members and also their difference with the ensemble control forecast. To animate the difference in MSLP with ensemble members '30' and '50', set:
|
Compare the SST parameter used for the ens_oper and ens_2016 ensemble forecasts. The 2016 reforecast of this case study used a coupled ocean model unlike the 2012 ensemble and HRES forecast that used climatology for the first 5 days. |
To show a cross-section of a particular ensemble member, use the macro 'ens_xs.mv'. This works in the same way as the an_xs.mv and hres_xs.mv macros. |
Find ensemble members that appear to produce a better forecast and look to see how the initial development in these members differs.
|
The probability distribution function of the normal distribution or Gaussian distribution. The probabilities expressed as a percentage for various widths of standard deviations (σ) represent the area under the curve. |
---|
Figure from Wikipedia. |
Cumulative distribution function for a normal |
---|
Figure from Wikipedia. |
The figures above illustrate the relationship between a normal distribution and its associated cumulative distribution function. The CDF is constructed from the area under the probability density function.
The CDF gives the probability that a value on the curve will be found to have a value less than or equal to the corresponding value on the x-axis. For example, in the figure, the probability for values less than or equal to X=0 is 50%.
The shape of the CDF curve is related to the shape of the normal distribution. The width of the CDF curve is directly related to the value of the standard deviation of the probability distribution function.
For an ensemble, the width is therefore related to the 'ensemble spread'.
For a forecast ensemble where all values were the same, the CDF would be a vertical straight line.
This exercise uses the cdf.mv icon. Right-click, select 'Edit' and then:
Note that only MSLP, 2m temperature (t2) and 10m wind-speed (speed10) are available for the CDF.
Make sure useClusters='off'.
Q. Compare the CDF from the different forecast ensembles; what can you say about the spread? |
The paper by Pantillon et al, describes the use of clustering to identify the main scenarios among the ensemble members.
This exercise repeats some of the plots from the previous one but this time with clustering enabled.
Using clustering will highlight the ensemble members in each cluster in the plots.
In this exercise you will:
Clusters can be created manually from lists of the ensemble members.
Choose members for two clusters. The stamp maps are useful for this task.
From the stamp map of z500 at 24/9/2012 (t+96), identify ensemble members that represent the two most likely forecast scenarios.
It is usual to create clusters from z500 as it represents the large-scale flow and is not a noisy field. However, for this particular case study, the stamp map of 'tp' (total precipitation) over France is also very indicative of the distinct forecast scenarios.
Right-click 'ens_oper_cluster.example.txt' and select Edit (or make a duplicate) The file contains two example lines:
The first line defines the list of members for 'Cluster 1': in this example, members 2, 3, 4, 9, 22, 33, 40. The second line defines the list of members for 'Cluster 2': in this example, members 10, 11, 12, 31, 49. Change these two lines!. You can create multiple cluster definitions by using the 'Duplicate' menu option to make copies of the file for use in the plotting macros.. The filename is important! |
Use the clusters of ensemble members you have created in Set Replot ensembles:RMSE: plot the RMSE curves using Stamp maps: the stamp maps will be reordered such at the ensemble members will be groups according to their cluster. Applies to Spaghetti maps: with clusters enabled, two additional maps are produced which show the contour lines for each cluster. The spaghetti maps are similar to Figure 10. in Pantillon et al. |
The macro Use If your cluster definition file is called 'ens_oper_cluster.example.txt', then Edit
If your cluster definition file is has another name, e.g. ens_oper_cluster.fred.txt, then members_1=["cl.fred.1"]. Plot other parameters:Plot total precipitation for France ( |
Q. Experiment with the choice of members in each clusters and plot z500 at t+96 (Figure 7 in Pantillon et al.). How similar are your cluster maps? |
A quantitative way of clustering an ensemble is by computing empirical orthogonal functions from the differences between the ensemble members and the control forecast.
Although geopotential height at 500hPa at 00 24/9/2012 is used in the paper by Pantillon et al., the steps described below can be used for any parameter at any step.
The eof.mv
macro computes the EOFs and the clustering.
Always use the Otherwise cluster_to_an.mv and other plots with clustering enabled will fail or plot with the wrong clustering of ensemble members. If you change step or ensemble, recompute the EOFS and cluster definitions using eof.mv. Note however, that once a cluster has been computed, it can be used for all steps with any parameter. |
Edit 'eof.mv' Set the parameter to use, choice of ensemble and forecast step required for the EOF computation:
Run the macro. The above example will compute the EOFs of geopotential height anomaly at 500hPa using the 2012 operational ensemble at forecast step 00Z on 24/09/2012. A plot will appear showing the first two EOFs (similar to Figure 5 in Pantillon et al.) The geographical area for the EOF computation is: 35-55N, 10W-20E (same as in Pantillon et al). If desired it can be changed in |
The eof.mv macro will create a text file with the cluster definitions, in the same format as described above in the previous task. The filename will be different, it will have 'eof' in the filename to indicate it was created by using empirical orthogonal functions.
If a different ensemble forecast is used, for example This cluster definition file can then be used to plot any variable at all steps (as for task 1). |
Q. What do the EOFs plotted by eof.mv show? |
Use the cluster definition file computed by eof.mv to the plot ensembles and maps with clusters enabled (as described for task 1, but this time with the 'eof' cluster file). The macro Use Edit
Run the macro. If time also look at the total precipitation (tp) over France and PV/320K. |
Q. How similar is the PCA computed clusters to your manual clustering? |
For those interested: The code that computes the clusters can be found in the Python script: This uses the 'ward' cluster method from SciPy. Other cluster algorithms are available. See http://docs.scipy.org/doc/scipy/reference/generated/scipy.cluster.hierarchy.linkage.html#scipy.cluster.hierarchy.linkage The python code can be changed to a different algorithm or the more adventurous can write their own cluster algorithm! |
To further compare the 2012 and 2016 ensemble forecasts, plots showing the percentile amount and probabilities above a threshold can be made for total precipitation.
Use these icons:
Both these macros will use the 6-hourly total precipitation for forecast steps at 90, 96 and 102 hours, plotted over France.
Edit the percentile_tp_compare.mv
icon.
Set the percentile for the total precipitation to 75%:
#The percentile of ENS precipitation forecast perc=75 |
Run the macro and compare the percentiles from both the forecasts. Change the percentiles to see how the forecasts differ.
This macro will produce maps showing the probability of 6-hourly precipitation for the same area as in Task 1.
In this case, the maps show the probability that total precipitation exceeds a threshold expressed in mm.
Edit the prob_tp_compare.mv
and set the probability to 20mm:
#The probability of precipitation greater than prob=20 |
Run the macro and view the map. Try changing the threshold value and run.
Q. Using these two macros, compare the 2012 and 2016 forecast ensemble. Which was the better forecast for HyMEX flight planning? |
In this exercise, various methods for presenting the forecast error are presented.
hres_rmse.mv : this plots the root-mean-square-error growth curves for the operational HRES forecast compared to the ECMWF analyses. hres_to_an_diff.mv : this plots a single parameter as a difference map between the operational HRES forecast and the ECMWF analysis. Use this to understand the forecast errors. |
In this task, we'll look at the difference between the forecast and the analysis by using "root-mean-square error" (RMSE) curves as a way of summarising the performance of the forecast.
Root-mean square error curves are a standard measure to determine forecast error compared to the analysis and several of the exercises will use them. The RMSE is computed by taking the square-root of the mean of the forecast difference between the HRES and analyses. RMSE of the 500hPa geopotential is a standard measure for assessing forecast model performance at ECMWF (for more information see: http://www.ecmwf.int/en/forecasts/quality-our-forecasts).
Right-click the hres_rmse.mv icon, select 'Edit' and plot the RMSE curve for z500.
Repeat for the mean-sea-level pressure mslp.
Repeat for both geographical regions: mapType=1 (Atlantic) and mapType=2 (France).
Q. What do the RMSE curves show? |
Use the hres_to_an_diff.mv icon and plot the difference map between the HRES forecast and the analysis for z500 and mslp.
Q. What differences can be seen? |
If time: look at other fields to study the behaviour of the forecast.
For more information on the stochastic physics scheme in (Open)IFS, see the article:
Shutts et al, 2011, ECMWF Newsletter 129.
We gratefully acknowledge the following for their contributions in preparing these exercises. From ECMWF: Glenn Carver, Sandor Kertesz, Linus Magnusson, Iain Russell, Simon Lang, Filip Vana. From ENM/Meteo-France: Frédéric Ferry, Etienne Chabot, David Pollack and Thierry Barthet for IT support at ENM.