...
The paper by Pantillon et al , describes the use of clustering to identify the main scenarios among the ensemble members.
...
It is usual to create clusters from z500 as it represents the large-scale flow and is not a noisy field. However, for this particular case study, the stamp map of 'tp' (total precipitation) over France is also very indicative of the distinct forecast scenarios. You might also try using other fields, such as 'mslp' to compare.
Panel | ||
---|---|---|
| ||
Right-click 'ens_oper_cluster.example.txt' and select Edit (or make a duplicate) The file contains two example lines:
The first line defines the list of members for 'Cluster 1': in this example, members 2, 3, 4, 9, 22, 33, 40. The second line defines the list of members for 'Cluster 2': in this example, members 10, 11, 12, 31, 49. Change these two lines!. You can create multiple cluster definitions by using the 'Duplicate' menu option to make copies of the file for use in the plotting macros.. The filename is important! |
...
Panel | ||
---|---|---|
| ||
Use the clusters of ensemble members you have created in Set Change Replot ensembles:RMSE: plot the RMSE curves using Stamp maps: the stamp maps will be reordered such at so the ensemble members will be groups grouped according to their cluster. Applies to Spaghetti maps: with clusters enabled, two additional maps are produced which show the contour lines for each cluster. The spaghetti maps are similar to Figure 10. in Pantillon et al. |
Panel | |||||
---|---|---|---|---|---|
| |||||
The macro Use If your cluster definition file is called 'ens_oper_cluster.example.txt', then Edit
If your cluster definition file is has another name, e.g. ens_oper_cluster.fred.txt, then members_1=["cl.fred.1"]. Plot other parameters:Plot total precipitation 'tp' for France ( |
Panel | ||
---|---|---|
| ||
Q. Experiment with the choice of members in each clusters and plot z500 at t+96 (Figure 7 in Pantillon et al.). How similar are your cluster maps? |
...
A quantitative way of clustering an ensemble is by computing a principal component analysis using empirical orthogonal functions. These are computed from the differences between the ensemble members and the control forecastthe ensemble mean, then computing the eigenvalues and eigenfunctions over all the members such that the difference of each member can be expressed as a linear combination of these eigenfuctions, also known as empirical orthogonal functions (EOFs).
Although geopotential height at 500hPa at 00 24/9/2012 is used in the paper by Pantillon et al., the steps described below can be used for any parameter at any step.
...
Panel | |||||||
---|---|---|---|---|---|---|---|
| |||||||
The eof.mv macro will create a text file with the cluster definitions, in the same format as described above in the previous task. The filename will be different, it will have 'eof' in the filename to indicate it was created by using empirical orthogonal functions.
If a different ensemble forecast is used, for example This cluster definition file can then be used to plot any variable at all steps (as for task 1). |
Info |
---|
If you rerun the |
Panel | ||
---|---|---|
| ||
Q. What do the EOFs plotted by eof.mv show? |
Panel | |||||
---|---|---|---|---|---|
| |||||
Use the cluster definition file computed by The macro Use Edit
Run the macro. If time also look at the total precipitation (tp) over France and PV/320K. |
...
Panel | ||
---|---|---|
| ||
For those interested: The code that computes the clusters can be found in the Python script: This uses the 'ward' cluster method from SciPy. Other cluster algorithms are available. See http://docs.scipy.org/doc/scipy/reference/generated/scipy.cluster.hierarchy.linkage.html#scipy.cluster.hierarchy.linkage The python code can be changed to a different algorithm or the more adventurous can write their own cluster algorithm! |
...