You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 15 Next »

Introduction

(include an explanation about previous conventions.... SPECS, CF, ACDD)

include link to standard names, and list of variables...

Encoding Guide

Global attributes

The following properties are intended to provide information about where the data came from and what has been done to it. This information is mainly for the benefit of human readers and data discovery mechanisms. The attribute values are all character strings. When an attribute appears both globally and as a variable attribute, the variable’s version has precedence.

 

Attribute NameValueExamplesComment
ConventionsCF convention string  [Other convention] :..."CF-1.6"
"CF-1.6 C3S-0.1"

Multiple conventions may be included (separated by blank spaces)

title

A controlled vocabulary will be provided

CF: Free text

ACDD (highly recommended)

"IPSL-CM5A-LR model output prepared for CMIP5 RCP4.5"
A short phrase or sentence describing the dataset. In many discovery systems, the title will be displayed in the results list from a search, and therefore should be human readable and reasonable to display in a list of such names
references

URIs (such as a URL or DOI) for papers or other references. A valid doi is recommended

CF: Free text

"doi:10.5194/gmd-8-1509-2015"
Published or web-based references that describe the data or methods used to produce it.
sourceA methodology to build this attribute will be provided
  • "world model v.0.1"
  • "model-generated, GloSea5-GC2"
  • "IPSL-CM5A-LR (2010) : atmos : LMDZ4 (LMDZ4_v5, 96x95x39); ocean : ORCA2 (NEMOV2_3, 2x2L31); seaIce : LIM2 (NEMOV2_3); ocnBgchem : PISCES (NEMOV2_3); land : ORCHIDEE (orchidee_1_9_4_AR5)"

The method of production of the original data. If it was model-generated, source should name the model and its version, as specifically as it could be useful

institution

A controlled vocabulary will be provided

CF: Free text

 

"Met Office"

Specifies where the original data was produced. The name of the institution principally responsible for originating this data.

contact

Copernicus User Support URI should be used
http://copernicus-support.ecmwf.int

CF: Free text

"http://copernicus-support.ecmwf.int"

 

project

"C3S Seasonal Forecast" should be used

CF: Free text

 

"C3S Seasonal Forecast"

 

creation_date

SPECS: YYYY-MM-DDThh:mm:ss<zone>

ISO 8601:2004 extended format

"2011-06-24T02:53:46Z"

NOTE: The ACDD 1.3 names this attribute as date_create
SPECS conventions will be followed
 

commentFree text
  • "Produced by University of Hamburg for DWD at ECMWF HPC facilities"
  • "Run by CMCC at CINECA"
forecast_type

"forecast" or "hindcast"

"forecast"To identify the type of data
history

Each line should begin with a timestamp indicating the date and time of day when the program was executed

CF: Free Text

  • "Produced using CDS Toolbox on 1/6/2016"
  • "Model raw output postprocessing with modelling environment (IMDI) at DKRZ: URL: http://svn-mad.zmaw.de/svn/mad/Model/IMDI/trunk, REV: 3436 2011-07-17T15:14:45Z CMOR rewrote data to comply with CF standards and CMIP5 requirements."

To record relevant information, such as the command history which led to this file being produced. Provides an audit trail for modifications to the original data.
Well-behaved generic netCDF filters will automatically append their name and the parameters with which they were invoked to the global history attribute of an input netCDF file

 

commit,

iso_lineage or lineage

Free text (ISO Lineage model 19115-2)"Produced using CDS Toolbox v1.0"

trace of the tools/scripts used.

Paco: include information about the versioning of the software used to create the data

Antonio S. Cofino Gonzalez: We need a more implementtios examples on this. This could achiived in EQC WP where metadata is been part of their activities (i.e. WP4@QA4SEAS). ISO 19115-2 defines a linage model where this is been considered. TBD.

summary

The content will be provided

ACDD (highly recommended): Text, defined phrase

 A short paragraph describing the dataset
keywords

The content will be provided

ACDD (highly recommended) : text, controlled vocabulary

 A comma separated list of key words and phrases.
forecast_reference_time

SPECS: YYYY-MM-DDThh:mm:ssZ

NOTE: This is ISO 8601:2004 extended format, but time zone is required to be UTC

"2011-06-01T00:00:00Z"

time of the analysis from which the forecast was made
Introduced as global attribute to keep compatibility with SPECS
NOTE: This works fine for SPECS data structure, i.e. one variable per start time per file

Spatial Coordinates

Type
(CMIP5)
Coordinate Name
(CMIP5)
Dimension Names
(CMIP5)
Axisstandard_namelong_name
(CMIP5)
units
(CF canonical units)
positivevalid_min
(CMIP5)
valid_max
(CMIP5)
boundsNotes
doublelatlatYlatitudelatitudedegrees_northN/A-90.90.lat_bounds

Values (1x1deg grid) prescribed:
dimension lat=181

[-90. , -89. , ..., 0., ... 90.]

doublelonlonXlongitudelongitudedegrees_eastN/A0.360.lon_boundsValues (1x1deg grid) prescribed:
dimension lon=360

[0. , 1. , ..., 358., 359.]

doubleplevplevZair_pressurepressure

Pa

downN/AN/Abounds?

This is also referred to as isobaric level by some tools
Values prescribed:
dimension plev=11

[925., 850., 700., 500., 400., 300., 200., 100., 50., 30., 10.]
(NOTE: values here written in hPa, they should be Pa)

doubledepthdepthZdepthdepth

m

downN/AN/Adepth_boundsOnly used for soil model levels
NOTE: Number and depth of levels is not prescribed by C3S
doubleheightheightZheightheightmup or down

CMIP5:

2mtemp: 1.
10mu/v: 1.

CMIP5:

2mtemp: 10.
10mu/v: 30.

 

Used for single level fields (height, soil,SST)

e.g. 2 m (for Temperature)

C3S: string

 

 realization

C3S: realization_dim

CF: a different name is needed for dim/variable
(see comment)


E realizationrealization1N/AN/AN/A members are not a physical quantity. Realization is a discrete coordinate and the mebers it categorical values (ordered or non-ordered ones)

Time Coordinates

TypeCoordinate NameDimension NamesAxisstandard_namelong_namecalendarunitsboundsNotes
doublereftimeN/ATforecast_reference_time"Start date of the forecast"gregorianUDUNITS time units
e.g.
"hours since YYYY-MM-DD hh:mm:ss TZhh:TZmm"
bounds?In SPECS it was a "global_attribute"
It has been additionally introduced here as a coordinate variable to ease future netCDF management (e.g. file merging)
doubleleadtimetimeN/Aforecast_period"Time elapsed since the start of the forecast"N/A

SPECS: days
C3S: requested units can be relaxed to equivalent time units

bounds?

The interval of time between the forecast reference time and the valid time

doubletimetimeT

time

"Verification time of the forecast"gregorian

SPECS: "days since 1850-01-01"

C3S: requested units can be relaxed to equivalent time units

time_bounds

Time for which the forecast is valid


NOTE: Definitions for "leadtime" and "time" have been taken from SPECS. The introduction of "reftime" as a variable has been adapted from SPECS global attribute description for the forecast reference time.

NOTE: Even though there are different requested time steps among the variables (6h, 12h, 24h), just one set of time axes has been defined, as that would be enough when applying the requirement of "one variable per file"

Cell boundaries

 As described in section 7.1 Cell Boundaries of CF convention.

To represent cells we add the attribute bounds to the appropriate coordinate variable(s). The value of bounds is the name of the variable that contains the vertices of the cell boundaries. We refer to this type of variable as a "boundary variable." A boundary variable will have one more dimension than its associated coordinate or auxiliary coordinate variable. The additional dimension should be the most rapidly varying one, and its size is the maximum number of cell vertices. Since a boundary variable is considered to be part of a coordinate variable’s metadata, it is not necessary to provide it with attributes such as long_name and units

Bounds NameDimensionsComments
time_boundstime,bounds
  • where to put the time coordinate (beginning, middle, end of the bounds) ?

e.g.
time=20160922 06
timebounds = [20160922 00, 20160922 06]

  • for 24h time steps:

[0,24] is that convention always valid?

lat_boundslat, boundsValues (1x1deg grid) prescribed:
[-90., 89.], [-89., -88.], ... [89., 90.]
lon_boundslon, bounds

Values (1x1deg grid) prescribed:

[0., 1.], [1., 2.], ... [359., 360.]

depth_boundsdepth,boundsShould define the full vertical extent of the soil model layers

Grid mapping

 As described in section 5.6 Grid Mappings and Projections of CF convention.

When the coordinate variables for a horizontal grid are longitude and latitude, a grid mapping variable with grid_mapping_name of latitude_longitude may be used to specify the ellipsoid and prime meridian.


char latitude_longitude ;
    latitude_longitude:grid_mapping_name = "latitude_longitude" ;

NOTE: Wouldn't a different name be more appropriate? (e.g. CRS as in CF examples)

Variables

NOTE: coordinates should list first of all the auxiliary coordinate(s) and then all the other coordinates.... SHOULD reftime and leadtime be included as well????

NOTE: type double/real????

Static Fields

  attributes 
name
(CMIP5)
dimensionsstandard_namelong_name
(CMIP5)
unitscoordinatesgrid_mappingNOTES
sftlflat,lonland_area_fraction"Land Area Fraction"1

"lat lon"

 

 

latitude_longitude


oroglat,lonsurface_altitude"Surface Altitude"m"lat lon"latitude_longitude 

Surface Fields (defined at a given height level)

  attributes 
name
(CMIP5)
dimensionsstandard_namelong_name
(CMIP5)
unitscoordinatescell_methodsgrid_mappingNOTES
tastime,lat,lonair_temperature"Near-Surface Air Temperature"K

"height time lat lon"

 

 

"time: point"

C3S: required 
CF:
recommended


latitude_longitude

height is usually 2m

tasmaxtime,lat,lonair_temperature"Daily Maximum Near-Surface Air Temperature"K

"height time lat lon"

"time: maximum (interval: <value> <unit>)"

C3S: required.
CF:
interval is optional

latitude_longitude

height is usually 2m

C3S: The interval is required to have a value<=3 hours)

tasmintime,lat,lonair_temperature"Daily Minimum Near-Surface Air Temperature"K

"height time lat lon"

"time: minimum (interval: <value> <unit>)"

C3S: required.
CF:
interval is optional
latitude_longitude

height is usually 2m

C3S: The interval is required to have a value<=3 hours)

 time,lat,londew_point_temperature K"height time lat lon""time: point"

C3S: required 
CF:
recommended
latitude_longitude

height is usually 2m

 

uastime,lat,lonx_windEastward Near-Surface Windm s-1"height time lat lon""time: point"

C3S: required 
CF:
recommended
latitude_longitude

height is usually 10m

vastime,lat,lony_windNorthward Near-Surface Windm s-1"height time lat lon""time: point"

C3S: required 
CF:
recommended
latitude_longitude

height is usually 10m

 time,lat,lonwind_speed_of_gust m s-1"height time lat lon" "time: maximum (interval: <value> <unit>)"
C3S: required.
CF:
interval is optional
 
latitude_longitude

height is usually 10m

C3S: The interval is required to have a value<=3 hours)

 

Surface Fields (not defined at a height level)

  attributes 
name
(CMIP5)
dimensionsstandard_namelong_name
(CMIP5)
unitscoordinatescell_methodsgrid_mappingNOTES
 time,lat,lonair_pressure_at_sea_level Pa "time: point"

C3S: required 
CF:
recommended

latitude_longitude 
 time,lat,loncloud_area_fraction 1 "time: point"

C3S: required 
CF:
recommended

latitude_longitude 
 time,lat,lonsoil_temperature K "time: point"

C3S: required 
CF:
recommended

latitude_longitude 
 time,lat,lonsea_surface_temperature K "time: point"

C3S: required 
CF:
recommended

latitude_longitude 
 time,lat,lonsea_ice_temperature K "time: point"

C3S: required 
CF:
recommended

latitude_longitude 
 time,lat,lonsea_ice_area_fraction 1 "time: point"

C3S: required 
CF:
recommended

latitude_longitude 
 time,depth,lat,lonmass_content_of_water_in_soil_layer kg m-2 "time: point"

C3S: required 
CF:
recommended

latitude_longitude 
 time,lat,lonlwe_thickness_of_surface_snow_amount m "time: point"

C3S: required 
CF:
recommended

latitude_longitude 
 time,lat,lonsnow_density kg m-3 "time: point"

C3S: required 
CF:
recommended

latitude_longitude 
 time,lat,lonlwe_thickness_of_stratiform_precipitation_amount m "time: sum"

C3S: required 
CF:
recommended

latitude_longitude 
 time,lat,lonlwe_thickness_of_convective_precipitation_amount m "time: sum"

C3S: required 
CF:
recommended

latitude_longitude 
 time,lat,lonlwe_thickness_of_precipitation_amount m "time: sum"

C3S: required 
CF:
recommended

latitude_longitude 
 time,lat,lonlwe_thickness_of_snowfall_amount m "time: sum"

C3S: required 
CF:
recommended

latitude_longitude 
 time,lat,lonintegral_of_surface_downward_sensible_heat_of_flux_wrt_time W s m-2 "time: sum"

C3S: required 
CF:
recommended

latitude_longitude 
 time,lat,lonintegral_of_surface_downward_latent_heat_of_flux_wrt_time W s m-2 "time: sum"

C3S: required 
CF:
recommended

latitude_longitude 
 time,lat,lonintegral_of_surface_downwelling_shortwave_flux_in_air_wrt_time W s m-2 "time: sum"

C3S: required 
CF:
recommended

latitude_longitude 
 time,lat,lonintegral_of_surface_net_downward_shortwave_flux_wrt_time W s m-2 "time: sum"

C3S: required 
CF:
recommended

latitude_longitude 
 time,lat,lonintegral_of_surface_net_downward_longwave_flux_wrt_time W s m-2 "time: sum"

C3S: required 
CF:
recommended

latitude_longitude 
 time,lat,lonintegral_of_toa_net_downward_shortwave_flux_wrt_time W s m-2 "time: sum"

C3S: required 
CF:
recommended

latitude_longitude 
 time,lat,lonintegral_of_toa_net_downward_longwave_flux_wrt_time W s m-2 "time: sum"

C3S: required 
CF:
recommended

latitude_longitude 
 time,lat,lonintegral_of_toa_incoming_shortwave_flux_wrt_time W s m-2 "time: sum"

C3S: required 
CF:
recommended

latitude_longitude 
 time,lat,lonintegral_of_surface_downward_eastward_stress_wrt_time Pa s "time: sum"

C3S: required 
CF:
recommended

latitude_longitude 
 time,lat,lonintegral_of_surface_downward_northward_stress_wrt_time Pa s "time: sum"

C3S: required 
CF:
recommended

latitude_longitude 
 time,lat,lonlwe_thickness_of_watert_evaporation_amount m "time: sum"

C3S: required 
CF:
recommended

latitude_longitude 
 time,lat,lonrunoff_amount kg m-2 "time: sum"

C3S: required 
CF:
recommended

latitude_longitude 
 time,lat,lonsurface_runoff_amount kg m-2 "time: sum"

C3S: required 
CF:
recommended

latitude_longitude 
 time,lat,lonsubsurface_runoff_amount kg m-2 "time: sum"

C3S: required 
CF:
recommended

latitude_longitude 

Pressure Level Fields

  attributes 
name
(CMIP5)
dimensionsstandard_namelong_name
(CMIP5)
unitscoordinatescell_methodsgrid_mappingNOTES
 time,lat,lonair_pressure_at_sea_level Pa "time: point"

C3S: required 
CF:
recommended

latitude_longitude 

Additional Questions to be addressed

QuestionDiscussionDecision
File format to be used?
Francisco Doblas-Reyes NetCDF4? With or without compression?
Kevin Marsh netCDF4 classic model (with deflate =6 suggested by Pierre-Antoine)
 
File naming,

Kevin Marsh Pierre-Antoine Bretonniere proposed  follow SPECS convention

 
forecast/hindcast matching and labelling  
File size recommendation (maximum size)?

Kevin Marsh Pierre-Antoine Bretonniere suggested 4GB recommended maximum size

Kevin Marsh recommend 4GB Max Size for data files

Versioning of data files?  
DOI

Kevin Marsh DOI likely to be assigned at dataset level

Kevin Marsh DOI likely to be assigned at dataset level

Variable short names to be specified?

Kevin Marsh  Antonio S. Cofino Gonzalez suggested follow cmip5 short names

Kevin Marsh follow cmip5 short names

Coordinate short names to be specified?

Kevin Marsh Antonio S. Cofino Gonzalez suggested  follow cmip5 coordinate short names

Kevin Marsh follow cmip5 coordinate short names

Extension to include ocean data for C3S?

Kevin Marsh yes, but not in the initial convention release

Kevin Marsh Not considered in initial release

Grids, resolution etc to be specified?

Kevin Marsh Antonio S. Cofino Gonzalez agreed 1 degree grid specified with valid max/min, but actual grid points not specified

Kevin Marsh 1 degree grid specified with valid max/min, but actual grid points not specified

MARS attributes to be specified?

Kevin Marsh These will be added by C3S, rather than data provider

Kevin Marsh These will be added by C3S

 standard name request/assignment process?

Kevin Marsh requested via standard name mailing list. Note that this process can take some considerable time.

Kevin Marsh requested via standard name mailing list

 

Discussion about time coordinates

NOTE: The SPECS approach (2 1D time coordinates) has been chosen for the "providers" convention

 

The encoding of multiple time coordinates requires particular consideration. An explicit example of the structure is given below.

Example of encoding data with multiple time axis informations

  
double forecast_reference_time(forecast_reference_time) ;
       forecast_reference_time:bounds = "forecast_reference_time_bnds" ;
       forecast_reference_time:units = "hours since 1970-01-01 00:00:00" ;
       forecast_reference_time:standard_name = "forecast_reference_time" ;
       forecast_reference_time:calendar = "gregorian" ;
double leadtime(leadtime) ;
       leadtime:bounds = "leadtime_bnds" ;
       leadtime:units = "hours" ;
       leadtime:standard_name = "forecast_period" ;
       leadtime:calendar = "gregorian" ;
double time(forecast_reference_time,leadtime) ;
       time:axis = "T" ;
       time:bounds = "time_bnds" ;
       time:units = "hours since 1970-01-01 00:00:00" ;
       time:standard_name = "time" ;
float temp(forecast_reference_time,leadtime,pressure,latitude,longitude);
      temp:units = "K";
      temp:standard_name = "air_temperature";
      temp:coordinates = "time";

Francisco Doblas-Reyes I interpret this as the time coordinates being a hypercube, where there could be missing data; this won't be consistent with the CMIP files; I still find this confusing unless a discussion about what to do with the missing data is undertaken.

Eduardo Penabad: Wouldn't that be solved by clarifying that different variables within the same file could potentially have different time coordinates/dimensions?

Francisco Doblas-Reyes Not sure. If to simplify you assume one variable only and this variable has in one file data for two start dates, one with three forecast time steps and another one with only two, the time dimensions will be forecast_reference_time=2, leadtime=3, but one of the values of temp() will have missing values, unless I haven't understood the model.

Antonio S. Cofino Gonzalez: discussion on multi-time dimension data

 

 

 

 

  • No labels