Common metrics of calibration for continuous Gaussian data and
exceedance probabilities

Glowienka-Hense, Rita; Hense, Andreas; Spangehl, Thomas; Schröder, Marc

doi:https://doi.org/10.5194/gmd-2018-141

Preprints

https://doi.org/10.5194/gmd-2018-141

Preprints

Submitted as: methods for assessment of models

11 Oct 2018

Submitted as: methods for assessment of models |

| 11 Oct 2018

Status: this preprint was under review for the journal GMD but the revision was not accepted.

Common metrics of calibration for continuous Gaussian data and exceedance probabilities

Rita Glowienka-Hense, Andreas Hense, Thomas Spangehl, and Marc Schröder

Abstract. A framework of ensemble forecast verification tools is discussed which is founded on the concept of information entropy. It can be based on a common yardstick namely that of "correlation". With these measures calibration is deduced from the balance between ensemble sharpness and resolution. With the same units these features can be put into one diagram for continuous time series from Gaussian processes and exceedance probabilities, the latter usually tested with the reliability term from the Brier score. The sharpness and resolution terms allow to use the same vocabulary of over- and underdispersion which is established for frequency histograms. The concept is based on the fact that mutual information (MI) of two Gaussian processes is directly related to Pearson's anomaly correlation. Further MI can be written as the Kullback-Leibler divergence of the conditional probability of observations given the model forecasts and the unconditioned observations. Thus the MI is a measure of resolution. The mean of the UTILITY defined by (Kleeman, 2002) is the corresponding measure of sharpness. For Gaussian processes the mean UTILITY is very close to the ratio of ensemble mean variance to mean ensemble variance (ANOVA) which is the analysis of variance factor when time is taken as treatment. The ensemble spread score (ESS) (Palmer et al., 2006) is shown to be a measure of calibration if model and observed data are scaled with their respective means and standard deviations. For exceedance probabilities the resolution term of the divergence score (Weijs et al., 2010) is already defined as a MI term and it is here complemented with a mean UTILITY formed similarly to the resolution term but with forecasts only. The entropy terms are then rescaled to the "correlation" yardstick. The concept is applied to temperature data from the German project on decadal climate prediction, Mittelfristige Klimaprognose (MiKlip). It is shown that both over – and underdispersion can be found for the 2m temperature forecasts. Increasing ensemble sharpness of surface ocean temperature with lead year in the southern ocean hints at model-data inconsistencies at some locations in the ocean. Finally empirical orthogonal functions (EOF) of northern hemisphere annual mean surface temperature for ERA-40/ERA-Interim and MiKlip retrospective hindcasts are determined. For both data sets the respective first EOF represents the low frequency temperature development. The time coefficients of the EOF are used to compare resolution and sharpness of continuous data and exceedance probabilities in one diagram.

Received: 11 Jun 2018 – Discussion started: 11 Oct 2018

Download & links

Rita Glowienka-Hense, Andreas Hense, Thomas Spangehl, and Marc Schröder

Status: closed

AC: Author comment | RC: Referee comment | SC: Short comment | EC: Editor comment

- Printer-friendly version

- Supplement

RC1: 'Comments on `Common metrics of calibration for continuous Gaussian data and exceedance probabilities'', Anonymous Referee #1, 16 Nov 2018
- AC1: 'Comment to reviewer 1', Rita Glowienka-Hense, 19 Dec 2018
RC2: 'See attached report', Anonymous Referee #2, 23 Nov 2018
- AC2: 'Answers to reviewer 2', Rita Glowienka-Hense, 19 Dec 2018

Status: closed

AC: Author comment | RC: Referee comment | SC: Short comment | EC: Editor comment

- Printer-friendly version

- Supplement

RC1: 'Comments on `Common metrics of calibration for continuous Gaussian data and exceedance probabilities'', Anonymous Referee #1, 16 Nov 2018
- AC1: 'Comment to reviewer 1', Rita Glowienka-Hense, 19 Dec 2018
RC2: 'See attached report', Anonymous Referee #2, 23 Nov 2018
- AC2: 'Answers to reviewer 2', Rita Glowienka-Hense, 19 Dec 2018

Rita Glowienka-Hense, Andreas Hense, Thomas Spangehl, and Marc Schröder

Viewed

Total article views: 1,604 (including HTML, PDF, and XML)

HTML	PDF	XML	Total	BibTeX	EndNote
1,040	481	83	1,604	73	60

HTML: 1,040
PDF: 481
XML: 83
Total: 1,604
BibTeX: 73
EndNote: 60

Views and downloads (calculated since 11 Oct 2018)

Month	HTML	PDF	XML	Total
Oct 2018	157	30	3	190
Nov 2018	34	12	1	47
Dec 2018	13	7	1	21
Jan 2019	10	6	0	16
Feb 2019	13	2	0	15
Mar 2019	17	7	0	24
Apr 2019	10	5	1	16
May 2019	8	6	0	14
Jun 2019	15	5	0	20
Jul 2019	3	5	0	8
Aug 2019	5	4	0	9
Sep 2019	10	5	0	15
Oct 2019	12	6	1	19
Nov 2019	11	6	0	17
Dec 2019	5	9	0	14
Jan 2020	23	6	0	29
Feb 2020	12	3	2	17
Mar 2020	2	6	0	8
Apr 2020	10	3	0	13
May 2020	13	6	0	19
Jun 2020	11	10	2	23
Jul 2020	41	28	28	97
Aug 2020	8	8	0	16
Sep 2020	17	9	0	26
Oct 2020	31	19	2	52
Nov 2020	19	19	1	39
Dec 2020	24	17	0	41
Jan 2021	27	25	0	52
Feb 2021	20	18	0	38
Mar 2021	21	17	0	38
Apr 2021	22	14	0	36
May 2021	15	12	0	27
Jun 2021	12	3	0	15
Jul 2021	14	6	0	20
Aug 2021	15	2	0	17
Sep 2021	6	2	0	8
Oct 2021	14	11	0	25
Nov 2021	31	19	1	51
Dec 2021	18	6	6	30
Jan 2022	17	8	4	29
Feb 2022	24	5	2	31
Mar 2022	10	4	1	15
Apr 2022	9	0	9
May 2022	11	8	2	21
Jun 2022	2	1	3
Jul 2022	7	4	1	12
Aug 2022	8	5	1	14
Sep 2022	17	4	0	21
Oct 2022	7	4	0	11
Nov 2022	10	6	0	16
Dec 2022	12	2	2	16
Jan 2023	16	4	2	22
Feb 2023	10	2	1	13
Mar 2023	9	2	1	12
Apr 2023	5	2	0	7
May 2023	5	3	2	10
Jun 2023	9	2	2	13
Jul 2023	8	3	2	13
Aug 2023	7	2	0	9
Sep 2023	9	2	1	12
Oct 2023	11	2	0	13
Nov 2023	5	0	5
Dec 2023	14	2	1	17
Jan 2024	12	1	13
Feb 2024	13	5	2	20
Mar 2024	11	7	4	22
Apr 2024	13	9	1	23

Cumulative views and downloads (calculated since 11 Oct 2018)

Month	HTML	PDF	XML	Total
Oct 2018	157	30	3	190
Nov 2018	34	12	1	47
Dec 2018	13	7	1	21
Jan 2019	10	6	0	16
Feb 2019	13	2	0	15
Mar 2019	17	7	0	24
Apr 2019	10	5	1	16
May 2019	8	6	0	14
Jun 2019	15	5	0	20
Jul 2019	3	5	0	8
Aug 2019	5	4	0	9
Sep 2019	10	5	0	15
Oct 2019	12	6	1	19
Nov 2019	11	6	0	17
Dec 2019	5	9	0	14
Jan 2020	23	6	0	29
Feb 2020	12	3	2	17
Mar 2020	2	6	0	8
Apr 2020	10	3	0	13
May 2020	13	6	0	19
Jun 2020	11	10	2	23
Jul 2020	41	28	28	97
Aug 2020	8	8	0	16
Sep 2020	17	9	0	26
Oct 2020	31	19	2	52
Nov 2020	19	19	1	39
Dec 2020	24	17	0	41
Jan 2021	27	25	0	52
Feb 2021	20	18	0	38
Mar 2021	21	17	0	38
Apr 2021	22	14	0	36
May 2021	15	12	0	27
Jun 2021	12	3	0	15
Jul 2021	14	6	0	20
Aug 2021	15	2	0	17
Sep 2021	6	2	0	8
Oct 2021	14	11	0	25
Nov 2021	31	19	1	51
Dec 2021	18	6	6	30
Jan 2022	17	8	4	29
Feb 2022	24	5	2	31
Mar 2022	10	4	1	15
Apr 2022	9	0	9
May 2022	11	8	2	21
Jun 2022	2	1	3
Jul 2022	7	4	1	12
Aug 2022	8	5	1	14
Sep 2022	17	4	0	21
Oct 2022	7	4	0	11
Nov 2022	10	6	0	16
Dec 2022	12	2	2	16
Jan 2023	16	4	2	22
Feb 2023	10	2	1	13
Mar 2023	9	2	1	12
Apr 2023	5	2	0	7
May 2023	5	3	2	10
Jun 2023	9	2	2	13
Jul 2023	8	3	2	13
Aug 2023	7	2	0	9
Sep 2023	9	2	1	12
Oct 2023	11	2	0	13
Nov 2023	5	0	5
Dec 2023	14	2	1	17
Jan 2024	12	1	13
Feb 2024	13	5	2	20
Mar 2024	11	7	4	22
Apr 2024	13	9	1	23

Viewed (geographical distribution)

Total article views: 1,430 (including HTML, PDF, and XML) Thereof 1,429 with geography defined and 1 with unknown origin.

Country	#	Views	%

Latest update: 18 Apr 2024

Short summary

Ensemble forecast verification treats the issues of forecast errors and uncertainty estimated from ensemble spread. We suggest measures based on relative entropy. For continuous variables correlation and the mean ratio of the ensemble spread to climate variance (analysis of variance (anova)) are related to these entropies. For categorical data corresponding scores are deduced that allow the comparison with continuous data.


Total:	0
HTML:	0
PDF:	0
XML:	0