Efficient bootstrap estimates for tail statistics

Breivik, Øyvind; Aarnes, Ole Johan

doi:https://doi.org/10.5194/nhess-17-357-2017

Articles | Volume 17, issue 3

https://doi.org/10.5194/nhess-17-357-2017

© Author(s) 2017. This work is distributed under
the Creative Commons Attribution 3.0 License.

https://doi.org/10.5194/nhess-17-357-2017

© Author(s) 2017. This work is distributed under
the Creative Commons Attribution 3.0 License.

Articles | Volume 17, issue 3

Research article

|

08 Mar 2017

Research article |

| 08 Mar 2017

Efficient bootstrap estimates for tail statistics

Øyvind Breivik and Ole Johan Aarnes

Abstract. Bootstrap resamples can be used to investigate the tail of empirical distributions as well as return value estimates from the extremal behaviour of the sample. Specifically, the confidence intervals on return value estimates or bounds on in-sample tail statistics can be obtained using bootstrap techniques. However, non-parametric bootstrapping from the entire sample is expensive. It is shown here that it suffices to bootstrap from a small subset consisting of the highest entries in the sequence to make estimates that are essentially identical to bootstraps from the entire sample. Similarly, bootstrap estimates of confidence intervals of threshold return estimates are found to be well approximated by using a subset consisting of the highest entries. This has practical consequences in fields such as meteorology, oceanography and hydrology where return values are calculated from very large gridded model integrations spanning decades at high temporal resolution or from large ensembles of independent and identically distributed model fields. In such cases the computational savings are substantial.

Received: 06 Jul 2016 – Discussion started: 05 Sep 2016 – Revised: 21 Feb 2017 – Accepted: 25 Feb 2017 – Published: 08 Mar 2017