In this paper we investigate a divide-And-conquer algorithm for estimating the extreme value index when data are stored in multiple machines. The oracle property of such an algorithm based on extreme value methods is not guaranteed by the general theory of distributed inference. We propose a distributed Hill estimator and establish its asymptotic theories. We consider various cases where the number of observations involved in each machine can be either homogeneous or heterogeneous, and either fixed or varying according to the total sample size. In each case we provide a sufficient, sometimes also necessary, condition under which the oracle property holds.
|Number of pages||8|
|Journal||Biometrika. A Journal for the Statistical Study of Biological Problems|
|Publication status||Published - 30 Jan 2021|
Bibliographical notePublisher Copyright:
© 2021 The Author(s) 2021. Published by Oxford University Press on behalf of the Biometrika Trust. All rights reserved.