Extreme value statistics in semi-supervised models

H Ahmed, John H.J. Einmahl, Chen Zhou

Research output: Contribution to journalArticleAcademicpeer-review

Abstract

We consider extreme value analysis in a semi-supervised setting, where we observe, next to the n data on the target variable, n + m data on one or more covariates. This is called the semi-supervised model with n labeled and m unlabeled data. By exploiting the tail dependence between the target variable and the covariates, we derive estimators for the extreme value index and extreme quantiles of the target variable in this setting and establish their asymptotic behavior. Our estimators substantially improve the univariate estimators, based on only the n target variable data, in terms of asymptotic variances whereas the asymptotic biases remain unchanged. A simulation study confirms the substantially improved behavior of both estimators. Finally the estimation method is applied to rainfall data in France. Supplementary materials for this article are available online, including a standardized description of the materials available for reproducing the work.

Original languageEnglish
JournalJournal of the American Statistical Association
DOIs
Publication statusAccepted/In press - 2024

Bibliographical note

Publisher Copyright:
© 2024 The Author(s). Published with license by Taylor & Francis Group, LLC.

Fingerprint

Dive into the research topics of 'Extreme value statistics in semi-supervised models'. Together they form a unique fingerprint.

Cite this