articleThe Annals of StatisticsOct 1, 2013GREEN OA

Equivalence of distance-based and RKHS-based statistics in hypothesis testing

UCL Australia · Max Planck Society · +3 more institutions

Indexed inarxivcrossref

Abstract

We provide a unifying framework linking two classes of statistics used in two-sample and independence testing: on the one hand, the energy distances and distance covariances from the statistics literature; on the other, maximum mean discrepancies (MMD), that is, distances between embeddings of distributions to reproducing kernel Hilbert spaces (RKHS), as established in machine learning. In the case where the energy distance is computed with a semimetric of negative type, a positive definite kernel, termed distance kernel, may be defined such that the MMD corresponds exactly to the energy distance. Conversely, for any positive definite kernel, we can interpret the MMD as energy distance with respect to some…

Citation impact

590
total citations
FWCI
21.60
Percentile
100%
References
57
Citations per year

Authors

4

Topics & keywords

Keywords
  • Mathematics
  • Reproducing kernel Hilbert space
  • Statistics
  • Equivalence (formal languages)
  • Kernel (algebra)
  • Covariance
  • Sample size determination
  • Statistical hypothesis testing
UN Sustainable Development Goals
  • Affordable and clean energy
No related works found for this paper.