Testing closeness of discrete distributions

Batu, TugkanORCID logo; Fortnow, Lance; Rubinfeld, Ronitt; Smith, Warren D.; and White, Patrick Testing closeness of discrete distributions Journal of the ACM, 60 (1): 4. ISSN 0004-5411
Copy

Given samples from two distributions over an n-element set, we wish to test whether these distributions are statistically close. We present an algorithm which uses sublinear in n, specifically, O ( n2/3ε -8/3 log n ) , independent samples from each distribution, runs in time linear in the sample size, makes no assumptions about the structure of the distributions, and distinguishes the cases when the distance between the distributions is small ( less than { ε4/3n-1/3/32, εn-1/2/4 }) or large (more than ε) in ℓ1 distance. This result can be compared to the lower bound of Ω ( n 2/3ε -2/3 ) for this problem given by Valiant [2008]. Our algorithm has applications to the problem of testing whether a given Markov process is rapidly mixing. We present sublinear algorithms for several variants of this problem as well.

Full text not available from this repository.

Atom BibTeX OpenURL ContextObject in Span OpenURL ContextObject Dublin Core MPEG-21 DIDL Data Cite XML EndNote HTML Citation METS MODS RIOXX2 XML Reference Manager Refer ASCII Citation
Export

Downloads