Compression of data streams down to their information content
According to the Kolmogorov complexity, every finite binary string is compressible to a shortest code-its information content-from which it is effectively recoverable. We investigate the extent to which this holds for the infinite binary sequences (streams). We devise a new coding method that uniformly codes every stream X into an algorithmically random stream Y , in such a way that the first n bits of X are recoverable from the first I(X \upharpoonright -{n}) bits of Y , where I is any partial computable information content measure that is defined on all prefixes of X , and where X \upharpoonright -{n} is the initial segment of X of length n. As a consequence, if g is any computable upper bound on the initial segment prefix-free complexity of X , then X is computable from an algorithmically random Y with oracle-use at most g. Alternatively (making no use of such a computable bound g ), one can achieve an the oracle-use bounded above by K(X \upharpoonright -{n})+\log n. This provides a strong analogue of Shannon's source coding theorem for the algorithmic information theory.
| Item Type | Article |
|---|---|
| Copyright holders | © 2019 IEEE |
| Keywords | source coding, algorithmic information theory, compression, Kolmogorov complexity, prefixfree codes, layered Kraft-Chaitin theorem |
| Departments | Mathematics |
| DOI | 10.1109/TIT.2019.2896638 |
| Date Deposited | 07 Feb 2019 11:30 |
| Acceptance Date | 2019-01-21 |
| URI | https://researchonline.lse.ac.uk/id/eprint/100040 |
-
picture_as_pdf -
subject - Accepted Version