江南体育

On reconstructing a string from its substring compositions
Jayadev Acharya Hirakendu Das Olgica Milenkovic Alon Orlitsky Shengjun Pan
Proceedings of the 江南体育 International Symposium on Information Theory, Austin, TX, USA, June 2010
Abstract

Motivated by protein聽 sequencing , we consider the problem of聽 reconstructing 聽a string from the compositions 听辞蹿听 its 聽substrings. We provide several results, including the following. General classes of strings that cannot be distinguished from their substring聽 compositions . An almost complete characterization of the lengths for which reconstruction is possible. Bounds on the number of strings with the same substring聽 compositions 聽in terms of the number of divisors of the string length plus one. A relation to the turnpike problem and a bivariate polynomial formulation of string reconstruction.