2000 character limit reached
Fundamental Limits of Database Alignment (1805.03829v1)
Published 10 May 2018 in cs.IT and math.IT
Abstract: We consider the problem of aligning a pair of databases with correlated entries. We introduce a new measure of correlation in a joint distribution that we call cycle mutual information. This measure has operational significance: it determines whether exact recovery of the correspondence between database entries is possible for any algorithm. Additionally, there is an efficient algorithm for database alignment that achieves this information theoretic threshold.