2000 character limit reached
Overlapping Communities Detection via Measure Space Embedding (1504.06796v2)
Published 26 Apr 2015 in cs.LG, cs.SI, and stat.ML
Abstract: We present a new algorithm for community detection. The algorithm uses random walks to embed the graph in a space of measures, after which a modification of $k$-means in that space is applied. The algorithm is therefore fast and easily parallelizable. We evaluate the algorithm on standard random graph benchmarks, including some overlapping community benchmarks, and find its performance to be better or at least as good as previously known algorithms. We also prove a linear time (in number of edges) guarantee for the algorithm on a $p,q$-stochastic block model with $p \geq c\cdot N{-\frac{1}{2} + \epsilon}$ and $p-q \geq c' \sqrt{p N{-\frac{1}{2} + \epsilon} \log N}$.