TY - GEN
T1 - Efficient processing node proximity via random walk with restart
AU - Lv, Bingqing
AU - Yu, Weiren
AU - Wang, Liping
AU - McCann, Julie A.
PY - 2014
Y1 - 2014
N2 - Graph is a useful tool to model complicated data structures. One important task in graph analysis is assessing node proximity based on graph topology. Recently, Random Walk with Restart (RWR) tends to pop up as a promising measure of node proximity, due to its proliferative applications in e.g. recommender systems, and image segmentation. However, the best-known algorithm for computing RWR resorts to a large LU matrix factorization on an entire graph, which is cost-inhibitive. In this paper, we propose hybrid techniques to efficiently compute RWR. First, a novel divide-and-conquer paradigm is designed, aiming to convert the large LU decomposition into small triangular matrix operations recursively on several partitioned subgraphs. Then, on every subgraph, a “sparse accelerator” is devised to further reduce the time of RWR without any sacrifice in accuracy. Our experimental results on real and synthetic datasets show that our approach outperforms the baseline algorithms by at least one constant factor without loss of exactness.
AB - Graph is a useful tool to model complicated data structures. One important task in graph analysis is assessing node proximity based on graph topology. Recently, Random Walk with Restart (RWR) tends to pop up as a promising measure of node proximity, due to its proliferative applications in e.g. recommender systems, and image segmentation. However, the best-known algorithm for computing RWR resorts to a large LU matrix factorization on an entire graph, which is cost-inhibitive. In this paper, we propose hybrid techniques to efficiently compute RWR. First, a novel divide-and-conquer paradigm is designed, aiming to convert the large LU decomposition into small triangular matrix operations recursively on several partitioned subgraphs. Then, on every subgraph, a “sparse accelerator” is devised to further reduce the time of RWR without any sacrifice in accuracy. Our experimental results on real and synthetic datasets show that our approach outperforms the baseline algorithms by at least one constant factor without loss of exactness.
UR - https://link.springer.com/chapter/10.1007/978-3-319-11116-2_50
U2 - 10.1007/978-3-319-11116-2_50
DO - 10.1007/978-3-319-11116-2_50
M3 - Conference publication
SN - 978-3-319-11115-5
T3 - Lecture Notes in Computer Science
SP - 542
EP - 549
BT - Web Technologies and Applications
A2 - Chen, Lei
A2 - Jia, Yan
A2 - Sellis, Timos
A2 - Liu, Guanfeng
PB - Springer
CY - Cham (CH)
T2 - 16th Asia-Pacific Web Conference
Y2 - 5 September 2014 through 7 September 2014
ER -