Edinburgh Research Archive

Graph pattern matching on social network analysis

dc.contributor.advisor
Fan, Wenfei
en
dc.contributor.advisor
Geerts, Floris
en
dc.contributor.advisor
Libkin, Leonid
en
dc.contributor.author
Wang, Xin
en
dc.date.accessioned
2013-12-11T10:23:04Z
dc.date.available
2013-12-11T10:23:04Z
dc.date.issued
2013-11-28
dc.description.abstract
Graph pattern matching is fundamental to social network analysis. Its effectiveness for identifying social communities and social positions, making recommendations and so on has been repeatedly demonstrated. However, the social network analysis raises new challenges to graph pattern matching. As real-life social graphs are typically large, it is often prohibitively expensive to conduct graph pattern matching over such large graphs, e.g., NP-complete for subgraph isomorphism, cubic time for bounded simulation, and quadratic time for simulation. These hinder the applicability of graph pattern matching on social network analysis. In response to these challenges, the thesis presents a series of effective techniques for querying large, dynamic, and distributively stored social networks. First of all, we propose a notion of query preserving graph compression, to compress large social graphs relative to a class Q of queries. We then develop both batch and incremental compression strategies for two commonly used pattern queries. Via both theoretical analysis and experimental studies, we show that (1) using compressed graphs Gr benefits graph pattern matching dramatically; and (2) the computation of Gr as well as its maintenance can be processed efficiently. Secondly, we investigate the distributed graph pattern matching problem, and explore parallel computation for graph pattern matching. We show that our techniques possess following performance guarantees: (1) each site is visited only once; (2) the total network traffic is independent of the size of G; and (3) the response time is decided by the size of largest fragment of G rather than the size of entire G. Furthermore, we show how these distributed algorithms can be implemented in the MapReduce framework. Thirdly, we study the problem of answering graph pattern matching using views since view based techniques have proven an effective technique for speeding up query evaluation. We propose a notion of pattern containment to characterise graph pattern matching using views, and introduce efficient algorithms to answer graph pattern matching using views. Moreover, we identify three problems related to graph pattern containment, and provide efficient algorithms for containment checking (approximation when the problem is intractable). Fourthly, we revise graph pattern matching by supporting a designated output node, which we treat as “query focus”. We then introduce algorithms for computing the top-k relevant matches w.r.t. the output node for both acyclic and cyclic pattern graphs, respectively, with early termination property. Furthermore, we investigate the diversified top-k matching problem, and develop an approximation algorithm with performance guarantee and a heuristic algorithm with early termination property. Finally, we introduce an expert search system, called ExpFinder, for large and dynamic social networks. ExpFinder identifies top-k experts in social networks by graph pattern matching, and copes with the sheer size of real-life social networks by integrating incremental graph pattern matching, query preserving compression and top-k matching computation. In particular, we also introduce bounded (resp. unbounded) incremental algorithms to maintain the weighted landmark vectors which are used for incremental maintenance for cached results.
en
dc.identifier.uri
http://hdl.handle.net/1842/8277
dc.language.iso
en
dc.publisher
The University of Edinburgh
en
dc.relation.hasversion
Wenfei Fan, Jianzhong Li, Jizhou Luo, Zijing Tan, Xin Wang, and YinghuiWu. Incremental graph pattern matching. In SIGMOD, 2011.
en
dc.relation.hasversion
Wenfei Fan, Jianzhong Li, XinWang, and YinghuiWu. Query preserving graph compression. In SIGMOD, 2012.
en
dc.relation.hasversion
Wenfei Fan, Xin Wang, and Yinghui Wu. Performance guarantees for distributed reachability queries. In PVLDB, 2012.
en
dc.relation.hasversion
Wenfei Fan, Xin Wang, and Yinghui Wu. Expfinder: Finding experts by graph pattern matching. In ICDE demo, 2013.
en
dc.relation.hasversion
Wenfei Fan, Xin Wang, and Yinghui Wu. Incremental graph pattern matching. In ACM Transactions on Database Systems (TODS), 2013.
en
dc.relation.hasversion
Wenfei Fan, Xin Wang, and Yinghui Wu. Diversified top-k graph pattern matching. In VLDB, 2013.
en
dc.subject
graph pattern matching
en
dc.subject
social network
en
dc.title
Graph pattern matching on social network analysis
en
dc.type
Thesis or Dissertation
en
dc.type.qualificationlevel
Doctoral
en
dc.type.qualificationname
PhD Doctor of Philosophy
en

Files

Original bundle

Now showing 1 - 2 of 2
Name:
Wang2013.pdf
Size:
2.62 MB
Format:
Adobe Portable Document Format
Description:
Name:
thesis files.zip
Size:
16.57 MB
Format:
Unknown data format
Description:

This item appears in the following Collection(s)