Show simple item record

dc.contributor.advisorKalnis, Panos
dc.contributor.authorEl Saeedy, Mohammed El Sayed
dc.date.accessioned2012-02-04T08:11:42Z
dc.date.available2012-02-04T08:11:42Z
dc.date.issued2011-07-24
dc.identifier.doi10.25781/KAUST-G70TD
dc.identifier.urihttp://hdl.handle.net/10754/209372
dc.description.abstractMining frequent subgraphs is an important operation on graphs. Most existing work assumes a database of many small graphs, but modern applications, such as social networks, citation graphs or protein-protein interaction in bioinformatics, are modeled as a single large graph. Interesting interactions in such applications may be transitive (e.g., friend of a friend). Existing methods, however, search for frequent isomorphic (i.e., exact match) subgraphs and cannot discover many useful patterns. In this paper we propose GRAMI, a framework that generalizes frequent subgraph mining in a large single graph. GRAMI discovers frequent patterns. A pattern is a graph where edges are generalized to distance-constrained paths. Depending on the definition of the distance function, many instantiations of the framework are possible. Both directed and undirected graphs, as well as multiple labels per vertex, are supported. We developed an efficient implementation of the framework that models the frequency resolution phase as a constraint satisfaction problem, in order to avoid the costly enumeration of all instances of each pattern in the graph. We also implemented CGRAMI, a version that supports structural and semantic constraints; and AGRAMI, an approximate version that supports very large graphs. Our experiments on real data demonstrate that our framework is up to 3 orders of magnitude faster and discovers more interesting patterns than existing approaches.
dc.language.isoen
dc.titleGRAMI: Generalized Frequent Subgraph Mining in Large Graphs
dc.typeThesis
dc.contributor.departmentComputer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division
thesis.degree.grantorKing Abdullah University of Science and Technology
dc.contributor.committeememberGao, Xin
dc.contributor.committeememberRavasi, Timothy
thesis.degree.disciplineComputer Science
thesis.degree.nameMaster of Science


Files in this item

Thumbnail
Name:
Mohammed El Sayed Thesis.pdf
Size:
923.3Kb
Format:
PDF
Description:
PDF file

This item appears in the following Collection(s)

Show simple item record