×

Scalable summarization of data graphs

  • US 8,984,019 B2
  • Filed: 11/20/2012
  • Issued: 03/17/2015
  • Est. Priority Date: 11/20/2012
  • Status: Active Grant
First Claim
Patent Images

1. A method for summarizing resource description framework datasets, the method comprising:

  • creating a condensed view of the resource description framework dataset graph comprising a plurality of entity vertices, type vertices and keyword vertices connected by a plurality of predicate edges by combining entity, keyword and type vertices into a plurality of condensed vertices linked only by inter entity vertex predicate edges from the resource description framework dataset, the condensed view comprising a dataset graph;

    removing entity information and keyword information from each condensed vertex and maintaining only type information in each condensed vertex;

    grouping the plurality of condensed vertices by common type information;

    splitting the condensed view of the resource description framework dataset graph into a plurality of partitions, each partition associated with a unique common type value selected from the type vertices and comprising a plurality of vertices and predicate edges connecting the vertices, splitting the condensed view of the resource description framework dataset graph comprising;

    creating a plurality of predicate edge disjoint subgraphs by selecting condensed vertices on which to begin predicate edge disjoint graphs by group and exhausting all condensed vertices in a given group before advancing to a subsequent group, each subgraph beginning at a given condensed vertex and extending out a predetermined number of hops through the condensed view of the resource description framework, each partition comprising all subgraphs beginning at condensed vertices comprising common type information; and

    defining a minimum set of common type based structures summarizing the plurality of partitions.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×