System and method of performing profile matching with a structured document
First Claim
1. A method of dynamically performing an interactive bidirectional profile matching between a plurality of source documents and a plurality of structured target documents, comprising:
- extracting an index from each of the structured target documents, the index containing a plurality of elements;
identifying a schema for the index;
assigning weighting values to the elements of the index;
mapping path expressions in the plurality of source documents;
for at least some of the structured target documents, matching the path expressions in the plurality of source documents to the index from each of the structured target documents, according to the index schema;
as a new target document is added, automatically identifying an index schema for the new target document, and incrementally matching the index schema for the new target document to the path expressions in the plurality of source documents, according to the index schema of the new target document; and
as a new source document is added, automatically mapping a new path expression for the new source document, and incrementally matching the new path expression to the index from each of the structured target documents, according to the index schema of each of structured target documents, for dynamically performing the interactive directional profile matching.
1 Assignment
0 Petitions
Accused Products
Abstract
A profile matching system and associated method match the path expressions in a structured or semi-structured document, such as an XML document, to an indexed resource. The system, having assigned weighting values to the elements in the index, maps the document path expressions and attempts to match them to the index elements according to a predetermined schema. The system is comprised of an indexing module that maps the document and identifies its content attributes, and a matching module that matches the document content attributes to weighted elements of an index. The system defines the attribute or set of attributes in the schema that result in a match. The matching criteria are identified in a map specification file that specifies the specific qualification criteria. The indexing module uses the map specification information to produce efficient indices from XML document. The matching module uses a match specification language (MSL) and a match operator. Each rule in the MSL is a pair of path expressions: one for the source document and one for target document or index.
-
Citations
37 Claims
-
1. A method of dynamically performing an interactive bidirectional profile matching between a plurality of source documents and a plurality of structured target documents, comprising:
-
extracting an index from each of the structured target documents, the index containing a plurality of elements;
identifying a schema for the index;
assigning weighting values to the elements of the index;
mapping path expressions in the plurality of source documents;
for at least some of the structured target documents, matching the path expressions in the plurality of source documents to the index from each of the structured target documents, according to the index schema;
as a new target document is added, automatically identifying an index schema for the new target document, and incrementally matching the index schema for the new target document to the path expressions in the plurality of source documents, according to the index schema of the new target document; and
as a new source document is added, automatically mapping a new path expression for the new source document, and incrementally matching the new path expression to the index from each of the structured target documents, according to the index schema of each of structured target documents, for dynamically performing the interactive directional profile matching. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
further comprising assigning weighting values to the elements of the index.
-
-
11. A system for dynamically performing an interactive bidirectional profile matching between a plurality of source documents and a plurality of structured target documents, comprising:
-
an indexing module for extracting an index from each of the structured target documents, the index containing a plurality of elements;
the indexing module identifying a schema for the index and assigning weighting values to the elements of the index;
a mapping module for mapping path expressions in the plurality of source documents;
for at least some of the structured target documents, a matching module for matching the path expressions in the plurality of source documents to the index from each of the structured target documents, according to the index schema;
as a new target document is added, the indexing module automatically identifies an index schema for the new target document, and the matching module incrementally matches the index schema for the new target document to the path expressions in the plurality of source documents, according to the index schema of the new target document; and
as a new source document is added, the mapping module automatically maps a new path expression for the new source document, and the matching module incrementally matches the new path expression to the index from each of the structured target documents, according to the index schema of each of target documents, for dynamically performing the interactive bidirectional profile matching. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18)
wherein for each rule in the match specification language the matching module specifies a pair of path expressions;
one for the documents and one for the index.
-
-
16. The system according to claim 11, wherein the plurality of structured target documents comprise a semi-structured target document.
-
17. The system according to claim 11, wherein the plurality of structured target documents comprise a structured target document.
-
18. The system according to claim 11, wherein the index contains a plurality of elements;
- and
wherein the indexing module assigns weighting values to the elements of the index.
- and
-
19. A computer program product having instruction codes for performing an interactive bidirectional profile matching between a plurality of source documents and a plurality of structured target documents, comprising:
-
a first set of instruction codes for extracting an index from each of the structured target documents, the index containing a plurality of elements;
a second set of instruction codes for identifying a schema for the index and for assigning weighting values to the elements of the index;
a third set of instruction codes for mapping path expressions in the plurality of source documents;
a fourth set of instruction, which for at least some of the structured target documents, matches the path expressions in the plurality of source documents to the index from each of the structured target documents, according to the index schema;
as a new target document is added, the second set of instruction codes automatically identifies an index schema for the new target document, and the fourth set of instruction codes incrementally matches the index schema for the new target document to the path expressions in the plurality of source documents, according to the index schema of the new target document; and
as a new source document is added, the third set of instruction codes automatically maps a new path expression for the new source document, and the fourth set of instruction codes incrementally matches the new path expression to the index from each of the structured target documents, according to the index schema of each of structured target documents, for dynamically performing the interactive bidirectional profile matching. - View Dependent Claims (20, 21, 22, 23, 24, 25, 26, 27, 28)
wherein the second set of instruction codes further assigns weighting values to the elements of the index.
-
-
29. A system for performing an interactive bidirectional profile matching between a plurality of source documents and a plurality of structured target documents, comprising:
-
means for extracting an index from each of the structured target documents, the index containing a plurality of elements and for assigning weighting values to the elements of the index;
means for identifying a schema for the index;
means for mapping path expressions in the plurality of source documents;
for at least some of the structured target documents, means for matching the path expressions in the plurality of source documents to the index from each of the structured target documents, according to the index schema;
as a new target document is added, the identifying means automatically identifies an index schema for the new target document, and the matching means incrementally matches the index schema for the new target document to the path expressions in the plurality of source documents, according to the index schema of the new target document; and
as a new source document is added, the mapping means automatically maps a new path expression for the new source document, and the matching means incrementally matches the new path expression to the index from each of the structured target documents, according to the index schema of each of structured target documents, for dynamically performing the interactive bidirectional profile matching. - View Dependent Claims (30, 31, 32, 33, 34, 35, 36, 37)
wherein the identifying means further assigns weighting values to the elements of the index.
-
Specification