System and method for positional representation of content for efficient indexing, search, retrieval, and compression
First Claim
Patent Images
1. A method of generating a positional representation of a document, comprising:
- identifying unique terms in a document and determining positions in the document at which each of the unique terms appear; and
for each of the unique terms, storing positional information derived from the positions into a positional representation.
5 Assignments
0 Petitions
Accused Products
Abstract
A method of generating a positional representation of a document, including identifying each unique term in a document and positions in the document at which the unique term appears, and for the each unique term, storing positional information derived from the positions into a positional representation.
37 Citations
20 Claims
-
1. A method of generating a positional representation of a document, comprising:
-
identifying unique terms in a document and determining positions in the document at which each of the unique terms appear; and for each of the unique terms, storing positional information derived from the positions into a positional representation. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A computer readable medium including computer code for generating a positional representation of a document comprising:
-
computer code for identifying each of the unique terms in a document and determining positions in the document at which each of the unique terms appear; and for each of the unique terms, computer code for storing positional information derived from the positions into a positional representation.
-
-
16. A method of generating an inverted index from a positional representation of a document, comprising:
-
inputting a positional representation of a document having a document identifier and positional records, wherein the positional records include a term of the document and occurrence positions of the term in the document; generating an entry for each of the positional records, wherein the entry includes the term and a document record, wherein the document record includes the document identifier and the occurrence positions; and inserting the entry into an inverted index. - View Dependent Claims (17, 18, 19)
-
-
20. An apparatus for generating a positional representation of a text document, comprising:
a processor for converting a document to a positional representation by extracting each of the unique terms from the document and their respective occurrence positions in the document, generating entries for each of the unique terms which include a first one of the unique terms and a set of the occurrence positions corresponding to the first one of the unique terms, and adding each of the entries to a positional representation.
Specification