Expanded inverted index
First Claim
Patent Images
1. A computer-implemented method for indexing documents, the method comprising:
- generating an inverted index for a collection of one or more documents, the inverted index comprising;
an inverted list for a single index term appearing in one or more of the documents in the collection, the inverted list including one or more postings, where a posting comprises;
a document identifier identifying a document in the collection of documents,a position identifier identifying a position of the index term in the document; and
proximity information specifying a proximal relationship between the index term and another index term in the document.
2 Assignments
0 Petitions
Accused Products
Abstract
Indexing documents is accomplished by generating an inverted index for a collection of one or more documents. The inverted index includes an inverted list for an index term appearing in one or more of the documents in the collection, and one or more postings. A posting includes a document identifier identifying a document in the collection of documents, a position identifier identifying a position of the index term in the document; and proximity information specifying whether the index term is positioned in a predefined proximal relationship between the index term and another a second index term in the document.
-
Citations
28 Claims
-
1. A computer-implemented method for indexing documents, the method comprising:
generating an inverted index for a collection of one or more documents, the inverted index comprising; an inverted list for a single index term appearing in one or more of the documents in the collection, the inverted list including one or more postings, where a posting comprises; a document identifier identifying a document in the collection of documents, a position identifier identifying a position of the index term in the document; and proximity information specifying a proximal relationship between the index term and another index term in the document. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
12. A computer-implemented method for indexing documents, the method comprising:
creating an inverted index for a collection of one or more documents, the inverted index comprising; an inverted list for a single index term included in the collection, the inverted list including one or more postings, where a posting comprises; a document identifier identifying a document in the collection of documents, a flag indicating the index term is positioned next to a common term in the document; a frequency of the index term occurring in the document; a common term identifier identifying the common term; and a position identifier identifying a position of the index term in the document. - View Dependent Claims (13, 14)
-
15. An article comprising a machine-readable storage medium storing instructions operable to cause one or more machines to perform operations comprising:
-
generating an inverted index for a collection of one or more documents, the inverted index comprising; an inverted list for a single index term appearing in one or more of the documents in the collection, the inverted list including one or more postings, where a posting comprises; a document identifier identifying a document in the collection of documents, a position identifier identifying a position of the index term in the document; and proximity information specify whether the index term is positioned to have a predefined proximal relationship with a second index term in the document. - View Dependent Claims (16, 17, 18, 19, 20, 21, 22, 23, 24, 25)
-
-
26. An article comprising a machine-readable storage medium storing instructions operable to cause one or more machines to perform operations comprising:
creating an inverted index for a collection of one or more documents, the inverted index comprising; an inverted list for a single index term included in the collection, the inverted list including one or more postings, where a posting comprises; a document identifier identifying a document in the collection of documents, a flag indicating the index term is positioned next to a common term in the document; a frequency of the index term occurring in the document; a common term identifier identifying the common term; and a position identifier identifying a position of the index term in the document. - View Dependent Claims (27, 28)
Specification