Efficient computation of ontology affinity matrices
First Claim
1. A method comprising:
- accessing an inverted index comprising a plurality of inverted index lists, each inverted index list corresponding to a term, each inverted index list comprising a term identifier of the term and one or more document identifiers indicating one or more documents of a document set in which the term appears;
organizing a plurality of ordered pairs, each ordered pair comprising a term identifier and a document identifier of an inverted index list, the organizing primarily based on the document identifiers of the ordered pairs; and
generating a term identifier index according to the inverted index, the term identifier index comprising a plurality of sections, each section corresponding to a document, each section comprising one or more term identifiers of one or more terms that appear in the document, the generating the term identifier index according to the inverted index comprising organizing the term identifiers of the ordered pairs in the sections of the term identifier index.
1 Assignment
0 Petitions
Accused Products
Abstract
In one embodiment, generating an ontology includes accessing an inverted index comprising a plurality of inverted index lists. An inverted index list may correspond to a term of a language. Each inverted index list may comprise a term identifier of the term and one or more document identifiers indicating one or more documents of a document set in which the term appears. The embodiment also includes generating a term identifier index according to the inverted index. The term identifier index comprises a plurality of sections and each section corresponds to a document. Each section may comprise one or more term identifiers of one or more terms that appear in the document.
-
Citations
24 Claims
-
1. A method comprising:
-
accessing an inverted index comprising a plurality of inverted index lists, each inverted index list corresponding to a term, each inverted index list comprising a term identifier of the term and one or more document identifiers indicating one or more documents of a document set in which the term appears; organizing a plurality of ordered pairs, each ordered pair comprising a term identifier and a document identifier of an inverted index list, the organizing primarily based on the document identifiers of the ordered pairs; and generating a term identifier index according to the inverted index, the term identifier index comprising a plurality of sections, each section corresponding to a document, each section comprising one or more term identifiers of one or more terms that appear in the document, the generating the term identifier index according to the inverted index comprising organizing the term identifiers of the ordered pairs in the sections of the term identifier index. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. An apparatus comprising:
-
a memory operable to; store an inverted index comprising a plurality of inverted index lists, each inverted index list corresponding to a term, each inverted index list comprising a term identifier of the term and one or more document identifiers indicating one or more documents of a document set in which the term appears; and a processor operable to execute logic to; organize a plurality of ordered pairs, each ordered pair comprising a term identifier and a document identifier of an inverted index list, the organizing primarily based on the document identifiers of the ordered pairs; and generate a term identifier index according to the inverted index, the term identifier index comprising a plurality of sections, each section corresponding to a document, each section comprising one or more term identifiers of one or more terms that appear in the document, the generating the term identifier index according to the inverted index comprising organizing the term identifiers of the ordered pairs in the sections of the term identifier index. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A tangible computer-readable medium having computer-executable code, when executed by a computer operable to:
-
access an inverted index comprising a plurality of inverted index lists, each inverted index list corresponding to a term, each inverted index list comprising a term identifier of the term and one or more document identifiers indicating one or more documents of a document set in which the term appears; organize a plurality of ordered pairs, each ordered pair comprising a term identifier and a document identifier of an inverted index list, the organizing primarily based on the document identifiers of the ordered pairs; and generate a term identifier index according to the inverted index, the term identifier index comprising a plurality of sections, each section corresponding to a document, each section comprising one or more term identifiers of one or more terms that appear in the document, the generating the term identifier index according to the inverted index comprising organizing the term identifiers of the ordered pairs in the sections of the term identifier index. - View Dependent Claims (18, 19, 20, 21, 22, 23, 24)
-
Specification