Information retrieval using enhanced document vectors
First Claim
Patent Images
1. A method comprising:
- generating a plurality of document vectors for a corresponding plurality of documents, said document vectors including text components and non-text components; and
performing an information retrieval operation using the generated document vectors.
1 Assignment
0 Petitions
Accused Products
Abstract
An information retrieval system includes an enhanced document vector module to generate enhanced document vectors representative of documents in a collection. The enhanced document vectors include text- and non-text components. The non-text components may include the location, in-links, and/or out-links in hypertext documents and attributes of the documents, e.g., size, create-date, and response-time. A processor uses the enhanced document vectors to perform an information retrieval operation, such as a clustering or classification operation.
-
Citations
36 Claims
-
1. A method comprising:
-
generating a plurality of document vectors for a corresponding plurality of documents, said document vectors including text components and non-text components; and
performing an information retrieval operation using the generated document vectors. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. Apparatus comprising:
a processor operative to generate a plurality of enhanced document vectors representative of a plurality of documents, at least one of the enhanced document vectors in said plurality including text components and non-text components. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 36)
-
24. A system comprising:
-
a source of a first plurality of documents, documents in said first plurality including text components and non-text components;
an input device operative to receive a user query;
a search engine operative to retrieve a second plurality of documents from the first plurality of documents in response to the user query;
an enhanced document vector module operative to generate a plurality of enhanced document vectors representative of documents in the second plurality of documents, said enhanced document vectors including text components and non-text components; and
a processor operative to perform an information retrieval operation using said enhanced document vectors.
-
-
35. An article comprising a machine-readable medium including machine-executable instructions operative to cause a machine to:
-
generate a plurality of enhanced document vectors for a corresponding plurality of documents, said enhanced document vectors including text components and non-text components; and
perform an information retrieval operation using said enhanced document vectors.
-
Specification