Document analysis and retrieval
First Claim
1. A method for document analysis and retrieval, comprising the steps of:
- receiving a document having text therein from a host of a first computing system;
generating document keys associated with said text from analysis of said text, each said document key selected from the group consisting of a keyword of said text and a keyphrase of said text;
providing a document taxonomy having categories, each category having category keys, each said category key selected from the group consisting of a keyword of said category and a keyphrase of said category;
comparing the category keys of each category with said document keys to make a determination of a distance between the document and each category as a measure of how close the document is to each category; and
returning a subset of said categories to said host, wherein said subset of said categories reflects said determination.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and system for document analysis and retrieval. A document that includes text is received from a host. Document keys (i.e., keywords and keyphrases) associated with the text are generated. In first embodiments, a document taxonomy is provided. The taxonomy has categories and associated category keys (i.e., keywords and keyphrases). The category keys of each category are compared with the document keys to determine a distance between the document and each category as a measure of how close the document is to each category. A subset of the categories is returned to the host, wherein the subset of the categories reflects the determined distances. In second embodiments, a search string is created as a logical function of a subset of the document keys. The search string is submitted to a search engine. Links to related documents are received from the search engine and returned to the host.
27 Citations
21 Claims
-
1. A method for document analysis and retrieval, comprising the steps of:
-
receiving a document having text therein from a host of a first computing system;
generating document keys associated with said text from analysis of said text, each said document key selected from the group consisting of a keyword of said text and a keyphrase of said text;
providing a document taxonomy having categories, each category having category keys, each said category key selected from the group consisting of a keyword of said category and a keyphrase of said category;
comparing the category keys of each category with said document keys to make a determination of a distance between the document and each category as a measure of how close the document is to each category; and
returning a subset of said categories to said host, wherein said subset of said categories reflects said determination. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A method for document analysis and retrieval, comprising the steps of:
-
receiving a document having text therein from a host of a first computing system;
generating document keys associated with said text from analysis of said text, each said document key selected from the group consisting of a keyword of said text and a keyphrase of said text;
creating a search string, said search string comprising a logical function of a subset of said document keys;
submitting said search string to a search engine;
receiving links to related documents from said search engine, said links being based on said search string; and
returning said links to said host. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A computer program product, comprising a computer usable medium having a computer readable program code embodied therein for document analysis and retrieval, wherein the computer readable program code comprises an algorithm adapted to:
-
receive a document having text therein from a host of a first computing system;
generate document keys associated with said text from analysis of said text, each said document key selected from the group consisting of a keyword of said text and a keyphrase of said text;
provide a document taxonomy having categories, each category having category keys, each said category key selected from the group consisting of a keyword of said category and a keyphrase of said category;
compare the category keys of each category with said document keys to make a determination of a distance between the document and each category as a measure of how close the document is to each category; and
return a subset of said categories to said host, wherein said subset of said categories reflects said determination.
-
-
21. A computer program product, comprising a computer usable medium having a computer readable program code embodied therein for document analysis and retrieval, wherein the computer readable program code comprises an algorithm adapted to:
-
receive a document having text therein from a host of a first computing system;
generate document keys associated with said text from analysis of said text, each said document key selected from the group consisting of a keyword of said text and a keyphrase of said text;
create a search string, said search string comprising a logical function of a subset of said document keys;
submit said search string to a search engine;
receive links to related documents from said search engine, said links being based on said search string; and
return said links to said host.
-
Specification