Categorized document bases
First Claim
1. A method of managing information comprising generating a categorized document base, comprising:
- providing a source collection of documents;
automatically assessing the documents using Information Retrieval (IR) techniques to assign at least some of the documents to one or more first categories; and
assigning for each first category one or more numerical scores based at least in part on a composition, makeup or constitution of the documents assigned to the category.
3 Assignments
0 Petitions
Accused Products
Abstract
A method of managing information comprises generating a categorized document base. Generating the document base comprises providing a pre-existing classification of things other than documents, providing a source collection of documents, and automatically assessing the documents using Information Retrieval techniques to assign at least some of the documents to one or more taxa of the classification. For each taxon in the classification one or more numerical scores are assigned, based at least in part on a composition, makeup or constitution of the documents assigned to the taxon of the categorized document base.
-
Citations
36 Claims
-
1. A method of managing information comprising generating a categorized document base, comprising:
-
providing a source collection of documents;
automatically assessing the documents using Information Retrieval (IR) techniques to assign at least some of the documents to one or more first categories; and
assigning for each first category one or more numerical scores based at least in part on a composition, makeup or constitution of the documents assigned to the category. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A method for analyzing documents, comprising:
-
providing at least first and second sets of categories;
providing a source collection of documents, at least some of the documents being assigned to one or more categories of each set of categories; and
generating at least one of an array of documents and an array of data relating to documents, wherein the categories provide axes of the array. - View Dependent Claims (16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28)
-
-
29. A system for managing information, arranged in operation to:
-
receive a source collection of documents;
automatically assess the documents using Information Retrieval (IR) techniques to assign at least some of the documents to one or more first categories; and
assign for each first category one or more numerical scores based at least in part on a composition, makeup or constitution of the documents assigned to the category. - View Dependent Claims (30)
-
-
31. A system for managing information, arranged in operation to
receive at least first and second sets of categories; -
receive a source collection of documents, at least some of the documents being assigned to one or more categories of each set of categories; and
generate at least one of an array of documents and an array of data relating to documents, wherein the categories provide axes of the array. - View Dependent Claims (32)
-
-
33. A software program which, when running on a computing system is arranged to cause the computing system to:
-
receive a source collection of documents;
automatically assess the documents using Information Retrieval (IR) techniques to assign at least some of the documents to one or more first categories; and
assign for each first category one or more numerical scores based at least in part on a composition, makeup or constitution of the documents assigned to the category. - View Dependent Claims (34)
-
-
35. A software program which, when running on a computing system is arranged to cause the computing system to:
-
receive at least first and second sets of categories;
receive a source collection of documents;
automatically assess the documents using Information Retrieval (IR) techniques to assign at least some of the documents to one or more categories of each set of categories; and
generate at least one of an array of documents and an array of data relating to documents, wherein the categories provide axes of the array. - View Dependent Claims (36)
-
Specification