Method and apparatus for generating metadata for a document
First Claim
1. A computer-implemented method of processing a document, said method comprising:
- converting a document into a common format document;
recognizing a concept in said common format document, wherein said concept represents a basic idea expressed in said common format document; and
incorporating said concept in a conceptual model.
2 Assignments
0 Petitions
Accused Products
Abstract
A method and system of generating metadata for a document so that the document may be identified by a subsequent search. A conceptual model is generated for the document, wherein the conceptual model indicates one or more concepts that are recognized in the document. A concept is defined by a plurality of features, each feature being associated with a feature weight. By referencing the conceptual model, one or more auto-attributes may be assigned to the document. Also, by referencing the conceptual model, the document may be categorized to one or more categories of a categorization taxonomy by assigning one or more auto-categories. The generated metadata, including the conceptual model, the one or more auto-attributes, and the one or more auto-categories, may be stored in a memory so that the subsequent search may identify the document by examining the generated metadata.
178 Citations
20 Claims
-
1. A computer-implemented method of processing a document, said method comprising:
-
converting a document into a common format document;
recognizing a concept in said common format document, wherein said concept represents a basic idea expressed in said common format document; and
incorporating said concept in a conceptual model. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A computer-readable medium to direct a computer to function in a specified manner, comprising:
-
instructions to recognize a basic idea expressed in a document;
instructions to assign a concept identification to said basic idea; and
instructions to generate a conceptual model based upon said concept identification. - View Dependent Claims (8, 9, 10, 11, 12, 13, 15, 16, 17, 18, 19, 20)
-
-
14. A computer, comprising:
-
a processor; and
a memory connected to said processor, wherein said memory includes;
a document modeling module, said document modeling module having;
a first module configured to direct said processor to recognize a concept in a document, wherein said concept represents a basic idea expressed in said document; and
a second module configured to direct said processor to generate a conceptual model based upon said concept.
-
Specification