Method, system, and apparatus for validation
First Claim
Patent Images
1. A method for associating documents with searchable metadata, the method comprising:
- receiving as input at least one text document; and
operating at least one programmed processor to perform acts ofcreating metadata to be associated with the at least one text document, the metadata comprising at least one text keyword, the creating comprisingextracting a set of one or more data elements from text of the at least one text document, the set of one or more data elements comprising at least one keyword that appears in the text of the at least one text document;
normalizing said set of data elements to create a set of normalized data elements, wherein the normalizing comprises, for a first keyword of the at least one keyword, determining at least one other keyword similar to the first keyword, the at least one other keyword not being a keyword appearing in the text of the at least one text document, and adding the at least one other keyword to the set of normalized data elements;
identifying at least one previously-validated keyword that is associated as metadata with at least one previously-stored text document, the at least one previously-stored text document not being one of the at least one text document, the at least one previously-validated keyword not being in the set of normalized data elements;
merging said set of normalized data elements with the at least one previously-validated keyword to form a preliminary set of data elements;
presenting said preliminary set of data elements for review by a user; and
receiving user input validating a validated set of data elements; and
in response to the user input validating the validated set of data elements, storing the at least one text document and storing the validated set of data elements as the metadata, the metadata being associated with the at least one text document such that the at least one text document may be located through a search for any data element included in the validated set of data elements.
7 Assignments
0 Petitions
Accused Products
Abstract
In a method for validating data, a text of a document is received. At least one fact is extracted from the text. At least one expert refinement is merged with the at least one fact to create at least one modified fact. The at least one modified fact is provided for a review. An expert refinement to the at least one modified fact is captured in response to the review. A superset document based on the at least one pre-existing refinement and the expert refinement is stored.
-
Citations
31 Claims
-
1. A method for associating documents with searchable metadata, the method comprising:
-
receiving as input at least one text document; and operating at least one programmed processor to perform acts of creating metadata to be associated with the at least one text document, the metadata comprising at least one text keyword, the creating comprising extracting a set of one or more data elements from text of the at least one text document, the set of one or more data elements comprising at least one keyword that appears in the text of the at least one text document; normalizing said set of data elements to create a set of normalized data elements, wherein the normalizing comprises, for a first keyword of the at least one keyword, determining at least one other keyword similar to the first keyword, the at least one other keyword not being a keyword appearing in the text of the at least one text document, and adding the at least one other keyword to the set of normalized data elements; identifying at least one previously-validated keyword that is associated as metadata with at least one previously-stored text document, the at least one previously-stored text document not being one of the at least one text document, the at least one previously-validated keyword not being in the set of normalized data elements; merging said set of normalized data elements with the at least one previously-validated keyword to form a preliminary set of data elements; presenting said preliminary set of data elements for review by a user; and receiving user input validating a validated set of data elements; and in response to the user input validating the validated set of data elements, storing the at least one text document and storing the validated set of data elements as the metadata, the metadata being associated with the at least one text document such that the at least one text document may be located through a search for any data element included in the validated set of data elements. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. A computer-usable medium having computer-readable instructions stored thereon for execution by a processor, wherein the instructions, when executed by the processor, cause the processor to perform a method for associating documents with searchable metadata, the method comprising:
-
receiving as input at least one text document; creating metadata to be associated with the at least one text document, the metadata comprising at least one text keyword, the creating comprising; extracting a set of one or more data elements from text of the at least one text document, the set of one or more data elements comprising at least one keyword that appears in the text of the at least one text document; normalizing said set of data elements to create a set of normalized data elements, wherein the normalizing comprises, for a first keyword of the at least one keyword, determining at least one other keyword similar to the first keyword, the at least one other keyword not appearing in the text of the at least one document, and adding the at least one other keyword to the set of normalized data elements; identifying at least one previously-stored document by examining a set of one or more previously-stored documents to identify documents related to the at least one text document; merging said set of normalized data elements with at least one previously-validated keyword that is associated as metadata with the at least one previously-stored document to form a preliminary set of data elements, the at least one previously-validated not being in the set of normalized data elements; presenting said preliminary set of data elements for review by a user; and receiving user input validating a validated set of data elements; and in response to the user input validating the validated set of data elements, storing the at least one text document and storing the validated set of data elements as the metadata, the metadata being associated with the at least one text document such that the at least one text document may be located through a search for any data element included in the validated set of data elements. - View Dependent Claims (22, 23, 24, 25)
-
-
26. An apparatus for associating documents with searchable metadata, the apparatus comprising:
at least one processor programmed to; receive as input a text document; create metadata to be associated with the at least one text document, the metadata comprising at least one text keyword, the at least one processor being programmed to create at least in part by; extracting a set of one or more data elements from text of the text document, the set of one or more data elements comprising at least one keyword that appears in the text of the text document; normalizing said set of data elements to create a set of normalized data elements, wherein the normalizing comprises, for a first keyword of the at least one keyword, determining at least one other keyword similar to the first keyword, the at least one other keyword not appearing in the text of the text document, and adding the at least one other keyword to the set of normalized data elements; identifying at least one previously-validated keyword that is associated as metadata with at least one previously-stored text document, the at least one previously-validated keyword not being in the set of normalized data elements; merging said set of normalized data elements with the at least one previously-validated keyword to form a preliminary set of data elements for the text document; presenting said preliminary set of data elements for review by a user; and following presenting said preliminary set of data elements to the user and in response to user input validating a validated set of data elements store the text document and store the validated set of data elements as metadata associated with the text document such that the at least one text document may be located through a search for any data element included in the validated set of data elements. - View Dependent Claims (27, 28, 29, 30, 31)
Specification