Classifying information captured in different formats for search and display
First Claim
1. A method comprising:
- receiving, by a computing device, a plurality of medical documents, the plurality of medical documents including a first set of medical documents in a first format in an image based format and a second set of medical documents in a second format in a structured data format;
determining, by the computing device, a schema based on a review of an organizing principle to which a common set of data fields can be used to classify information from the first set of medical documents and the second set of medical documents;
determining, by the computing device, via extraction, unstructured information in the first set of medical documents relevant to the common set of data fields, wherein the unstructured information is based on information included in images of the first set of medical documents and in a format that is insertable into a table;
classifying, by the computing device, structured information from a first data field in the second set of medical documents and a portion of the unstructured information in the first set of medical documents to a common data field in the common set of data fields, wherein the portion of unstructured information is included in an image of a medical document in the first set of medical documents;
inserting, by the computing device, the structured information and the unstructured information into the table based on the common set of data fields, wherein the common data field in the table includes the structured information from the first data field and the portion of unstructured information;
storing, by the computing device, the table based on the common set of data fields to allow for searching for information in both the first set of medical documents and the second set of medical documents using a query, wherein the query retrieves the structured information from the first data field and the portion of unstructured information from the common data field in the table;
receiving the query;
determining search results for the query using the set of common data fields associated with the schema, wherein the search results include information in the table from a subset of medical documents from the first set of medical documents and the second set of medical documents that are determined to match the query; and
displaying the search results in an interface, wherein the information from the subset of medical documents is displayed in a common data format.
0 Assignments
0 Petitions
Accused Products
Abstract
In one embodiment, a method receives a plurality of documents. The documents may be received from different medical providers. Also, the documents may be medical record documents generated or captured in a first format and a second format. The first format may be an unstructured data format and the second format may be a structured data format. The first and second documents are then converted to a common format. For example, a common format may emerge as the most restrictive or constrained denominator of the first format and the second format. A schema is determined that provides an organizational structure with categories that can be used to index the content of the first and second documents while they are being converted to the common format. The schema and indexing enable the different formats of documents to be combined and organized simultaneously into a single view for a comprehensive review.
-
Citations
16 Claims
-
1. A method comprising:
-
receiving, by a computing device, a plurality of medical documents, the plurality of medical documents including a first set of medical documents in a first format in an image based format and a second set of medical documents in a second format in a structured data format; determining, by the computing device, a schema based on a review of an organizing principle to which a common set of data fields can be used to classify information from the first set of medical documents and the second set of medical documents; determining, by the computing device, via extraction, unstructured information in the first set of medical documents relevant to the common set of data fields, wherein the unstructured information is based on information included in images of the first set of medical documents and in a format that is insertable into a table; classifying, by the computing device, structured information from a first data field in the second set of medical documents and a portion of the unstructured information in the first set of medical documents to a common data field in the common set of data fields, wherein the portion of unstructured information is included in an image of a medical document in the first set of medical documents; inserting, by the computing device, the structured information and the unstructured information into the table based on the common set of data fields, wherein the common data field in the table includes the structured information from the first data field and the portion of unstructured information; storing, by the computing device, the table based on the common set of data fields to allow for searching for information in both the first set of medical documents and the second set of medical documents using a query, wherein the query retrieves the structured information from the first data field and the portion of unstructured information from the common data field in the table; receiving the query; determining search results for the query using the set of common data fields associated with the schema, wherein the search results include information in the table from a subset of medical documents from the first set of medical documents and the second set of medical documents that are determined to match the query; and displaying the search results in an interface, wherein the information from the subset of medical documents is displayed in a common data format. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A non-transitory computer-readable storage medium containing instructions, that when executed, control a computer system to be configured for:
-
receiving a plurality of medical documents, the plurality of medical documents including a first set of medical documents in a first format in an image based format and a second set of medical documents in a second format in a structured data format; determining a schema based on a review of an organizing principle to which a common set of data fields can be used to classify information from the first set of medical documents and the second set of medical documents; determining, via extraction, unstructured information in the first set of medical documents relevant to the common set of data fields, wherein the unstructured information is based on information included in images of the first set of medical documents and in a format that is insertable into a table; classifying structured information from a first data field in the second set of medical documents and a portion of the unstructured information in the first set of medical documents to a common data field in the common set of data fields, wherein the portion of unstructured information is included in an image of a medical document in the first set of medical documents; inserting the structured information and the unstructured information into the table based on the common set of data fields, wherein the common data field in the table includes the structured information from the first data field and the portion of unstructured information; storing the table based on the common set of data fields to allow for searching for information in both the first set of medical documents and the second set of medical documents using a query, wherein the query retrieves the structured information from the first data field and the portion of unstructured information from the common data field in the table; receiving the query; determining search results for the query using the set of common data fields associated with the schema, wherein the search results include information in the table from a subset of medical documents from the first set of medical documents and the second set of medical documents that are determined to match the query; and displaying the search results in an interface, wherein the information from the subset of medical documents is displayed in a common data format. - View Dependent Claims (10, 11, 12, 13, 14, 15)
-
-
16. An apparatus comprising:
-
one or more computer processors; and a non-transitory computer-readable storage medium comprising instructions, that when executed, control the one or more computer processors to be configured for; receiving a plurality of medical documents, the plurality of medical documents including a first set of medical documents in a first format in an image based format and a second set of medical documents in a second format in a structured data format; determining a schema based on a review of an organizing principle to which a common set of data fields can be used to classify information from the first set of medical documents and the second set of medical documents; determining, via extraction, unstructured information in the first set of medical documents relevant to the common set of data fields, wherein the unstructured information is based on information included in images of the first set of medical documents and in a format that is insertable into a table; classifying structured information from a first data field in the second set of medical documents and a portion of the unstructured information in the first set of medical documents to a common data field in the common set of data fields, wherein the portion of unstructured information is included in an image of a medical document in the first set of medical documents; inserting the structured information and the unstructured information into the table based on the common set of data fields, wherein the common data field in the table includes the structured information from the first data field and the portion of unstructured information; storing the table based on the common set of data fields to allow for searching for information in both the first set of medical documents and the second set of medical documents using a query, wherein the query retrieves the structured information from the first data field and the portion of unstructured information from the common data field in the table; receiving the query; determining search results for the query using the set of common data fields associated with the schema, wherein the search results include information in the table from a subset of medical documents from the first set of medical documents and the second set of medical documents that are determined to match the query; and displaying the search results in an interface, wherein the information from the subset of medical documents is displayed in a common data format.
-
Specification