Classifying information captured in different formats for search and display in an image-based format
First Claim
1. A method comprising:
- receiving a plurality of medical documents, the plurality of medical documents including a first set of medical documents in a first format and a second set of medical documents in a second format, wherein the first format is an image-based format and the second format is in a structured data format, and wherein the structured data format is in a non-image text data organized by fields and respective values;
determining a schema, the schema including a plurality of categories determined based on an expected structural content in the first set of medical documents or second set of medical documents;
determining indices from the plurality of categories to be used for tagging the first set of medical documents and the second set of medical documents when converted into the image-based format based on a review of an organizing principle to which a common set of descriptors can be identified to tag and organize images of the first set of medical documents and the second set of medical documents;
converting the non-image text data of the second set of medical documents into images in the image-based format based on the determined indices, wherein the second format is removed in the conversion and content included in each image is determined based on the determined indices applying to the fields and respective values;
indexing images of the first medical documents and the images of the second medical documents in the image-based format by associating the determined indices based on the schema with content determined from the first and second medical documents; and
storing the images of the first set of medical documents and the images of the second set of medical documents in the image-based format with the indices to allow searching of both the first set of medical documents and the second set of medical documents together based on a search query.
0 Assignments
0 Petitions
Accused Products
Abstract
In one embodiment, a method receives a plurality of documents. The documents may be received from different medical providers. Also, the documents may be medical record documents generated or captured in a first format and a second format. The first format may be an unstructured data format and the second format may be a structured data format. The first and second documents are then converted to a common format. For example, a common format may emerge as the most restrictive or constrained denominator of the first format and the second format. A schema is determined that provides an organizational structure with categories that can be used to index the content of the first and second documents while they are being converted to the common format. The schema and indexing enable the different formats of documents to be combined and organized simultaneously into a single view for a comprehensive review.
-
Citations
18 Claims
-
1. A method comprising:
-
receiving a plurality of medical documents, the plurality of medical documents including a first set of medical documents in a first format and a second set of medical documents in a second format, wherein the first format is an image-based format and the second format is in a structured data format, and wherein the structured data format is in a non-image text data organized by fields and respective values; determining a schema, the schema including a plurality of categories determined based on an expected structural content in the first set of medical documents or second set of medical documents; determining indices from the plurality of categories to be used for tagging the first set of medical documents and the second set of medical documents when converted into the image-based format based on a review of an organizing principle to which a common set of descriptors can be identified to tag and organize images of the first set of medical documents and the second set of medical documents; converting the non-image text data of the second set of medical documents into images in the image-based format based on the determined indices, wherein the second format is removed in the conversion and content included in each image is determined based on the determined indices applying to the fields and respective values; indexing images of the first medical documents and the images of the second medical documents in the image-based format by associating the determined indices based on the schema with content determined from the first and second medical documents; and storing the images of the first set of medical documents and the images of the second set of medical documents in the image-based format with the indices to allow searching of both the first set of medical documents and the second set of medical documents together based on a search query. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A non-transitory computer-readable storage medium containing instructions, that when executed, control a computer system to be configured for:
-
receiving a plurality of medical documents, the plurality of medical documents including a first set of medical documents in a first format and a second set of medical documents in a second format, wherein the first format is an image-based format and the second format is in a structured data format, and wherein the structured data format is in a non-image text data organized by fields and respective values; determining a schema, the schema including a plurality of categories determined based on an expected structural content in the first set of medical documents or second set of medical documents; determining indices from the plurality of categories to be used for tagging the first set of medical documents and the second set of medical documents when converted into the image-based format based on a review of an organizing principle to which a common set of descriptors can be identified to tag and organize images of the first set of medical documents and the second set of medical documents; converting the non-image text data of the second set of medical documents into images in the image-based format based on the determined indices, wherein the second format is removed in the conversion and content included in each image is determined based on the determined indices applying to the fields and respective values; indexing images of the first medical documents and the images of the second medical documents in the image-based format by associating the determined indices based on the schema with content determined from the first and second medical documents; and storing the images of the first set of medical documents and the images of the second set of medical documents in the image-based format with the indices to allow searching of both the first set of medical documents and the second set of medical documents together based on a search query. - View Dependent Claims (10, 11, 12, 13, 14)
-
-
15. A method comprising:
-
receiving a plurality of medical documents from multiple medical providers using disparate formats, wherein the plurality of medical documents include first medical documents of images in an image-based format and second medical documents in a structured data format, and wherein the structured data format is non-image text data organized by fields and respective values; determining a schema, the schema including a plurality of categories determined based on an expected structural content in the first medical documents or second medical documents; determining indices from the plurality of categories to be used for tagging the first medical documents and the second medical documents when converted into the image-based format based on a review of an organizing principle to which a common set of descriptors can be identified to tag and organize images of the first medical documents and the second medical documents; converting the text data of the second medical documents having the structured data format into images for the second medical documents in the image-based format based on the determined indices, wherein the structured data format is removed in the conversion and the content included in each image is determined based on the determined indices applying to the fields and respective values; indexing the images of the first medical documents and the images of the second medical documents in the image-based format by associating the determined indices based on the schema with the first and second medical documents; storing the images of first medical documents and the second medical documents in the image-based format with the indices to allow searching of both the first medical documents and the second medical documents together based on a search query; and storing the second medical documents with an active link to view specific data in the structured data format provided by the second medical documents for enabling access to the structured data. - View Dependent Claims (16, 17, 18)
-
Specification