Classification and storage of documents
First Claim
1. A method comprising:
- defining a plurality of known document types that are each associated with feature information that is determined for a respective known document type by extracting features from manually classified documents that were each identified as corresponding to the respective known document type by a manual classification input;
monitoring an email account associated with a user to detect that an email message has been received at the email account associated with the user, wherein a subject document is included as an attachment to the email message; and
processing the subject document without action by the user, in response to detecting that the email message has been received, including the steps of;
extracting one or more features from the subject document,comparing the one or more features from the subject document to the feature information,determining a document type of the subject document from the plurality of known document types based on the comparison of the one or more features from the subject document to the feature information,selecting a dedicated storage location at a cloud storage system from a plurality of dedicated storage locations associated with the user based on the respective known document type that is associated with the subject document, wherein each of the plurality of dedicated storage locations corresponds to a respective folder of a hierarchical storage structure of the cloud storage system, and wherein the selected dedicated storage location is associated with the user and contains only documents of the respective known document type, andtransmitting the subject document to a cloud storage system for storage in the selected dedicated storage location.
2 Assignments
0 Petitions
Accused Products
Abstract
A method includes defining a plurality of known document types, obtaining a collection of previously classified documents that are each associated with one of the known document types, and extracting features from each document from the collection of previously classified documents to define feature information. The method also includes obtaining a subject document that is associated with a user, extracting one or more features from the subject document, comparing the one or more features from the subject document to the feature information, associating the subject document with one of the known document types based on the comparison, and transmitting the document to a cloud storage system for storage in a dedicated storage location that is associated with the user and contains only documents of the respective known document type that is associated with the subject document.
153 Citations
16 Claims
-
1. A method comprising:
-
defining a plurality of known document types that are each associated with feature information that is determined for a respective known document type by extracting features from manually classified documents that were each identified as corresponding to the respective known document type by a manual classification input; monitoring an email account associated with a user to detect that an email message has been received at the email account associated with the user, wherein a subject document is included as an attachment to the email message; and processing the subject document without action by the user, in response to detecting that the email message has been received, including the steps of; extracting one or more features from the subject document, comparing the one or more features from the subject document to the feature information, determining a document type of the subject document from the plurality of known document types based on the comparison of the one or more features from the subject document to the feature information, selecting a dedicated storage location at a cloud storage system from a plurality of dedicated storage locations associated with the user based on the respective known document type that is associated with the subject document, wherein each of the plurality of dedicated storage locations corresponds to a respective folder of a hierarchical storage structure of the cloud storage system, and wherein the selected dedicated storage location is associated with the user and contains only documents of the respective known document type, and transmitting the subject document to a cloud storage system for storage in the selected dedicated storage location. - View Dependent Claims (2, 3)
-
-
4. An apparatus comprising:
-
a memory; and a processor configured to execute instructions stored in the memory to; define a plurality of known document types that are each associated with feature information that is determined for a respective known document type by extracting features from manually classified documents that were each identified as corresponding to the respective known document type by a manual classification input, monitor an email account associated with a user to detect that an email message has been received at the email account associated with the user, wherein a subject document is included as an attachment to the email message, and process the subject document without action by the user, in response to detecting that the email message has been received, by executing further instruction stored in the memory to; extract one or more features from the subject document, compare the one or more features from the subject document to the feature information, determine a document type of the subject document from the plurality of known document types based on the comparison of the one or more features from the subject document to the feature information, select a dedicated storage location at a cloud storage system from a plurality of dedicated storage locations associated with the user based on the respective known document type that is associated with the subject document, wherein each of the plurality of dedicated storage locations corresponds to a respective folder of a hierarchical storage structure of the cloud storage system, and wherein the selected dedicated storage location is associated with the user and contains only documents of the respective known document type, and transmit the subject document to a cloud storage system for storage in the selected dedicated storage location. - View Dependent Claims (5, 6)
-
-
7. A method comprising:
-
obtaining a document that is associated with a user; extracting one or more features from the document, wherein the one or more features are alphanumeric text-based features that include at least one of text size, text font, text positioning, or substantive text content; comparing the one or more features from the document to receipt feature information describing features extracted from a collection of sales receipts; determining that a document of type of the document is a sales receipt from a plurality of known document types based on the comparison of one or more features from the document to the receipt feature information; selecting a dedicated storage location at a cloud storage system from a plurality of dedicated storage locations associated with the user based on the determination that the document is a sales receipt, wherein each of the plurality of dedicated storage locations corresponds to a respective folder of a hierarchical storage structure of the cloud storage system, and wherein the selected dedicated storage location is associated with the user and contains only sales receipt documents; and transmitting the document to a cloud storage system for storage in the selected dedicated storage location. - View Dependent Claims (8, 9, 10, 11)
-
-
12. An apparatus comprising:
-
a memory; and a processor configured to execute instructions stored in the memory to; obtain a document that is associated with a user, extract one or more features from the document, wherein the one or more features are alphanumeric text-based features that include at least one of text size, text font, text positioning, or substantive text content, compare the one or more features from the document to receipt feature information describing features extracted from a collection of sales receipts, determine that a document type of the document is a sales receipt from a plurality of known document types based on the comparison of one or more features from the document to the receipt feature information, select a dedicated storage location at a cloud storage system from a plurality of dedicated storage locations associated with the user based on the determination that the document is a sales receipt, wherein each of the plurality of dedicated storage locations corresponds to a respective folder of a hierarchical storage structure of the cloud storage system, and wherein the selected dedicated storage location is associated with the user and contains only sales receipt documents, and transmit the document to a cloud storage system for storage in the selected dedicated storage location that is associated with the user. - View Dependent Claims (13, 14, 15, 16)
-
Specification