System and method for classifying content with a web service
First Claim
1. A method, comprising:
- receiving, at a classification process, a classification request from a client associated with an internal network of an organization to classify an archived dataset, the request comprising;
a content identifier indicative of a remote data repository provided for the organization by a cloud service provider and corresponding to the archived dataset; and
authorization credentials;
determining whether the remote data repository allows access from the classification process;
in response to receiving the classification request, sending, by a server (1) external to the internal network of the organization, (2) external to the cloud service provider, and (3) executing the classification process, a data request to the remote data repository, wherein if the remote data repository does not allow access from the classification process, including in the data request an authorization object that conforms to a security scheme of the organization based on the authorization credentials, and wherein the server is configured to provide document classification services to a plurality of organizations having documents that are stored at a plurality of cloud service providers and that are secured by different security schemes;
receiving, by the server, the archived dataset from the remote data repository in response to the data request;
in response to receiving the archived dataset, extracting a plurality of documents from the archived dataset;
classifying the plurality of documents using a classification engine, the classification engine being operable to, for each document in the plurality of documents;
determine whether the document is suitable for dissemination outside an organization based on whether the document contains sensitive information regarding the organization; and
apply a corresponding classification to the document based on whether the document is suitable for dissemination outside the organization; and
transmitting a response from the server to the client, the response comprising a list of each of the extracted plurality of documents and each corresponding classification.
1 Assignment
0 Petitions
Accused Products
Abstract
A method includes receiving a classification request from a client. The request includes a content identifier indicative of a data repository. The request also includes authorization credentials. The method further includes, in response to receiving the classification request, sending a data request to the data repository. The data request includes an authorization object based on the authorization credentials. The method also includes receiving a plurality of documents from the data repository in response to the data request. The method additionally includes classifying the plurality of documents using a classification engine. The classification engine is operable to determine respective classifications for the plurality of documents. The method still further includes transmitting a response to the client. The response includes the classifications.
27 Citations
20 Claims
-
1. A method, comprising:
-
receiving, at a classification process, a classification request from a client associated with an internal network of an organization to classify an archived dataset, the request comprising; a content identifier indicative of a remote data repository provided for the organization by a cloud service provider and corresponding to the archived dataset; and authorization credentials; determining whether the remote data repository allows access from the classification process; in response to receiving the classification request, sending, by a server (1) external to the internal network of the organization, (2) external to the cloud service provider, and (3) executing the classification process, a data request to the remote data repository, wherein if the remote data repository does not allow access from the classification process, including in the data request an authorization object that conforms to a security scheme of the organization based on the authorization credentials, and wherein the server is configured to provide document classification services to a plurality of organizations having documents that are stored at a plurality of cloud service providers and that are secured by different security schemes; receiving, by the server, the archived dataset from the remote data repository in response to the data request; in response to receiving the archived dataset, extracting a plurality of documents from the archived dataset; classifying the plurality of documents using a classification engine, the classification engine being operable to, for each document in the plurality of documents; determine whether the document is suitable for dissemination outside an organization based on whether the document contains sensitive information regarding the organization; and apply a corresponding classification to the document based on whether the document is suitable for dissemination outside the organization; and transmitting a response from the server to the client, the response comprising a list of each of the extracted plurality of documents and each corresponding classification. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A computer configured to access a storage device, the computer comprising:
-
a processor; and a non-transitory, computer-readable storage medium storing computer-readable instructions that when executed by the processor cause the computer to perform; receiving, at a classification process, a classification request from a client associated with an internal network of an organization to classify an archived dataset, the request comprising; a content identifier indicative of a remote data repository provided for the organization by a cloud service provider and corresponding to the archived dataset; and authorization credentials; determining whether the remote data repository allows access from the classification process; in response to receiving the classification request, sending, by a server (1) external to the internal network of the organization, (2) external to the cloud service provider, and (3) executing the classification process, a data request to the remote data repository, wherein if the remote data repository does not allow access from the classification process, including in the data request an authorization object that conforms to a security scheme of the organization based on the authorization credentials, and wherein the server is configured to provide document classification services to a plurality of organizations having documents that are stored at a plurality of cloud service providers and that are secured by different security schemes; receiving, by the server, the archived dataset from the remote data repository in response to the data request; in response to receiving the archived dataset, extracting a plurality of documents from the archived dataset; classifying the plurality of documents using a classification engine, the classification engine being operable to, for each document in the plurality of documents; determine whether the document is suitable for dissemination outside an organization based on whether the document contains sensitive information regarding the organization; and apply a corresponding classification to the document based on whether the document is suitable for dissemination outside the organization; and transmitting a response from the server to the client, the response comprising a list of each of the extracted plurality of documents and each corresponding classification. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A computer program product comprising:
a non-transitory computer-readable storage medium having computer-readable program code embodied therewith, the computer-readable program code comprising; computer-readable program code configured to receive, at a classification process, a classification request from a client associated with an internal network of an organization to classify the archived dataset, the request comprising; a content identifier indicative of a remote data repository provided for the organization by a cloud service provider and corresponding to the archived dataset; and authorization credentials; computer-readable program code configured to, determine whether the remote data repository allows access from the classification process; computer-readable program code configured to, in response to receiving the classification request, send, by a server (1) external to the internal network of the organization, (2) external to the cloud service provider, and (3) executing the classification process, a data request to the remote data repository, wherein if the remote data repository does not allow access from the classification process, including in the data request an authorization object that conforms to a security scheme of the organization based on the authorization credentials, and wherein the server is configured to provide document classification services to a plurality of organizations having documents that are stored at a plurality of cloud service providers and that are secured by different security schemes; computer-readable program code configured to, receive, by the server, the archived dataset from the remote data repository in response to the data request; computer-readable program code configured to, in response to receiving the archived dataset, extracting a plurality of documents from the archived dataset; computer-readable program code configured to, classify the plurality of documents using a classification engine, the classification engine being operable to, for each document in the plurality of documents; determine whether the document is suitable for dissemination outside an organization based on whether the document contains sensitive information regarding the organization; and apply a corresponding classification to the document based on whether the document is suitable for dissemination outside the organization; and computer-readable program code configured to transmit a response from the server to the client, the response comprising a list of each of the extracted plurality of documents and each corresponding classification. - View Dependent Claims (16, 17, 18, 19, 20)
Specification