Crawling secure data sources
First Claim
1. A method of crawling secure documents in an enterprise system, comprising:
- specifying in metadata for a secure data source an indication that a temporary storage option is selected for the secure data source, the temporary storage option causing security credentials to be stored temporarily;
examining, by a processor, the metadata for the secure data source at crawl time to determine that the temporary storage option is selected for the secure data source;
prompting for and receiving security credentials;
in response to determining that the temporary storage option is selected, writing, by a processor, the security credentials to a temporary storage;
reading the security credentials from the temporary storage;
crawling, by a processor, a set of documents provided by the secure data source, wherein each document of the set of documents is associated with a set of security attributes;
wherein crawling further comprises presenting the security credentials to the secure data source;
indexing, by a processor, one or more documents of the set of documents, wherein each document of the one or more documents is associated with a set of security attributes that allow access to said each document based on the security credentials; and
deleting, by a processor, the security credentials in response to completing use of the security credentials.
0 Assignments
0 Petitions
Accused Products
Abstract
It is desirable to provide a secure search mechanism to provide for searching over any and all content, such as across an enterprise. A secure search, however, requires access to the secure content repositories holding the data to be searched. In some cases the credentials required to crawl a repository may be extremely sensitive, or the user may be reluctant or unwilling to store user identification information in memory or on disk for any longer than is absolutely necessary. An approach is provided that allows a user or an administrator to provide security credentials to be stored and used only during a crawl, and to erase the credentials from the system when the crawl is complete.
189 Citations
18 Claims
-
1. A method of crawling secure documents in an enterprise system, comprising:
-
specifying in metadata for a secure data source an indication that a temporary storage option is selected for the secure data source, the temporary storage option causing security credentials to be stored temporarily; examining, by a processor, the metadata for the secure data source at crawl time to determine that the temporary storage option is selected for the secure data source; prompting for and receiving security credentials; in response to determining that the temporary storage option is selected, writing, by a processor, the security credentials to a temporary storage; reading the security credentials from the temporary storage; crawling, by a processor, a set of documents provided by the secure data source, wherein each document of the set of documents is associated with a set of security attributes; wherein crawling further comprises presenting the security credentials to the secure data source; indexing, by a processor, one or more documents of the set of documents, wherein each document of the one or more documents is associated with a set of security attributes that allow access to said each document based on the security credentials; and deleting, by a processor, the security credentials in response to completing use of the security credentials. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A non-transitory computer-readable storage medium storing instructions, the instructions which when executed by one or more processors, cause the one or more processors to crawl secure documents in an enterprise system, the instructions comprising instructions for:
-
specifying in metadata for a secure data source an indication that a temporary storage option is selected for the secure data source, the temporary storage option causing security credentials to be stored temporarily; examining the metadata for the secure data source at crawl time to determine that the temporary storage option is selected for the secure data source; prompting for and receiving security credentials; in response to determining that the temporary storage option is selected, writing the security credentials to a temporary storage; reading the security credentials from the temporary storage; crawling a set of documents provided by the secure data source, wherein each document of the set of documents is associated with a set of security attributes; wherein crawling further comprises presenting the security credentials to the secure data source; indexing one or more documents of the set of documents, each document of the one or more documents associated with a set of security attributes that allow access to the one or more documents based on the security credentials; and deleting the security credentials in response to completing use of the security credentials. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
Specification