DISTRIBUTED INDEXING OF FILE CONTENT
First Claim
1. A method of content-based indexing a file, said method comprising:
- determining whether content-based index information for said file is available from an external source;
if said content-based index information for said file is available from said external source, receiving and storing said content-based index information from said external source; and
if occurrence of any one of said content-based index information for said file is not available from said external source and said content-based index information for said file is not complete, generating and storing content-based index information for said file and sharing said generated content-based index information with said external source.
2 Assignments
0 Petitions
Accused Products
Abstract
Described herein is technology for, among other things, distributed indexing of file content. Content-based indexing the file involves determining whether content-based index information for the file is available from an external source. This avoids repeating already-performed content analysis, which is time consuming and computationally intensive especially for non-text files. The content-based index information, if it is available, is received from the external source and may be stored. If the content-based index information is not available or is not complete, content-based index information for the file is generated and stored. Moreover, the generated content-based index information is shared with the external source. Once content analysis of the file is performed to generate content-based index information for the file, the content-based index information is available and sharable as needed. There is no need to repeat the same content analysis on the file.
37 Citations
20 Claims
-
1. A method of content-based indexing a file, said method comprising:
-
determining whether content-based index information for said file is available from an external source; if said content-based index information for said file is available from said external source, receiving and storing said content-based index information from said external source; and if occurrence of any one of said content-based index information for said file is not available from said external source and said content-based index information for said file is not complete, generating and storing content-based index information for said file and sharing said generated content-based index information with said external source. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A method of creating an index for files, said method comprising:
-
receiving and storing content-based index information for said files; and generating and storing content-based index information for said files, wherein said index comprises said received content-based index information and said generated content-based index information. - View Dependent Claims (12, 13, 14)
-
-
15. An apparatus comprising:
-
a processor; an indexing unit operable to utilize said processor to request and receive content-based index information for files from an external source, generate content-based index information for files, and create an index comprising said received content-based index information and said generated content-based index information; and a storage unit operable to store said received content-based index information and said generated content-based index information. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification