Method and system for offline indexing of content and classifying stored data
First Claim
1. A non-transitory computer-readable storage medium storing computer-executable instructions that, when executed by at least one data storage device, performs a method of controlling a computer system to identify stored data comprising:
- receiving a search request directed to target content, wherein the search request contains classifications associated with the target content;
searching an index containing information associated with the target content to generate search results,wherein the index contains information identifying at least one content item that is not available from mounted disk media or faster media,wherein the faster media has a retrieval time or accessibility that is faster than mounted disk media,wherein the information contained in the index is generated based upon analysis of content of a secondary copy of data created from a primary copy of data,wherein the analysis is performed without impacting one or more systems from which the primary copy of data is available, andwherein the information is associated with either the primary copy, the secondary copy, or both the primary copy and the secondary copy;
for search results identifying content items that are not available from mounted disk media or faster media, retrieving information about the target content from an archive or secondary storage location;
providing the search results in response to the search request,wherein searching the index includes searching the index based on information about availability of content items, wherein the availability is based upon locations of the content items, and wherein providing search results includes providing search results indicating times required to access the content items based upon respective availabilities.
3 Assignments
0 Petitions
Accused Products
Abstract
A method and system for creating an index of content without interfering with the source of the content includes an offline content indexing system that creates an index of content from an offline copy of data. The system may associate additional properties or tags with data that are not part of traditional indexing of content, such as the time the content was last available or user attributes associated with the content. Users can search the created index to locate content that is no longer available or based on the associate attributes.
-
Citations
25 Claims
-
1. A non-transitory computer-readable storage medium storing computer-executable instructions that, when executed by at least one data storage device, performs a method of controlling a computer system to identify stored data comprising:
-
receiving a search request directed to target content, wherein the search request contains classifications associated with the target content; searching an index containing information associated with the target content to generate search results, wherein the index contains information identifying at least one content item that is not available from mounted disk media or faster media, wherein the faster media has a retrieval time or accessibility that is faster than mounted disk media, wherein the information contained in the index is generated based upon analysis of content of a secondary copy of data created from a primary copy of data, wherein the analysis is performed without impacting one or more systems from which the primary copy of data is available, and wherein the information is associated with either the primary copy, the secondary copy, or both the primary copy and the secondary copy; for search results identifying content items that are not available from mounted disk media or faster media, retrieving information about the target content from an archive or secondary storage location; providing the search results in response to the search request, wherein searching the index includes searching the index based on information about availability of content items, wherein the availability is based upon locations of the content items, and wherein providing search results includes providing search results indicating times required to access the content items based upon respective availabilities. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A method of controlling a computer system to identify archived data, comprising:
-
in response to receiving a search request directed to desired data, searching an index containing information associated with the desired data to generate search results, wherein the index contains information identifying at least one data item that is not available from mounted disk media or faster media, wherein the faster media has a retrieval time or accessibility that is faster than mounted disk media, wherein the information is associated with either the primary copy, the secondary copy, or both the primary copy and the secondary copy; for search results identifying desired data items that are not available from mounted disk media or faster media, retrieving information about the desired data from an archive or secondary storage location; and providing the search results in response to the search request, wherein searching the index includes searching the index based on information about availability of at least the desired data, wherein the availability is based upon locations of at least the desired data, and wherein providing search results includes providing search results indicating estimated times required to access at least the desired data based upon respective availabilities. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17)
-
-
18. A non-transitory computer-readable storage medium storing computer-executable instructions that, when executed by at least one data storage device, performs a method of controlling a computer system to identify stored data comprising:
-
in response to receiving a search request directed to desired data, searching an index containing information associated with the desired data to generate search results, wherein the index contains information identifying at least one data item that is not available from mounted disk media or faster media, wherein the faster media has a retrieval time or accessibility that is faster than mounted disk media, wherein the information is associated with either the primary copy, the secondary copy, or both the primary copy and the secondary copy; for search results identifying desired data items that are not available from mounted disk media or faster media, retrieving information about the desired data from an archive or secondary storage location; and providing the search results in response to the search request, wherein searching the index includes searching the index based on information about availability of at least the desired data, wherein the availability is based upon locations of at least the desired data, and wherein providing search results includes providing search results indicating estimated times required to access at least the desired data based upon respective availabilities. - View Dependent Claims (19, 20, 21, 22, 23, 24, 25)
-
Specification