DISTRIBUTED ARCHITECTURE FOR CONTENT INDEXING EMAILS
First Claim
1. A networked information management system for content indexing emails, the networked information management system comprising:
- a content indexing proxy having one or more first hardware processors, wherein the content indexing proxy is configured with first computer-executable instructions that, when executed, cause the content indexing proxy to;
receive, by a first thread executing on the content indexing proxy, identification of emails assigned to the content indexing proxy by a master content indexing proxy, wherein the identified emails are each associated with an email page in a plurality of email pages; and
for each email page in the plurality of email pages,transmit, by the first thread to an indexing storage system, a query for secondary copy location data corresponding to the emails associated with the respective email page,receive, by the first thread, the secondary copy location data,transmit, by a second thread executing on the content indexing proxy, an instruction to a first computing device that executes a media agent to restore secondary copies stored at locations indicated by the secondary copy location data,receive, by a third thread executing on the content indexing proxy, an acknowledgment from the first computing device that a restoration of the secondary copies is complete, andtransmit, by a fourth thread executing on the content indexing proxy, a request to content index the restored secondary copies; and
one or more computing devices in communication with the content indexing proxy, wherein the one or more computing devices each have one or more second hardware processors, wherein the one or more computing devices are configured with second computer-executable instructions that, when executed, cause the one or more computing devices to content index the restored secondary copies.
2 Assignments
0 Petitions
Accused Products
Abstract
An improved content indexing (CI) system is disclosed herein. For example, the improved CI system may include a distributed architecture of client computing devices, media agents, a single backup and CI database, and a pool of servers. After a file backup occurs, the backup and CI database may include file metadata indices and other information associated with backed up files. Servers in the pool of servers may, in parallel, query the backup and CI database for a list of files assigned to the respective server that have not been content indexed. The servers may then request a media agent to restore the assigned files from secondary storage and provide the restored files to the servers. The servers may then content index the received restored files. Once the content indexing is complete, the servers can send the content index information to the backup and CI database for storage.
56 Citations
20 Claims
-
1. A networked information management system for content indexing emails, the networked information management system comprising:
-
a content indexing proxy having one or more first hardware processors, wherein the content indexing proxy is configured with first computer-executable instructions that, when executed, cause the content indexing proxy to; receive, by a first thread executing on the content indexing proxy, identification of emails assigned to the content indexing proxy by a master content indexing proxy, wherein the identified emails are each associated with an email page in a plurality of email pages; and for each email page in the plurality of email pages, transmit, by the first thread to an indexing storage system, a query for secondary copy location data corresponding to the emails associated with the respective email page, receive, by the first thread, the secondary copy location data, transmit, by a second thread executing on the content indexing proxy, an instruction to a first computing device that executes a media agent to restore secondary copies stored at locations indicated by the secondary copy location data, receive, by a third thread executing on the content indexing proxy, an acknowledgment from the first computing device that a restoration of the secondary copies is complete, and transmit, by a fourth thread executing on the content indexing proxy, a request to content index the restored secondary copies; and one or more computing devices in communication with the content indexing proxy, wherein the one or more computing devices each have one or more second hardware processors, wherein the one or more computing devices are configured with second computer-executable instructions that, when executed, cause the one or more computing devices to content index the restored secondary copies. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A computer-implemented method for content indexing emails, the computer-implemented method comprising:
-
receiving, by a first thread executing on a content indexing proxy, identification of emails assigned to the content indexing proxy by a master content indexing proxy, wherein the identified emails are each associated with an email page in a plurality of email pages; and for each email page in the plurality of email pages, transmitting, by the first thread to an indexing storage system, a query for secondary copy location data corresponding to the emails associated with the respective email page, receiving, by the first thread, the secondary copy location data, transmitting, by a second thread executing on the content indexing proxy, an instruction to a first computing device that executes a media agent to restore secondary copies stored at locations indicated by the secondary copy location data, receiving, by a third thread executing on the content indexing proxy, an acknowledgment from the first computing device that a restoration of the secondary copies is complete, and transmitting, by a fourth thread executing on the content indexing proxy, a request to content index the restored secondary copies. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
Specification