×

Distributed indexing system for data storage

  • US 8,335,776 B2
  • Filed: 07/02/2008
  • Issued: 12/18/2012
  • Est. Priority Date: 07/02/2008
  • Status: Active Grant
First Claim
Patent Images

1. A method of storing index information describing secondary copies of data, the method comprising:

  • receiving at a media agent executing in one or more computer processors, data copied during a first data storage operation, wherein the media agent is configured to convey data from a primary copy of data between a client computer and one or more first data storage devices associated with the media agent to create a secondary copy of the primary copy of data, the primary copy of data generated by one or more software applications running on the client computer and stored in a data store associated with the client computer that is separate from the one or more first data storage devices;

    indexing with the media agent the secondary copy to determine content from the secondary copy, wherein indexing the secondary copy creates a first index of indexed data,wherein the first index is associated with a primary index server and wherein the indexed data comprises information about the content of the secondary copy and information about location of the secondary copy on the first data storage devices;

    selecting at least a secondary index server among multiple available index servers based on a failover policy wherein the secondary index server is configured to store a second index that is created using the first index and is a replica of the first index, wherein the multiple index servers are networked together and collectively provide a distributed index;

    sending from the media agent a reference to the indexed data associated with the secondary copy to the primary index server and to the secondary index server, wherein the primary index server and the secondary index server retrieve the indexed data from the media agent using the reference such that the second index remains a replica of the first index;

    updating the distributed index using the indexed data retrieved from the media agent;

    receiving an index update about migrated data associated with a migration of at least a portion of the secondary copy from the first data storage devices to at least a second data storage device based on one or more storage policies;

    wherein the index update comprises information about the new location of the migrated data on the second storage device and information about the content of the migrated data;

    determining that the primary index server is not available; and

    updating the distributed index about the migrated data using the index update retrieved from the media agent via the secondary index server.

View all claims
  • 4 Assignments
Timeline View
Assignment View
    ×
    ×