×

STORAGE SYSTEM

  • US 20110246431A1
  • Filed: 06/13/2011
  • Published: 10/06/2011
  • Est. Priority Date: 12/26/2006
  • Status: Active Grant
First Claim
Patent Images

1. A storage system which stores files of data inputted in/outputted from a host apparatus in a storage area, comprising:

  • a file server apparatus which includes a de-duplicate processing unit which performs de-duplicate processing to a plurality of files having the same content in a file group stored in the storage area and which creates de-duplication information which is stored in the storage area, and which comprises control information indicating a de-duplicate status of the file server apparatus including presence/absence of files having the same content and information of a representative file in the storage area; and

    a full-text search processing apparatus which performs a full-text search processing including an index information creation processing to the file group stored in the storage area to create index information and which performs a de-duplicate group information creation processing to create de-duplicate group information based on said de-duplication information, wherein the de-duplicate group information indicates a group of files having the same content, representative files in the group of files and a link between the files,wherein the index information includes keyword occurrence position information in a data body of the file and further includes said de-duplicate group information,wherein the index information is de-duplicated by inhibiting the keyword occurrence position information creation processing performed to the plurality of files having the same content by the full-text search processing apparatus according to a status of the de-duplicate processing to the file group performed by the de-duplicate processing unit of the file server apparatus,wherein the full-text search processing apparatus executes de-duplicate correspondence processing based on the index information,wherein, said full-text search server apparatus responds to the host apparatus by providing search result information comprising information regarding a representative file included in a search result, and further provides, by referring to said de-duplicate group information, information of another file which belongs to a de-duplicate group of the representative file and which has the same content as said representative file, said representative file being searched by said de-duplicate correspondence processing,wherein the file server apparatus is coupled between the host apparatus and the storage area, wherein the full-text search processing apparatus is coupled between the host apparatus and the index information, and wherein the host apparatus and the file server apparatus are coupled between the full-text search server apparatus and the storage area, andwherein the de-duplicate correspondence processing performed by the full text search processing apparatus is a separate function from the de-duplicate processing performed by the de-duplicate processing unit of the file server apparatus, and wherein the de-duplicate correspondence processing performed by the full text search processing apparatus is controlled by the de-duplicate information created by the de-duplicate processing unit of the file server apparatus.

View all claims
  • 0 Assignments
Timeline View
Assignment View
    ×
    ×