System and method for batched indexing of network documents

System and method for batched indexing of network documents

  • CN 1,755,676 B
  • Filed: 07/11/2005
  • Issued: 01/23/2013
  • Est. Priority Date: 09/30/2004
  • Status: Active Grant
First Claim
Patent Images

1. computer implemented method that is used for by concordance program the document of storer being carried out batched indexing, wherein said document is organized in the level, described level comprises website, substation point and comprises symbol, described storer is included in the server, described server comprise web services and for each document of the website that comprises in the described storer, comprise the object model that symbol and substation point provide object, described method comprises:

  • To first request of described web services transmission to the information relevant with the website in the level, wherein said web services is forwarded to described object model with described the first request, and described object model returns to described web services with the tabulation of a plurality of substation points;

    In response to described the first request, receive described tabulation as url list from described web services;

    Transmission is to the second request of the information relevant with a sub-website in the tabulation of described a plurality of substation points;

    In response to described the second request, receive the tabulation that comprises symbol that is included in the described sub-website from described web services;

    Transmission comprises the 3rd request that accords with interior first batch of document data to of being stored in the described tabulation that comprises symbol;

    Receive described first batch of document data, wherein said first batch of document data is based on the metadata that received by described web services and definite, and described metadata is corresponding to the document in the described storer and be used for determining being included in document data in the described storer of described first batch of document data;

    The described first batch of document data of index;

    To the request of described web services transmission to the current location in the change journal of storing in the described server, wherein said request is corresponding to the request that is changed identifier the last time in the described change journal;

    Receive and store the described last identifier that changes;

    The 4th request of the change that the document of transmission subtend website inner tissue is made, described the 4th request comprise the described last identifier that changes;

    In response to described the 4th request, be received from from the described last one batch of document data through changing that has changed since the identifier, and receive current variation identifier;

    According to one batch that the receives document data through changing, upgrade index;

    Described current variation identifier is stored as for next time last time changes identifier.

View all claims
    ×
    ×

    Thank you for your feedback

    ×
    ×