Method for global search of text containing four-byte character

Method for global search of text containing four-byte character

  • CN 1,694,092 A
  • Filed: 05/31/2005
  • Published: 11/09/2005
  • Est. Priority Date: 05/31/2005
  • Status: Active Application
First Claim
Patent Images

1. , a kind of method that the text that contains four-byte character is carried out full-text search, its step comprises:

  • (1) when setting up index, at first in word flow, adopt the method for character examination one by one to judge whether the character that will set up index is four-byte character;

    (2) four-byte character in this way, the four-byte character that this is single adds inverted index as index terms;

    As not being four-byte character, determine keyword by the participle mode of search engine routine, keyword is added inverted index as indexing units.(3) in retrieval, at first in word flow, adopt the method for character examination one by one to judge whether character to be checked is four-byte character;

    (4) four-byte character in this way, the four-byte character that this is single is as a query word;

    As not being four-byte character, determine keyword by the participle mode of search engine routine, with the keyword that obtains as a query word;

    (5) search engine being sent in aforesaid query word set inquires about.

View all claims
    ×
    ×

    Thank you for your feedback

    ×
    ×