×

Information processing for searching categorizing information in a document based on a categorization hierarchy and extracted phrases

  • US 6,571,240 B1
  • Filed: 02/02/2000
  • Issued: 05/27/2003
  • Est. Priority Date: 02/02/2000
  • Status: Expired due to Term
First Claim
Patent Images

1. A method to process a document from a Web site, based on a categorization hierarchy which has a plurality of categories, each category including one or more phrases, the method comprising:

  • extracting phrases from the document;

    categorizing at least one of the extracted phrases under a category of the categorization hierarchy; and

    identifying at least one of the extracted phrases that cannot be categorized into the categorization hierarchy for analysis;

    such that information in the document can be appropriately categorized and the document can be systematically retrieved by a natural language responding engine when needed;

    wherein the location of the document is related to a URL;

    wherein the document includes at least an image when the document is displayed on the Web site, and the method includes not categorizing the image; and

    wherein the document includes at least a phrase that is hidden when the document is displayed on the Web site, and the method includes extracting that phrase for categorizing.

View all claims
  • 14 Assignments
Timeline View
Assignment View
    ×
    ×