×

System, method, and computer program product for identifying multi-page documents in hypertext collections

  • US 20050071310A1
  • Filed: 09/30/2003
  • Published: 03/31/2005
  • Est. Priority Date: 09/30/2003
  • Status: Abandoned Application
First Claim
Patent Images

1. A method for improving information retrieval, classification, indexing, and summarization, comprising:

  • identifying a compound document as a coherent body of hyperlinked material on a single topic as created by a number of collaborating authors;

    analyzing the content and structure of the compound document to find a preferred entry point for the compound document;

    processing the compound document as a whole, including at least one of indexing, classification, and retrieval; and

    processing the compound document from the entry point, including at least one of creating at least one of presentation of results from retrieval, summarization, and classification.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×