×

UNSTRUCTURED AND SEMISTRUCTURED DOCUMENT PROCESSING AND SEARCHING

  • US 20080263032A1
  • Filed: 04/19/2007
  • Published: 10/23/2008
  • Est. Priority Date: 04/19/2007
  • Status: Active Grant
First Claim
Patent Images

1. A method for analyzing and indexing an unstructured or semistructured document, comprising:

  • receiving an unstructured or semistructured document;

    converting the document to one or more text streams;

    analyzing the one or more text streams for identifying textual contents of the document;

    analyzing the one or more text streams for identifying logical sections of the document;

    associating the textual contents with the logical sections;

    indexing the textual contents and their association with the logical sections; and

    saving a result of the indexing in a data storage device.

View all claims
  • 14 Assignments
Timeline View
Assignment View
    ×
    ×