×

Data format conversion

  • US 20070009161A1
  • Filed: 06/30/2006
  • Published: 01/11/2007
  • Est. Priority Date: 07/08/2005
  • Status: Active Grant
First Claim
Patent Images

1. A method of converting intermediate document data representing document text derived from data in an image data format into a semantically-meaningful tagged text data format, the method comprising:

  • inputting intermediate document data derived from document image data, said intermediate document data comprising character data corresponding to characters in the document and attribute data corresponding to one or more attributes of characters in the document;

    processing the intermediate document data according to attribute-dependent rules; and

    generating tagged text data comprising tagged sections of said document text, the tags defining semantically meaningful portions of said text determined according to said attribute data.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×