×

Method and apparatus for identifying words described in a portable electronic document

  • US 5,832,530 A
  • Filed: 06/27/1997
  • Issued: 11/03/1998
  • Est. Priority Date: 09/12/1994
  • Status: Expired due to Term
First Claim
Patent Images

1. A method for identifying words in a document comprising:

  • (a) retrieving a text segment including its x,y position from a portable electronic document that has a page including a plurality of characters that have been identified as characters but not identified as words and a plurality of text segments and associated position data;

    (b) creating a text object from each text segment and entering the text object into a linked list of text objects;

    (c) identifying words from the linked list by analyzing the text object for word breaks and by analyzing a gap between the text object with a prior text object using the associated position data;

    (d) adding identified words to a word list; and

    (e) repeating steps (a) to (e) until the end of the page is reached.

View all claims
  • 0 Assignments
Timeline View
Assignment View
    ×
    ×