Selective document retrieval method and system
First Claim
1. A method implemented on a digital computer for storing and selectively retrieving information contained in a set of documents originally located external to said computer, wherein said document set includes at least one page, said method comprising:
- A. generating a bit-mapped image data set representative of information contained in the document set;
B. storing the image data set in a first memory storage device associated with said computer;
C. generating a text data set representative of a text portion of the information contained in the document set;
D. storing the text data set in a second memory storage device associated with said computer;
E. generating a text-image correspondence table including information representative of correlations between each phrase within the stored text data set and two-dimensional coordinates of a corresponding location within the stored image data set;
F. identifying a search phrase, corresponding to user-specified search criteria, in the stored text data set;
G. identifying two-dimensional coordinates corresponding to the search phrase from the text-image correspondence table; and
H. generating a display of at least that image data, from within the stored image data set and by using said identified two-dimensional coordinates, corresponding to said search phrase.
0 Assignments
0 Petitions
Accused Products
Abstract
A method and system for storing and selectively retrieving information, such as words, from a document set. The method includes generating an image data set representative of the information contained in the document set. The method also involves generating a text data set representative of a text portion of the information contained in the document set. A text-image correspondence (TIC) table is generated that includes data representative of coordinates information corresponding to each phrase of the document set. A search phrase is identified in response to user-specified search criteria and the search phrase is identified in the text image data set. Then, the TIC table is used to identify the coordinates information corresponding to the search phrase identified in the text data set. A display of the portion of the page containing the search phrase is generated using the coordinates information.
-
Citations
29 Claims
-
1. A method implemented on a digital computer for storing and selectively retrieving information contained in a set of documents originally located external to said computer, wherein said document set includes at least one page, said method comprising:
-
A. generating a bit-mapped image data set representative of information contained in the document set; B. storing the image data set in a first memory storage device associated with said computer; C. generating a text data set representative of a text portion of the information contained in the document set; D. storing the text data set in a second memory storage device associated with said computer; E. generating a text-image correspondence table including information representative of correlations between each phrase within the stored text data set and two-dimensional coordinates of a corresponding location within the stored image data set; F. identifying a search phrase, corresponding to user-specified search criteria, in the stored text data set; G. identifying two-dimensional coordinates corresponding to the search phrase from the text-image correspondence table; and H. generating a display of at least that image data, from within the stored image data set and by using said identified two-dimensional coordinates, corresponding to said search phrase. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A method implemented on a digital computer for storing and selectively retrieving information contained in a set of documents originally located external to said computer, wherein said document set includes at least one page, said method comprising:
-
A. generating a bit-mapped image data set representative of information contained in the document set; B. storing the image data set in a first memory storage device associated with said computer; C. generating a text data set representative of a text portion of the information contained in the document set; D. storing the text data set in a second memory storage device associated with said computer; E. generating a text-image correspondence table including information representative of correlations between each phrase within the stored text data set and two-dimensional coordinates of a corresponding location within the stored image data set; F. generating a set of non-literal search terms, in accordance with a predetermined set of rules, corresponding to user-specified search criteria; G. identifying at least one of the non-literal search terms in the stored text data set; H. identifying two-dimensional coordinates corresponding to the non-literal search term(s) from the text-image correspondence table; and I. generating a display of at least that image data identified in Step H, from within the stored image data set and by using said identified two-dimensional coordinates. - View Dependent Claims (18, 19, 20, 21, 22, 23)
-
-
24. A computer-based system for storing and selectively retrieving information contained in a set of documents originally located external to said computer, said document set including at least one page, said system comprising:
-
A. a stored bit-mapped image data set associated with said computer and representative of information contained in the document set; B. a stored text data set associated with said computer and representative of a text portion of the information contained in the document set; C. a text-image correspondence table including information representative of correlations between each phrase within the stored text data set and two-dimensional coordinates of a corresponding location within the stored image data set; D. means for identifying a search phrase, corresponding to user-specified search criteria, in the stored text data set; E. means for identifying two-dimensional coordinates corresponding to the search phrase from the text-image correspondence table; and F. means for generating a display of at least that image data identified in paragraph E above from within the stored image data set and by using said identified two-dimensional coordinates. - View Dependent Claims (25, 26, 27)
-
-
28. A method implemented on a digital computer for storing and selectively retrieving information contained in a set of documents originally located external to said computer, wherein said document set includes at least one page, said method comprising:
-
A. generating a bit-mapped image data set representative of information contained in the document set; B. storing the image data set in a first memory storage device associated with said computer; C. generating a text data set representative of a text portion of the information contained in the document set; D. storing the text data set in a second memory storage device associated with said computer; E. generating a text-image correspondence table including information representative of correlations between each phrase within the text data set and two-dimensional coordinates of a corresponding location within the image data set; F. identifying a search phrase, corresponding to user-specified search criteria, in the text data set; G. identifying, within the image data set and by means of using said coordinates, image data corresponding to the search phrase identified in the text data set; and H. generating a display of at least that image data corresponding to said search phrase, said generating step not displaying any text data; wherein each corresponding location within the image data set comprises a two-dimensional beginning boundary point and a two-dimensional ending boundary point.
-
-
29. A method implemented on a digital computer for storing and selectively retrieving information contained in a set of documents originally located external to said computer, wherein said document set includes at least one page, said method comprising:
-
A. generating a bit-mapped image data set representative of information contained in the document set; B. storing the image data set in a first memory storage device associated with said computer; C. generating a text data set representative of a text portion of the information contained in the document set; D. storing the text data set in a second memory storage device associated with said computer; E. generating a text-image correspondence table including information representative of correlations between each phrase within the text data set and two-dimensional coordinates of a corresponding location within the image data set; F. identifying a search phrase, corresponding to user-specified search criteria, in the text data set; G. identifying, within the image data set and by means of using said coordinates, image data corresponding to the search phrase identified in the text data set; and H. generating a display of at least that image data corresponding to said search phrase, said generating step not displaying any text data; wherein the image data set is divided by a first set of parallel lines into a set of zones and then further divided by a second set of parallel lines, orthogonal to the first set of parallel lines, to create a set of points that define said two-dimensional coordinates.
-
Specification