Methods and apparatus for automatic page break detection
First Claim
1. A computer-based method of processing an electronic document generated in accordance with a handwriting system, the method comprising the steps of:
- obtaining electronic ink data from the handwriting system, the ink data being associated with the electronic document; and
automatically identifying, using at least a portion of the electronic ink data, one or more potential page breaks for possible insertion in the electronic document to maintain a page correspondence between the electronic document and a physical document, also generated in accordance with the handwriting system, and so as to at least partially reduce asynchrony between an electronic page and a physical page;
wherein the step of automatically identifying one or more potential page breaks further comprises the steps of;
measuring a spatial difference between consecutive pairs of strokes made in accordance with the handwriting system; and
labeling spatial differences not below a threshold value as possible insertion points.
0 Assignments
0 Petitions
Accused Products
Abstract
In one aspect of the present invention, page breaks are identified in the following manner. A set of ink data and a document description are processed by a variety of scoring methods, each of which generates a score for each possible insertion point in the ink. These scores are combined to produce a ranked list of hypothesized page breaks for the corresponding ink data. This ranked list is then used either to insert page breaks automatically using a predefined threshold to determine a cut-off in the list; or to present, on-line, to a human for verification/approval; or a mixture of the two based on two thresholds: one for automatic insertion and the other for human verification. It is to be understood not all scoring methods need be used, that is, one or more of the scoring methods may be used as needed.
12 Citations
16 Claims
-
1. A computer-based method of processing an electronic document generated in accordance with a handwriting system, the method comprising the steps of:
-
obtaining electronic ink data from the handwriting system, the ink data being associated with the electronic document; and automatically identifying, using at least a portion of the electronic ink data, one or more potential page breaks for possible insertion in the electronic document to maintain a page correspondence between the electronic document and a physical document, also generated in accordance with the handwriting system, and so as to at least partially reduce asynchrony between an electronic page and a physical page; wherein the step of automatically identifying one or more potential page breaks further comprises the steps of; measuring a spatial difference between consecutive pairs of strokes made in accordance with the handwriting system; and labeling spatial differences not below a threshold value as possible insertion points. - View Dependent Claims (2, 3)
-
-
4. A computer-based method of processing an electronic document generated in accordance with a handwriting system, the method comprising the steps of:
-
obtaining electronic ink data from the handwriting system, the ink data being associated with the electronic document; and automatically identifying, using at least a portion of the electronic ink data, one or more potential page breaks for possible insertion in the electronic document to maintain a page correspondence between the electronic document and a physical document, also generated in accordance with the handwriting system, and so as to at least partially reduce asynchrony between an electronic page and a physical page; wherein the step of automatically identifying one or more potential page breaks further comprises the steps of; measuring a temporal difference between consecutive pairs of strokes made in accordance with the handwriting system; and labeling temporal differences not below a threshold value as possible insertion points. - View Dependent Claims (5)
-
-
6. A computer-based method of processing an electronic document generated in accordance with a handwriting system, the method comprising the steps of:
-
obtaining electronic ink data from the handwriting system, the ink data being associated with the electronic document; and automatically identifying, using at least a portion of the electronic ink data, one or more potential page breaks for possible insertion in the electronic document to maintain a page correspondence between the electronic document and a physical document, also generated in accordance with the handwriting system, and so as to at least partially reduce asynchrony between an electronic page and a physical page; wherein the step of automatically identifying one or more potential page breaks further comprises the step of identifying as a possible insertion point a point before a stroke, made in accordance with the handwriting system, wherein the stroke falls within a constrained region on a page associated with the document and wherein the stroke is not immediately preceded by another stroke in the same constrained region.
-
-
7. A computer-based method of processing an electronic document generated in accordance with a handwriting system, the method comprising the steps of:
-
obtaining electronic ink data from the handwriting system, the ink data being associated with the electronic document; and automatically identifying, using at least a portion of the electronic ink data, one or more potential page breaks for possible insertion in the electronic document to maintain a page correspondence between the electronic document and a physical document, also generated in accordance with the handwriting system, and so as to at least partially reduce asynchrony between an electronic page and a physical page; wherein, in an electronic document that corresponds to a form with fields, the step of automatically identifying one or more potential page breaks further comprises the steps of; computing a measure of field appropriateness for each stroke made in accordance with the handwriting system to indicate how well a stroke fits within a particular field; and for N consecutive strokes which do not fit the field of a particular page of the document, identifying a potential page break before these N consecutive strokes. - View Dependent Claims (8)
-
-
9. A computer-based method of processing an electronic document generated in accordance with a handwriting system, the method comprising the steps of:
-
obtaining electronic ink data from the handwriting system, the ink data being associated with the electronic document; and automatically identifying, using at least a portion of the electronic ink data, one or more potential page breaks for possible insertion in the electronic document to maintain a page correspondence between the electronic document and a physical document, also generated in accordance with the handwriting system, and so as to at least partially reduce asynchrony between an electronic page and a physical page; wherein the step of automatically identifying one or more potential page breaks further comprises the steps of; computing a measure of overlap for each stroke with a previous stroke; and for N consecutive strokes with a total measure of overlap which is not less than a threshold value, identifying a potential page break before these N consecutive strokes.
-
-
10. A computer-based method of processing an electronic document generated in accordance with a handwriting system, the method comprising the steps of:
-
obtaining electronic ink data from the handwriting system, the ink data being associated with the electronic document; and automatically identifying, using at least a portion of the electronic ink data, one or more potential page breaks for possible insertion in the electronic document to maintain a page correspondence between the electronic document and a physical document, also generated in accordance with the handwriting system, and so as to at least partially reduce asynchrony between an electronic page and a physical page; wherein the step of automatically identifying one or more potential page breaks further comprises the steps of; computing a moving average of spatial positions of strokes, made in accordance with the handwriting system, on a page using a predetermined window width, the computation of the moving average resulting in a spatial position moving average curve; computing a moving average of a slope associated with the spatial position moving average curve, the computation of the moving average resulting in a slope moving average curve; and identifying one or more negative slopes in the slope moving average curve as potential page breaks. - View Dependent Claims (11)
-
-
12. A computer-based method of processing an electronic document generated in accordance with a handwriting system, the method comprising the steps of:
-
obtaining electronic ink data from the handwriting system, the ink data being associated with the electronic document; and automatically identifying, using at least a portion of the electronic ink data, one or more potential page breaks for possible insertion in the electronic document to maintain a page correspondence between the electronic document and a physical document, also generated in accordance with the handwriting system, and so as to at least partially reduce asynchrony between an electronic page and a physical page; wherein the step of automatically identifying one or more potential page breaks further comprises the steps of; computing a moving average of temporal positions of strokes, made in accordance with the handwriting system, on a page using a predetermined window width, the computation of the moving average resulting in a temporal position moving average curve; computing a moving average of a slope associated with the temporal position moving average curve, the computation of the moving average resulting in a slope moving average curve; and identifying one or more positive slopes in the slope moving average curve as potential page breaks. - View Dependent Claims (13)
-
-
14. A computer-based method of processing an electronic document generated in accordance with a handwriting system, the method comprising the steps of:
-
obtaining electronic ink data from the handwriting system, the ink data being associated with the electronic document; and automatically identifying, using at least a portion of the electronic ink data, one or more potential page breaks for possible insertion in the electronic document to maintain a page correspondence between the electronic document and a physical document, also generated in accordance with the handwriting system, and so as to at least partially reduce asynchrony between an electronic page and a physical page; wherein the step of automatically identifying one or more potential page breaks further comprises the steps of; performing two or more scoring procedures, each scoring procedure generating a list whose elements include a possible insertion point and a corresponding score; merging the lists generated by the two or more scoring procedures to form a combined list; and selecting one or more top scoring possible insertion points as the one or more potential page breaks. - View Dependent Claims (15, 16)
-
Specification