Line extraction in digital ink
First Claim
Patent Images
1. A searchable electronic filing system including:
- (a) a first database for storing a digital representation of each of a plurality of printed documents, each printed document including machine-readable identification data which distinguishes it from the others;
(b) a second database for storing annotation digital ink, the annotation digital ink representing at least one physical handwritten annotation on at least one of the printed documents;
(c) a processor configured to;
i. segment the annotation digital ink into lines by;
obtaining a stroke sequence comprised of a sequence of strokes;
removing small strokes less than a threshold length;
segmenting the strokes into a sequence of substrokes by;
computing a relative angle for each line segment connecting successive stroke data points; and
segmenting the stroke into substrokes if the difference between successive relative angles is greater than a preset threshold; and
identifying the substrokes at line breaks, thereby facilitating line extraction, by;
computing N-neighbourhood centroids;
computing angular differences between N-neighbouring substrokes, or groups of substrokes; and
computing local maxima of the angular differences in the sequence of substrokes;
ii. accept a digital ink query;
iii. match the digital ink query against at least some of the annotation digital ink in the second database, to thereby identify a corresponding one of the documents in the first database.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and system of line extraction in a digital ink sequence of handwritten text data points, the method including the steps of: obtaining are provided in which a stroke sequence comprised of a sequence of are strokes is obtained, the strokes are segmented into a sequence of substrokes by applying a stroke segmentation algorithm angular differences are calculated between neighboring groups of substrokes, in the sequence of substrokes, and the positions of the extrema of the angular differences are determined, thereby indentifying the substrokes at line breaks and enabling line extraction.
-
Citations
2 Claims
-
1. A searchable electronic filing system including:
-
(a) a first database for storing a digital representation of each of a plurality of printed documents, each printed document including machine-readable identification data which distinguishes it from the others; (b) a second database for storing annotation digital ink, the annotation digital ink representing at least one physical handwritten annotation on at least one of the printed documents; (c) a processor configured to; i. segment the annotation digital ink into lines by; obtaining a stroke sequence comprised of a sequence of strokes; removing small strokes less than a threshold length; segmenting the strokes into a sequence of substrokes by; computing a relative angle for each line segment connecting successive stroke data points; and segmenting the stroke into substrokes if the difference between successive relative angles is greater than a preset threshold; and identifying the substrokes at line breaks, thereby facilitating line extraction, by; computing N-neighbourhood centroids; computing angular differences between N-neighbouring substrokes, or groups of substrokes; and computing local maxima of the angular differences in the sequence of substrokes; ii. accept a digital ink query; iii. match the digital ink query against at least some of the annotation digital ink in the second database, to thereby identify a corresponding one of the documents in the first database. - View Dependent Claims (2)
-
Specification