Method and system for separating text and drawings in digital ink
First Claim
1. A computer readable medium having computer-executable instructions, comprising,accessing a plurality of stroke samples, the stroke samples representing more than one class, wherein at least on class represented is a text class and at least one class represented is a drawing class;
- extracting curvature features of each of the strokes for each class; and
using the curvature features, training a support vector machine to classify strokes for each class, wherein the curvature features of a stroke comprise a discrete curvature stroke, the discrete curvature being defined using a difference between angles determined in accordance with points along the stroke.
2 Assignments
0 Petitions
Accused Products
Abstract
A system for separating text and drawings in a digital ink file (e.g., a handwritten digital ink file). A stroke analyzer classifies single strokes that have been input by a user as “text” or “unknown.” The stroke analyzer utilizes a trainable classifier, such as a support vector machine. A grouping component is provided that groups text strokes in an attempt to form text objects (e.g., words, characters, or letters). The grouping component also groups unknown strokes in an attempt to form objects (e.g., shapes, drawings, or even text). A trainable classifier, such as a support vector machine, evaluates the grouped strokes to determine if they are objects.
30 Citations
14 Claims
-
1. A computer readable medium having computer-executable instructions, comprising,
accessing a plurality of stroke samples, the stroke samples representing more than one class, wherein at least on class represented is a text class and at least one class represented is a drawing class; -
extracting curvature features of each of the strokes for each class; and using the curvature features, training a support vector machine to classify strokes for each class, wherein the curvature features of a stroke comprise a discrete curvature stroke, the discrete curvature being defined using a difference between angles determined in accordance with points along the stroke. - View Dependent Claims (2, 3, 4)
-
-
5. A computer readable medium having computer-executable instructions, comprising:
-
accessing a digital ink file having at least one stroke therein; extracting curvature features of the at least one stroke; based upon an analysis of the curvature features, determining whether the at least one stroke is text by evaluating the stroke with a support vector machine; and based upon the curvature features, determining whether the at least one stroke is classified as an unknown stroke. - View Dependent Claims (6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A computer readable medium having computer-executable instructions, comprising:
-
accessing a digital ink file having at least one stroke therein; extracting the tangent histogram of the at least one stroke; based upon an analysis of the the tangent histogram, determining whether the at least one stroke is text; and based upon the the tangent histogram, determining whether the at least one stroke is classified as an unknown stroke.
-
Specification