Distinguishing text from non-text in digital ink
First Claim
Patent Images
1. A method of distinguishing between text strokes and non-text strokes in digital ink, the method comprising:
- detecting a gap between a first stroke and a second stroke in the digital ink;
generating a classification model based on at least one gap feature of the gap, wherein the model defines a posterior probability of the second stroke being a test stroke or a non-text stroke; and
labeling the gap as a change gap or a non-change gap based on the posterior probability.
2 Assignments
0 Petitions
Accused Products
Abstract
A discriminative machine learning system for labels text and non-text strokes in digital ink. The learning system considers stroke features and the context of the strokes, such as temporal information about one or more strokes, in a probabilistic framework. The learning system can also consider gap features within the probabilistic framework to label associated strokes.
26 Citations
63 Claims
-
1. A method of distinguishing between text strokes and non-text strokes in digital ink, the method comprising:
-
detecting a gap between a first stroke and a second stroke in the digital ink;
generating a classification model based on at least one gap feature of the gap, wherein the model defines a posterior probability of the second stroke being a test stroke or a non-text stroke; and
labeling the gap as a change gap or a non-change gap based on the posterior probability. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A computer program product encoding a computer program that executes a computer process for distinguishing between text strokes and non-text strokes in digital ink, the computer process comprising:
-
detecting a gap between a first stroke and a second stroke in the digital ink;
generating a classification model based on at least one gap feature of the gap, wherein the model defines a posterior probability of the second stroke being a test stroke or a non-text stroke; and
labeling the gap as a change gap or a non-change gap based on the posterior probability. - View Dependent Claims (10, 11, 12, 13, 14, 15, 16)
-
-
17. A method of distinguishing between text strokes and non-text strokes in digital ink, the method comprising:
-
detecting a sequence of at least two strokes in the digital ink;
generating a classification model based on at least one stroke feature of at least one of the strokes, wherein the model incorporates a transition probability between the at least two stokes; and
labeling the sequence as a sequence of text strokes or non-text strokes based on the transition probability. - View Dependent Claims (18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28)
-
-
29. A computer program product encoding a computer program that executes a computer process for distinguishing between text strokes and non-text strokes in digital ink, the computer process comprising:
-
detecting a sequence of at least two strokes in the digital ink;
generating a classification model based on at least one stroke feature of at least one of the strokes, wherein the model incorporates a transition probability between the at least two strokes; and
labeling the sequence as a sequence of text strokes or non-text strokes based on the transition probability. - View Dependent Claims (30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40)
-
-
41. A method of distinguishing between text strokes and non-text strokes in digital ink, the method comprising:
-
detecting a stroke in the digital ink;
generating a classification model based on at least one stroke feature of the stroke, wherein the model defines a posterior probability that the stroke is a text stroke; and
labeling the stroke as a text stroke or a non-text stroke based on the posterior probability. - View Dependent Claims (42, 43, 44, 45, 46, 47, 48, 49, 50)
-
-
51. A computer program product encoding a computer program for distinguishing between text strokes and non-text strokes in digital ink, the computer process comprising:
-
detecting a stroke in the digital ink;
generating a classification model based on at least one stroke feature of the stroke, wherein the model defines a posterior probability that the stroke is a text stroke; and
labeling the stroke as a text stroke or a non-text stroke based on the posterior probability. - View Dependent Claims (52, 53, 54, 55, 56, 57, 58, 59, 60)
-
-
61. A system for distinguishing between text strokes and non-text strokes in digital ink, the system comprising:
-
a stroke feature extractor module that detects a stroke in the digital ink; and
a stroke classifier module that generates a classification model based on at least one stroke feature of the stroke, wherein the model defines a posterior probability that the stoke is a text stroke, and labels the stroke as a text stroke or a non-text stroke based on the posterior probability.
-
-
62. A system for distinguishing between text strokes and non-text strokes in digital ink, the system comprising:
-
a stroke feature extractor module that detects a sequence of at least two strokes in the digital ink; and
a stroke classifier module that generates a classification model based on at least one stroke feature of a least one of the strokes; and
a sequence model that incorporates a transition probability between the at least two strokes and labels the sequence of strokes as a sequence of text strokes or non-text strokes based on the transition probability and the classification model.
-
-
63. A system for distinguishing between text strokes and non-text strokes in digital ink, the system comprising:
-
a gap feature extractor module detecting a gap between a first stroke and a second stroke in the digital ink; and
a gap classifier module generating a classification model based on at least one gap feature of the gap; and
a sequence model that defines a posterior probability of the second stroke being a test stroke or a non-text stroke and labels the gap as a change gap or a non-change gap based on the posterior probability. 308238.01 MS 1-2023US
-
Specification