Contextualizing noisy samples by substantially minimizing noise induced variance
First Claim
1. A method of contextualizing noisy samples comprising:
- receiving a sample comprising exemplar content corresponding to one of a plurality of exemplars and noise, wherein the noise comprises a modification to the exemplar content relative to the exemplar content as originally comprised by a physical object from which the sample was generated, and wherein variance induced by the noise differentiates the sample from one or more of the plurality of exemplars;
generalizing the sample and the plurality of exemplars in order to minimize the variance induced by the noise;
comparing the generalized sample to each of the plurality of generalized exemplars to identify which of the plurality of exemplars corresponds to the exemplar content of the sample;
contextualizing the sample based on a document type corresponding to the identified exemplar of the plurality of exemplars; and
presenting the contextualized sample to a user to facilitate interpretation thereof and, in response thereto, receiving data representative of a user determination associated with the noise.
2 Assignments
0 Petitions
Accused Products
Abstract
A system for contextualizing noisy samples by substantially minimizing noise induced variance may include a memory, an interface, and a processor. The memory is operative to store exemplars. The processor is operative to receive, via the interface, a sample which includes exemplar content corresponding to one of the exemplars, and noise. Variance induced by the noise may differentiate the sample from one or more of the exemplars. The processor may generalize the sample and the exemplars in order to substantially minimize the variance. The processor may compare the generalized sample to the generalized exemplars to identify the exemplar corresponding to the exemplar content of the sample. The processor may contextualize the sample based on a document type of the identified exemplar. The processor may present the contextualized sample to a user to facilitate interpretation thereof, and in response thereto, receive data representative of a user determination associated with the noise.
51 Citations
26 Claims
-
1. A method of contextualizing noisy samples comprising:
-
receiving a sample comprising exemplar content corresponding to one of a plurality of exemplars and noise, wherein the noise comprises a modification to the exemplar content relative to the exemplar content as originally comprised by a physical object from which the sample was generated, and wherein variance induced by the noise differentiates the sample from one or more of the plurality of exemplars; generalizing the sample and the plurality of exemplars in order to minimize the variance induced by the noise; comparing the generalized sample to each of the plurality of generalized exemplars to identify which of the plurality of exemplars corresponds to the exemplar content of the sample; contextualizing the sample based on a document type corresponding to the identified exemplar of the plurality of exemplars; and presenting the contextualized sample to a user to facilitate interpretation thereof and, in response thereto, receiving data representative of a user determination associated with the noise. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 25)
-
-
9. A method of contextualizing noisy samples comprising:
-
receiving a sample electronic document image comprising exemplar information corresponding to one of a plurality of exemplar electronic document images and random information, wherein the random information comprises a modification to the exemplar information relative to the exemplar information as originally comprised by a physical object from which the sample electronic document image was generated, and wherein variance induced by the random information renders the sample electronic document image distinguishable from one or more of the plurality of exemplar electronic document images; applying a filter to the sample electronic document image and the plurality of exemplar electronic document images in order to minimize the variance induced by the random information of the sample electronic document image; identifying the one of the plurality of exemplar electronic document images corresponding to the exemplar information of the sample electronic document image by comparing the filtered sample electronic document image to each of the plurality of filtered exemplar electronic document images; contextualizing the sample electronic document image based on a document type corresponding to the identified exemplar electronic document image; and presenting the contextualized sample electronic document image to a user to facilitate interpretation thereof and, in response thereto, receiving data representative of a user determination associated with the random information.
-
-
10. A method of contextualizing noisy samples comprising:
-
receiving a sample electronic document image comprising exemplar content and noise, wherein the noise comprises a modification to the exemplar content relative to the exemplar content as originally comprised by a physical object from which the sample electronic document image was generated; generating a plurality of transformations of the sample electronic document image, wherein each of the plurality of transformations of the sample electronic document image corresponds to a different level of filtering of the exemplar content and the noise; comparing the plurality of transformations of the sample electronic document image to a plurality of transformations of each of a plurality of exemplar electronic document images; determining one of the plurality of exemplar electronic document images of which a greatest number of the plurality of transformations satisfy a matching criteria with respect to the plurality of transformations of the sample electronic document image having a same level of filtering; contextualizing the sample electronic document image based on a document type corresponding to the determined one of the plurality of exemplar electronic document images; and presenting the contextualized sample electronic document image to a user to facilitate interpretation thereof and, in response thereto, receiving data representative of a user determination associated with the noise. - View Dependent Claims (11, 12, 13, 14)
-
-
15. A method of classifying an electronic document image, the method comprising:
-
receiving a sample electronic document image comprising content and noise, wherein the noise comprises a modification to the content relative to the content as originally comprised by a physical object from which the sample electronic document image was generated; filtering the sample electronic document image to account for variance induced by the noise; filtering a the plurality of exemplar electronic document images to a same level of filtering as the sample electronic document image; determining one of the plurality of exemplar electronic document images corresponding to the content of the sample electronic document image by comparing the filtered sample electronic document image to each of the plurality of filtered exemplar electronic document images; contextualizing the sample electronic document based on a document type corresponding to the determined one of the plurality of exemplar electronic document images; and presenting the contextualized sample electronic document image to a user to facilitate interpretation thereof and, in response thereto, receiving data representative of a user determination associated with the noise.
-
-
16. A method of classifying an electronic document image, the method comprising:
-
identifying a plurality of exemplar electronic document images wherein each of the plurality of exemplar electronic document images is characterized by a document type; receiving a sample electronic document image comprising a variation of one of the plurality of exemplar electronic document images, wherein the variation comprises a modification to content as originally comprised by a physical object from which the sample electronic document image was generated, and wherein variance induced by the variation distinguishes the sample electronic document image from one or more of the plurality of exemplar electronic document images; filtering the sample electronic document image and the plurality of exemplar electronic document images in order to minimize the variance induced by the variation; comparing the filtered sample electronic document image to each of the plurality of filtered exemplar electronic document images to determine one of the plurality of exemplar electronic document images corresponding to the sample electronic document image; contextualizing the sample electronic document based on the document type corresponding to the determined one of the plurality of exemplar electronic document images; and presenting the contextualized sample electronic document image to a user to facilitate interpretation thereof and, in response thereto, receiving data representative of a user determination associated with the variation.
-
-
17. A system for contextualizing noisy samples comprising:
-
a memory operative to store a plurality of exemplars; an interface coupled with the memory and operative to receive a sample comprising exemplar content corresponding to one of the plurality of exemplars and noise, wherein the noise comprises a modification to the exemplar content relative to the exemplar content as originally comprised by a physical object from which the sample was generated; and a processor coupled with the interface and operative to receive, via the interface, the sample comprising the exemplar content corresponding to one of the plurality of exemplars and the noise, wherein variance induced by the noise differentiates the sample from one or more of the plurality of exemplars, generalize the sample and the plurality of exemplars to minimize the variance induced by the noise, compare the generalized sample to each of the plurality of generalized exemplars to identify which of the plurality of exemplars corresponds to the exemplar content of the sample, contextualize the sample based on a document type corresponding to the identified exemplar, and present the contextualized sample to a user to facilitate interpretation thereof, and in response thereto, receive data representative of a user determination associated with the noise. - View Dependent Claims (18, 19, 20, 21, 22, 23, 24, 26)
-
Specification