SYSTEMS, DEVICES, AND/OR METHODS FOR MANAGING SAMPLE SELECTION BIAS
First Claim
Patent Images
1. A system for managing sample selection bias, comprising;
- a memory that stores instructions; and
a processor that executes the instructions to perform operations, the operations comprising;
randomly selecting sentences in a source language to obtain background data, wherein the background data has a background data sample selection bias that is substantially equivalent to an occurrence data sample selection bias of occurrence data;
sampling sentences in the source language that contain a predetermined word to obtain the occurrence data; and
determining an unbiased estimate of a distribution utilized for language translation from the occurrence data, wherein the occurrence data is related to the background data.
3 Assignments
0 Petitions
Accused Products
Abstract
Certain exemplary embodiments can provide a method that can include, via a special purpose processor, automatically determining an unbiased estimate of a distribution from occurrence data having an occurrence data sample selection bias substantially equivalent to a background data sample selection bias, the occurrence data related to background data, the background data chosen with the background data sample selection bias, the occurrence data representing a physically-measurable variable of one or more physical and tangible objects or substances.
-
Citations
20 Claims
-
1. A system for managing sample selection bias, comprising;
-
a memory that stores instructions; and a processor that executes the instructions to perform operations, the operations comprising; randomly selecting sentences in a source language to obtain background data, wherein the background data has a background data sample selection bias that is substantially equivalent to an occurrence data sample selection bias of occurrence data; sampling sentences in the source language that contain a predetermined word to obtain the occurrence data; and determining an unbiased estimate of a distribution utilized for language translation from the occurrence data, wherein the occurrence data is related to the background data. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A method for managing sample selection bias, comprising;
-
randomly selecting sentences in a source language to obtain background data, wherein the background data has a background data sample selection bias that is substantially equivalent to an occurrence data sample selection bias of occurrence data; sampling sentences in the source language that contain a predetermined word to obtain the occurrence data; and determining an unbiased estimate of a distribution utilized for speech interpretation from the occurrence data by utilizing instructions stored in memory and executed by a processor, wherein the occurrence data is related to the background data. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A tangible computer-readable medium comprising instructions, which when executed by a processor, cause the processor to perform operations comprising:
-
randomly selecting sentences in a source language to obtain background data, wherein the background data has a background data sample selection bias that is substantially equivalent to an occurrence data sample selection bias of occurrence data; sampling sentences in the source language that contain a predetermined word to obtain the occurrence data; and determining an unbiased estimate of a distribution from the occurrence data, wherein the occurrence data is related to the background data.
-
Specification