SPEECH-TO-TEXT TRAINING DATA BASED ON INTERACTIVE RESPONSE DATA
First Claim
1. A device comprising:
- a memory configured to store speech-to-text training data; and
a processor configured to;
access interactive response (IR) training data of an IR system, the IR training data associating input phrases supported by the IR system to user intent indicators;
in response to determining that a first input phrase of the input phrases includes a first term that is included in a term hierarchy, generate a second phrase by replacing the first term in the first input phrase with a second term included in the term hierarchy;
determine that the IR training data indicates that the first input phrase is associated with a first user intent indicator;
determine that user interaction data indicates that a first proportion of user phrases received by the IR system from users corresponds to the first user intent indicator; and
update the speech-to-text training data based on the first input phrase and the second phrase so that a second proportion of training phrases of the speech-to-text training data corresponds to the first user intent indicator, the second proportion based on the first proportion, wherein a speech-to-text model is trained based on the speech-to-text training data.
1 Assignment
0 Petitions
Accused Products
Abstract
A device includes a processor configured to, in response to determining that an input phrase includes a first term that is included in a term hierarchy, generate a second phrase by replacing the first term in the input phrase with a second term included in the term hierarchy. The processor is configured to determine that interactive response (IR) training data indicates that the input phrase is associated with a user intent indicator. The processor is configured to determine that user interaction data indicates that a first proportion of user phrases received by an IR system correspond to the user intent indicator. The processor is configured to update speech-to-text training data based on the input phrase and the second phrase so that a second proportion of training phrases of the speech-to-text training data correspond to the user intent indicator. The second proportion is based on the first proportion. A speech-to-text model is trained based on the speech-to-text training data.
6 Citations
20 Claims
-
1. A device comprising:
-
a memory configured to store speech-to-text training data; and a processor configured to; access interactive response (IR) training data of an IR system, the IR training data associating input phrases supported by the IR system to user intent indicators; in response to determining that a first input phrase of the input phrases includes a first term that is included in a term hierarchy, generate a second phrase by replacing the first term in the first input phrase with a second term included in the term hierarchy; determine that the IR training data indicates that the first input phrase is associated with a first user intent indicator; determine that user interaction data indicates that a first proportion of user phrases received by the IR system from users corresponds to the first user intent indicator; and update the speech-to-text training data based on the first input phrase and the second phrase so that a second proportion of training phrases of the speech-to-text training data corresponds to the first user intent indicator, the second proportion based on the first proportion, wherein a speech-to-text model is trained based on the speech-to-text training data. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A method comprising:
-
accessing, at a device, interactive response (IR) training data of an IR system, the IR training data associating input phrases supported by the IR system to user intent indicators; determining, at the device, that a first input phrase of the input phrases includes a first term that is included in a term hierarchy; in response to determining that the first input phrase includes the first term, generating a second phrase by replacing the first term in the first input phrase with a second term included in the term hierarchy; determining, at the device, that the IR training data indicates that the first input phrase is associated with a first user intent indicator; determining, at the device, that user interaction data indicates that a first proportion of user phrases received by the IR system from users corresponds to the first user intent indicator; and updating, at the device, speech-to-text training data based on the first input phrase and the second phrase so that a second proportion of training phrases of the speech-to-text training data corresponds to the first user intent indicator, the second proportion based on the first proportion, wherein a speech-to-text model is trained based on the speech-to-text training data. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18)
-
-
19. A computer program product comprising a computer readable storage medium having program instructions embodied therewith, the program instructions executable by a processor to cause the processor to perform operations comprising:
-
accessing interactive response (IR) training data of an IR system, the IR training data associating input phrases supported by the IR system to user intent indicators; determining that a first input phrase of the input phrases includes a first term that is included in a term hierarchy; in response to determining that the first input phrase includes the first term, generating a second phrase by replacing the first term in the first input phrase with a second term included in the term hierarchy; determining that the IR training data indicates that the first input phrase is associated with a first user intent indicator; determining that user interaction data indicates that a first proportion of user phrases received by the IR system from users corresponds to the first user intent indicator; and updating speech-to-text training data based on the first input phrase and the second phrase so that a second proportion of training phrases of the speech-to-text training data corresponds to the first user intent indicator, the second proportion based on the first proportion, wherein a speech-to-text model is based on the speech-to-text training data. - View Dependent Claims (20)
-
Specification