System and method for identifying semantic intent from acoustic information
First Claim
Patent Images
1. A computer implemented method of processing acoustic information, comprising:
- extracting data representing a plurality of sets of acoustic information of interest from a data store;
performing speech recognition using a computer with an application grammar and a second grammar on the data to obtain speech recognition results;
identifying whether the speech recognition results are generated with the application grammar or the second grammar;
performing a semantic analysis of the second data;
clustering the sets of acoustic information into clusters based on the semantic analysis of the speech recognition results;
ranking the clusters based on a number of instances of utterances contained in each cluster;
removing clusters based on a consistency threshold calculation of each cluster that indicates when a cluster has a number of unlike utterances that have meaningful semantics;
identifying, for each cluster, and storing an indicator of a set of acoustic information as being representative of a corresponding cluster, wherein the identified set of acoustic information is selected from the sets of acoustic information in each corresponding cluster;
identifying the representative set of acoustic information for a given cluster as representing either a semantic intent covered by the application or as an unrepresented semantic intent based on the speech recognition results assigned to the given cluster were generated using the application grammar or the second grammar; and
generating a revision to the application grammar to accommodate for the unrepresented semantic intent.
2 Assignments
0 Petitions
Accused Products
Abstract
In accordance with one embodiment of the present invention, unanticipated semantic intents are discovered in audio data in an unsupervised manner. For instance, the audio acoustics are clustered based on semantic intent and representative acoustics are chosen for each cluster. The human then need only listen to a small number of representative acoustics for each cluster (and possibly only one per cluster) in order to identify the unforeseen semantic intents.
-
Citations
33 Claims
-
1. A computer implemented method of processing acoustic information, comprising:
-
extracting data representing a plurality of sets of acoustic information of interest from a data store; performing speech recognition using a computer with an application grammar and a second grammar on the data to obtain speech recognition results; identifying whether the speech recognition results are generated with the application grammar or the second grammar; performing a semantic analysis of the second data; clustering the sets of acoustic information into clusters based on the semantic analysis of the speech recognition results; ranking the clusters based on a number of instances of utterances contained in each cluster; removing clusters based on a consistency threshold calculation of each cluster that indicates when a cluster has a number of unlike utterances that have meaningful semantics; identifying, for each cluster, and storing an indicator of a set of acoustic information as being representative of a corresponding cluster, wherein the identified set of acoustic information is selected from the sets of acoustic information in each corresponding cluster; identifying the representative set of acoustic information for a given cluster as representing either a semantic intent covered by the application or as an unrepresented semantic intent based on the speech recognition results assigned to the given cluster were generated using the application grammar or the second grammar; and generating a revision to the application grammar to accommodate for the unrepresented semantic intent. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22)
-
-
23. A system for processing acoustic information, comprising:
a computer memory including instructions to execute a clustering component configured to; cluster sets of acoustic information, from an application, into clusters based on a semantic analysis of speech recognition results of speech recognition performed on the sets of acoustic information; rank the clusters based on a number of instances of utterances contained in each cluster; remove clusters based on a consistency threshold calculation of each cluster that indicates when a cluster has a number of unlike utterances that have meaningful semantics; identify, for each cluster, a set of acoustic information as being representative of a corresponding cluster, wherein the identified set of acoustic information is selected from the sets of acoustic information in each corresponding cluster; identify the representative set of acoustic information for a given cluster as representing either a semantic intent covered by the application or as an unrepresented semantic intent based on the speech recognition results assigned to the given cluster were generated using an application grammar or a second grammar; and generate a revision to the application grammar to accommodate for the unrepresented semantic intent. - View Dependent Claims (24, 25, 26, 27, 28, 29, 30)
-
31. A computer storage medium storing instructions which, when executed by a computer, cause the computer to process acoustic information by performing steps of:
-
extracting a plurality of sets of acoustic information of interest from a data store; performing speech recognition on the acoustic information to obtain speech recognition results; identifying whether the speech recognition results are generated with the application grammar or the second grammar; clustering the sets of acoustic information into clusters based on a semantic analysis of the speech recognition results; ranking the clusters based on a number of instances of utterances contained in each cluster; removing clusters based on a consistency threshold calculation of each cluster that indicates when a cluster has a number of unlike utterances that have meaningful semantics; identifying, for each cluster, and storing an indicator of a set of acoustic information as being representative of a corresponding cluster, wherein the identified set of acoustic information is selected from the sets of acoustic information in each corresponding cluster; selecting, for each cluster, a set of acoustic information from the sets of acoustic information in a particular cluster as being representative of the particular cluster; identifying the representative set of acoustic information for a given cluster as representing either a semantic intent covered by the application or as an unrepresented semantic intent based on the speech recognition results assigned to the given cluster were generated using the application grammar or the second grammar; and generating a revision to the application grammar to accommodate for the unrepresented semantic intent. - View Dependent Claims (32, 33)
-
Specification