Grammar fragment acquisition using syntactic and semantic clustering
First Claim
Patent Images
1. A method comprising:
- selecting candidate multi-word phrases from a set of words, wherein a maximum number of candidate multi-word phrases is based on a highest of a number of preceding contexts, a number of candidate fragments, and a number of call-types;
using a semantic similarity and a syntactic similarity in the candidate multi-word phrases to generate semantic measurements and syntactic measurements associated with a succeeding context of a succeeding phrase and a preceding context of a preceding phrase; and
clustering, via a processor, the candidate multi-word phrases into a grammar fragment based on the semantic measurements and the syntactic measurements, to yield a clustered grammar fragment representing similar phrases that are both syntactically and semantically coherent.
4 Assignments
0 Petitions
Accused Products
Abstract
A method and apparatus are provided for automatically acquiring grammar fragments for recognizing and understanding fluently spoken language. Grammar fragments representing a set of syntactically and semantically similar phrases may be generated using three probability distributions: of succeeding words, of preceding words, and of associated call-types. The similarity between phrases may be measured by applying Kullback-Leibler distance to these three probability distributions. Phrases being close in all three distances may be clustered into a grammar fragment.
-
Citations
16 Claims
-
1. A method comprising:
-
selecting candidate multi-word phrases from a set of words, wherein a maximum number of candidate multi-word phrases is based on a highest of a number of preceding contexts, a number of candidate fragments, and a number of call-types; using a semantic similarity and a syntactic similarity in the candidate multi-word phrases to generate semantic measurements and syntactic measurements associated with a succeeding context of a succeeding phrase and a preceding context of a preceding phrase; and clustering, via a processor, the candidate multi-word phrases into a grammar fragment based on the semantic measurements and the syntactic measurements, to yield a clustered grammar fragment representing similar phrases that are both syntactically and semantically coherent. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A method comprising:
-
selecting a set of candidate multi-word phrases from a database of utterances based on a frequency of occurrence of the candidate phrases, wherein a maximum number of candidate multi-word phrases is based on a highest of a number of preceding contexts, a number of candidate fragments, and a number of call-types; forming, via a processor, a measurement of similarity between the selected candidate phrases, wherein the measurement of similarity is associated with a succeeding context of a succeeding phrase and a preceding context of a preceding phrase; and based on the measurement, clustering a plurality of the set of candidate phrases into a grammar fragment to yield a cluster grammar fragment representing similar phrases that are both syntactically coherent and semantically coherent. - View Dependent Claims (10, 11, 12, 13, 14)
-
-
15. A method comprising:
-
(1) processing an utterance associated with a call by applying a grammar fragment to the utterance, the grammar fragment being formed by; (a) selecting a set of candidate multi-word phrases from a database of utterances based on a frequency of occurrence of the candidate phrases, wherein a maximum number of candidate multi-word phrases is based on a highest of a number of preceding contexts, a number of candidate fragments, and a number of call-types; (b) forming, via a processor, a measurement of similarity between the selected candidate phrases, the measurement being associated with a succeeding context of a succeeding phrase and a preceding context of preceding phrase; and (c) based on the measurement, clustering a plurality of the selected candidate phrases into a grammar fragment to yield a cluster grammar fragment representing similar phrases that are both syntactically coherent and semantically coherent; and (2) determining, based on the processing, an appropriate handling for the call. - View Dependent Claims (16)
-
Specification