Systems and method for resolving ambiguity
First Claim
1. A method of resolving ambiguity comprising the steps of:
- determining recognized speech information;
determining discourse functions in the recognized speech information;
determining a predictive model of discourse functions based on prosodic features;
determining at least one set of candidate discourse functions for the recognized speech information;
determining a rank of the at least one set of discourse functions based on the predictive model of discourse functions; and
resolving the ambiguity between the set of at least one discourse functions based on the determined rank.
1 Assignment
0 Petitions
Accused Products
Abstract
Techniques are provided for resolving ambiguity in natural language speech. Speech is recognized using automatic speech recognition. A theory of discourse analysis is determined and at least one set of candidate discourse functions is determined based on the theory of discourse analysis. Prosodic features in the speech and a correlation between the prosodic features and the discourse functions is determined. The sets of candidate discourse functions are ranked based on the prosodic features in the speech information and a correlation to the prosodic features expected for the determined discourse functions. Ambiguity is resolved between sets of candidate discourse functions based on the rank information.
-
Citations
19 Claims
-
1. A method of resolving ambiguity comprising the steps of:
-
determining recognized speech information; determining discourse functions in the recognized speech information; determining a predictive model of discourse functions based on prosodic features; determining at least one set of candidate discourse functions for the recognized speech information; determining a rank of the at least one set of discourse functions based on the predictive model of discourse functions; and resolving the ambiguity between the set of at least one discourse functions based on the determined rank. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A system for synthesizing speech using discourse function level prosodic features comprising:
-
an input/output circuit for retrieving recognized speech and prosodic features; a processor that determines at least one set of candidate discourse functions in the recognized speech information;
determines a predictive model of discourse functions;
determines a rank of the at least one set of candidate discourse functions based on the predictive model of discourse functions and the prosodic features of the recognized speech and disambiguates between the at least one set of candidate discourse functions based on a measure of prosodic correlation between the prosodic features for the recognized speech and the expected prosodic features associated with each discourse function in the predictive model of discourse functions. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. Computer readable storage medium comprising:
- computer readable program code embodied on the computer readable storage medium, the computer readable program code usable to program a computer to resolve ambiguity comprising the steps of;
determining recognized speech information; determining discourse functions in the recognized speech information; determining a predictive model of discourse functions based on prosodic features; determining at least one set of candidate discourse functions for the recognized speech information; determining a rank of the at least one set of discourse functions based on the predictive model of discourse functions; and resolving the ambiguity between the set of at least one discourse functions based on the determined rank.
- computer readable program code embodied on the computer readable storage medium, the computer readable program code usable to program a computer to resolve ambiguity comprising the steps of;
Specification