Architecture for multi-domain utterance processing
First Claim
1. A system comprising:
- a computer-readable memory storing executable instructions; and
one or more processors in communication with the computer-readable memory, wherein the one or more processors are programmed by the executable instructions to at least;
receive data regarding an utterance of a user;
generate a transcription of the utterance using automatic speech recognition;
process the transcription with a first natural language understanding (“
NLU”
) module to produce a first plurality of interpretations of a requested action in the transcription, wherein the first NLU module is associated with a first domain of actions, and wherein at least a first interpretation of the first plurality of interpretations is associated with a first score indicative of whether the first interpretation corresponds to the requested action in the transcription;
process the transcription with a second NLU module to produce a second plurality of interpretations of the requested action in the transcription, wherein the second NLU module is associated with a second domain of actions, and wherein at least a second interpretation of the second plurality of interpretations is associated with a second score indicative of whether the second interpretation corresponds to the requested action in the transcription;
select, from the first plurality of interpretations or the second plurality of interpretations, a selected interpretation based at least in part on a score associated with the selected interpretation, wherein the score corresponds to one of the first score or the second score; and
generate a response based at least partly on the selected interpretation.
1 Assignment
0 Petitions
Accused Products
Abstract
Features are disclosed for processing a user utterance with respect to multiple subject matters or domains, and for selecting a likely result from a particular domain with which to respond to the utterance or otherwise take action. A user utterance may be transcribed by an automatic speech recognition (“ASR”) module, and the results may be provided to a multi-domain natural language understanding (“NLU”) engine. The multi-domain NLU engine may process the transcription(s) in multiple individual domains rather than in a single domain. In some cases, the transcription(s) may be processed in multiple individual domains in parallel or substantially simultaneously. In addition, hints may be generated based on previous user interactions and other data. The ASR module, multi-domain NLU engine, and other components of a spoken language processing system may use the hints to more efficiently process input or more accurately generate output.
347 Citations
29 Claims
-
1. A system comprising:
-
a computer-readable memory storing executable instructions; and one or more processors in communication with the computer-readable memory, wherein the one or more processors are programmed by the executable instructions to at least; receive data regarding an utterance of a user; generate a transcription of the utterance using automatic speech recognition; process the transcription with a first natural language understanding (“
NLU”
) module to produce a first plurality of interpretations of a requested action in the transcription, wherein the first NLU module is associated with a first domain of actions, and wherein at least a first interpretation of the first plurality of interpretations is associated with a first score indicative of whether the first interpretation corresponds to the requested action in the transcription;process the transcription with a second NLU module to produce a second plurality of interpretations of the requested action in the transcription, wherein the second NLU module is associated with a second domain of actions, and wherein at least a second interpretation of the second plurality of interpretations is associated with a second score indicative of whether the second interpretation corresponds to the requested action in the transcription; select, from the first plurality of interpretations or the second plurality of interpretations, a selected interpretation based at least in part on a score associated with the selected interpretation, wherein the score corresponds to one of the first score or the second score; and generate a response based at least partly on the selected interpretation. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A computer-implemented method comprising:
under control of one or more computing devices configured with specific computer-executable instructions, receiving text corresponding to a request of a user; processing the text in a first natural language understanding (“
NLU”
) module to generate a first interpretation of the transcription, and in a second NLU module to generate a second interpretation of the transcription,wherein first NLU module is associated with a first domain, and the second NLU module is associated with a second domain, wherein the first interpretation is associated with a first score indicative of whether the first interpretation corresponds to an action requested by the user, and wherein the second interpretation is associated with a second score indicative of whether the second interpretation corresponds to the action requested by the user; selecting the first interpretation based at least partly on the first score and the second score; and generating a response based at least partly on the first interpretation. - View Dependent Claims (9, 10, 11, 12, 13, 14, 15, 16, 17, 18)
-
19. One or more non-transitory computer readable media comprising executable code that, when executed, cause one or more computing devices to perform a process comprising:
-
receiving text corresponding to a request of a user; processing the text in a first natural language understanding (“
NLU”
) module to generate a first interpretation of the transcription, and in a second NLU module to generate a second interpretation of the transcription,wherein first NLU module is associated with a first domain, and the second NLU module is associated with a second domain, and wherein the first interpretation is associated with a first score indicative of whether the first interpretation corresponds to an action requested by the user, and the second interpretation is associated with a second score indicative of whether the second interpretation corresponds to the action requested by the user; selecting the first interpretation based at least partly on the first score and the second score; and generating a response based at least partly on the first interpretation. - View Dependent Claims (20, 21, 22, 23, 24, 25, 26, 27, 28, 29)
-
Specification