DIALOG DEVICE WITH DIALOG SUPPORT GENERATED USING A MIXTURE OF LANGUAGE MODELS COMBINED USING A RECURRENT NEURAL NETWORK
First Claim
1. A dialog device comprising:
- a natural language interfacing device comprising a chat interface or a telephonic device;
a natural language output device comprising the chat interface, a display device, or a speech synthesizer outputting speech to the telephonic device; and
a computer programmed to store natural language dialog conducted via the natural language interfacing device and to construct a current natural language utterance word-by-word with each word of the current natural language utterance being chosen by operations including;
applying a plurality of language models to a context comprising a concatenation of the stored natural language dialog and the current natural language utterance up to but not including the word being chosen to output, for each applied language model, a distribution over the words of a vocabulary;
normalizing the distributions output by the plurality of language models to generate corresponding normalized distributions;
applying a recurrent neural network (RNN) to the normalized distributions to generate a mixture distribution; and
choosing the next word using the mixture distribution;
wherein the natural language output device is configured to output the current natural language utterance after it has been constructed by the computer.
6 Assignments
0 Petitions
Accused Products
Abstract
A dialog device comprises a natural language interfacing device (chat interface or a telephonic device), and a natural language output device (the chat interface, a display device, or a speech synthesizer outputting to the telephonic device). A computer stores natural language dialog conducted via the interfacing device and constructs a current utterance word-by-word. Each word is chosen by applying a plurality of language models to a context comprising concatenation of the stored dialog and the current utterance thus far. Each language model outputs a distribution over the words of a vocabulary. A recurrent neural network (RNN) is applied to the distributions to generate a mixture distribution. The next word is chosen using the mixture distribution. The output device outputs the current natural language utterance after it has been constructed by the computer.
226 Citations
20 Claims
-
1. A dialog device comprising:
-
a natural language interfacing device comprising a chat interface or a telephonic device; a natural language output device comprising the chat interface, a display device, or a speech synthesizer outputting speech to the telephonic device; and a computer programmed to store natural language dialog conducted via the natural language interfacing device and to construct a current natural language utterance word-by-word with each word of the current natural language utterance being chosen by operations including; applying a plurality of language models to a context comprising a concatenation of the stored natural language dialog and the current natural language utterance up to but not including the word being chosen to output, for each applied language model, a distribution over the words of a vocabulary; normalizing the distributions output by the plurality of language models to generate corresponding normalized distributions; applying a recurrent neural network (RNN) to the normalized distributions to generate a mixture distribution; and choosing the next word using the mixture distribution; wherein the natural language output device is configured to output the current natural language utterance after it has been constructed by the computer. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A dialog method comprising:
-
conducting a natural language dialog using a chat interface or a telephonic device; while conducting the natural language dialog, constructing a current natural language utterance word-by-word using a computer programmed to choose each word of the current natural language utterance by operations including; applying a plurality of language models to a context comprising a concatenation of the natural language dialog and the current natural language utterance up to but not including the word being chosen, applying a recurrent neural network (RNN) to word distributions output by the applied plurality of language models to generate a mixture distribution, and choosing the next word using the mixture distribution; and outputting the constructed current natural language utterance via one of the chat interface, a display device, and a speech synthesizer outputting speech to the telephonic device. - View Dependent Claims (12, 13, 14, 15, 16, 17)
-
-
18. A non-transitory storage medium storing instructions readable and executable by a computer to construct a current natural language utterance for continuing a natural language dialog by a method in which each word of the current natural language utterance is chosen by operations including:
-
applying a plurality of language models to a context comprising a concatenation of the natural language dialog and the current natural language utterance up to but not including the word being chosen; normalizing word distributions output by the plurality of language models to generate corresponding normalized word distributions; applying a recurrent neural network (RNN) to the normalized distributions to generate a mixture distribution; and choosing the next word using the mixture distribution. - View Dependent Claims (19, 20)
-
Specification