Evaluating transcriptions with a semantic parser
First Claim
Patent Images
1. A method performed by data processing apparatus, the method comprising:
- providing, over a network, audio data for an utterance;
receiving, at a client device and over the network, information that indicates (i) candidate transcriptions for the utterance and (ii) semantic information for the candidate transcriptions;
using a semantic parser at the client device to evaluate each of at least a plurality of the candidate transcriptions; and
selecting one of the plurality of the candidate transcriptions based on at least (i) the received semantic information and (ii) the output of the semantic parser at the client device for the plurality of candidate transcriptions that are evaluated.
2 Assignments
0 Petitions
Accused Products
Abstract
In some implementations, audio data for an utterance is provided over a network. At a client device and over the network, information is received that indicates candidate transcriptions for the utterance and semantic information for the candidate transcriptions. A semantic parser is used at the client device to evaluate each of at least a plurality of the candidate transcriptions. One of the candidate transcriptions is selected based on at least the received semantic information and the output of the semantic parser for the plurality of candidate transcriptions that are evaluated.
-
Citations
25 Claims
-
1. A method performed by data processing apparatus, the method comprising:
-
providing, over a network, audio data for an utterance; receiving, at a client device and over the network, information that indicates (i) candidate transcriptions for the utterance and (ii) semantic information for the candidate transcriptions; using a semantic parser at the client device to evaluate each of at least a plurality of the candidate transcriptions; and selecting one of the plurality of the candidate transcriptions based on at least (i) the received semantic information and (ii) the output of the semantic parser at the client device for the plurality of candidate transcriptions that are evaluated. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A non-transitory computer storage medium encoded with instructions that, when executed by a user device, cause the user device to perform operations comprising:
-
providing, over a network, audio data for an utterance; receiving, at the user device and over the network, information that indicates (i) candidate transcriptions for the utterance and (ii) semantic information for the candidate transcriptions; using a semantic parser at the user device to evaluate each of at least a plurality of the candidate transcriptions; and selecting one of the plurality of the candidate transcriptions based on at least (i) the received semantic information and (ii) the output of the semantic parser at the user device for the plurality of candidate transcriptions that are evaluated. - View Dependent Claims (12, 13, 14, 15)
-
-
16. A system comprising:
-
a user device and one or more storage devices storing instructions that are operable, when executed by the user device, to cause the user device to perform operations comprising; providing, over a network, audio data for an utterance; receiving, at the user device and over the network, information that indicates (i) candidate transcriptions for the utterance and (ii) semantic information for the candidate transcriptions; using a semantic parser at the user device to evaluate each of at least a plurality of the candidate transcriptions; and selecting one of the plurality of the candidate transcriptions based on at least (i) the received semantic information and (ii) the output of the semantic parser at the user device for the plurality of candidate transcriptions that are evaluated. - View Dependent Claims (17, 18, 19, 20)
-
-
21. A method performed by data processing apparatus, the method comprising:
-
providing, from a client device to a server system over a network, audio data for an utterance; receiving, at the client device and from the server system over the network, information that indicates (i) candidate transcriptions for the utterance and (ii) semantic information comprising output of a first semantic parser for at least one of the candidate transcriptions; using a second semantic parser at the client device to evaluate each of at least a plurality of the candidate transcriptions; and selecting one of the plurality of the candidate transcriptions based on at least (i) the received semantic information comprising output of the first semantic parser for at least one of the candidate transcriptions and (ii) the output of the second semantic parser at the client device for the plurality of candidate transcriptions that are evaluated. - View Dependent Claims (22, 23, 24, 25)
-
Specification