Systems and method to resolve audio-based requests in a networked environment
First Claim
1. A system to resolve requests in an audio-based networked system, comprising a computing device comprising one or more processors and a memory, the one or more processors to execute:
- a proficiency detector to;
receive a vocal utterance captured at a client device;
determine a vocal characteristic of the vocal utterance captured at the client device;
a speech-to-text module to select a query understanding model from a plurality of candidate query understanding models based on the vocal characteristic;
an intent matcher to determine an intent of the vocal utterance using the query understanding model;
a fulfillment module to select a content item based on the intent and one or more keywords parsed from the vocal utterance; and
an interface to transmit the content item to the client device.
1 Assignment
0 Petitions
Accused Products
Abstract
Techniques are described herein for enabling an automated assistant to adjust its behavior depending on a detected vocabulary level or other vocal characteristics of an input utterance provided to an automated assistant. The estimated vocabulary level or other vocal characteristics may be used to influence various aspects of a data processing pipeline employed by the automated assistant. In some implementations, one or more tolerance thresholds associated with, for example, grammatical tolerances or vocabulary tolerances, may be adjusted based on the estimated vocabulary level or vocal characteristics of the input utterance.
17 Citations
20 Claims
-
1. A system to resolve requests in an audio-based networked system, comprising a computing device comprising one or more processors and a memory, the one or more processors to execute:
-
a proficiency detector to; receive a vocal utterance captured at a client device; determine a vocal characteristic of the vocal utterance captured at the client device; a speech-to-text module to select a query understanding model from a plurality of candidate query understanding models based on the vocal characteristic; an intent matcher to determine an intent of the vocal utterance using the query understanding model; a fulfillment module to select a content item based on the intent and one or more keywords parsed from the vocal utterance; and an interface to transmit the content item to the client device. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A method implemented using one or more processors, comprising:
-
receiving, by a proficiency detector executed by one or more processors, a vocal utterance captured at a client device; determining, by the proficiency detector executed by the one or more processors, a vocal characteristic based on the vocal utterance captured at the client device; selecting, by a speech-to-text module executed by the one or more processors, a query understanding model from a plurality of candidate query understanding models based on the vocal characteristic; determining, by an intent matcher executed by the one or more processors, an intent of the vocal utterance using the query understanding model; selecting, by a fulfillment module executed by the one or more processors, a content item based on the intent and one or more keywords parsed from the vocal utterance; and transmitting, by the one or more processors, the content item to the client device. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
Specification