Controlling offensive content in output
First Claim
1. A system comprising:
- a computer-readable memory storing executable instructions; and
one or more processors in communication with the computer-readable memory, wherein the one or more processors are programmed by the executable instructions to at least;
obtain first audio input data regarding a first user utterance of a user;
obtain a characteristic of the user;
perform speech processing on the first audio input data to generate first speech processing results, the speech processing results including contextual information indicating a context to which the first utterance relates;
determine a first response to the first user utterance using the first speech processing results, wherein the first response comprises a name of a content item;
generate a profanity score for the name using an output filter model, the name, and the contextual information, the output filter model adapted to provide an output profanity score based upon an input word and a context in which the input word is used;
identify a sensitivity threshold for users having the characteristic, the sensitivity threshold indicating an acceptable degree of offensiveness for users having the characteristic;
determine the profanity score for the name exceeds the sensitivity threshold;
generate first output audio data using the first response and text-to-speech synthesis, wherein (i) a portion of the first output audio data corresponding to the name of the content item is modified or (ii) the name of the content item is modified before generating the first output audio data; and
transmit the first audio output to a user device.
1 Assignment
0 Petitions
Accused Products
Abstract
Features are disclosed for recognizing inappropriate content in an output. The offensive content may be generated as a result of a speech processing error. A system may identify the inappropriate elements of a generated output and select among different appropriate alternatives. The system may be adjusted based on certain user characteristics. The system may be localized based on language and cultural features. The system may modify the generated output based on characteristics such as the tolerance threshold of known persons in the proximity of the system. The tolerance threshold may further be used to personalize and modify available content. Models used by the system may be further trained using input from a user.
-
Citations
23 Claims
-
1. A system comprising:
-
a computer-readable memory storing executable instructions; and one or more processors in communication with the computer-readable memory, wherein the one or more processors are programmed by the executable instructions to at least; obtain first audio input data regarding a first user utterance of a user; obtain a characteristic of the user; perform speech processing on the first audio input data to generate first speech processing results, the speech processing results including contextual information indicating a context to which the first utterance relates; determine a first response to the first user utterance using the first speech processing results, wherein the first response comprises a name of a content item; generate a profanity score for the name using an output filter model, the name, and the contextual information, the output filter model adapted to provide an output profanity score based upon an input word and a context in which the input word is used; identify a sensitivity threshold for users having the characteristic, the sensitivity threshold indicating an acceptable degree of offensiveness for users having the characteristic; determine the profanity score for the name exceeds the sensitivity threshold; generate first output audio data using the first response and text-to-speech synthesis, wherein (i) a portion of the first output audio data corresponding to the name of the content item is modified or (ii) the name of the content item is modified before generating the first output audio data; and transmit the first audio output to a user device. - View Dependent Claims (2, 3, 4)
-
-
5. A computer-implemented method comprising:
-
under control of one or more computing devices configured with specific computer-executable instructions, obtaining first input data regarding a first user utterance by a user; obtaining a characteristic of the user; performing speech processing on the first audio data to generate first speech processing results, the first speech processing results including contextual information indicating a context to which the first utterance relates; determining a first response using the first speech processing results, wherein the first response comprises a name of a content item; generating a profanity score for the name using an output filter model, the name, and the contextual information; identifying a sensitivity threshold for users having the characteristic; determining the profanity score for the name exceeds the sensitivity threshold; generating a first output using the first response, wherein (i) a portion of the first output corresponding to the name of the content item is modified or (ii) the name of the content item is modified before generating the first output; and transmitting the first output to a client device. - View Dependent Claims (6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. One or more non-transitory computer readable media comprising executable code that, when executed, cause one or more computing devices to perform a process comprising:
-
obtaining first input data regarding a first user utterance; performing speech processing on the first input data to generate first speech processing results, the first speech processing results including contextual information indicating a context to which the first utterance relates; determining a first response to the first user utterance using the first speech processing results, wherein the first response comprises a name of a content item; generating a profanity score for the name using an output filter model, the name, and the contextual information; identifying a sensitivity threshold using the first input data; determining the profanity score for the name exceeds the sensitivity threshold; and generating a first output using the first response, wherein (i) a portion of the first output corresponding to the name of the content item is modified or (ii) the name of the content item is modified before generating the first output. - View Dependent Claims (16, 17, 18, 19, 20, 21, 22, 23)
-
Specification