SYSTEM AND METHOD FOR USING PROSODY FOR VOICE-ENABLED SEARCH
First Claim
1. A method of processing speech, the method comprising:
- receiving a word lattice generated by an automatic speech recognizer based on a user speech representing a query;
receiving a prosodic analysis of the user speech;
generating a reweighted word lattice based on the word lattice and the prosodic analysis;
approximating based on the reweighted word lattice at least one relevant response to the query; and
presenting to a user one of the at least one relevant response to the query.
3 Assignments
0 Petitions
Accused Products
Abstract
Disclosed herein are systems, methods, and non-transitory computer-readable storage media for approximating relevant responses to a user query with voice-enabled search. A system practicing the method receives a word lattice generated by an automatic speech recognizer based on a user speech and a prosodic analysis of the user speech, generates a reweighted word lattice based on the word lattice and the prosodic analysis, approximates based on the reweighted word lattice one or more relevant responses to the query, and presents to a user the responses to the query. The prosodic analysis examines metalinguistic information of the user speech and can identify the most salient subject matter of the speech, assess how confident a speaker is in the content of his or her speech, and identify the attitude, mood, emotion, sentiment, etc. of the speaker. Other information not described in the content of the speech can also be used.
66 Citations
20 Claims
-
1. A method of processing speech, the method comprising:
-
receiving a word lattice generated by an automatic speech recognizer based on a user speech representing a query; receiving a prosodic analysis of the user speech; generating a reweighted word lattice based on the word lattice and the prosodic analysis; approximating based on the reweighted word lattice at least one relevant response to the query; and presenting to a user one of the at least one relevant response to the query. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A system comprising:
-
a processor; a first module configured to control the processor to receive a word lattice generated by an automatic speech recognizer based on a user speech; a second module configured to control the processor to receive a prosodic analysis of the user speech; a third module configured to control the processor to generate a reweighted word lattice based on the word lattice and the prosodic analysis; a fourth module configured to control the processor to approximate based on the reweighted word lattice at least one relevant response to the query; and a fifth module configured to control the processor to present to a user one of the at least one relevant response to the query. - View Dependent Claims (12, 13, 14, 15, 16)
-
-
17. A non-transitory computer-readable storage medium storing instructions which, when executed by a computing device, cause the computing device to process speech, the instructions comprising:
-
receiving a word lattice generated by an automatic speech recognizer based on a user speech; receiving a prosodic analysis of the user speech; generating a reweighted word lattice based on the word lattice and the prosodic analysis; approximating based on the reweighted word lattice at least one relevant response to the query; and presenting to a user one of the at least one relevant response to the query. - View Dependent Claims (18, 19, 20)
-
Specification