Web triggered word set boosting for speech interfaces to the world wide web
First Claim
1. A method for user speech actuation of access to information stored in a computer system, the method comprising:
- storing the information in a memory of the computer system;
displaying a selected subset of the information via the computer on a display viewable by the user;
forming a web triggered word set from a portion of the information stored in the computer, including a plurality of individual words contained in the displayed subset;
receiving a speech input comprising one or more spoken words from the user;
performing a speech recognition process based on the speech input from the user to determine statistically a set of probable words based on the speech input of the user, including;
processing the input speech in accordance with a set of language/acoustic model and speech recognition search parameters which produce probability scores for at least individual words;
modifying the language/acoustic model and speech recognition search parameters dynamically using the web triggered word set to boost the probability score of at least individual ones of the probable words; and
updating the display to display a new subset of the information in accordance with the set of probable words determined from the speech recognition search as modified using the web-triggered word set.
3 Assignments
0 Petitions
Accused Products
Abstract
A computer system for user speech actuation of access to stored information, the system including a central processing unit, a memory and a user input/output interface including a microphone for input of user speech utterances and audible sound signal processing circuitry, and a file system for accessing and storing information in the memory of the computer. A speech recognition processor operating on the computer system recognizes words based on the input speech utterances of the user in accordance with a set of language/acoustic model and speech recognition search parameters. Software running on the CPU scans a document accessed by a web browser to form a web triggered word set from a selected subset of information in the document. The language/acoustic model and speech recognition search parameters are modified dynamically using the web triggered word set, and used by the speech recognition processor for generating a word string for input to the browser to initiate a change in the information accessed.
345 Citations
12 Claims
-
1. A method for user speech actuation of access to information stored in a computer system, the method comprising:
-
storing the information in a memory of the computer system; displaying a selected subset of the information via the computer on a display viewable by the user; forming a web triggered word set from a portion of the information stored in the computer, including a plurality of individual words contained in the displayed subset; receiving a speech input comprising one or more spoken words from the user; performing a speech recognition process based on the speech input from the user to determine statistically a set of probable words based on the speech input of the user, including; processing the input speech in accordance with a set of language/acoustic model and speech recognition search parameters which produce probability scores for at least individual words; modifying the language/acoustic model and speech recognition search parameters dynamically using the web triggered word set to boost the probability score of at least individual ones of the probable words; and updating the display to display a new subset of the information in accordance with the set of probable words determined from the speech recognition search as modified using the web-triggered word set. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A computer system for user speech actuation of access to stored information, the system comprising:
-
a computer including a central processing unit, a memory and a user input/output interface including a microphone for input of user speech utterances and audible sound signal processing circuitry; means for accessing and storing information in the memory of the computer system; a speech recognition processor operating on the computer system for recognizing words including individual words based on the input speech utterances of the user in accordance with a set of language/acoustic model and speech recognition search parameters which produce a probability score for at least the individual words; means for forming a web triggered word set from a selected subset of information in the document, the web triggered word set including a plurality of individual words contained in the displayed subset; means for modifying the language/acoustic model and speech recognition search parameters dynamically using the web triggered word set to boost the probability score of at least the individual words; and means responsive to the speech recognition processor for generating a word string based on the probability score as boosted by the modifying means for input to the accessing and storing means to initiate a change in the information accessed. - View Dependent Claims (10, 11, 12)
-
Specification