Historical database storing relationships of successively spoken words
First Claim
1. A system for generating text in response to a succession of audio signals representing spoken input events provided by a user, said system comprising:
- means for comparing each spoken input event with a plurality of tokens representing vocabulary words thereby to identify a plurality of candidate tokens which may correspond to the spoken input event, each candidate being scored as to likelihood of match;
computer implemented means for generating and storing, for each spoken input event, a data record which includes;
the identity of the best matching candidate token;
the identity of the correct candidate; and
data placing the chronology of the data record relative other data records in the database;
thereby to generate a dictation event database useful for improving recognizer accuracy by learning a user'"'"'s speech behavior.
10 Assignments
0 Petitions
Accused Products
Abstract
The text is generated from voice input that divides the processing of each spoken word into a dictation event and a text event. Each dictation event handles the processing of data relating to the input into the system, and each text event deals with the generation of text from the inputted voice signals. In order to easily distinguish the dictation events from each other and text events from each other the system and method creates a data structure for storing certain information relating to each individual event. Such data structures enable the system and method to process both simple spoken words as well as spoken commands and to provide the necessary text generation in response to the spoken words or to execute an appropriate function in response to a command.
-
Citations
16 Claims
-
1. A system for generating text in response to a succession of audio signals representing spoken input events provided by a user, said system comprising:
-
means for comparing each spoken input event with a plurality of tokens representing vocabulary words thereby to identify a plurality of candidate tokens which may correspond to the spoken input event, each candidate being scored as to likelihood of match; computer implemented means for generating and storing, for each spoken input event, a data record which includes; the identity of the best matching candidate token; the identity of the correct candidate; and data placing the chronology of the data record relative other data records in the database; thereby to generate a dictation event database useful for improving recognizer accuracy by learning a user'"'"'s speech behavior. - View Dependent Claims (2, 3, 4)
-
-
5. A system for generating text in response to a succession of audio signals representing spoken input events provided by a user, said system comprising:
-
means for comparing each spoken input event with a plurality of tokens representing vocabulary words thereby to identify a plurality of candidate words which may correspond to the spoken input event, each candidate being scored as to likelihood of match; means enabling a user to accept one of said candidate words; computer implemented means for generating and storing, for each spoken input event, a data record which includes; the identity of the candidate tokens the identity of the best matching candidate token the identity of the word accepted by the user and data placing the chronology of the respective data record relative other data records; thereby to generate a dictation event database useful for backing up in the generated text and changing the word accepted by the user. - View Dependent Claims (6, 7)
-
-
8. A system for generating text in response to a succession of audio signals representing spoken input events provided by a user, said system comprising:
-
means for comparing each spoken input event with a plurality of tokens representing vocabulary words thereby to identify a plurality of candidate tokens which may correspond to the spoken input event; means enabling a user to accept one of said candidate tokens; computer implemented means responsive to an accepted candidate token for generating and storing a data record which includes; data defining the respective input event data placing the chronology of the record with respect to other similarly generated records data defining any hierarchical relationship to other similarly generated records; thereby to generate a text event database useful in backing up in the generated text and making corrections.
-
-
9. A system for generating text in response to a succession of audio signals representing spoken input events provided by a user, said system comprising:
-
means for comparing each spoken input event with a plurality of tokens representing vocabulary words thereby to identify a plurality of candidate words which may correspond to the spoken input event, each candidate being scored as to likelihood of match; means enabling a user to accept one of said candidate words; computer implemented means for generating and storing, for each spoken input event, a dictation event data record which includes; the identity of the candidate words the identity of the best matching candidate word the identity of the word accepted by the user and data placing the chronology of the respective data record relative other data records; means responsive to an accepted candidate word for generating a text event data record which includes; data defining the respective input event data placing the chronology of the record with respect to other similarly generated records data defining any hierarchical relationship to other similarly generated records; thereby to generate a database useful in backing up in the generated text and making corrections. - View Dependent Claims (10)
-
-
11. A system for generating text in response to a succession of input events provided by a user, said input events including audio signals representing speech input and manual events representing manual input events, said system comprising:
-
means for comparing each spoken input event with a plurality of tokens representing vocabulary words thereby to identify a plurality of candidate tokens which may correspond to the spoken input event; means enabling a user to accept one of said candidate tokens; computer implemented means responsive to an accepted candidate token or a manual event for generating and storing a data record which includes; data defining the type of input event data placing the chronology of the record with respect to other similarly generated records thereby to generate a text event database useful in backing up in the generated text and making corrections. - View Dependent Claims (12, 13)
-
-
14. A system for generating structured text in response to a succession of audio signals representing spoken input events provided by a user, said system comprising:
-
a plurality of tokens representing vocabulary words including both text words and command words, at least one of said command words being associated with a text form having fields to be filled in; means for comparing each spoken input event with at least a preselected group of said tokens thereby to identify candidate words which correspond to the spoken input events; means enabling a user to accept a candidate word; computer implemented means responsive to an accepted candidate word for generating and storing a text event data record which includes; data defining the respective input event data defining any hierarchical relationship to other similarly generated records, text event data records associated with filling in fields in said text form being inferior to a text event data record associated with said text form; thereby to generate a text event database useful in subsequent editing of the structured text.
-
-
15. A system for generating structured text in response to a succession of audio signals representing spoken input events provided by a user, said system comprising:
-
a plurality of tokens representing vocabulary words including both text words and command words, at least one of said command words being associated with a text form having fields to be filled in; means for comparing each spoken input event with at least a preselected group of said tokens thereby to identify candidate words which correspond to the spoken input events; means enabling a user to accept a candidate word; computer implemented means for generating and storing, for each spoken input event, a data record which includes; the identity of the candidate words the identity of the best matching candidate word and the identity of the correct candidate; computer implemented means responsive to an accepted candidate word for generating and storing a text event data record which includes; data defining the respective input event data defining any hierarchical relationship to other similarly generated records, text event data records associated with filling in fields in said text form being inferior to a text event data record associated with said text form; thereby to generate a database useful in subsequent editing of the structured text. - View Dependent Claims (16)
-
Specification