System and method for speech recognition utilizing a merged dictionary
First Claim
Patent Images
1. A system for performing a speech recognition procedure, comprising:
- a sound sensor that converts a spoken utterance into input speech data;
a recognizer configured to compare said input speech data to dictionary entries from a merged dictionary, said merged dictionary being implemented by utilizing a merging technique that maps two or more related phrases with similar meanings to a single one of said dictionary entries, said two or more related phrases each having a different final particle that does not alter a basic shared meaning of said two or more related phrases, said merging technique being based upon a particle context from each of said two or more related phrases, said particle context indicating an intended mood of an initial speaker of said input speech data, each of said two or more related phrases including a command followed by said particle context, one of said two or more related phrases having an assertive particle context to indicate said intended mood of said initial speaker of said input speech data; and
a processor configured to control said recognizer to perform said speech recognition procedure.
1 Assignment
0 Petitions
Accused Products
Abstract
The present invention comprises a system and method for speech recognition utilizing a merged dictionary, and may include a recognizer that is configured to compare input speech data to a series of dictionary entries from the merged dictionary to detect a recognized phrase or command. The merged dictionary may be implemented by utilizing a merging technique that maps two or more related phrases or commands with similar meanings to a single one of the dictionary entries. The recognizer may thus achieve more accurate speech recognition accuracy by merging phrases or commands which might otherwise be erroneously mistaken for each other.
12 Citations
38 Claims
-
1. A system for performing a speech recognition procedure, comprising:
-
a sound sensor that converts a spoken utterance into input speech data; a recognizer configured to compare said input speech data to dictionary entries from a merged dictionary, said merged dictionary being implemented by utilizing a merging technique that maps two or more related phrases with similar meanings to a single one of said dictionary entries, said two or more related phrases each having a different final particle that does not alter a basic shared meaning of said two or more related phrases, said merging technique being based upon a particle context from each of said two or more related phrases, said particle context indicating an intended mood of an initial speaker of said input speech data, each of said two or more related phrases including a command followed by said particle context, one of said two or more related phrases having an assertive particle context to indicate said intended mood of said initial speaker of said input speech data; and a processor configured to control said recognizer to perform said speech recognition procedure. - View Dependent Claims (2)
-
-
3. A system for performing a speech recognition procedure, comprising:
-
a sound sensor that converts a spoken utterance into input speech data; a recognizer configured to compare said input speech data to dictionary entries from a merged dictionary, said merged dictionary being implemented by utilizing a merging technique that maps two or more related phrases with similar meanings to a single one of said dictionary entries, said two or more related phrases each having a different final particle that does not alter a basic shared meaning of said two or more related phrases, said merging technique being based upon a particle context from each of said two or more related phrases, said particle context indicating an intended mood of an initial speaker of said input speech data, each of said two or more related phrases including a command followed by said particle context, one of said two or more related phrases having a neutral particle context to indicate said intended mood of said initial speaker of said input speech data; and a processor configured to control said recognizer to perform said speech recognition procedure. - View Dependent Claims (4)
-
-
5. A system for performing a speech recognition procedure, comprising:
-
a sound sensor that converts a spoken utterance into input speech data; a recognizer configured to compare said input speech data to dictionary entries from a merged dictionary, said merged dictionary being implemented by utilizing a merging technique that maps two or more related phrases with similar meanings to a single one of said dictionary entries, said two or more related phrases each having a different final particle that does not alter a basic shared meaning of said two or more related phrases, said merging technique being based upon a particle context from each of said two or more related phrases, said particle context indicating an intended mood of an initial speaker of said input speech data, each of said two or more related phrases including a command followed by said particle context, one of said two or more related phrases having a polite particle context to indicate said intended mood of said initial speaker of said input speech data; and a processor configured to control said recognizer to perform said speech recognition procedure. - View Dependent Claims (6)
-
-
7. A system for performing a speech recognition procedure, comprising:
-
a sound sensor that converts a spoken utterance into input speech data; a recognizer configured to compare said input speech data to dictionary entries from a merged dictionary, said merged dictionary being implemented by utilizing a merging technique that maps two or more related phrases with similar meanings to a single one of said dictionary entries, said two or more related phrases each having a different final particle that does not alter a basic shared meaning of said two or more related phrases, said merged dictionary being implemented to include dictionary entries that represent phone strings of a Cantonese language without utilizing corresponding tonal information as part of said phone strings; and a processor configured to control said recognizer to perform said speech recognition procedure. - View Dependent Claims (8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A method for performing a speech recognition procedure, comprising:
-
converting a spoken utterance into input speech data by using a sound sensor; utilizing a recognizer for comparing said input speech data to dictionary entries from a merged dictionary, said merged dictionary being implemented with a merging technique that maps two or more related phrases with similar meanings to a single one of said dictionary entries, said two or more related phrases each having a different final particle that does not alter a basic shared meaning of said two or more related phrases, said merging technique being based upon a particle context from each of said two or more related phrases, said particle context indicating an intended mood of an initial speaker of said input speech data, each of said two or more related phrases including a command followed by said particle context, one of said two or more related phrases having an assertive particle context to indicate said intended mood of said initial speaker of said input speech data. - View Dependent Claims (21)
-
-
22. A method for performing a speech recognition procedure, comprising:
-
converting a spoken utterance into input speech data by using a sound sensor; utilizing a recognizer for comparing said input speech data to dictionary entries from a merged dictionary, said merged dictionary being implemented with a merging technique that maps two or more related phrases with similar meanings to a single one of said dictionary entries, said two or more related phrases each having a different final particle that does not alter a basic shared meaning of said two or more related phrases, said merging technique being based upon a particle context from each of said two or more related phrases, said particle context indicating an intended mood of an initial speaker of said input speech data, each of said two or more related phrases including a command followed by said particle context, one of said two or more related phrases having a neutral particle context to indicate said intended mood of said initial speaker of said input speech data. - View Dependent Claims (23)
-
-
24. A method for performing a speech recognition procedure, comprising:
-
converting a spoken utterance into input speech data by using a sound sensor; utilizing a recognizer for comparing said input speech data to dictionary entries from a merged dictionary, said merged dictionary being implemented with a merging technique that maps two or more related phrases with similar meanings to a single one of said dictionary entries, said two or more related phrases each having a different final particle that does not alter a basic shared meaning of said two or more related phrases, said merging technique being based upon a particle context from each of said two or more related phrases, said particle context indicating an intended mood of an initial speaker of said input speech data, each of said two or more related phrases including a command followed by said particle context, one of said two or more related phrases having a polite particle context to indicate said intended mood of said initial speaker of said input speech data. - View Dependent Claims (25)
-
-
26. A method for performing a speech recognition procedure, comprising:
-
converting a spoken utterance into input speech data by using a sound sensor; utilizing a recognizer for comparing said input speech data to dictionary entries from a merged dictionary, said merged dictionary being implemented with a merging technique that maps two or more related phrases with similar meanings to a single one of said dictionary entries, said two or more related phrases each having a different final particle that does not alter a basic shared meaning of said two or more related phrases, said merged dictionary being implemented to include dictionary entries that represent phone strings of a Cantonese language without utilizing corresponding tonal information as part of said phone strings. - View Dependent Claims (27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38)
-
Specification