Hierarchical transcription and display of input speech
First Claim
1. A method for hierarchical transcription and display of input speech, the method comprising the steps of:
- converting a speech portion to a word;
determining a confidence of the word;
displaying the word if the confidence of the word meets a threshold confidence; and
displaying at least one syllable, corresponding to the word, if the confidence of the word does not meet the threshold confidence.
2 Assignments
0 Petitions
Accused Products
Abstract
Generally, the present invention provides the ability to present a mixed display of a transcription to a user. The mixed display is preferably organized in a hierarchical fashion. Words, syllables and phones can be placed on the same display by the present invention, and the present invention can select the appropriate symbol transcription based on the parts of speech that meet minimum confidences. Words are displayed if they meet a minimum confidence or else syllables, which make up the word, are displayed. Additionally, if a syllable does not meet a predetermined confidence, then phones, which make up the syllable, may be displayed. A transcription, in one aspect of the present invention, may also be described as a hierarchical transcription, because a unique confidence is derived that accounts for mixed word/syllable/phone data.
55 Citations
41 Claims
-
1. A method for hierarchical transcription and display of input speech, the method comprising the steps of:
-
converting a speech portion to a word;
determining a confidence of the word;
displaying the word if the confidence of the word meets a threshold confidence; and
displaying at least one syllable, corresponding to the word, if the confidence of the word does not meet the threshold confidence. - View Dependent Claims (2)
-
-
3. A method comprising the steps of:
-
providing a recognized sentence portion comprising words and syllables;
transforming a plurality of hypothesis scores of the recognized sentence portion to phone level;
determining, by using the transformed hypothesis scores, confidence of the recognized sentence portion as a function of time; and
using the confidence as a function of time to determine confidences for parts of speech in the recognized sentence portion. - View Dependent Claims (4, 5, 6, 7, 8, 9, 10)
-
-
11. A method for hierarchical transcription and display of input speech, the method comprising the steps of:
-
determining for a speech portion which of a plurality of parts of speech meets predetermined criteria for that part of speech; and
displaying the part of speech that meets the predetermined criteria for that part of speech. - View Dependent Claims (12, 13, 14, 15, 16, 17, 19, 20, 21, 22, 23, 24, 26, 27, 28, 29, 35, 36)
-
-
18. A system comprising:
-
a memory that stores computer-readable code; and
a processor operatively coupled to the memory, the processor configured to implement the computer-readable code, the computer-readable code configured to;
provide a recognized sentence portion comprising words and syllables;
transform a plurality of hypothesis scores of the recognized sentence portion to phone level;
determine, by using the transformed hypothesis scores, confidence of the recognized sentence portion as a function of time; and
use the confidence as a function of time to determine confidences for parts of speech in the recognized sentence portion.
-
-
25. A system for hierarchical transcription and display of input speech, the system comprising:
-
a memory that stores computer-readable code; and
a processor operatively coupled to the memory, the processor configured to implement the computer-readable code, the computer-readable code configured to;
determine for a speech portion which of a plurality of parts of speech meets predetermined criteria for that part of speech; and
display the part of speech that meets the predetermined criteria for that part of speech.
-
-
30. An article of manufacture comprising:
a computer-readable medium having computer-readable code embodied thereon, the computer-readable code comprising;
a step to provide a recognized sentence portion comprising words and syllables;
a step to transform a plurality of hypothesis scores of the recognized sentence portion to phone level;
a step to determine, by using the transformed hypothesis scores, confidence of the recognized sentence portion as a function of time; and
a step to use the confidence as a function of time to determine confidences for parts of speech in the recognized sentence portion. - View Dependent Claims (31, 32, 33, 34, 38, 39, 40, 41)
-
37. An article of manufacture comprising:
a computer-readable medium having computer-readable code embodied thereon, the computer-readable code comprising;
a step to determine for a speech portion which of a plurality of parts of speech meets predetermined criteria for that part of speech; and
a step to display the part of speech that meets the predetermined criteria for that part of speech.
Specification