System and method for searching, analyzing and displaying text transcripts of speech after imperfect speech recognition
First Claim
Patent Images
1. A method for processing text transcripts of speech after imperfect speech recognition, the method comprising the steps of:
- converting a speech document to text;
processing the text to determine salient terms; and
displaying the text by emphasizing the salient terms and minimizing non-salient terms, wherein said salient terms are determined by;
determining high selectivity terms in the at least one text transcript;
determining how many high selectivity terms there are;
determining how many multiword and high selectivity terms there are that are above a first predetermined selectivity;
when there are ten or more high selectivity and multiword terms that have confidences greater than a first predetermined selectivity, displaying the ten or more high selectivity and multiword terms; and
when there are less than ten high selectivity and multiword terms that are above a first predetermined selectivity;
determining how many single word and high selectivity terms there are;
determining if there are ten or more single word and multiword terms that are high selectivity terms that have selectivities greater than a first predetermined selectivity;
when there are ten or more single word and multiword terms that are high selectivity terms that have selectivity greater than a first predetermined selectivity, displaying all of the ten or more single word and multiword terms; and
when there are not ten or more single word and multiword terms that are high selectivity terms that have confidences greater than a first predetermined selectivity, displaying all single word and multiword terms that are high selectivity terms and that have confidences greater than a second predetermined selectivity.
3 Assignments
0 Petitions
Accused Products
Abstract
A speech conversation is changed to a text transcript, which is then pre-processed and subjected to text mining to determine salient terms. Salient terms are those terms that meet a predetermined level of selectivity in a collection. The text transcript of the speech conversation is displayed by emphasizing the salient terms and minimizing non-salient terms. An interface is provided that allows a user to select a salient term, whereupon the speech conversation is played beginning at the location, in the speech file, of the selected salient term.
83 Citations
31 Claims
-
1. A method for processing text transcripts of speech after imperfect speech recognition, the method comprising the steps of:
-
converting a speech document to text; processing the text to determine salient terms; and displaying the text by emphasizing the salient terms and minimizing non-salient terms, wherein said salient terms are determined by; determining high selectivity terms in the at least one text transcript; determining how many high selectivity terms there are; determining how many multiword and high selectivity terms there are that are above a first predetermined selectivity; when there are ten or more high selectivity and multiword terms that have confidences greater than a first predetermined selectivity, displaying the ten or more high selectivity and multiword terms; and when there are less than ten high selectivity and multiword terms that are above a first predetermined selectivity; determining how many single word and high selectivity terms there are; determining if there are ten or more single word and multiword terms that are high selectivity terms that have selectivities greater than a first predetermined selectivity; when there are ten or more single word and multiword terms that are high selectivity terms that have selectivity greater than a first predetermined selectivity, displaying all of the ten or more single word and multiword terms; and when there are not ten or more single word and multiword terms that are high selectivity terms that have confidences greater than a first predetermined selectivity, displaying all single word and multiword terms that are high selectivity terms and that have confidences greater than a second predetermined selectivity. - View Dependent Claims (2, 3, 4)
-
-
5. A method for processing text transcripts of speech after imperfect speech recognition, the method comprising the steps of:
-
converting a speech document to at least one text transcript; increasing sentence and paragraph structure in the at least one text transcript; removing non-word utterances from the at least one text transcript; determining salient terms in the at least one text transcript; and displaying text of the at least one text transcript, the step of displaying performed to emphasize display of the salient terms relative to non-salient terms, wherein said salient terms are determined by; determining high selectivity terms in the at least one text transcript; determining how many high selectivity terms there are; determining how many multiword and high selectivity terms there are that are above a first predetermined selectivity; when there are ten or more high selectivity and multiword terms that have confidences greater than a first predetermined selectivity, displaying the ten or more high selectivity and multiword terms; and when there are less than ten high selectivity and multiword terms that are above a first predetermined selectivity; determining how many single word and high selectivity terms there are; determining if there are ten or more single word and multiword terms that are high selectivity terms that have selectivities greater than a first predetermined selectivity; when there are ten or more single word and multiword terms that are high selectivity terms that have selectivity greater than a first predetermined selectivity, displaying all of the ten or more single word and multiword terms; and when there are not ten or more single word and multiword terms that are high selectivity terms that have confidences greater than a first predetermined selectivity, displaying all single word and multiword terms that are high selectivity terms and that have confidences greater than a second predetermined selectivity. - View Dependent Claims (6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A system for processing text transcripts of speech after imperfect speech recognition, the system comprising:
-
a memory that stores computer-readable code; and a processor operatively coupled to the memory, the processor configured to implement the computer-readable code, the computer-readable code configured to; convert a speech document to text; process the text to determine salient terms; and display the text by emphasizing the salient terms and minimizing non-salient terms, wherein said salient terms are determined by; determining high selectivity terms in the at least one text transcript; determining how many high selectivity terms there are; determining how many multiword and high selectivity terms there are that are above a first predetermined selectivity; when there are ten or more high selectivity and multiword terms that have confidences greater than a first predetermined selectivity, displaying the ten or more high selectivity and multiword terms; and when there are less than ten high selectivity and multiword terms that are above a first predetermined selectivity; determining how many single word and high selectivity terms there are; determining if there are ten or more single word and multiword terms that are high selectivity terms that have selectivities greater than a first predetermined selectivity; when there are ten or more single word and multiword terms that are high selectivity terms that have selectivity greater than a first predetermined selectivity, displaying all of the ten or more single word and multiword terms; and when there are not ten or more single word and multiword terms that are high selectivity terms that have confidences greater than a first predetermined selectivity, displaying all single word and multiword terms that are high selectivity terms and that have confidences greater than a second predetermined selectivity. - View Dependent Claims (15, 16, 17)
-
-
18. A system for processing text transcripts of speech after imperfect speech recognition, the system comprising:
-
a memory that stores computer-readable code; and a processor operatively coupled to the memory, the processor configured to implement the computer-readable code, the computer-readable code configured to; convert a speech document to at least one text transcript; increase sentence and paragraph structure in the at least one text transcript; remove non-word utterances from the at least one text transcript; determine salient terms in the at least one text transcript; and display text of the at least one text transcript, the step of displaying performed to emphasize display of the salient terms relative to non-salient terms, wherein said salient terms are determined by; determining high selectivity terms in the at least one text transcript; determining how many high selectivity terms there are; determining how many multiword and high selectivity terms there are that are above a first predetermined selectivity; when there are ten or more high selectivity and multiword terms that have confidences greater than a first predetermined selectivity, displaying the ten or more high selectivity and multiword terms; and when there are less than ten high selectivity and multiword terms that are above a first predetermined selectivity; determining how many single word and high selectivity terms there are; determining if there are ten or more single word and multiword terms that are high selectivity terms that have selectivities greater than a first predetermined selectivity; when there are ten or more single word and multiword terms that are high selectivity terms that have selectivity greater than a first predetermined selectivity, displaying all of the ten or more single word and multiword terms; and when there are not ten or more single word and multiword terms that are high selectivity terms that have confidences greater than a first predetermined selectivity, displaying all single word and multiword terms that are high selectivity terms and that have confidences greater than a second predetermined selectivity. - View Dependent Claims (19, 20, 21, 22)
-
-
23. An article of manufacture for processing text transcripts of speech after imperfect speech recognition, the article of manufacture comprising:
-
a step to convert a speech document to text; a step to process the text to determine salient terms; and a step to display the text by emphasizing the salient terms and minimizing non-salient terms, wherein said salient terms are determined by; determining high selectivity terms in the at least one text transcript; determining how many high selectivity terms there are; determining how many multiword and high selectivity terms there are that are above a first predetermined selectivity; when there are ten or more high selectivity and multiword terms that have confidences greater than a first predetermined selectivity, displaying the ten or more high selectivity and multiword terms; and when there are less than ten high selectivity and multiword terms that are above a first predetermined selectivity; determining how many single word and high selectivity terms there are; determining if there are ten or more single word and multiword terms that are high selectivity terms that have selectivities greater than a first predetermined selectivity; when there are ten or more single word and multiword terms that are high selectivity terms that have selectivity greater than a first predetermined selectivity, displaying all of the ten or more single word and multiword terms; and when there are not ten or more single word and multiword terms that are high selectivity terms that have confidences greater than a first predetermined selectivity, displaying all single word and multiword terms that are high selectivity terms and that have confidences greater than a second predetermined selectivity. - View Dependent Claims (24, 25, 26)
-
-
27. An article of manufacture for processing text transcripts of speech after imperfect speech recognition, the article of manufacture comprising:
-
a step to convert a speech document to at least one text transcript; a step to increase sentence and paragraph structure in the at least one text transcript; a step to remove non-word utterances from the at least one text transcript; a step to determine salient terms in the at least one text transcript; and a step to display text of the at least one text transcript, the step of displaying performed to emphasize display of the salient terms relative to non-salient terms, wherein said salient terms are determined by; determining high selectivity terms in the at least one text transcript; determining how many high selectivity terms there are; determining how many multiword and high selectivity terms there are that are above a first predetermined selectivity; when there are ten or more high selectivity and multiword terms that have confidences greater than a first predetermined selectivity, displaying the ten or more high selectivity and multiword terms; and when there are less than ten high selectivity and multiword terms that are above a first predetermined selectivity; determining how many single word and high selectivity terms there are; determining if there are ten or more single word and multiword terms that are high selectivity terms that have selectivities greater than a first predetermined selectivity; when there are ten or more single word and multiword terms that are high selectivity terms that have selectivity greater than a first predetermined selectivity, displaying all of the ten or more single word and multiword terms; and when there are not ten or more single word and multiword terms that are high selectivity terms that have confidences greater than a first predetermined selectivity, displaying all single word and multiword terms that are high selectivity terms and that have confidences greater than a second predetermined selectivity. - View Dependent Claims (28, 29, 30, 31)
-
Specification