Method and system for dynamic speech recognition and tracking of prewritten script
DCFirst Claim
1. A computer-implemented method for dynamically presenting a prewritten text in a graphical user interface comprisinga) receiving a text artifact, said text artifact containing a plurality of artifact words;
- b) storing, via a processor, said text artifact in a memory device of a computer;
c) retrieving, via said processor, said text artifact;
d) displaying said text artifact on the display screen of said computer;
e) receiving a vocal input;
f) generating, via said processor, a text file representing the words spoken in said vocal input, said text file containing a plurality of hypothesis words;
g) comparing, via said processor, a predetermined number of said hypothesis words to a predetermined number of said artifact words;
h) determining a match location in said text artifact where a specific number of said predetermined number of hypothesis words match a specific number of said predetermined number of artifact words;
i) altering, via said processor, the display on said display screen to display said match location on said display screen of said computer;
j) determining, via said processor, the font size of the text of said text artifact as displayed on said display screen;
k) determining, via said processor, the orientation of the electronic screen displaying the text of the text artifact;
l) altering, via said processor, said predetermined number of artifact words utilized for comparing to said predetermined number of hypothesis words.
4 Assignments
Litigations
0 Petitions
Accused Products
Abstract
A computer-implemented method for dynamically presenting a prewritten text in a graphical user interface is disclosed. The method comprises receiving a text artifact, storing the text artifact in a memory device of a computer, retrieving the text artifact, displaying the text artifact on the display screen of the computer, receiving a vocal input, generating a text file representing the words spoken in the vocal input, comparing a predetermined number of the hypothesis words to a predetermined number of the artifact words, determining a match location in the text artifact where a specific number of the predetermined number of hypothesis words match a specific number of the predetermined number of artifact words, and altering the display on the display screen to display the match location on the display screen of the computer.
-
Citations
20 Claims
-
1. A computer-implemented method for dynamically presenting a prewritten text in a graphical user interface comprising
a) receiving a text artifact, said text artifact containing a plurality of artifact words; -
b) storing, via a processor, said text artifact in a memory device of a computer; c) retrieving, via said processor, said text artifact; d) displaying said text artifact on the display screen of said computer; e) receiving a vocal input; f) generating, via said processor, a text file representing the words spoken in said vocal input, said text file containing a plurality of hypothesis words; g) comparing, via said processor, a predetermined number of said hypothesis words to a predetermined number of said artifact words; h) determining a match location in said text artifact where a specific number of said predetermined number of hypothesis words match a specific number of said predetermined number of artifact words; i) altering, via said processor, the display on said display screen to display said match location on said display screen of said computer; j) determining, via said processor, the font size of the text of said text artifact as displayed on said display screen; k) determining, via said processor, the orientation of the electronic screen displaying the text of the text artifact; l) altering, via said processor, said predetermined number of artifact words utilized for comparing to said predetermined number of hypothesis words. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11)
-
-
12. A computer-implemented method for dynamically presenting a prewritten text in a graphical user interface comprising
a) receiving a text artifact, said text artifact containing a plurality of artifact words; -
b) storing, via a processor, said text artifact in a memory device of a computer; c) retrieving, via said processor, said text artifact; d) displaying said text artifact on the display screen of said computer; e) receiving a vocal input; f) generating, via said processor, a text file representing the words spoken in said vocal input, said text file containing a plurality of hypothesis words; g) comparing, via said processor, a predetermined number of said hypothesis words to a predetermined number of said artifact words, wherein comparing said predetermined number of said hypothesis words to said predetermined number of said artifact words comprises i) creating a first group containing said predetermined number of hypothesis words; ii) creating a second group containing said predetermined number of artifact words, said second group being displayed on said display screen of said computer; iii) respectively determining whether each of said predetermined number of hypothesis words is present in said first group; iv) respectively determining whether each of said predetermined number of artifact words is present in said first group; v) generating a single number, wherein said single number is representative of the degree of similarity between said first group and said second group; h) determining a match location in said text artifact where a specific number of said predetermined number of hypothesis words match a specific number of said predetermined number of artifact words; i) altering, via said processor, the display on said display screen to display said match location on said display screen of said computer. - View Dependent Claims (13, 14)
-
-
15. A computer-implemented method for dynamically presenting a prewritten text in a graphical user interface comprising
a) receiving a text artifact, said text artifact containing a plurality of artifact words; -
b) storing, via a processor, said text artifact in a memory device of a computer; c) retrieving, via said processor, said text artifact; d) displaying said text artifact on the display screen of said computer; e) receiving a vocal input; f) generating, via said processor, a text file representing the words spoken in said vocal input, said text file containing a plurality of hypothesis words; g) comparing, via said processor, a predetermined number of said hypothesis words to a predetermined number of said artifact words; h) determining a match location in said text artifact where a specific number of said predetermined number of hypothesis words match a specific number of said predetermined number of artifact words; i) altering, via said processor, the display on said display screen to display said match location on said display screen of said computer; j) determining two or more second groups of said predetermined number of artifact words having a generated single number greater than a predetermined number; k) creating a list of all bigrams contained in said predetermined number of hypothesis words; l) creating a list of all trigrams contained in said predetermined number of hypothesis words; m) respectively creating a list of all bigrams contained in each of said two or more second groups; n) respectively creating a list of all trigrams contained in each of said two or more second groups; o) respectively comparing each list of bigrams of said two or more second groups to said list of all bigrams contained in said predetermined number of hypothesis words; p) respectively comparing each list of trigrams of said two or more second groups to said list of all trigrams contained in said predetermined number of hypothesis words; q) determining one of said two or more second groups of said predetermined number of artifact words having the greatest total number of bigrams and trigrams present in said list of all bigrams and said list of all trigrams contained in said predetermined number of hypothesis words; r) selecting said one of said two or more second groups of said predetermined number of artifact words having the greatest total number of bigrams and trigrams present in said list of all bigrams and said list of all trigrams contained in said predetermined number of hypothesis words; s) selecting a portion of said text artifact, wherein said selected portion of text artifact contains said selected one of said two or more second groups of said predetermined number of artifact words having the greatest total number of bigrams and trigrams present in said list of all bigrams and said list of all trigrams contained in said predetermined number of hypothesis words. - View Dependent Claims (16, 17)
-
-
18. A computer-implemented method for dynamically presenting a prewritten text in a graphical user interface comprising, on a computer comprising a microprocessor, a visual display, and a nonvolatile memory unit, said nonvolatile memory unit storing instructions which when executed by said microprocessor cause the computer to perform operations comprising
a) receiving a text artifact, said text artifact containing a plurality of artifact words; -
b) storing, via a processor, said text artifact in a memory device of a computer; c) retrieving, via said processor, said text artifact; d) displaying said text artifact on the display screen of said computer; e) receiving a vocal input; f) generating, via said processor, a text file representing the words spoken in said vocal input, said text file containing a plurality of hypothesis words; g) comparing, via said processor, a predetermined number of said hypothesis words to a predetermined number of said artifact words; h) determining a match location in said text artifact where a specific number of said predetermined number of hypothesis words match a specific number of said predetermined number of artifact words; i) altering, via said processor, the display on said display screen to display said match location on said display screen of said computer; j) determining whether one or more words of said plurality of hypothesis words have been matched to a match location of said text artifact; k) removing one or more words from said plurality of hypothesis words, wherein said removed words have been previously matched to a match location of said text artifact; l) determining, via said processor, the font size of the text of said text artifact as displayed on said display screen; m) determining, via said processor, the orientation of the electronic screen displaying the text of the text artifact; n) altering, via said processor, said predetermined number of artifact words utilized for comparing to said predetermined number of hypothesis words. - View Dependent Claims (19, 20)
-
Specification