Methods and apparatus for proofing of a text input
First Claim
1. A method for assisting a user verify accuracy of and/or correct text obtained by performing automatic speech recognition on speech input by the user, the method comprising:
- using at least one computer hardware processor to perform;
receiving speech input by the user over a course of multiple user turns as a plurality of speech chunks, each of the plurality of speech chunks comprising speech spoken by the user during a respective single user turn, the plurality of speech chunks including a first speech chunk comprising data corresponding to at least two words spoken by the user;
converting, by performing automatic speech recognition, the plurality of speech chunks to a textual representation comprising a plurality of text chunks, each of the plurality of speech chunks corresponding to a respective one of the plurality of text chunks, the plurality of text chunks comprising a first text chunk corresponding to the first speech chunk and comprising at least two recognized words corresponding to the at least two words; and
for each text chunk in the plurality of text chunks;
automatically designating the text chunk of the plurality of text chunks as an active text chunk, whenever the text chunk corresponds to a last speech chunk input by the user; and
providing a visual presentation of the active text chunk and at least one other text chunk in the plurality of text chunks such that the active text chunk is visually presented differently than the at least one other text chunk to assist the user, when presented, in proofing the textual representation.
1 Assignment
0 Petitions
Accused Products
Abstract
Techniques for presenting data input as a plurality of data chunks including a first data chunk and a second data chunk. The techniques include converting the plurality of data chunks to a textual representation comprising a plurality of text chunks including a first text chunk corresponding to the first data chunk and a second text chunk corresponding to the second data chunk, respectively, and providing a presentation of at least part of the textual representation such that the first text chunk is presented differently than the second text chunk to, when presented, assist a user in proofing the textual representation.
-
Citations
27 Claims
-
1. A method for assisting a user verify accuracy of and/or correct text obtained by performing automatic speech recognition on speech input by the user, the method comprising:
using at least one computer hardware processor to perform; receiving speech input by the user over a course of multiple user turns as a plurality of speech chunks, each of the plurality of speech chunks comprising speech spoken by the user during a respective single user turn, the plurality of speech chunks including a first speech chunk comprising data corresponding to at least two words spoken by the user; converting, by performing automatic speech recognition, the plurality of speech chunks to a textual representation comprising a plurality of text chunks, each of the plurality of speech chunks corresponding to a respective one of the plurality of text chunks, the plurality of text chunks comprising a first text chunk corresponding to the first speech chunk and comprising at least two recognized words corresponding to the at least two words; and for each text chunk in the plurality of text chunks; automatically designating the text chunk of the plurality of text chunks as an active text chunk, whenever the text chunk corresponds to a last speech chunk input by the user; and providing a visual presentation of the active text chunk and at least one other text chunk in the plurality of text chunks such that the active text chunk is visually presented differently than the at least one other text chunk to assist the user, when presented, in proofing the textual representation. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
10. A system for assisting a user verify accuracy of and/or correct text obtained by performing automatic speech recognition on speech input by the user, the system comprising:
at least one computer hardware processor configured to perform; receiving speech input by the user over a course of multiple user turns as a plurality of speech chunks, each of the plurality of speech chunks comprising speech spoken by the user during a respective single user turn, the plurality of speech chunks including a first speech chunk comprising data corresponding to at least two words spoken by the user; converting, by performing automatic speech recognition, the plurality of speech chunks to a textual representation comprising a plurality of text chunks, each of the plurality of speech chunks corresponding to a respective one of the plurality of text chunks, the plurality of text chunks comprising a first text chunk corresponding to the first speech chunk and comprising at least two recognized words corresponding to the at least two words; and for each text chunk in the plurality of text chunks; automatically designating the text chunk of the plurality of text chunks as an active text chunk, whenever the text chunk corresponds to a last speech chunk input by the user; and providing a visual presentation of the active text chunk and at least one other text chunk in the plurality of text chunks such that the active text chunk is visually presented differently than the at least one other text chunk to assist the user, when presented, in proofing the textual representation. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
19. At least one non-transitory computer readable medium storing instructions that, when executed on at least one computer, cause the at least one computer to perform a method for assisting a user verify accuracy of and/or correct text obtained by performing automatic speech recognition on speech input by the user, the method comprising:
-
receiving speech input by the user over a course of multiple user turns as a plurality of speech chunks, each of the plurality of speech chunks comprising speech spoken by the user during a respective single user turn, the plurality of speech chunks including a first speech chunk comprising data corresponding to at least two words spoken by the user; converting, by performing automatic speech recognition, the plurality of speech chunks to a textual representation comprising a plurality of text chunks, each of the plurality of speech chunks corresponding to a respective one of the plurality of text chunks, the plurality of text chunks comprising a first text chunk corresponding to the first speech chunk and comprising at least two recognized words corresponding to the at least two words; and for each text chunk in the plurality of text chunks; automatically designating the text chunk of the plurality of text chunks as an active text chunk, whenever the text chunk corresponds to a last speech chunk input by the user; and providing a visual presentation of the active text chunk and at least one other text chunk in the plurality of text chunks such that the active text chunk is visually presented differently than the at least one other text chunk to assist the user, when presented, in proofing the textual representation. - View Dependent Claims (20, 21, 22, 23, 24, 25, 26, 27)
-
Specification