Transcription of spoken communications
First Claim
1. A user terminal comprising:
- a microphone for capturing a portion of speech spoken by a near-end user of said user terminal;
a network interface for connecting to a communication network;
a touchscreen user interface;
a communication client application configured to;
conduct a communication session, over said communication network, between the near-end user and one or more far-end users of one or more far-end terminals, said communication session including an estimated transcription of said portion of speech that is capable of being sent in a message to the one or more far-end users;
obtain a plurality of alternative transcriptions for said portion of speech including an estimated probability of being correct for each transcription of the plurality of alternative transcriptions;
implement a vetting mechanism to allow the near-end user to vet the estimated transcription via the touchscreen user interface prior to the estimated transcription being sent in said message, the vetting mechanism including;
a first gesture received at the touchscreen user interface indicating acceptance of the estimated transcription to be included in a predetermined role in the message; and
one or more second gestures received at the touchscreen user interface indicating rejection of the estimated transcription from being included in said message; and
in response to receiving an indication of the one or more second gestures, select a next most probable transcription from the plurality of alternative transcriptions according to the respective estimated probability of being correct, and present the next most probable transcription with an option to accept or reject the next most probable transcription via the touchscreen user interface to be sent in said message.
1 Assignment
0 Petitions
Accused Products
Abstract
A portion of speech is captured when spoken by a near-end user. A near-end user terminal conducts a communication session, over a network, between the near-end user and one or more far-end users, the session including a message sent to the one or more far-end users. A vetting mechanism is provided via a touchscreen user interface of the near-end user terminal, to allow the near-end user to vet an estimated transcription of the portion of speech prior to being sent to the one or more far-end users in the message. According to the vetting mechanism: (i) a first gesture performed by the near-end user through the touchscreen user interface accepts the estimated transcription to be included in a predetermined role in the sent message, while (ii) one or more second gestures performed by the near-end user through the touchscreen user interface each reject the estimated transcription to be sent in the message.
-
Citations
20 Claims
-
1. A user terminal comprising:
-
a microphone for capturing a portion of speech spoken by a near-end user of said user terminal; a network interface for connecting to a communication network; a touchscreen user interface; a communication client application configured to; conduct a communication session, over said communication network, between the near-end user and one or more far-end users of one or more far-end terminals, said communication session including an estimated transcription of said portion of speech that is capable of being sent in a message to the one or more far-end users; obtain a plurality of alternative transcriptions for said portion of speech including an estimated probability of being correct for each transcription of the plurality of alternative transcriptions; implement a vetting mechanism to allow the near-end user to vet the estimated transcription via the touchscreen user interface prior to the estimated transcription being sent in said message, the vetting mechanism including; a first gesture received at the touchscreen user interface indicating acceptance of the estimated transcription to be included in a predetermined role in the message; and one or more second gestures received at the touchscreen user interface indicating rejection of the estimated transcription from being included in said message; and in response to receiving an indication of the one or more second gestures, select a next most probable transcription from the plurality of alternative transcriptions according to the respective estimated probability of being correct, and present the next most probable transcription with an option to accept or reject the next most probable transcription via the touchscreen user interface to be sent in said message. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 20)
-
-
17. A method comprising:
-
capturing a portion of speech spoken by a near-end user of a near-end user terminal; operating the near-end user terminal to conduct a communication session, over a network, between the near-end user and one or more far-end users of one or more far-end terminals, the communication session including an estimated transcription for said portion of speech that is capable of being sent in a message to the one or more far-end users; obtaining a plurality of alternative transcriptions for said portion of speech including an estimated probability of being correct for each transcription of the plurality of alternative transcriptions; implementing a vetting mechanism via a touchscreen user interface of the near-end user terminal, to allow the near-end user to vet an estimated transcription of said portion of speech prior to being sent to the one or more far-end users in said message, wherein said vetting mechanism includes a first gesture received at the touchscreen user interface indicating acceptance of the estimated transcription to be included in a predetermined role in said message and one or more second gestures received at the touchscreen user interface indicating rejection of the estimated transcription from being included in said message; and responsive to receiving an indication of the one or more second gestures, selecting a next most probable transcription from the plurality of alternative transcriptions according to the respective estimated probability of being correct, and presenting the next most probable transcription with an option to accept or reject the next most probable transcription via the touchscreen user interface to be sent in said message. - View Dependent Claims (19)
-
-
18. A computer-readable storage medium storing program code that is executable on a near-end user terminal to perform operations comprising:
-
capturing a portion of speech spoken by the near-end user; operating the near-end user terminal to conduct a communication session, over a network, between the near-end user and one or more far-end users of one or more far-end terminals, the communication session including an estimated transcription of said portion of speech that is capable of being sent in a message to the one or more far-end users; obtaining a plurality of alternative transcriptions for said portion of speech including an estimated probability of being correct for each transcription of the plurality of alternative transcriptions; implementing a vetting mechanism via a touchscreen user interface of the near-end user terminal to allow the near-end user to vet the estimated transcription prior to being sent in said message, wherein said vetting mechanism includes a first gesture received at the touchscreen user interface indicating acceptance of the estimated transcription to be included in a predetermined role in said message and one or more second gestures received at the touchscreen user interface indicating rejection of the estimated transcription from being included in said message; and responsive to receiving an indication of the one or more second gestures, selecting a next most probable transcription from the plurality of alternative transcriptions according to the respective estimated probability of being correct; and presenting the next most probable transcription with an option to accept or reject the next most probable transcription via the touchscreen user interface to be sent in said message.
-
Specification