Methods and systems for managing telecommunications and for translating voice messages to text messages
First Claim
1. A system, comprising:
- at least one computing device comprising hardware;
at least one network interface coupled to a Public Switched Telephone Network (PSTN);
non-transitory memory coupled to the at least one computing device that stores instructions that when executed by the at least one computing device cause, at least in part, the system to implement;
enabling a software application to be downloaded over a network to a computing device of a user;
recording a voice mail message from a caller;
transcribing some or all of the voice mail message from the caller, including a plurality of spoken words, to text using at least in part a speech-to-text recognizer;
creating a text message from a selected portion of the transcribed text wherein resolution of transcribed text ambiguities is based at least in part on using a context and/or location of a word or phrase in the voice mail message; and
transmitting the text message over the network to the user computing device for display by the software application, the software application hosted by the computing device of the user, wherein the display of the transcribed text for an uncertain word is grayed.
1 Assignment
0 Petitions
Accused Products
Abstract
Systems and methods that can be utilized to convert a voice communication received over a telecommunication network to text are described. In an illustrative embodiment, a call processing system coupled to a telecommunications network receives a call from a caller intended for a first party, wherein the call is associated with call signaling information. At least a portion of the call signaling information is stored in a computer readable medium. A greeting is played the caller, and a voice communication from the caller is recorded. At least a portion of the voice communication is converted to text, which is analyzed to identify portions that are inferred to be relatively more important to communicate to the first party. A text communication is generated including at least some of the identified portions and including fewer words than the recorded voice communication. At least a portion of the text communication is made available to the first party over a data network.
-
Citations
20 Claims
-
1. A system, comprising:
-
at least one computing device comprising hardware; at least one network interface coupled to a Public Switched Telephone Network (PSTN); non-transitory memory coupled to the at least one computing device that stores instructions that when executed by the at least one computing device cause, at least in part, the system to implement; enabling a software application to be downloaded over a network to a computing device of a user; recording a voice mail message from a caller; transcribing some or all of the voice mail message from the caller, including a plurality of spoken words, to text using at least in part a speech-to-text recognizer; creating a text message from a selected portion of the transcribed text wherein resolution of transcribed text ambiguities is based at least in part on using a context and/or location of a word or phrase in the voice mail message; and transmitting the text message over the network to the user computing device for display by the software application, the software application hosted by the computing device of the user, wherein the display of the transcribed text for an uncertain word is grayed. - View Dependent Claims (2, 3, 4, 5, 6)
-
-
7. A system, comprising:
-
at least one computing device comprising hardware; at least one network interface coupled to a Public Switched Telephone Network (PSTN); non-transitory memory coupled to an at least one networked computing device that stores a first instance of a software application configured to be installed on a computing device of a user, wherein the software application, when executed by the device of the user, is configured to receive and display text messages corresponding to voice messages transcribed to text and to receive transcription feedback from the user with respect to text messages corresponding to voice messages transcribed to text; non-transitory memory coupled to the at least one computing device that stores instructions that when executed by the at least one computing device cause, at least in part, the system to implement; receiving at the at least one network interface a call from a caller directed to a phone address, wherein the phone address is associated with the user; recording a voice message from the caller; transcribing some or all of the voice message, including a plurality of spoken words, to text using at least in part a speech-to-text recognizer; creating a text message from at least a portion of the transcribed text; and transmitting the text message over the network to the user computing device for display by a second instance of the software application, wherein the second instance of the software application is hosted on the computing device of the user, and wherein the display of the transmitted text message to the user signals, for one or more transcribed words in the transmitted text message, an indication of the uncertainty of the word transcription, wherein the display of the transcribed text for an uncertain word is grayed, and wherein the second instance of the software application is configured to enable the user to provide transcription feedback regarding the received text message. - View Dependent Claims (8, 9, 10, 11)
-
-
12. A system, comprising:
-
at least one computing device comprising hardware; at least one network interface coupled to a Public Switched Telephone Network (PSTN); non-transitory memory coupled to an at least one networked computing device that stores a first instance of a software application configured to be installed on a computing device of a user, wherein the software application, when executed by the device of the user, is configured to receive and display text messages corresponding to voice messages transcribed to text and to receive transcription feedback from the user with respect to text messages corresponding to voice messages transcribed to text; non-transitory memory coupled to the at least one computing device that stores instructions that when executed by the at least one computing device cause, at least in part, the system to implement; receiving at the at least one network interface a call from a caller directed to a phone address, wherein the phone address is associated with the user; recording a voice message from the caller; transcribing some or all of the voice message, including a plurality of spoken words, to text using at least in part a speech-to-text recognizer; creating a text message from at least a portion of the transcribed text; and transmitting the text message over the network to the user computing device for display by a second instance of the software application, wherein the second instance of the software application is hosted on the computing device of the user, and wherein the display of the transmitted text message to the user signals, for one or more transcribed words in the transmitted text message, an indication of the uncertainty of the word transcription, and wherein the second instance of the software application is configured to enable the user to provide transcription feedback regarding the received text message, and wherein the display of the transcribed text for a relatively higher certainty word comprises a regular font or a bold text font.
-
-
13. A system, comprising:
-
at least one computing device comprising hardware; at least one network interface coupled to a Public Switched Telephone Network (PSTN); non-transitory memory coupled to an at least one networked computing device that stores a first instance of a software application configured to be installed on a computing device of a user, wherein the software application, when executed by the device of the user, is configured to receive and display text messages corresponding to voice messages transcribed to text and to receive transcription feedback from the user with respect to text messages corresponding to voice messages transcribed to text; non-transitory memory coupled to the at least one computing device that stores instructions that when executed by the at least one computing device cause, at least in part, the system to implement; receiving at the at least one network interface a call from a caller directed to a phone address, wherein the phone address is associated with the user; recording a voice message from the caller; transcribing some or all of the voice message, including a plurality of spoken words, to text using at least in part a speech-to-text recognizer; creating a text message from at least a portion of the transcribed text; and transmitting the text message over the network to the user computing device for display by a second instance of the software application, wherein the second instance of the software application is hosted on the computing device of the user, and wherein the display of the transmitted text message to the user signals, for one or more transcribed words in the transmitted text message, an indication of the uncertainty of the word transcription, and wherein the second instance of the software application is configured to enable the user to provide transcription feedback regarding the received text message, and receiving, from the second instance of the software application over the network, an indication that at least a one or more word or phrase in the text message is erroneously transcribed.
-
-
14. A system, comprising:
-
at least one computing device comprising hardware; at least one network interface coupled to a Public Switched Telephone Network (PSTN); non-transitory memory coupled to an at least networked one computing device that stores a first instance of a software application configured to be downloaded to a computing device of a user; non-transitory memory coupled to the at least one computing device that stores instructions that when executed by the at least one computing device cause, at least in part, the system to implement; receiving at the at least one network interface a call from a caller directed to the user; recording a voice message from the caller; transcribing some or all of the voice message, including a plurality of spoken words, to text using at least in part a speech-to-text recognizer; creating a text message from a selected portion of the transcribed text; transmitting the text message over the network to a second instance of the software application, wherein the second instance of the software application is hosted on the computer device of the user; displaying, via a user interface of the second instance of the software application, the second instance of the software application executing on the computing device of the user, the text message transmitted over the network, wherein the display of the transmitted text message to the user signals, for each transcribed word in the selected portion of the transcribed text, an indication of the uncertainty of the word transcription, wherein the display of the transcribed text for a higher certainty word comprises a regular font or a bold text font; and providing a user interface control via the user interface of the second instance of the software application that enables the user to send a text reply to the caller. - View Dependent Claims (15, 16, 17, 18)
-
-
19. A system, comprising:
-
at least one computing device comprising hardware; at least one network interface coupled to a Public Switched Tele shone Network PSTN; non-transitory memory coupled to an at least networked one computing device that stores a first instance of a software application configured to be downloaded to a computing device of a user; non-transitory memory coupled to the at least one computing device that stores instructions that when executed by the at least one computing device cause, at least in part, the system to implement; receiving at the at least one network interface a call from a caller directed to the user; recording a voice message from the caller; transcribing some or all of the voice message, including a plurality of spoken words, to text using at least in part a speech-to-text recognizer; creating a text message from a selected portion of the transcribed text; transmitting the text message over the network to a second instance of the software application, wherein the second instance of the software application is hosted on the computer device of the user; displaying, via a user interface of the second instance of the software application, the second instance of the software application executing on the computing device of the user, the text message transmitted over the network, wherein the display of the transmitted text message to the user signals, for each transcribed word in the selected portion of the transcribed text, an indication of the uncertainty of the word transcription; providing a user interface control via the user interface of the second instance of the software application that enables the user to send a text reply to the caller; receiving from the second instance of the software application over the network, a text message reply; and in response to receiving the text message reply from the second instance of the software application, sending, from the at least one computing device, an SMS message directed to the caller.
-
-
20. A system, comprising:
-
at least one computing device comprising hardware; at least one network interface coupled to a Public Switched Telephone Network (PSTN); non-transitory memory coupled to an at least networked one computing device that stores a first instance of a software application configured to be downloaded to a computing device of a user; non-transitory memory coupled to the at least one computing device that stores instructions that when executed by the at least one computing device cause, at least in part, the system to implement; receiving at the at least one network interface a call from a caller directed to the user; recording a voice message from the caller; transcribing some or all of the voice message, including a plurality of spoken words, to text using at least in part a speech-to-text recognizes; creating a text message from a selected portion of the transcribed text; transmitting the text message over the network to a second instance of the software application, wherein the second instance of the software application is hosted on the computer device of the user; displaying, via a user interface of the second instance of the software application, the second instance of the software application executing on the computing device of the user, the text message transmitted over the network, wherein the display of the transmitted text message to the user signals, for each transcribed word in the selected portion of the transcribed text, an indication of the uncertainty of the word transcription, wherein the display of the transcribed text for an uncertain word is grayed; and providing a user interface control via the user interface of the second instance of the software application that enables the user to send a text reply to the caller.
-
Specification