Methods and systems for managing telecommunications and for translating voice messages to text messages

US 9,277,043 B1
Filed: 03/03/2015
Issued: 03/01/2016
Est. Priority Date: 03/26/2007
Status: Expired due to Fees

First Claim

Patent Images

1. A system, comprising:

at least one computing device comprising hardware;

at least one network interface coupled to a Public Switched Telephone Network (PSTN);

non-transitory memory coupled to the at least one computing device that stores instructions that when executed by the at least one computing device cause, at least in part, the system to implement;

enabling a software application to be downloaded over a network to a computing device of a user;

recording a voice mail message from a caller;

transcribing some or all of the voice mail message from the caller, including a plurality of spoken words, to text using at least in part a speech-to-text recognizer;

creating a text message from a selected portion of the transcribed text wherein resolution of transcribed text ambiguities is based at least in part on using a context and/or location of a word or phrase in the voice mail message; and

transmitting the text message over the network to the user computing device for display by the software application, the software application hosted by the computing device of the user, wherein the display of the transcribed text for an uncertain word is grayed.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

Systems and methods that can be utilized to convert a voice communication received over a telecommunication network to text are described. In an illustrative embodiment, a call processing system coupled to a telecommunications network receives a call from a caller intended for a first party, wherein the call is associated with call signaling information. At least a portion of the call signaling information is stored in a computer readable medium. A greeting is played the caller, and a voice communication from the caller is recorded. At least a portion of the voice communication is converted to text, which is analyzed to identify portions that are inferred to be relatively more important to communicate to the first party. A text communication is generated including at least some of the identified portions and including fewer words than the recorded voice communication. At least a portion of the text communication is made available to the first party over a data network.

Citations

20 Claims

1. A system, comprising:
- at least one computing device comprising hardware;
  
  at least one network interface coupled to a Public Switched Telephone Network (PSTN);
  
  non-transitory memory coupled to the at least one computing device that stores instructions that when executed by the at least one computing device cause, at least in part, the system to implement;
  
  enabling a software application to be downloaded over a network to a computing device of a user;
  
  recording a voice mail message from a caller;
  
  transcribing some or all of the voice mail message from the caller, including a plurality of spoken words, to text using at least in part a speech-to-text recognizer;
  
  creating a text message from a selected portion of the transcribed text wherein resolution of transcribed text ambiguities is based at least in part on using a context and/or location of a word or phrase in the voice mail message; and
  
  transmitting the text message over the network to the user computing device for display by the software application, the software application hosted by the computing device of the user, wherein the display of the transcribed text for an uncertain word is grayed.
- View Dependent Claims (2, 3, 4, 5, 6)
- - 2. The system as defined in claim 1, wherein the user computing device comprises a mobile phone.
  - 3. The system as defined in claim 1, wherein the location of the word or phrase resolution occurs at or near the beginning or at or near the end of the recorded voice mail message.
  - 4. The system as defined in claim 1, wherein the software application comprises a web browser.
  - 5. The system as defined in claim 1, wherein the network interface coupled to the PSTN comprises an Internet Protocol connection.
  - 6. The system as defined in claim 1, the operations further comprising:
    - identifying one or more of the following in the voice mail message;
      
      a pause;
      
      a clause initial conjunction;
      
      ora hesitation word.

7. A system, comprising:
- at least one computing device comprising hardware;
  
  at least one network interface coupled to a Public Switched Telephone Network (PSTN);
  
  non-transitory memory coupled to an at least one networked computing device that stores a first instance of a software application configured to be installed on a computing device of a user, wherein the software application, when executed by the device of the user, is configured to receive and display text messages corresponding to voice messages transcribed to text and to receive transcription feedback from the user with respect to text messages corresponding to voice messages transcribed to text;
  
  non-transitory memory coupled to the at least one computing device that stores instructions that when executed by the at least one computing device cause, at least in part, the system to implement;
  
  receiving at the at least one network interface a call from a caller directed to a phone address, wherein the phone address is associated with the user;
  
  recording a voice message from the caller;
  
  transcribing some or all of the voice message, including a plurality of spoken words, to text using at least in part a speech-to-text recognizer;
  
  creating a text message from at least a portion of the transcribed text; and
  
  transmitting the text message over the network to the user computing device for display by a second instance of the software application,wherein the second instance of the software application is hosted on the computing device of the user, and wherein the display of the transmitted text message to the user signals, for one or more transcribed words in the transmitted text message, an indication of the uncertainty of the word transcription, wherein the display of the transcribed text for an uncertain word is grayed, andwherein the second instance of the software application is configured to enable the user to provide transcription feedback regarding the received text message.
- View Dependent Claims (8, 9, 10, 11)
- - 8. The system as defined in claim 7, wherein the first phone address is associated with the user computing device.
  - 9. The system as defined in claim 7, wherein the second instance of the software application comprises a web browser.
  - 10. The system as defined in claim 7, wherein the network interface coupled to the PSTN comprises an Internet Protocol connection.
  - 11. The system as defined in claim 7, the operations further comprising:
    - identifying one or more of the following in the voice message;
      
      a pause;
      
      a clause initial conjunction;
      
      ora hesitation word.

12. A system, comprising:
- at least one computing device comprising hardware;
  
  at least one network interface coupled to a Public Switched Telephone Network (PSTN);
  
  non-transitory memory coupled to an at least one networked computing device that stores a first instance of a software application configured to be installed on a computing device of a user, wherein the software application, when executed by the device of the user, is configured to receive and display text messages corresponding to voice messages transcribed to text and to receive transcription feedback from the user with respect to text messages corresponding to voice messages transcribed to text;
  
  non-transitory memory coupled to the at least one computing device that stores instructions that when executed by the at least one computing device cause, at least in part, the system to implement;
  
  receiving at the at least one network interface a call from a caller directed to a phone address, wherein the phone address is associated with the user;
  
  recording a voice message from the caller;
  
  transcribing some or all of the voice message, including a plurality of spoken words, to text using at least in part a speech-to-text recognizer;
  
  creating a text message from at least a portion of the transcribed text; and
  
  transmitting the text message over the network to the user computing device for display by a second instance of the software application,wherein the second instance of the software application is hosted on the computing device of the user, and wherein the display of the transmitted text message to the user signals, for one or more transcribed words in the transmitted text message, an indication of the uncertainty of the word transcription, andwherein the second instance of the software application is configured to enable the user to provide transcription feedback regarding the received text message, and wherein the display of the transcribed text for a relatively higher certainty word comprises a regular font or a bold text font.

13. A system, comprising:
- at least one computing device comprising hardware;
  
  at least one network interface coupled to a Public Switched Telephone Network (PSTN);
  
  non-transitory memory coupled to an at least one networked computing device that stores a first instance of a software application configured to be installed on a computing device of a user, wherein the software application, when executed by the device of the user, is configured to receive and display text messages corresponding to voice messages transcribed to text and to receive transcription feedback from the user with respect to text messages corresponding to voice messages transcribed to text;
  
  non-transitory memory coupled to the at least one computing device that stores instructions that when executed by the at least one computing device cause, at least in part, the system to implement;
  
  receiving at the at least one network interface a call from a caller directed to a phone address, wherein the phone address is associated with the user;
  
  recording a voice message from the caller;
  
  transcribing some or all of the voice message, including a plurality of spoken words, to text using at least in part a speech-to-text recognizer;
  
  creating a text message from at least a portion of the transcribed text; and
  
  transmitting the text message over the network to the user computing device for display by a second instance of the software application,wherein the second instance of the software application is hosted on the computing device of the user, and wherein the display of the transmitted text message to the user signals, for one or more transcribed words in the transmitted text message, an indication of the uncertainty of the word transcription, andwherein the second instance of the software application is configured to enable the user to provide transcription feedback regarding the received text message, andreceiving, from the second instance of the software application over the network, an indication that at least a one or more word or phrase in the text message is erroneously transcribed.

14. A system, comprising:
- at least one computing device comprising hardware;
  
  at least one network interface coupled to a Public Switched Telephone Network (PSTN);
  
  non-transitory memory coupled to an at least networked one computing device that stores a first instance of a software application configured to be downloaded to a computing device of a user;
  
  non-transitory memory coupled to the at least one computing device that stores instructions that when executed by the at least one computing device cause, at least in part, the system to implement;
  
  receiving at the at least one network interface a call from a caller directed to the user;
  
  recording a voice message from the caller;
  
  transcribing some or all of the voice message, including a plurality of spoken words, to text using at least in part a speech-to-text recognizer;
  
  creating a text message from a selected portion of the transcribed text;
  
  transmitting the text message over the network to a second instance of the software application, wherein the second instance of the software application is hosted on the computer device of the user;
  
  displaying, via a user interface of the second instance of the software application, the second instance of the software application executing on the computing device of the user, the text message transmitted over the network, wherein the display of the transmitted text message to the user signals, for each transcribed word in the selected portion of the transcribed text, an indication of the uncertainty of the word transcription, wherein the display of the transcribed text for a higher certainty word comprises a regular font or a bold text font; and
  
  providing a user interface control via the user interface of the second instance of the software application that enables the user to send a text reply to the caller.
- View Dependent Claims (15, 16, 17, 18)
- - 15. The system as defined in claim 14, wherein the user'"'"'s computing device comprises a mobile phone.
  - 16. The system as defined in claim 14, wherein the second instance of the software application comprises a web browser.
  - 17. The system as defined in claim 14, wherein the coupled network interface to the PSTN comprises an Internet Protocol connection.
  - 18. The system as defined in claim 14, the operations further comprising:
    - identifying one or more of the following in the voice message;
      
      a pause;
      
      a clause initial conjunction;
      
      ora hesitation word.

19. A system, comprising:
- at least one computing device comprising hardware;
  
  at least one network interface coupled to a Public Switched Tele shone Network PSTN;
  
  non-transitory memory coupled to an at least networked one computing device that stores a first instance of a software application configured to be downloaded to a computing device of a user;
  
  non-transitory memory coupled to the at least one computing device that stores instructions that when executed by the at least one computing device cause, at least in part, the system to implement;
  
  receiving at the at least one network interface a call from a caller directed to the user;
  
  recording a voice message from the caller;
  
  transcribing some or all of the voice message, including a plurality of spoken words, to text using at least in part a speech-to-text recognizer;
  
  creating a text message from a selected portion of the transcribed text;
  
  transmitting the text message over the network to a second instance of the software application, wherein the second instance of the software application is hosted on the computer device of the user;
  
  displaying, via a user interface of the second instance of the software application, the second instance of the software application executing on the computing device of the user, the text message transmitted over the network, wherein the display of the transmitted text message to the user signals, for each transcribed word in the selected portion of the transcribed text, an indication of the uncertainty of the word transcription;
  
  providing a user interface control via the user interface of the second instance of the software application that enables the user to send a text reply to the caller;
  
  receiving from the second instance of the software application over the network, a text message reply; and
  
  in response to receiving the text message reply from the second instance of the software application, sending, from the at least one computing device, an SMS message directed to the caller.

20. A system, comprising:
- at least one computing device comprising hardware;
  
  at least one network interface coupled to a Public Switched Telephone Network (PSTN);
  
  non-transitory memory coupled to an at least networked one computing device that stores a first instance of a software application configured to be downloaded to a computing device of a user;
  
  non-transitory memory coupled to the at least one computing device that stores instructions that when executed by the at least one computing device cause, at least in part, the system to implement;
  
  receiving at the at least one network interface a call from a caller directed to the user;
  
  recording a voice message from the caller;
  
  transcribing some or all of the voice message, including a plurality of spoken words, to text using at least in part a speech-to-text recognizes;
  
  creating a text message from a selected portion of the transcribed text;
  
  transmitting the text message over the network to a second instance of the software application, wherein the second instance of the software application is hosted on the computer device of the user;
  
  displaying, via a user interface of the second instance of the software application, the second instance of the software application executing on the computing device of the user, the text message transmitted over the network, wherein the display of the transmitted text message to the user signals, for each transcribed word in the selected portion of the transcribed text, an indication of the uncertainty of the word transcription, wherein the display of the transcribed text for an uncertain word is grayed; and
  
  providing a user interface control via the user interface of the second instance of the software application that enables the user to send a text reply to the caller.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
CallWave Communications, LLC
Original Assignee
CallWave Communications, LLC
Inventors
Bladon, Anthony, Giannini, David, Hofstatter, David Frank, Kelley, Colin, McClintock, David C., Smith, Robert F., Trandal, David S., Kirchhoff, Leland W.
Primary Examiner(s)
Gonzalez, Amancio

Application Number

US14/637,003
Time in Patent Office

364 Days
Field of Search

455/412.2, 455/414.4, 370/300, 379/71, 379/80, 379/85, 379/88.22, 379/88.25, 379/88.26, 704/200, 704/201, 704/251
US Class Current

1/1
CPC Class Codes

G10L 15/26   Speech to text systems G10L...

G10L 2015/088   Word spotting

H04L 51/063   Content adaptation, e.g. re...

H04L 51/066   Format adaptation, e.g. for...

H04L 51/52   for supporting social netwo...

H04M 1/72433   for voice messaging, e.g. d...

H04M 2201/60   Medium conversion

H04M 2203/4536   Voicemail combined with tex...

H04M 2203/554   Data synchronization

H04M 2207/12   intelligent networks

H04M 2207/203   composed of PSTN and data n...

H04M 3/42382   Text-based messaging servic...

H04M 3/533   Voice mail systems

H04M 3/53341   Message reply

H04M 3/5335   Message type or catagory, e...

H04M 7/0012   Details of application prog...

H04W 4/14   Short messaging services, e...

H04W 4/16   Communication-related suppl...

H04W 84/12   WLAN [Wireless Local Area N...

Methods and systems for managing telecommunications and for translating voice messages to text messages

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

Citations

20 Claims

Specification

Solutions

Use Cases

Quick Links

Methods and systems for managing telecommunications and for translating voice messages to text messages

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

20 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links