×

USE OF METADATA TO POST PROCESS SPEECH RECOGNITION OUTPUT

  • US 20090248415A1
  • Filed: 03/31/2009
  • Published: 10/01/2009
  • Est. Priority Date: 03/31/2008
  • Status: Active Grant
First Claim
Patent Images

1. A method of utilizing metadata stored in a computer-readable medium to assist in the conversion of spoken audio input, received by a hand-held mobile communication device, into a textual representation for display on the hand-held mobile communication device, comprising the steps of:

  • initializing a hand-held mobile communication device so that the hand-held mobile communication device is capable of communicating with a backend server via a data channel of the hand-held mobile communication device;

    upon receipt of an utterance by the hand-held mobile communication device, recording and storing an audio message, representative of the utterance, in the hand-held mobile communication device in the form of binary audio data;

    transmitting, via the data channel, the recorded and stored binary audio data, representing the utterance, from the hand-held mobile communication device to a backend server through a client-server communication protocol;

    in conjunction with the transmission of the recorded and stored binary audio data, transmitting metadata from the hand-held mobile communication device to the backend server through the client-server communication protocol;

    converting the transmitted binary audio data into a textual representation of the utterance in the backend server;

    comparing at least one portion of the textual representation to at least one portion of the metadata;

    replacing at least one portion of the textual representation with at least one portion of the metadata; and

    sending the converted textual representation of the utterance, with metadata replacement, from the server back to the hand-held mobile communication device.

View all claims
  • 5 Assignments
Timeline View
Assignment View
    ×
    ×