×

Methods and apparatus for speech segmentation using multiple metadata

  • US 10,229,686 B2
  • Filed: 08/18/2014
  • Issued: 03/12/2019
  • Est. Priority Date: 08/18/2014
  • Status: Active Grant
First Claim
Patent Images

1. A method of performing automated speech recognition (ASR) in a system having a speech enhancement module for generating an audio stream signal and metadata, coupled to an ASR module for performing speech recognition on the audio stream signal using the metadata, the method comprising:

  • by the speech enhancement module, processing microphone signals to generate the audio stream signal;

    by a first speech detector having a first response latency, generating first metadata that indicate the possible presence of speech in the audio stream signal with a first confidence level;

    by a second speech detector having a second response latency that is higher than the first response latency, generating second metadata that indicate the possible presence of speech in the audio stream signal with a second confidence level that is higher than the first confidence level;

    by the ASR module based on the first metadata, initiating buffering of the audio stream signal from an endpoint; and

    by the ASR module based on the second metadata, initiating speech recognition on the buffered audio stream signal from the endpoint.

View all claims
  • 2 Assignments
Timeline View
Assignment View
    ×
    ×