System and Method for Multimodal Utterance Detection

US 20140222430A1
Filed: 02/03/2014
Published: 08/07/2014
Est. Priority Date: 10/17/2008
Status: Active Grant

First Claim

Patent Images

1. A computer-implemented speech utterance detection method comprising:

a) generating a plurality of features from an audio stream;

b) obtaining a plurality of time aligned speech segments based on the features;

c) filtering the plurality of speech segments using general speech related knowledge and application specific knowledge to yield at least one candidate segment;

e) finding a desired speech segment from the at least one candidate segment based on multimodal timing information related to the desired speech segment; and

f) outputting the desired speech segment.

View all claims

0 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

The disclosure describe a system and method for detecting one or more segments of desired speech utterances from an audio stream using timings of events from other modes that are correlated to the timings of the desired segments of speech. The redundant information from other modes results in a highly accurate and robust utterance detection.

90 Citations

View as Search Results

1 Claim

1. A computer-implemented speech utterance detection method comprising:
- a) generating a plurality of features from an audio stream;
  
  b) obtaining a plurality of time aligned speech segments based on the features;
  
  c) filtering the plurality of speech segments using general speech related knowledge and application specific knowledge to yield at least one candidate segment;
  
  e) finding a desired speech segment from the at least one candidate segment based on multimodal timing information related to the desired speech segment; and
  
  f) outputting the desired speech segment.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Ashwin P. Rao
Original Assignee
Ashwin P. Rao
Inventors
Rao, Ashwin P.

Granted Patent

US 9,922,640 B2
Time in Patent Office

Days
Field of Search
US Class Current

704/254
CPC Class Codes

G10L 15/04 Segmentation; Word boundary...

G10L 2015/088 Word spotting

System and Method for Multimodal Utterance Detection

First Claim

0 Assignments

0 Petitions

Accused Products

Abstract

90 Citations

1 Claim

Specification

Use Cases

Quick Links

Others

System and Method for Multimodal Utterance Detection

First Claim

0 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

90 Citations

1 Claim

Specification

Subscription Required

Use Cases

Quick Links

Others