Speech recognition candidate selection based on non-acoustic input
First Claim
Patent Images
1. An apparatus, comprising:
- a memory; and
a processor operatively coupled to the memory and configured to;
receive a speech input;
generate at least two speech recognition candidates from the speech input;
observe a scene related to the speech input using one or more non-acoustic sensors;
segment the observed scene into one or more regions;
compute one or more properties for the one or more regions, wherein the computation of the one or more properties comprises a determination of a textual label using optical character recognition; and
select one of the speech recognition candidates based on the one or more computed properties of the one or more regions.
1 Assignment
0 Petitions
Accused Products
Abstract
A method includes the following steps. A speech input is received. At least two speech recognition candidates are generated from the speech input. A scene related to the speech input is observed using one or more non-acoustic sensors. The observed scene is segmented into one or more regions. One or more properties for the one or more regions are computed. One of the speech recognition candidates is selected based on the one or more computed properties of the one or more regions.
-
Citations
20 Claims
-
1. An apparatus, comprising:
-
a memory; and a processor operatively coupled to the memory and configured to; receive a speech input; generate at least two speech recognition candidates from the speech input; observe a scene related to the speech input using one or more non-acoustic sensors; segment the observed scene into one or more regions; compute one or more properties for the one or more regions, wherein the computation of the one or more properties comprises a determination of a textual label using optical character recognition; and select one of the speech recognition candidates based on the one or more computed properties of the one or more regions. - View Dependent Claims (2, 3, 4, 5, 6, 7, 15, 16, 17)
-
-
8. An article of manufacture comprising a computer readable storage medium for storing computer readable program code which, when executed, causes a computer to:
-
receive a speech input; generate at least two speech recognition candidates from the speech input; observe a scene related to the speech input using one or more non-acoustic sensors; segment the observed scene into one or more regions; compute one or more properties for the one or more regions, wherein the computation of the one or more properties comprises program code to determine a textual label using optical character recognition; and select one of the speech recognition candidates based on the one or more computed properties of the one or more regions. - View Dependent Claims (9, 10, 11, 12, 13, 14, 18, 19, 20)
-
Specification