Multisensory speech detection
First Claim
Patent Images
1. A computer-implemented method comprising:
- identifying, by a mobile computing device, a first pose with which the mobile computing device is being held by a user, the first pose being identified from among a plurality of predetermined poses;
initiating, after identifying the first pose, a first audio recording process using a microphone of the mobile computing device;
selecting, based on the first pose identified by the mobile computing device, a first set of one or more parameters from among a plurality of parameters, wherein the first set of one or more parameters define settings that are specific to the first pose and that differentiate between, at least, speech and background noise when the mobile computing device is being held in the first pose;
detecting that the user has started speaking during the first audio recording process;
determining, after the user has started speaking during the first audio recording process and using the first set of one or more parameters, whether the user has stopped speaking during the first audio recording process;
stopping, based on the determining whether the user has stopped speaking during the first audio recording process, the first audio recording process;
identifying, by the mobile computing device, a second pose with which the mobile computing device is being held by the user, the second pose being identified from among the plurality of predetermined poses, and the second pose being distinct from the first pose;
initiating, after identifying the second pose, a second audio recording process using the microphone of the mobile computing device;
selecting, based on the second pose identified by the mobile computing device, a second set of one or more parameters from among the plurality of parameters, wherein the second set of one or more parameters define settings that are specific to the second pose and that differentiate between, at least, speech and background noise when the mobile computing device is being held in the second pose;
detecting that the user has started speaking during the second audio recording process;
determining, after the user has started speaking during the second audio recording process and using the second set of one or more parameters, whether the user has stopped speaking during the second audio recording process; and
stopping, based on the determining whether the user has stopped speaking during the second audio recording process, the second audio recording process.
2 Assignments
0 Petitions
Accused Products
Abstract
A computer-implemented method of multisensory speech detection is disclosed. The method comprises determining an orientation of a mobile device and determining an operating mode of the mobile device based on the orientation of the mobile device. The method further includes identifying speech detection parameters that specify when speech detection begins or ends based on the determined operating mode and detecting speech from a user of the mobile device based on the speech detection parameters.
88 Citations
20 Claims
-
1. A computer-implemented method comprising:
-
identifying, by a mobile computing device, a first pose with which the mobile computing device is being held by a user, the first pose being identified from among a plurality of predetermined poses; initiating, after identifying the first pose, a first audio recording process using a microphone of the mobile computing device; selecting, based on the first pose identified by the mobile computing device, a first set of one or more parameters from among a plurality of parameters, wherein the first set of one or more parameters define settings that are specific to the first pose and that differentiate between, at least, speech and background noise when the mobile computing device is being held in the first pose; detecting that the user has started speaking during the first audio recording process; determining, after the user has started speaking during the first audio recording process and using the first set of one or more parameters, whether the user has stopped speaking during the first audio recording process; stopping, based on the determining whether the user has stopped speaking during the first audio recording process, the first audio recording process; identifying, by the mobile computing device, a second pose with which the mobile computing device is being held by the user, the second pose being identified from among the plurality of predetermined poses, and the second pose being distinct from the first pose; initiating, after identifying the second pose, a second audio recording process using the microphone of the mobile computing device; selecting, based on the second pose identified by the mobile computing device, a second set of one or more parameters from among the plurality of parameters, wherein the second set of one or more parameters define settings that are specific to the second pose and that differentiate between, at least, speech and background noise when the mobile computing device is being held in the second pose; detecting that the user has started speaking during the second audio recording process; determining, after the user has started speaking during the second audio recording process and using the second set of one or more parameters, whether the user has stopped speaking during the second audio recording process; and stopping, based on the determining whether the user has stopped speaking during the second audio recording process, the second audio recording process. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A mobile computing device comprising:
-
one or more computer processors; a pose identifier that is programmed to identify a first pose with which the mobile computing device is being held by a user and a second pose with which the mobile device is being held by the user, the first and second poses being identified from among a plurality of predetermined poses, and the first pose being distinct from the second pose; a microphone that is programmed to initiate a first audio recording process after identification of the first pose and a second audio recording process after identification of the second pose; a speech detector that is programmed i) to select a first set of one or more parameters from among a plurality of parameters based on the first pose, wherein the first set of one or more parameters define settings that are specific to the first pose to differentiate between, at least, speech and background noise when the mobile computing device is being held in the first pose, ii) to detect, using the first set of one or more parameters, that the user has started speaking during the first audio recording process, iii) to select a second set of one or more parameters from among the plurality of parameters based on the second pose, wherein the second set of one or more parameters define settings that are specific to the second pose to differentiate between, at least, speech and background noise when the mobile computing device is being held in the second pose, and iv) to detect, using the second set of one or more parameters, that the user has started speaking during the second audio recording process; and a speech endpointer that is programmed i) to determine, after the user has started speaking and using the first set of one or more parameters, whether the user has stopped speaking during the first audio recording process, ii) to stop the first audio recording process based on determining that the user has stopped speaking during the first audio recording process, iii) to determine, after the user has started speaking and using the second set of one or more parameters, whether the user has stopped speaking during the second audio recording process, and iv) to stop the second audio recording process based on determining that the user has stopped speaking during the second audio recording process. - View Dependent Claims (16, 17, 18, 19)
-
-
20. A computer program product embodied in a computer readable storage device storing instructions that, when executed, cause one or more computing devices to perform operations comprising:
-
identifying a first pose with which the mobile computing device is being held by a user, the first pose being identified from among a plurality of predetermined poses; initiating, after identifying the first pose, a first audio recording process using a microphone of the mobile computing device; selecting, based on the first pose identified by the mobile computing device, a first set of one or more parameters from among a plurality of parameters, wherein the first set of one or more parameters define settings that are specific to the first pose and that differentiate between, at least, speech and background noise when the mobile computing device is being held in the first pose; detecting that the user has started speaking during the first audio recording process; determining, after the user has started speaking during the first audio recording process and using the first set of one or more parameters, whether the user has stopped speaking during the first audio recording process; stopping, based on the determining whether the user has stopped speaking during the first audio recording process, the first audio recording process; identifying a second pose with which the mobile computing device is being held by the user, the second pose being identified from among the plurality of predetermined poses, and the second pose being distinct from the first pose; initiating, after identifying the second pose, a second audio recording process using the microphone of the mobile computing device; selecting, based on the second pose identified by the mobile computing device, a second set of one or more parameters from among the plurality of parameters, wherein the second set of one or more parameters define settings that are specific to the second pose and that differentiate between, at least, speech and background noise when the mobile computing device is being held in the second pose; detecting that the user has started speaking during the second audio recording process; determining, after the user has started speaking during the second audio recording process and using the second set of one or more parameters, whether the user has stopped speaking during the second audio recording process; and stopping, based on the determining whether the user has stopped speaking during the second audio recording process, the second audio recording process.
-
Specification