MOTION-BASED VOICE ACTIVITY DETECTION
First Claim
Patent Images
1. A method for providing voice activity detection, the method comprising:
- receiving a data stream;
determining whether at least one non-audio element associated with the data stream indicates that the data stream comprises speech; and
in response to determining that the at least one non-audio element associated with the data stream indicates that the data stream comprises speech, processing at least one audio element associated with the data stream as speech.
2 Assignments
0 Petitions
Accused Products
Abstract
Motion-based voice activity detection may be provided. A data stream may be received and a determination may be made whether at least one non-audio element associated with the data stream indicates that the data stream comprises speech. In response to determining that the at least one non-audio element associated with the data stream indicates that the data stream comprises speech, a speech to text conversion may be performed on at least one audio element associated with the data stream.
20 Citations
20 Claims
-
1. A method for providing voice activity detection, the method comprising:
-
receiving a data stream; determining whether at least one non-audio element associated with the data stream indicates that the data stream comprises speech; and in response to determining that the at least one non-audio element associated with the data stream indicates that the data stream comprises speech, processing at least one audio element associated with the data stream as speech. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. A computer-readable medium which stores a set of instructions which when executed performs a method for providing voice activity detection, the method executed by the set of instructions comprising:
-
receiving a data stream from a user; determining whether a plurality of inputs associated with the data stream indicate that the data stream comprises speech; in response to determining that the plurality of inputs associated with the data stream indicates that the data stream comprises speech, performing a speech to text conversion on at least one audio element associated with the data stream; and displaying the converted text to the user. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A system for providing voice activity detection, the system comprising:
-
a memory storage; and a processing unit coupled to the memory storage, wherein the processing unit is operative to; learn at least one gesture associated with the user indicating that the data stream comprises speech; receive a data stream from a user; determine whether the at least one learned gesture has been detected in association with the data stream; in response to determining that the at least one learned gesture has not been detected, determine whether a plurality of non-audio inputs associated with the data stream indicate that the data stream comprises speech, wherein the plurality of inputs comprise at least one of the following;
a sensor reading, a user input, a device status, and an application status;in response to determining that the plurality of inputs associated with the data stream indicates that the data stream comprises speech, perform a speech to text conversion on at least one audio element associated with the data stream; and display the converted text to the user.
-
Specification