Voice-activated switch

US 5,430,826 A
Filed: 10/13/1992
Issued: 07/04/1995
Est. Priority Date: 10/13/1992
Status: Expired due to Term

First Claim

Patent Images

1. Apparatus for detecting human speech in an audio signal, comprising:

a single lag autocorrelation unit, that receives a digital signal representative of the audio signal and provides a respective single-lag autocorrelated signal, representative of each received digital signal multiplied by said each received digital signal delayed by the same period of time corresponding to a first formant frequency;

an initial decision unit for providing initial decisions associated with portions of said single-lag autocorrelated signal, wherein an initial decision indicates a talk state if an associated portion of said single-lag autocorrelated signal exceeds a scaled noise value and wherein said initial decision indicates a pause state otherwise; and

a final decision unit that determines when a portion of the audio signal contains human speech according to a plurality of said initial decisions.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

Human speech is detected in an audio signal by first providing a single autocorrelated signal indicative of the audio signal multiplied by a time-delayed portion of the audio signal, the delay being an amount of time indicative of a period corresponding to a first formant frequency. Portions of the autocorrelated signal are compared with a scaled noise value. Human speech is detected by examining whether a plurality of portions of the autocorrelated signal exceed the scaled noise value.

51 Citations

View as Search Results

28 Claims

1. Apparatus for detecting human speech in an audio signal, comprising:
- a single lag autocorrelation unit, that receives a digital signal representative of the audio signal and provides a respective single-lag autocorrelated signal, representative of each received digital signal multiplied by said each received digital signal delayed by the same period of time corresponding to a first formant frequency;
  
  an initial decision unit for providing initial decisions associated with portions of said single-lag autocorrelated signal, wherein an initial decision indicates a talk state if an associated portion of said single-lag autocorrelated signal exceeds a scaled noise value and wherein said initial decision indicates a pause state otherwise; and
  
  a final decision unit that determines when a portion of the audio signal contains human speech according to a plurality of said initial decisions.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
- - 2. Apparatus for detecting human speech in an audio signal, according to claim 1, wherein said final decision unit deems a particular portion of the audio signal to contain speech if a final decision associated with an immediately preceding portion of the audio signal indicates a speech state and if an initial decision for the particular portion or a subsequent portion of the single-lag autocorrelated signal indicates a talk state.
  - 3. Apparatus for detecting human speech in an audio signal, according to claim 1, wherein the final decision unit deems a particular portion of the audio signal not to contain speech if an immediately preceding portion of the audio signal is deemed not to contain speech and if the initial decision for the particular portion or a subsequent portion indicates a pause state.
  - 4. Apparatus for detecting human speech in an audio signal, according to claim 1, wherein portions of the audio signal which are before and after a portion where speech is detected are also deemed to contain speech.
  - 5. Apparatus for detecting human speech in an audio signal, according to claim 1, wherein the scaled noise value equals the minimum of a predetermined number of portions of the audio signal multiplied by a constant value.
  - 6. Apparatus for detecting human speech in an audio signal, according to claim 5, wherein the constant value is user selectable.
  - 7. Apparatus for detecting human speech in an audio signal, according to claim 5, wherein the predetermined number of portions is forty-eight.
  - 8. Apparatus for detecting human speech in an audio signal, according to claim 1, wherein the delay is two msec.
  - 9. Apparatus for detecting human speech in an audio signal, according to claim 2, wherein the final decision unit deems a particular portion of the audio signal not to contain speech if an immediately preceding portion of the audio signal is deemed not to contain speech and if the initial decision for the particular portion or a subsequent portion indicates a pause state.
  - 10. Apparatus for detecting human speech in an audio signal, according to claim 9, wherein portions of the audio signal which are before and after a portion where speech is detected are also deemed to contain speech.
  - 11. Apparatus for detecting human speech in an audio signal, according to claim 10, wherein the scaled noise value equals the minimum of a predetermined number of portions of the audio signal multiplied by a constant value.
  - 12. Apparatus for detecting human speech in an audio signal, according to claim 11, wherein the constant value is selected by a user.
  - 13. A voice activated switch for detecting human speech in a sound signal, according to claim 12, wherein the predetermined number of portions is forty-eight.
  - 14. A voice activated switch for detecting human speech in a sound signal, according to claim 13, wherein the delay is two msec.

15. Method of detecting speech in an audio signal, comprising the steps of:
- providing a single autocorrelated signal corresponding to the audio signal multiplied by a portion of the audio signal delayed by only a single-lag period of time corresponding to a first formant frequency;
  
  associating a initial decisions with portions of said single-lag autocorrelated signal, wherein an initial decision indicates a talk state if an associated portion of said single-lag autocorrelated signal exceeds a scaled noise value and wherein said initial decision indicates a pause state otherwise; and
  
  deeming a portion of the audio signal to contain human speech according to a plurality of initial decisions.
- View Dependent Claims (16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28)
- - 16. Method of detecting speech in an audio signal, according to claim 15, wherein a portion of the audio signal is deemed to contain speech if a final decision associated with an immediately preceding portion of the audio signal indicates a speech state and if an initial decision for the particular portion or a subsequent portion of the single-lag autocorrelated signal indicates a talk state.
  - 17. Method of detecting speech in an audio signal, according to claim 15, wherein a particular portion of the audio signal is deemed not to contain speech if an immediately preceding portion of the audio signal is deemed not to contain speech and if the initial decision for the particular portion or a subsequent portion indicates a pause state.
  - 18. Method of detecting speech in an audio signal, according to claim 15, further comprising the step of:
    - deeming portions of the audio signal which are before and after a portion where speech is detected as containing speech.
  - 19. Method of detecting speech in an audio signal, according to claim 15, wherein the scaled noise value equals the minimum of a predetermined number of portions of the audio signal multiplied by a constant value.
  - 20. Method of detecting speech in an audio signal, according to claim 19, wherein the constant value is selected by a user.
  - 21. Method of detecting speech in an audio signal, according to claim 19, wherein the predetermined number of portions is forty-eight.
  - 22. Method of detecting speech in an audio signal, according to claim 15, wherein the delay is two msec.
  - 23. Method of detecting speech in an audio signal, according to claim 16, wherein a particular portion of the audio signal is deemed not to contain speech if an immediately preceding portion of the audio signal is deemed not to contain speech and if the initial decision for the particular portion or a subsequent portion indicates a pause state.
  - 24. Method of detecting speech in an audio signal, according to claim 23, wherein portions of the audio signal which are before and after a portion where speech is detected are also deemed to contain speech.
  - 25. Method of detecting speech in an audio signal, according to claim 24, wherein the scaled noise value equals the minimum of a predetermined number of portions of the audio signal multiplied by a constant value.
  - 26. Method of detecting speech in an audio signal, according to claim 25, wherein the constant value is selected by a user.
  - 27. Method of detecting speech in an audio signal, according to claim 26, wherein the predetermined number of portions is forty-eight.
  - 28. Method of detecting speech in an audio signal, according to claim 27, wherein the delay is two msec.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Harris Corporation (L3Harris Technologies, Inc.)
Original Assignee
Harris Corporation (L3Harris Technologies, Inc.)
Inventors
Sinclair, Gregory S., Webster, Mark A., Wright, Thomas H.
Primary Examiner(s)
MacDonald, Allen R.
Assistant Examiner(s)
Dorvil, Richemond

Application Number

US07/959,759
Time in Patent Office

994 Days
Field of Search

381/46, 381/49, 395/2.17, 395/2.18, 395/2.19, 395/2.23, 395/2.24, 395/2.42, 395/2.57, 395/2.62, 395/22.72, 395/2.26
US Class Current

704/200
CPC Class Codes

G10L 25/06   the extracted parameters be...

G10L 25/15   the extracted parameters be...

G10L 25/78   Detection of presence or ab...

Voice-activated switch

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

51 Citations

28 Claims

Specification

Solutions

Use Cases

Quick Links

Voice-activated switch

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

51 Citations

28 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links