Method and apparatus for performing prosody-based endpointing of a speech signal
First Claim
Patent Images
1. A method for processing a speech signal comprising:
- extracting prosodic features from a speech signal;
modeling the prosodic features to identify at least one speech endpoint; and
producing an endpoint signal corresponding to the occurrence of the at least one speech endpoint.
2 Assignments
0 Petitions
Accused Products
Abstract
A method and apparatus for finding endpoints in speech by utilizing information contained in speech prosody. Prosody denotes the way speakers modulate the timing, pitch and loudness of phones, words, and phrases to convey certain aspects of meaning; informally, prosody includes what is perceived as the “rhythm” and “melody” of speech. Because speakers use prosody to convey units of speech to listeners, the method and apparatus performs endpoint detection by extracting and interpreting the relevant prosodic properties of speech.
48 Citations
21 Claims
-
1. A method for processing a speech signal comprising:
-
extracting prosodic features from a speech signal;
modeling the prosodic features to identify at least one speech endpoint; and
producing an endpoint signal corresponding to the occurrence of the at least one speech endpoint. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10)
-
-
11. Apparatus for processing a speech signal comprising:
-
a prosodic feature extractor for extracting prosodic features from the speech signal;
a prosodic feature analyzer for modeling the prosodic features to identify at least one speech endpoint; and
an endpoint signal producer that produces an endpoint signal corresponding to the occurrence of the at least one speech endpoint. - View Dependent Claims (12, 13, 14, 15, 16, 17, 18, 19, 20)
-
-
21. An electronic storage medium for storing a program that, when executed by a processor, causes a system to perform a method for processing a speech signal comprising:
-
extracting prosodic features from a speech signal;
modeling the prosodic features to identify at least one speech endpoint; and
producing an endpoint signal corresponding to the occurrence of the at least one speech endpoint.
-
Specification