Method and system for automatically providing linguistic formulations that are outside a recognition domain of an automatic speech recognition system
First Claim
1. A method for automatically providing a hypothesis of a linguistic formulation that is uttered by a user, the method comprising:
- automatically providing a hypothesis of a linguistic formulation that is uttered by a user of an automatic voice service based on an automatic speech recognition system that is outside a recognition domain of said automatic speech recognition system by;
providing a constrained speech recognition and an unconstrained speech recognition of a portion of a first input speech signal that is outside a recognition domain of said automatic speech recognition system, the constrained speech recognition includes constrained phonemes based on a sequence of time segments and the unconstrained speech recognition includes unconstrained phonemes based on the sequence of time segments;
identifying and temporally segmenting a given constrained phoneme of said constrained speech recognition corresponding to a time segment of the given constrained phoneme in order to determine whether the given constrained phoneme is outside said recognition domain, including;
computing confidence measures for the constrained phonemes of said constrained speech recognition, wherein the confidence measures include a discrete time quanta, andidentifying said given constrained phoneme of said constrained speech recognition outside said recognition domain based on said confidence measures;
identifying and temporally segmenting a given unconstrained phoneme of said unconstrained speech recognition corresponding to a time segment of the given unconstrained phoneme, the time segment of the given unconstrained phoneme being substantially the same as the time segment of the given constrained phoneme; and
providing said linguistic formulation hypothesis based on said identified unconstrained phoneme of said unconstrained speech recognition.
2 Assignments
0 Petitions
Accused Products
Abstract
A method for automatically providing a hypothesis of a linguistic formulation that is uttered by users of a voice service based on an automatic speech recognition system and that is outside a recognition domain of the automatic speech recognition system. The method includes providing a constrained and an unconstrained speech recognition from an input speech signal, identifying a part of the constrained speech recognition outside the recognition domain, identifying a part of the unconstrained speech recognition corresponding to the identified part of the constrained speech recognition, and providing the linguistic formulation hypothesis based on the identified part of the unconstrained speech recognition.
-
Citations
17 Claims
-
1. A method for automatically providing a hypothesis of a linguistic formulation that is uttered by a user, the method comprising:
-
automatically providing a hypothesis of a linguistic formulation that is uttered by a user of an automatic voice service based on an automatic speech recognition system that is outside a recognition domain of said automatic speech recognition system by; providing a constrained speech recognition and an unconstrained speech recognition of a portion of a first input speech signal that is outside a recognition domain of said automatic speech recognition system, the constrained speech recognition includes constrained phonemes based on a sequence of time segments and the unconstrained speech recognition includes unconstrained phonemes based on the sequence of time segments; identifying and temporally segmenting a given constrained phoneme of said constrained speech recognition corresponding to a time segment of the given constrained phoneme in order to determine whether the given constrained phoneme is outside said recognition domain, including; computing confidence measures for the constrained phonemes of said constrained speech recognition, wherein the confidence measures include a discrete time quanta, and identifying said given constrained phoneme of said constrained speech recognition outside said recognition domain based on said confidence measures; identifying and temporally segmenting a given unconstrained phoneme of said unconstrained speech recognition corresponding to a time segment of the given unconstrained phoneme, the time segment of the given unconstrained phoneme being substantially the same as the time segment of the given constrained phoneme; and providing said linguistic formulation hypothesis based on said identified unconstrained phoneme of said unconstrained speech recognition. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A system for automatically providing hypothesis of linguistic formulations that are uttered by users of an automatic voice service based on an automatic speech recognition system and that are outside a recognition domain of the automatic speech recognition system, said system comprising a computer configured to:
-
automatically provide a hypothesis of a linguistic formulation that is uttered by a user of an automatic voice service based on an automatic speech recognition system that is outside a recognition domain of said automatic speech recognition system by; providing a constrained speech recognition and an unconstrained speech recognition of a portion of a first input speech signal that is outside a recognition domain of said automatic speech recognition system, the constrained speech recognition includes constrained phonemes based on a sequence of time segments and the unconstrained speech recognition includes unconstrained phonemes based on the sequence of time segments; identifying and temporally segmenting a given constrained phoneme of said constrained speech recognition corresponding to a time segment of the given constrained phoneme in order to determine whether the identified constrained phoneme is outside said recognition domain, including; computing confidence measures for constrained phonemes of said constrained speech recognition, wherein the confidence measures include a discrete time quanta, and identifying said constrained phoneme of said constrained speech recognition outside said recognition domain based on said confidence measures; identifying and temporally segmenting a given unconstrained phoneme of said unconstrained speech recognition corresponding to a time segment of the given unconstrained phoneme, the time segment of the given unconstrained phoneme being substantially the same as the time segment of the given constrained phoneme; and providing said linguistic formulation hypothesis based on said identified unconstrained phoneme of said unconstrained speech recognition.
-
-
16. A non-transitory computer program medium encoded with a computer program comprising a computer program code, wherein the computer program code, when loaded in a computer, causes the computer to:
-
automatically provide a hypothesis of a linguistic formulation that is uttered by a user of an automatic voice service based on an automatic speech recognition system that is outside a recognition domain of said automatic speech recognition system by; providing a constrained speech recognition and an unconstrained speech recognition of a portion of a first input speech signal that is outside a recognition domain of said automatic speech recognition system, the constrained speech recognition includes constrained phonemes based on a sequence of time segments and the unconstrained speech recognition includes unconstrained phonemes based on the sequence of time segments; identifying and temporally segmenting a given constrained phoneme of said constrained speech recognition corresponding to a constrained phoneme time segment in order to determine whether the identified constrained phoneme is outside said recognition domain, including; computing confidence measures for the constrained phonemes of said constrained speech recognition, wherein the confidence measures include a discrete time quanta, and identifying said constrained phoneme of said constrained speech recognition outside said recognition domain based on said confidence measures; identifying and temporally segmenting a given unconstrained phoneme of said unconstrained speech recognition corresponding to time segment of the given unconstrained phoneme, the time segment of the given unconstrained phoneme being substantially the same as the time segment of the given constrained phoneme; and providing said linguistic formulation hypothesis based on said identified unconstrained phoneme of said unconstrained speech recognition.
-
-
17. A method for providing an automatic voice service based on an automatic speech recognition system, comprising:
-
receiving an input speech signal; performing an automatic speech recognition based on said input speech signal; and providing a hypothesis of a linguistic formulation that is uttered by a user of said automatic voice service and that is outside a recognition domain of said automatic speech recognition system, wherein said hypothesis is automatically provided by; automatically providing a hypothesis of a linguistic formulation that is uttered by a user of an automatic voice service based on an automatic speech recognition system that is outside a recognition domain of said automatic speech recognition system by; providing a constrained speech recognition and an unconstrained speech recognition of a portion of a first input speech signal that is outside a recognition domain of said automatic speech recognition system, the constrained speech recognition includes constrained phonemes based on a sequence of time segments and the unconstrained speech recognition includes unconstrained phonemes based on the sequence of time segments; identifying and temporally segmenting a given constrained phoneme of said constrained speech recognition corresponding to a time segment of the given constrained phoneme in order to determine whether the identified constrained phoneme is outside said recognition domain, including; computing confidence measures for the constrained phonemes of said constrained speech recognition, wherein the confidence measures include a discrete time quanta, and identifying said constrained phoneme of said constrained speech recognition outside said recognition domain based on said confidence measures; identifying and temporally segmenting a given unconstrained phoneme of said unconstrained speech recognition corresponding to a time segment of the given unconstrained phoneme, the time segment of the given unconstrained phoneme being substantially the same as the time segment of the given constrained phoneme; and providing said linguistic formulation hypothesis based on said identified unconstrained phoneme of said unconstrained speech recognition.
-
Specification