METHODS AND APPARATUS FOR ACOUSTIC DISAMBIGUATION
First Claim
1. A method comprising:
- identifying at least one text segment, in a textual representation having a plurality of text segments, having at least one acoustically similar word and/or phrase;
annotating the textual representation with disambiguating information to help disambiguate the at least one text segment from the at least one acoustically similar word and/or phrase; and
synthesizing a speech signal, at least in part, by performing text-to-speech synthesis on at least a portion of the textual representation that includes the at least one text segment, wherein the speech signal includes speech corresponding to the disambiguating information located proximate the portion of the speech signal corresponding to the at least one text segment.
1 Assignment
0 Petitions
Accused Products
Abstract
Techniques for disambiguating at least one text segment from at least one acoustically similar word and/or phrase. The techniques include identifying at least one text segment, in a textual representation having a plurality of text segments, having at least one acoustically similar word and/or phrase, annotating the textual representation with disambiguating information to help disambiguate the at least one text segment from the at least one acoustically similar word and/or phrase, and synthesizing a speech signal, at least in part, by performing text-to-speech synthesis on at least a portion of the textual representation that includes the at least one text segment, wherein the speech signal includes speech corresponding to the disambiguating information located proximate the portion of the speech signal corresponding to the at least one text segment.
239 Citations
27 Claims
-
1. A method comprising:
-
identifying at least one text segment, in a textual representation having a plurality of text segments, having at least one acoustically similar word and/or phrase; annotating the textual representation with disambiguating information to help disambiguate the at least one text segment from the at least one acoustically similar word and/or phrase; and synthesizing a speech signal, at least in part, by performing text-to-speech synthesis on at least a portion of the textual representation that includes the at least one text segment, wherein the speech signal includes speech corresponding to the disambiguating information located proximate the portion of the speech signal corresponding to the at least one text segment. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. At least one computer readable medium storing instructions that, when executed on at least one processor, perform a method comprising:
-
identifying at least one text segment, in a textual representation having a plurality of text segments, having at least one acoustically similar word and/or phrase; and annotating the textual representation with disambiguating information to help disambiguate the at least one text segment from the at least one acoustically similar word and/or phrase; synthesizing a speech signal, at least in part, by performing text-to-speech synthesis on at least a portion of the textual representation that includes the at least one text segment, wherein the speech signal includes speech corresponding to the disambiguating information located proximate the portion of the speech signal corresponding to the at least one text segment. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A system comprising:
-
at least one input interface for receiving data from the user; a conversion component configured to convert the data into a textual representation; and a presentation component configured to provide an audio presentation of at least a portion of the textual representation by performing; identifying at least one text segment, in a textual representation having a plurality of text segments, having at least one acoustically similar word and/or phrase; annotating the textual representation with disambiguating information to help disambiguate the at least one text segment from the at least one acoustically similar word and/or phrase; synthesizing a speech signal, at least in part, by performing text-to-speech synthesis on at least a portion of the textual representation that includes the at least one text segment, wherein the speech signal includes speech corresponding to the disambiguating information located proximate the portion of the speech signal corresponding to the at least one text segment. - View Dependent Claims (20, 21, 22, 23, 24, 25, 26, 27)
-
Specification