Intelligent text-to-speech conversion
First Claim
1. A computer-implemented method of converting text to speech, the method comprising:
- selecting a document to be converted to speech, the selected document including base text and one or more links located within the base text;
parsing the selected document, wherein the parsing comprises;
resolving at least one of the one or more links in the selected document; and
retrieving pre-existing text from one or more documents obtained by said resolving;
appending at least a portion of the retrieved pre-existing text to the base text;
generating speech by converting to speech the base text and the portion of the retrieved pre-existing text appended to the base text; and
creating an audio file based on the converted text, wherein the audio file includes at least one audio cue configured to be beneficial to visually impaired listeners.
1 Assignment
0 Petitions
Accused Products
Abstract
Techniques for improved text-to-speech processing are disclosed. The improved text-to-speech processing can convert text from an electronic document into an audio output that includes speech associated with the text as well as audio contextual cues. One aspect provides audio contextual cues to the listener when outputting speech (spoken text) pertaining to a document. The audio contextual cues can be based on an analysis of a document prior to a text-to-speech conversion. Another aspect can produce an audio summary for a file. The audio summary for a document can thereafter be presented to a user so that the user can hear a summary of the document without having to process the document to produce its spoken text via text-to-speech conversion.
887 Citations
18 Claims
-
1. A computer-implemented method of converting text to speech, the method comprising:
-
selecting a document to be converted to speech, the selected document including base text and one or more links located within the base text; parsing the selected document, wherein the parsing comprises; resolving at least one of the one or more links in the selected document; and retrieving pre-existing text from one or more documents obtained by said resolving; appending at least a portion of the retrieved pre-existing text to the base text; generating speech by converting to speech the base text and the portion of the retrieved pre-existing text appended to the base text; and creating an audio file based on the converted text, wherein the audio file includes at least one audio cue configured to be beneficial to visually impaired listeners. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A computer-implemented method of generating an audio summary for a document, the method comprising:
-
parsing a document to extract metadata from the document; generating an audio summary for the parsed document based on the extracted metadata; and associating the audio summary with the parsed document, wherein the associating the audio summary to the parsed document includes at least embedding the audio summary into the parsed document. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A computer-implemented method of generating an audio summary for a document, the method comprising:
-
parsing a document to extract metadata from the document; generating an audio summary for the parsed document based on the extracted metadata; and associating the audio summary with the parsed document by creating a software pointer from the parsed document to the audio summary, and embedding the software pointer into the parsed document. - View Dependent Claims (16)
-
-
17. A non-transitory computer readable storage medium including at least computer program code for converting text to speech, comprising:
-
computer program code for selecting a document to be converted to speech, the selected document including base text and one or more links located within the base text; computer program code for parsing the selected document, wherein the computer program code for parsing comprises; computer program code for resolving at least one of the one or more links in the selected document; and computer program code for retrieving pre-existing text from one or more documents obtained by the said resolving; computer program code for appending at least a portion of the retrieved pre-existing text to the base text; computer program code for generating speech by converting to speech the base text and the portion of the retrieved pre-existing text appended to the base text; and computer program code for creating an audio file based on the converted text, wherein the audio file includes at least one audio cue configured to be beneficial to visually impaired listeners. - View Dependent Claims (18)
-
Specification