Text-to-speech enriching system
First Claim
1. A computer-implemented method for producing an audible version of a document comprising:
- identifying a document including both a rich data portion of content, and an unstructured portion of content;
identifying, within the document, the rich data portion of content that corresponds to a pre-determined format, wherein the rich data portion of content is to be replaced with a summary of the content in the audible version of the document;
determining that the unstructured portion of the document includes alphanumeric text;
generating the summary of the rich data portion of content at a first level of detail based at least in part on a first portion of metadata associated with the rich data portion of content, wherein a second level of detail corresponds to a second portion of the metadata associated with the rich data portion of the content; and
audibly outputting the audible version of the document including both the alphanumeric text of the unstructured portion of the document and the generated summary, wherein the generated summary replaces the rich data portion content from the identified document in the output audible version of the document.
1 Assignment
0 Petitions
Accused Products
Abstract
Disclosed herein are system, method, and computer program product embodiments for a text-to-speech system. An embodiment operates by identifying a document including text, wherein the text includes both a structured portion of text, and an unstructured portion of text. Both the structured portion and unstructured portions of the text are identified within the document rich data, wherein the structured portion corresponds to a rich data portion that includes both a descriptor and content, and wherein an unstructured portion of the text includes alphanumeric text. A summary of the content of the rich data portion of the document at a specified level of detail is generated at a specified level of detail. An audible version of the document including both the text-only portion of the document and the summary of the content of the rich data portion of the document is output.
10 Citations
20 Claims
-
1. A computer-implemented method for producing an audible version of a document comprising:
-
identifying a document including both a rich data portion of content, and an unstructured portion of content; identifying, within the document, the rich data portion of content that corresponds to a pre-determined format, wherein the rich data portion of content is to be replaced with a summary of the content in the audible version of the document; determining that the unstructured portion of the document includes alphanumeric text; generating the summary of the rich data portion of content at a first level of detail based at least in part on a first portion of metadata associated with the rich data portion of content, wherein a second level of detail corresponds to a second portion of the metadata associated with the rich data portion of the content; and audibly outputting the audible version of the document including both the alphanumeric text of the unstructured portion of the document and the generated summary, wherein the generated summary replaces the rich data portion content from the identified document in the output audible version of the document. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A system comprising:
-
a memory; and at least one processor coupled to the memory and configured to perform operations comprising; identifying a document including both a rich data portion of content, and an unstructured portion of content; identifying, within the document, the rich data portion of content that corresponds to a pre-determined format, wherein the rich data portion of content is to be replaced with a summary of the content in the audible version of the document; determining that the unstructured portion of the document includes alphanumeric text; generating the summary of the rich data portion of content at a first level of detail based at least in part on a first portion of metadata associated with the rich data portion of content, wherein a second level of detail corresponds to a second portion of the metadata associated with the rich data portion of the content; and audibly outputting the audible version of the document including both the alphanumeric text of the unstructured portion of the document and the generated summary, wherein the generated summary replaces the rich data portion content from the identified document in the output audible version of the document. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. A non-transitory computer-readable device having instructions stored thereon that, when executed by at least one computing device, cause the at least one computing device to perform operations comprising:
-
identifying a document including both a rich data portion of content, and an unstructured portion of content; identifying, within the document, the rich data portion of content that corresponds to a pre-determined format, wherein the rich data portion of content is to be replaced with a summary of the content in the audible version of the document; determining that the unstructured portion of the document includes alphanumeric text;
generating the summary of the rich data portion of content at a first level of detail based at least in part on a first portion of metadata associated with the rich data portion of content, wherein a second level of detail corresponds to a second portion of the metadata associated with the rich data portion of the content; andaudibly outputting the audible version of the document including both the alphanumeric text of the unstructured portion of the document and the generated summary, wherein the generated summary replaces the rich data portion content from the identified document in the output audible version of the document. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification