Community audio narration generation
First Claim
Patent Images
1. One or more computer readable media storing computer-executable instructions that, when executed, cause one or more processors to perform acts comprising:
- selecting a text-based work that includes at least one content section without a corresponding audio reading;
presenting the text-based work to a plurality of human readers to solicit an audio reading of the at least one content section of the text-based work;
obtaining a group of audio recordings from the plurality of human readers, each audio recording having metadata that identifies a respective location within a corresponding content section of the text-based work;
combining the group of audio recordings in order using the respective location identified by the metadata of the audio recordings to produce an audio file that includes the audio reading for at least the content section of the text-based work; and
distributing an integrated product that includes a copy of the text-based work and a copy of the audio file to an electronic device.
1 Assignment
0 Petitions
Accused Products
Abstract
The community-based generation of audio narrations for a text-based work leverages collaboration of a community of people to provide human-voiced audio readings. During the community-based generation, a collection of audio recordings for the text-based work may be collected from multiple human readers in a community. An audio recording for each section in the text-based work may be selected from the collection of audio recordings. The selected audio recordings may be then combined to produce an audio reading of at least a portion of the text-based work.
-
Citations
30 Claims
-
1. One or more computer readable media storing computer-executable instructions that, when executed, cause one or more processors to perform acts comprising:
-
selecting a text-based work that includes at least one content section without a corresponding audio reading; presenting the text-based work to a plurality of human readers to solicit an audio reading of the at least one content section of the text-based work; obtaining a group of audio recordings from the plurality of human readers, each audio recording having metadata that identifies a respective location within a corresponding content section of the text-based work; combining the group of audio recordings in order using the respective location identified by the metadata of the audio recordings to produce an audio file that includes the audio reading for at least the content section of the text-based work; and distributing an integrated product that includes a copy of the text-based work and a copy of the audio file to an electronic device. - View Dependent Claims (2, 3, 4)
-
-
5. A computer implemented method, comprising:
-
receiving a group of audio recordings from a plurality of human readers for storage on a server, individual ones of the group of audio recordings including metadata that provides identification information and identifies a respective location within a corresponding section of a text-based work; identifying a set of audio recordings from the group of audio recordings as corresponding to the text-based work based at least on the metadata; and combining the set of audio recordings to produce an audio reading including at least one audio file for at least a portion of the text-based work by digitally splicing the set of audio recordings in an order based at least in part on the respective location identified by the metadata of the set of audio recordings. - View Dependent Claims (6, 7, 8, 9, 10, 11, 12, 13, 14, 15)
-
-
16. A server, comprising:
-
a processor; and memory storing components executable by the processor, the components comprising; a content presentation component that presents a text-based work that includes a content section without a corresponding audio reading to solicit an audio reading of the content section; an audio collection component to receive the audio reading of the content section from a human reader, the audio reading of the content section including metadata that identifies the audio reading as corresponding to the content section and that identifies a location within the content section; and an integration component to digitally splice the audio reading with an additional audio reading of another content section of the text-based work in response to determining, based at least on the metadata and additional metadata that is associated with the additional audio reading, that the audio reading and the additional reading are related, wherein an order in which the audio reading is digitally spliced with the additional audio reading is based at least in part on the location identified by the metadata. - View Dependent Claims (17, 18, 19, 20, 21, 22)
-
-
23. One or more computer readable media storing computer-executable instructions that, when executed, cause one or more processors to perform acts comprising:
-
receiving an audio reading for a content section of a text-based work from a human reader, the audio reading including metadata identifying a location within the content section; determining whether spoken words in the audio reading match at least a threshold amount of text in the content section based at least in part on a speech-to-text analysis of at least a portion of the audio reading; storing the audio reading in a data store when the spoken words at least match the threshold amount of the text in the content section; prompting the human reader to submit a repeat audio reading of at least a portion of the content section when the spoken words fail to match at least the threshold amount of the text in the content section; and combining the audio reading with at least one additional audio reading, wherein an order in which the audio reading is combined with the at least one additional audio reading is based at least in part on the location identified by the metadata. - View Dependent Claims (24, 25, 26, 27, 28, 29, 30)
-
Specification