IDENTIFYING CORRESPONDING REGIONS OF CONTENT
First Claim
1. A system comprising:
- an electronic data store configured to store;
an audiobook; and
an electronic book that is a companion to the audiobook;
a computing device in communication with the electronic data store, the computing device configured to;
generate a textual transcription of the audiobook;
identify, with respect to the textual transcription, an uncertain region in the electronic book;
identify a region of the audiobook that is preliminarily aligned to the uncertain region;
apply a language model to the region of the audiobook to generate an updated textual transcription;
determine that a portion of the updated textual transcription substantially corresponds to a portion of the electronic book including the uncertain region; and
generate content synchronization information, wherein the content synchronization information facilitates the synchronous presentation of the portion of the electronic book including the uncertain region and a portion of the audiobook from which the substantially corresponding portion of the updated textual transcription was generated.
1 Assignment
0 Petitions
Accused Products
Abstract
A content alignment service may generate content synchronization information to facilitate the synchronous presentation of audio content and textual content. In some embodiments, a region of the textual content whose correspondence to the audio content is uncertain may be analyzed to determine whether the region of textual content corresponds to one or more words that are audibly presented in the audio content, or whether the region of textual content is a mismatch with respect to the audio content. In some embodiments, words in the textual content that correspond to words in the audio content are synchronously presented, while mismatched words in the textual content may be skipped to maintain synchronous presentation. Accordingly, in one example application, an audiobook is synchronized with an electronic book, so that as the electronic book is displayed, corresponding words of the audiobook are audibly presented.
69 Citations
29 Claims
-
1. A system comprising:
-
an electronic data store configured to store; an audiobook; and an electronic book that is a companion to the audiobook; a computing device in communication with the electronic data store, the computing device configured to; generate a textual transcription of the audiobook; identify, with respect to the textual transcription, an uncertain region in the electronic book; identify a region of the audiobook that is preliminarily aligned to the uncertain region; apply a language model to the region of the audiobook to generate an updated textual transcription; determine that a portion of the updated textual transcription substantially corresponds to a portion of the electronic book including the uncertain region; and generate content synchronization information, wherein the content synchronization information facilitates the synchronous presentation of the portion of the electronic book including the uncertain region and a portion of the audiobook from which the substantially corresponding portion of the updated textual transcription was generated. - View Dependent Claims (2, 3, 4)
-
-
5. A computer-implemented method comprising:
as implemented by one or more computing devices configured with specific computer-executable instructions, identifying an uncertain region in an item of textual content; identifying a region of a companion item of audio content that is preliminarily aligned to the uncertain region; generating a textual transcription of the region of the item of audio content; determining that a portion of the textual transcription substantially corresponds to a portion of the item of textual content to which the uncertain region belongs; and generating content synchronization information for synchronizing presentation of the portion of the item of textual content and a portion of the item of audio content from which the substantially corresponding portion of the updated textual transcription was generated. - View Dependent Claims (6, 7, 8, 9, 10)
-
11. A system for synchronizing presentation of an item of audio content to a companion item of textual content, the system comprising:
-
an electronic data store configured to store content synchronization information; and a computing device in communication with the electronic data store, the computing device being configured to; identify, with respect to a textual transcription of the item of audio content, an uncertain region in the companion item of textual content, the uncertain region comprising one or more words; identify a region of the textual transcription that is preliminarily aligned to the uncertain region, the region of the textual transcription comprising one or more words; convert the one or more words of the uncertain region to a first phoneme string; convert the one or more words of the region of the textual transcription to a second phoneme string; and if the first phoneme string substantially corresponds to the second phoneme string, generate content synchronization information that facilitates the synchronous presentation of the one or more words of the uncertain region in the companion item of textual content and the one or more words of the item of audio content from which the region of the textual transcription was generated. - View Dependent Claims (12, 13, 14, 15)
-
-
16. A non-transitory computer-readable medium having a computer-executable module, the computer-executable module configured to:
-
identify an uncertain region in an item of textual content, the uncertain region comprising one or more words; identify a region of the item of audio content that is preliminarily aligned to the uncertain region; generate a textual transcription of the region of the item of audio content, the textual transcription comprising one or more words; determine whether the one or more words of the uncertain region substantially correspond to the one or more words of the textual transcription of the preliminarily aligned region of the item of audio content; and if the one or more words of the uncertain region substantially correspond to the one or more words of the updated textual transcription of the preliminarily aligned region of audio content, generate content synchronization information; wherein the content synchronization information facilitates the synchronous presentation of the uncertain region in the item of textual content and the preliminarily aligned region of the item of audio content. - View Dependent Claims (17, 18, 19, 20, 21, 22)
-
-
23. A computer-implemented method comprising:
as implemented by one or more computing devices configured with specific computer-executable instructions, identifying an uncertain region in an item of textual content; identifying a region of the item of audio content that is preliminarily aligned to the uncertain region; generating a textual transcription of the region of the item of audio content; identifying a significant corresponding word, wherein the significant corresponding word appears in both the textual transcription and the item of textual content; and generating content synchronization information, wherein the content synchronization information facilitates the synchronous presentation of the significant corresponding word in both the item of textual content and the item of audio content. - View Dependent Claims (24, 25, 26, 27, 28, 29)
Specification