×

Aligning body matter across content formats

  • US 10,109,278 B2
  • Filed: 09/05/2012
  • Issued: 10/23/2018
  • Est. Priority Date: 08/02/2012
  • Status: Active Grant
First Claim
Patent Images

1. A system for aligning content, the system comprising:

  • an electronic data store configured to store;

    an electronic book comprising;

    a plurality of paragraphs of body text, andmatter other than body text, wherein the matter other than body text comprises text within at least front matter and back matter; and

    an audiobook that is a companion to the electronic book; and

    a physical computing device in communication with the electronic data store, the physical computing device configured to;

    generate a textual transcription of the audiobook by applying a speech-to-text recognition routine on the audiobook;

    identify a portion of the textual transcription that includes text also included in a paragraph of the electronic book;

    determine a level of correlation between words in the paragraph of the electronic book and words in the portion of the textual transcription;

    determine that the level of correlation satisfies a threshold value;

    in response to determining that the level of correlation satisfies the threshold value, identify the paragraph of the electronic book as body text;

    identify a first portion of the electronic book that does not satisfy the threshold value with respect to the textual transcription;

    determine that the first portion of the electronic book that does not satisfy the threshold value is front matter based at least in part on a determination that the first portion of the electronic book that does not satisfy the threshold value appears within the electronic book prior to an earliest portion of the electronic book for which a corresponding portion of the audiobook is identified;

    identify a second portion of the electronic book that does not satisfy the threshold value with respect to the textual transcription;

    determine that the second portion of the electronic book that does not satisfy the threshold value is back matter based at least in part on a determination that the second portion of the electronic book that does not satisfy the threshold value appears within the electronic book after a last portion of the electronic book for which a corresponding portion of the audiobook is identified; and

    generate content synchronization information that identifies (a) portions of the audiobook that correspond to the paragraphs of the body text and (b) further identifies the matter other than body text in the electronic book, wherein the content synchronization information indicates that the matter other than body text in the electronic book, including the first portion and second portion of the electronic book, does not correspond to any portion of the audiobook,wherein the content synchronization information indicates that the paragraph, excluding the matter other than body text, should be presented in synchronization with a portion of the audiobook from which the corresponding portion of the textual transcription was generated.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×