Aligning content items to identify differences
First Claim
Patent Images
1. One or more non-transitory computer-readable media storing computer-executable instructions that, when executed, cause one or more processors to perform acts comprising:
- determining an overall similarity between a first electronic book and a second electronic book;
determining that the overall similarity is less than a first overall similarity threshold and greater than a second overall similarity threshold;
selecting a first string of words of the first electronic book and a second string of words of the second electronic book;
comparing the first string of words to the second string of words;
determining that a similarity in alignment between the first string of words and the second string of words is greater than an alignment threshold based at least in part on the comparing;
selecting a third string of words of the first electronic book at a predetermined location relative to the first string of words;
identifying a fourth string of words of the second electronic book at the predetermined location relative to the second string of words;
determining a similarity metric between the third string of words and the fourth string of words; and
performing word-to-word matching between the third string of words and the fourth string of words, based at least in part on the similarity metric being greater than a predetermined threshold.
1 Assignment
0 Petitions
Accused Products
Abstract
Techniques for aligning content items with one another are described herein. These techniques may align different versions of a same content item for the purpose of identifying differences between the versions, identifying commonalities between the versions, mapping annotations made in one version to the other version, aggregating annotations across the different versions, or for any other reason. The content items may include electronic books, songs, videos, documents, or the like.
118 Citations
28 Claims
-
1. One or more non-transitory computer-readable media storing computer-executable instructions that, when executed, cause one or more processors to perform acts comprising:
-
determining an overall similarity between a first electronic book and a second electronic book; determining that the overall similarity is less than a first overall similarity threshold and greater than a second overall similarity threshold; selecting a first string of words of the first electronic book and a second string of words of the second electronic book; comparing the first string of words to the second string of words; determining that a similarity in alignment between the first string of words and the second string of words is greater than an alignment threshold based at least in part on the comparing; selecting a third string of words of the first electronic book at a predetermined location relative to the first string of words; identifying a fourth string of words of the second electronic book at the predetermined location relative to the second string of words; determining a similarity metric between the third string of words and the fourth string of words; and performing word-to-word matching between the third string of words and the fourth string of words, based at least in part on the similarity metric being greater than a predetermined threshold. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method comprising:
-
under control of one or more computer systems configured with specific executable instructions, associating a first string of words of a first version of an electronic book with a first position within the first version of the electronic book; identifying a second string of words of a second version of the electronic book that correspond to the first string of words of the first version of the electronic book; associating the second string of words of the second version of the electronic book with a second position within the second version of the electronic book, wherein the first version of the electronic book is in a same electronic book format as the second version of the electronic book; aligning the first string of words of the first version of the electronic book with the second position of the second string of words of the second version of the electronic book; and mapping the second string of words of the second version of the electronic book to the first position within the first version of the electronic book based at least in part on the aligning. - View Dependent Claims (9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A method comprising:
-
under control of one or more computer systems configured with specific executable instructions, calculating an overall similarity between a first content item and a second content item; aligning the first content item with the second content item based at least in part on the overall similarity being greater than a predetermined threshold; identifying an annotation that has been associated with a particular position within the first content item by a user that previously accessed the first content item; determining a position in the second content item that corresponds to the particular position within the first content item at least partly based on the aligning; and associating the annotation with the position in the second content item at least partly based on the determining. - View Dependent Claims (18, 19)
-
-
20. A system comprising:
-
one or more processors; memory; one or more modules, stored in the memory and executable on the one or more processors to; determine an overall similarity between a first electronic book and a second electronic book; select a string of words of the first electronic book and a string of words of the second electronic book based at least in part on the overall similarity being less than a first overall similarity threshold and greater than a second overall similarity threshold; determine a similarity in alignment for the string of words of the first electronic book and the string of words of the second electronic book; at least partly in response to the similarity in alignment being greater than an alignment threshold, determine a similarity metric for words subsequent to the string of words of the first electronic book and words subsequent to the string of words of the second electronic book; and performing word-to-word matching between the words subsequent to the string of words of the first electronic book and the words subsequent to the string of words of the second electronic book, based at least in part on the similarity metric being greater than a predetermined threshold. - View Dependent Claims (21, 22, 23, 24, 25, 26, 27, 28)
-
Specification