System and method for determining originality of data content
First Claim
1. A computer-implemented method for providing a recommendation for an item based on originality of content of similar items, the computer-implemented method comprising:
- as implemented by one or more computing devices configured with specific executable instructions,for at least one item in a plurality of items, determining a subset of items in the plurality of items that are similar to the at least one item;
for each item of the subset of items determined to be similar to the at least one item,generating an originality score for a pairing including the item in the subset of similar items and the at least one item, the originality score indicating a degree to which content of the item in the subset of similar items is diverse from content of the at least one item, and the originality score being generated based at least in part on a comparison of the content of the item in the subset of similar items with the content of the at least one item;
storing the generated originality scores in a diversity matrix;
receiving an indication of a selection of an item of interest, the item of interest being one of the items in the plurality of items for which originality scores have been generated with respect to a plurality of items determined to be similar to the item of interest;
for each item in the plurality of items determined to be similar to the item of interest, obtaining, from the diversity matrix, a generated originality score for a pairing including the item in the plurality of items determined to be similar to the item of interest and the item of interest; and
selecting, based on the obtained originality scores, an item within the plurality of items that are similar to the item of interest as a recommended item, the selected item having content that is most diverse, among items in the plurality of items determined to be similar to the item of interest, from content of the item of interest.
1 Assignment
0 Petitions
Accused Products
Abstract
The present invention provides systems and methods for determining the originality of data content. In one embodiment, the determined originality of a particular item (e.g., a book) as compared to one or more other items can be used as a factor in recommending the item to a user. For example, in one embodiment, upon a user'"'"'s selection of an item (e.g., a book), one or more items that have content most diverse from the selected item are determined and provided to the user. In another embodiment, various versions of an item are compared to each other to determine how content in each version differs from that in another version. In another embodiment, content in a collection of items are compared against content from publicly (freely) available sources (e.g., web pages) to determine the originality of the content in the collection of items.
32 Citations
19 Claims
-
1. A computer-implemented method for providing a recommendation for an item based on originality of content of similar items, the computer-implemented method comprising:
as implemented by one or more computing devices configured with specific executable instructions, for at least one item in a plurality of items, determining a subset of items in the plurality of items that are similar to the at least one item; for each item of the subset of items determined to be similar to the at least one item, generating an originality score for a pairing including the item in the subset of similar items and the at least one item, the originality score indicating a degree to which content of the item in the subset of similar items is diverse from content of the at least one item, and the originality score being generated based at least in part on a comparison of the content of the item in the subset of similar items with the content of the at least one item; storing the generated originality scores in a diversity matrix; receiving an indication of a selection of an item of interest, the item of interest being one of the items in the plurality of items for which originality scores have been generated with respect to a plurality of items determined to be similar to the item of interest; for each item in the plurality of items determined to be similar to the item of interest, obtaining, from the diversity matrix, a generated originality score for a pairing including the item in the plurality of items determined to be similar to the item of interest and the item of interest; and selecting, based on the obtained originality scores, an item within the plurality of items that are similar to the item of interest as a recommended item, the selected item having content that is most diverse, among items in the plurality of items determined to be similar to the item of interest, from content of the item of interest. - View Dependent Claims (2, 3)
-
4. A computer-readable medium having a computer-executable component for determining originality of data content, the computer-executable component comprising:
-
an original content determination component for; for at least one item in a plurality of items, determining a subset of items in the plurality of items that are similar to the at least one item; for each item of the subset of items determined to be similar to the at least one item, generating an originality score for a pairing including the item in the subset of similar items and the at least one item, the originality score indicating a degree to which content of the item in the subset of similar items is diverse from content of the at least one item, and the originality score being generated based at least in part on a comparison of the content of the item in the subset of similar items with the content of the at least one item; storing the generated originality scores; receiving an indication of a selection of an item of interest, the item of interest being an item in the plurality of items for which originality scores have been generated with respect to a subset of the plurality of items determined to be similar to the item of interest; for each item in the subset of items determined to be similar to the item of interest, obtaining, from among the stored originality scores, an originality score for a pairing including the item in the subset of items determined to be similar to the item of interest and the item of interest; and selecting an item within the subset of items determined to be similar to the item of interest as a recommended item, the selecting based at least in part on the obtained originality scores. - View Dependent Claims (5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16)
-
-
17. A system for providing a recommendation for an item based on originality of content of similar items, the system comprising:
-
a data store that stores data relating to a plurality of items; and a computing device, comprising one or more processors, in communication with the data store that is configured to; for at least one item in the plurality of items, determine a subset of items in the plurality of items that are similar to the at least one item; for each item of the subset of items determined to be similar to the at least one item, generate an originality score for a pairing including the item in the subset of similar items and the at least one item, the originality score indicating a degree to which content of the item in the subset of similar items is diverse from content of the at least one item, and the originality score being generated based at least in part on a comparison of the content of the item in the subset of similar items with the content of the at least one item; store the generated originality scores in a diversity matrix; receive an indication of a selection of an item of interest, the item of interest being one of the items in the plurality of items for which originality scores have been generated with respect to a plurality of items determined to be similar to the item of interest; for each item in the plurality of items determined to be similar to the item of interest, obtain, from the diversity matrix, a generated originality score for a pairing including the item in the plurality of items determined to be similar to the item of interest and the item of interest; and select, based on the obtained originality scores, an item within the plurality of items that are similar to the item of interest as a recommended item, the selected item having content that is most diverse, among items in the plurality of items determined to be similar to the item of interest, from content of the item of interest. - View Dependent Claims (18, 19)
-
Specification