×

Identifying book title sets

  • US 9,881,009 B1
  • Filed: 03/15/2011
  • Issued: 01/30/2018
  • Est. Priority Date: 03/15/2011
  • Status: Active Grant
First Claim
Patent Images

1. A computer-implemented method, comprising:

  • under control of one or more processors configured with executable instructions,receiving, from a device of an author and via a content ingestion service associated with a network, an electronic book having first body text and first metadata;

    normalizing the electronic book by removing illustrations from the electronic book, removing extraneous characters from the electronic book, and converting characters of the electronic book to a single case;

    determining, in response to the normalizing of the electronic book, whether the first metadata of the electronic book matches metadata of any existing book title sets;

    based at least partly on a first determination that the first metadata of the electronic book matches second metadata of no more than a single existing book title set that includes at least one book, adding the electronic book to the single existing book title set such that the single existing book title set includes the at least one book and the electronic book;

    based at least partly on a second determination that the first metadata of the electronic book matches third metadata of multiple existing book title sets, calculating a text matching score corresponding to individual ones of the existing book title sets, the text matching score indicating a comparison of a first frequency of one or more words included in the first body text of the electronic book and a second frequency of the one or more words included in second body text of the corresponding existing book title set; and

    adding the electronic book to an existing book title set of the multiple existing book title sets based at least partly on the text matching score corresponding to the existing book title set being greater than a specified threshold, the existing book title set including the electronic book and one or more other books.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×