×

Identifying topics in a digital work

  • US 9,613,003 B1
  • Filed: 03/28/2012
  • Issued: 04/04/2017
  • Est. Priority Date: 09/23/2011
  • Status: Active Grant
First Claim
Patent Images

1. One or more non-transitory computer-readable media maintaining instructions executable by one or more processors to perform operations comprising:

  • extracting text from a digital work;

    identifying a plurality of noun phrases from the text extracted from the digital work;

    searching a network accessible resource having a plurality of entries to identify a set of one or more entries in the network accessible resource that contain information related to at least one noun phrase of the plurality of noun phrases, wherein each noun phrase corresponding to an entry in the set of one or more entries is a candidate topic in a set of candidate topics;

    ranking the candidate topics based, at least in part, on at least one of a number of incoming links or a number of outgoing links between each of the entries corresponding to the candidate topics;

    excluding, from the set of candidate topics, one or more candidate topics ranked below a first threshold;

    comparing a first term frequency-inverse document frequency (tf-idf) value with a second tf-idf value, wherein the first tf-idf value is determined with respect to the digital work for each candidate topic remaining in the set of candidate topics, and wherein the second tf-idf value is determined for the candidate topics with respect to a corpus of works;

    excluding, from the set of candidate topics, one or more candidate topics for which a difference between the first tf-idf value and the second tf-idf value is less than a second threshold;

    generating a digital supplemental information file comprising at least one reference to supplemental information relating to at least one candidate topic remaining in the set of candidate topics;

    receiving a request for the digital supplemental information file from an electronic device; and

    transmitting the digital supplemental information file to the electronic device, the digital supplemental information file to cause the digital work to include at least one selectable portion that enables display of the at least one reference to supplemental information and a visual representation of at least a location in the digital work of each occurrence of the at least one candidate topic remaining in the set of candidate topics, wherein the visual representation comprises an object with markings corresponding to each occurrence.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×