×

Methods and systems for analyzing reading logs and documents thereof

  • US 10,467,255 B2
  • Filed: 12/29/2015
  • Issued: 11/05/2019
  • Est. Priority Date: 12/11/2015
  • Status: Active Grant
First Claim
Patent Images

1. A method for analyzing reading logs and documents corresponding thereto, comprising:

  • acquiring reading logs related to webpages and documents corresponding thereto, wherein the reading logs at least includes reading-related information about the documents within a predetermined period of time and the reading-related information at least includes an interesting reading time and a number of interesting readings;

    selecting a plurality of interesting document sets from the documents in each time segment of the predetermined period of time according to the interesting reading times and the number of interesting readings of the documents in the reading logs, each of the interesting document sets corresponding to one of the time segments of the predetermined period of time;

    performing a document content pre-processing on the interesting document sets to determine keyword sets corresponding to the interesting document sets;

    performing a cluster calculation on the keyword sets to obtain topics and calculating cohesion of each topic;

    deleting topics with insufficient cohesion among the topics obtained to obtain a plurality of high-relevance topics and classifying each high-relevance topic into one of a plurality of predetermined topic classes by comparing the respective keyword sets of the high-relevance topics with a plurality of keyword sets of the predetermined topic classes;

    obtaining reading statistics for documents of each predetermined topic class and calculating a plurality of degrees of interest for documents of each predetermined topic class during each time segment; and

    determining a reading trend on each predetermined topic class according to changes in the degrees of interest,wherein the document content pre-processing step further comprises the steps of performing the following steps on each document of the interesting document sets;

    obtaining a plurality of keywords;

    paragraphing the document and calculating a frequency at which the keywords appear in each paragraph to calculate a plurality of importance-weightings corresponding to all of the paragraphs and determining at least one key paragraph according to the importance-weightings; and

    generating the set of keywords for the document based on the keywords within the at least one key paragraph.

View all claims
  • 1 Assignment
Timeline View
Assignment View
    ×
    ×