DETERMINING CONCEPT BLOCKS BASED ON CONTEXT
First Claim
1. A method of generating a set of concept blocks, the method comprising:
- accessing a corpus of documents;
generating a plurality of target words from the corpus;
determining context strings for the target words, wherein the context strings include words that are adjacent to the target words;
obtaining pattern types, wherein the pattern types are based on number of words and position of words relative to the target words;
assigning weights to each of the context strings, such that a weight of a context string having a particular
1 Assignment
0 Petitions
Accused Products
Abstract
A method for generating a set of concept blocks is presented, wherein the concept blocks are words in a corpus of documents that can be processed to extract trends, build an efficient inverted search index, or generate a summary report of the content. The method entails generating a plurality of target words from the corpus, determining context strings for the target words, obtaining pattern types that are based on number of words and position of words relative to the target words, and assigning weights to each of the context strings having a particular pattern type. The target words are then expressed as vectors that reflect the weights of the context strings. The vectors are compared and grouped into clusters based on similarity. Target words in the resulting clusters are concept blocks. A subgroup of clusters may be selected for another iteration of the process to catch new concept blocks.
20 Citations
25 Claims
-
1. A method of generating a set of concept blocks, the method comprising:
-
accessing a corpus of documents; generating a plurality of target words from the corpus; determining context strings for the target words, wherein the context strings include words that are adjacent to the target words; obtaining pattern types, wherein the pattern types are based on number of words and position of words relative to the target words; assigning weights to each of the context strings, such that a weight of a context string having a particular - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19)
-
-
20. A method of generating a summary of discussions in SNS posts, comprising:
-
obtaining a categorically arranged set of keywords from a data storage; identifying occurrences of the keywords in posts that are associated with an SNS user account; and arranging the identified keywords according to their categories to generate a summary report including subjects of discussion and members of the subjects. - View Dependent Claims (21, 22, 23, 24)
-
-
25. A method of generating a set of keywords, the method comprising:
-
accessing a corpus of documents; generating target words from the corpus of documents; determining context strings for the target words, wherein the context strings include words that are adjacent to the target words; obtaining pattern types, wherein the pattern types are based on number of words and position of words relative to the target words; assigning weights to each of the context strings, such that a weight of a context string having a particular
-
Specification