SYSTEM AND METHOD FOR IDENTIFYING AND VISUALISING TOPICS AND THEMES IN COLLECTIONS OF DOCUMENTS
First Claim
1. A method for estimating and visualising a plurality of topics in a collection of documents, wherein the collection of documents comprises a plurality of words and each document comprises one or more of the plurality of words, the method comprising:
- performing two rounds of topic modelling to the collection of documents, wherein the first round of topic modelling estimates a plurality of topics associated with the collection of documents and each topic comprises one or more words, and the second round identifies a plurality of themes associated with the topics, wherein each theme comprises one or more topics; and
visually representing the topics and themes to a user.
1 Assignment
0 Petitions
Accused Products
Abstract
Method and systems for estimating and visualising a plurality of topics in a collection of documents, wherein the collection of documents comprises a plurality of words and each document comprises one or more of the plurality of words, the method comprising: performing two rounds of topic modelling to the collection of documents, wherein the first round of topic modelling estimates a plurality of topics associated with the collection of documents and each topic comprises one or more words, and the second round identifies a plurality of themes associated with the topics, wherein each theme comprises one or more topics; and visually representing the topics and themes to a user.
61 Citations
9 Claims
-
1. A method for estimating and visualising a plurality of topics in a collection of documents, wherein the collection of documents comprises a plurality of words and each document comprises one or more of the plurality of words, the method comprising:
-
performing two rounds of topic modelling to the collection of documents, wherein the first round of topic modelling estimates a plurality of topics associated with the collection of documents and each topic comprises one or more words, and the second round identifies a plurality of themes associated with the topics, wherein each theme comprises one or more topics; and visually representing the topics and themes to a user. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
Specification