Method and apparatus for identifiying similar questions in a consultation system
First Claim
Patent Images
1. A method comprising:
- receiving at an online consultation system a question associated with a predetermined category;
extracting a plurality of candidate topics from the received question;
applying a first term frequency inverse document frequency (TFIDF) filter to the extracted plurality of candidate topics to identify a first sorted list of the plurality of extracted candidate topics according to an affinity of the candidate topics to the predetermined category;
applying a second TFIDF filter to the first sorted list to identify a second sorted list of the plurality of extracted candidate topics according to an affinity of the candidate topics to the received question;
identifying as similar questions a plurality of previously submitted questions having topics matching the second sorted list of the plurality of extracted candidate topics;
creating excerpts of the identified similar questions by selecting one or more sections of each of the identified similar questions;
presenting to one or more users via the online consultation system the excerpts of the identified plurality of previously submitted questionswhile the user is waiting for an answer to the question.
1 Assignment
0 Petitions
Accused Products
Abstract
Embodiments of the present invention further provide systems and methods for automatically identifying questions on topics similar to a newly submitted question to an online the consultation system.
-
Citations
10 Claims
-
1. A method comprising:
-
receiving at an online consultation system a question associated with a predetermined category; extracting a plurality of candidate topics from the received question; applying a first term frequency inverse document frequency (TFIDF) filter to the extracted plurality of candidate topics to identify a first sorted list of the plurality of extracted candidate topics according to an affinity of the candidate topics to the predetermined category; applying a second TFIDF filter to the first sorted list to identify a second sorted list of the plurality of extracted candidate topics according to an affinity of the candidate topics to the received question; identifying as similar questions a plurality of previously submitted questions having topics matching the second sorted list of the plurality of extracted candidate topics; creating excerpts of the identified similar questions by selecting one or more sections of each of the identified similar questions; presenting to one or more users via the online consultation system the excerpts of the identified plurality of previously submitted questions while the user is waiting for an answer to the question. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A topic extraction apparatus in an online consultation system, the apparatus comprising a plurality of engines for:
-
receiving at an online consultation system a question associated with a predetermined category; extracting a plurality of candidate topics from the received question; applying a first term frequency inverse document frequency (TFIDF) filter to the extracted plurality of candidate topics to identify a first sorted list of the plurality of extracted candidate topics according to an affinity to the predetermined category; applying a second TFIDF filter to the first sorted list to identify a second sorted list of the plurality of extracted candidate topics according to an affinity to the received question; identifying as similar questions a plurality of previously submitted questions having topics matching the second sorted list of the plurality of extracted candidate topics; creating excerpts of the identified similar questions by selecting one or more sections of each of the identified similar questions; presenting to one or more users via the online consultation system the excerpts of the identified plurality of previously submitted questions while the user is waiting for an answer to the question.
-
-
10. A non-transitory machine-readable storage medium having embodied thereon instructions which when executed by at least one processor, causes a machine to perform operations comprising:
-
receiving at an online consultation system a question associated with a predetermined category; extracting a plurality of candidate topics from the received question; applying a first term frequency inverse document frequency (TFIDF) filter to the extracted plurality of candidate topics to identify a first sorted list of the plurality of extracted candidate topics according to an affinity to the predetermined category; applying a second TFIDF filter to the first sorted list to identify a second sorted list of the plurality of extracted candidate topics according to an affinity to the received question; identifying as similar questions a plurality of previously submitted questions having topics matching the second sorted list of the plurality of extracted candidate topics; creating excerpts of the identified similar questions by selecting one or more sections of each of the identified similar questions; presenting to one or more users via the online consultation system the excerpts of the identified plurality of previously submitted questions while the user is waiting for an answer to the question.
-
Specification