Corpus specific natural language query completion assistant
First Claim
1. A method for completing a question comprising:
- receiving, from a user, a question prefix representing a portion of a natural language question;
selecting, based on the question prefix, at least one index comprising a short text index or a question term index, the short text index comprising a plurality of text entries, each entry corresponding to an associated document, the question term index comprising a plurality of question entries, each entry corresponding to a partial question phrase;
searching the selected at least one index using the question prefix and retrieving a plurality of retrieved entries;
selecting a subset of the plurality of retrieved entries;
responsive to selecting the subset, creating a language model from the subset, the language model comprising a plurality of n-grams each with an n-gram probability;
creating a plurality of question completion suggestions based on the language model and the question prefix; and
causing presentation of the plurality of question completion suggestions to the user via a user interface.
1 Assignment
0 Petitions
Accused Products
Abstract
Representative embodiments disclose mechanisms to complete partial natural language questions. Users enter a partial question. The system comprises a plurality of indexes, one index comprising common phrases associated with natural language questions and other indexes comprising short text entries associated with documents, such as document titles. The partial question is used to search one or more of the indexes. The search yields a ranked list of results. The top k entries of the list are selected and one or more language models are created from the top k entries. Each language model comprises n-grams from the top k entries from an index and an associated probability for each n-gram. A question completion generator creates question completion suggestions by matching n-grams with the partial question, removing ungrammatical candidate suggestions, and filtering the remaining suggestions per a filtering criteria. The top N results are returned as suggestions to complete the question.
29 Citations
20 Claims
-
1. A method for completing a question comprising:
-
receiving, from a user, a question prefix representing a portion of a natural language question; selecting, based on the question prefix, at least one index comprising a short text index or a question term index, the short text index comprising a plurality of text entries, each entry corresponding to an associated document, the question term index comprising a plurality of question entries, each entry corresponding to a partial question phrase; searching the selected at least one index using the question prefix and retrieving a plurality of retrieved entries; selecting a subset of the plurality of retrieved entries; responsive to selecting the subset, creating a language model from the subset, the language model comprising a plurality of n-grams each with an n-gram probability; creating a plurality of question completion suggestions based on the language model and the question prefix; and causing presentation of the plurality of question completion suggestions to the user via a user interface. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A system for completing a question comprising:
a processor and executable instructions accessible on a computer-storage medium that, when executed, cause the processor to perform operations comprising; receive, from a user, a question prefix representing a portion of a natural language question; selecting at least one index comprising a short text index or a question term index, the short text index comprising a plurality of text entries, each entry corresponding to an associated document, the question term index comprising a plurality of question entries, each entry corresponding to a partial question phrase; search the at least one index and retrieve a plurality of retrieved entries; identify a subset of the plurality of the retrieved entries; create a language model from the subset, the language model comprising a plurality of n-grams each with an n-gram probability; create a plurality of question completion suggestions based on the language model and the question prefix; return plurality of question completion suggestions to the user via a user interface; and store user interaction with the question completion suggestions. - View Dependent Claims (11, 12, 13, 14, 15, 16)
-
17. A computer storage medium comprising executable instructions that, when executed by a processor of a machine, cause the machine to perform operations comprising:
-
receive, from a user, a question prefix representing a portion of a natural language question; selecting, based on the question prefix, at least one index comprising a short text index or a question term index, the short text index comprising a plurality of text entries, each entry corresponding to an associate document, the question term index comprising a plurality of question entries, each entry corresponding to a partial question phrase; search the at least one index and retrieve a plurality of retrieved entries, the plurality of retrieved entries comprising the title of a document or the partial question phrase, or both; select a subset of the plurality of the retrieved entries; create a language model from the subset, the language model comprising a plurality of n-grams each with an n-gram probability; create a plurality of question completion suggestions based on the language model and the question prefix; and return plurality of question completion suggestions to the user via a user interface. - View Dependent Claims (18, 19, 20)
-
Specification