Natural question generation from query data using natural language processing system
First Claim
1. A computer-implemented method, comprising:
- identifying fully formed questions from a query database stored on a storage device, the query database containing a plurality of search query character strings;
identifying partially formed questions from the query database;
creating question templates from the identified fully formed questions;
storing the question templates in a template database;
identifying entities with the partially formed questions and classifying each entity;
storing the classified entities in an entity database; and
generating fully formed questions using the question templates from the template database and classified entities from the entity database.
1 Assignment
0 Petitions
Accused Products
Abstract
A training query generation system is usable to generate fully formed training questions from prior search queries, some of which may be fully formed search queries and some of which are not fully formed. The system may identify fully formed questions from a query database stored on a storage device. The query database includes a plurality of search query character string, The system further identifies partially formed questions from the query database, creates question templates from the identified fully formed questions, and stores the question templates in a template database. The system then identifies entities with the partially formed questions, classifies each entity, and stores the classified entities in an entity database. Fully formed questions can then be generated using the question templates from the template database and classified entities from the entity database.
7 Citations
20 Claims
-
1. A computer-implemented method, comprising:
-
identifying fully formed questions from a query database stored on a storage device, the query database containing a plurality of search query character strings; identifying partially formed questions from the query database; creating question templates from the identified fully formed questions; storing the question templates in a template database; identifying entities with the partially formed questions and classifying each entity; storing the classified entities in an entity database; and generating fully formed questions using the question templates from the template database and classified entities from the entity database. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A system, comprising:
-
one or more computing devices configured to execute a query service and a natural language service; one or more storage devices configured to store a query database, a template database and an entity database; wherein the query service is configured to; receive a search query character string; generate a topic identifier associated with the search query character string; and add a record to the query database including the search query character string, the topic identifier, and a time stamp; wherein natural language service is configured to; retrieve a plurality of records from the query database; for each record, determine whether the search query character string is a fully formed question or a partially formed question; for a fully formed question, replace a portion of the fully formed question with an argument value indicative of a role associated with the replaced portion of the fully formed question to generate a query template and store the query template in the template database; and for a partially formed question, identify an entity, classify each identified entity, and store the entity and a classification for the entity in the entity database. - View Dependent Claims (11, 12, 13, 14)
-
-
15. A computer program product for computer generation of training questions for training a language processing system, said computer program product comprising a computer readable storage medium having program instructions embodied therewith, the program instructions executable by a processor to cause the processor to:
-
from a plurality of search query character strings stored in a storage device, identify fully formed questions and identify partially formed questions; create question templates from the identified fully formed questions and store the question templates in a template database; identify entities within the partially formed questions, classify each entity, and store the classified entities in an entity database; and generate fully formed questions through insertion of entities from the entity database into the question templates from the template database. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification