Meta learning for question classification
First Claim
1. A computer-implemented method for automatically classifying a first question, the method comprising:
- receiving unlabeled audio or digital text data from an input module, said unlabeled audio or digital text data comprising data that is not previously associated with an expected answer;
automatically labeling said unlabeled audio or digital text data using a processor to produce first labeled audio or digital text data associating a first answer with the unlabeled audio or digital text data using a first artificial neural network, said first artificial neural network comprising a first set of weights, said first artificial neural network producing the first labeled audio or digital text data by performing one or more auxiliary tasks analyzing characteristics of said unlabeled audio or digital text data;
transferring said first set of weights to a second artificial neural network;
receiving second labeled audio or digital text data comprising a second question and a corresponding answer;
training said second artificial neural network with the processor using said second labeled audio or digital text data by modifying a second set of weights associated with the second artificial neural network responsive to the second labeled audio or digital text data and freezing the first set of weights;
receiving the first question from the input module; and
associating a question category with the first question using said second artificial neural network, said question category identifying a source for retrieving text data or audio data describing an answer corresponding to the first question.
1 Assignment
0 Petitions
Accused Products
Abstract
A system and a method are disclosed for automatic question classification and answering. A multipart artificial neural network (ANN) comprising a main ANN and an auxiliary ANN classifies a received question according to one of a plurality of defined categories. Unlabeled data is received from a source, such as a plurality of human volunteers. The unlabeled data comprises additional questions that might be asked of an autonomous machine such as a humanoid robot, and is used to train the auxiliary ANN in an unsupervised mode. The unsupervised training can comprise multiple auxiliary tasks that generate labeled data from the unlabeled data, thereby learning an underlying structure. Once the auxiliary ANN has trained, the weights are frozen and transferred to the main ANN. The main ANN can then be trained using labeled questions. The original question to be answered is applied to the trained main ANN, which assigns one of the defined categories. The assigned category is used to map the original question to a database that most likely contains the appropriate answer. An object and/or a property within the original question can be identified and used to formulate a query, using, for example, system query language (SQL), to search for the answer within the chosen database. The invention makes efficient use of available information, and improves training time and error rate relative to use of single part ANNs.
65 Citations
25 Claims
-
1. A computer-implemented method for automatically classifying a first question, the method comprising:
-
receiving unlabeled audio or digital text data from an input module, said unlabeled audio or digital text data comprising data that is not previously associated with an expected answer; automatically labeling said unlabeled audio or digital text data using a processor to produce first labeled audio or digital text data associating a first answer with the unlabeled audio or digital text data using a first artificial neural network, said first artificial neural network comprising a first set of weights, said first artificial neural network producing the first labeled audio or digital text data by performing one or more auxiliary tasks analyzing characteristics of said unlabeled audio or digital text data; transferring said first set of weights to a second artificial neural network; receiving second labeled audio or digital text data comprising a second question and a corresponding answer; training said second artificial neural network with the processor using said second labeled audio or digital text data by modifying a second set of weights associated with the second artificial neural network responsive to the second labeled audio or digital text data and freezing the first set of weights; receiving the first question from the input module; and associating a question category with the first question using said second artificial neural network, said question category identifying a source for retrieving text data or audio data describing an answer corresponding to the first question. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 20, 21)
-
-
15. An apparatus for automatically classifying a first question, the apparatus comprising:
-
an input module configured to; receive unlabeled audio or digital text data comprising data not previously associated with an expected answer, receive first labeled audio or digital text data comprising a second question and a corresponding answer, and receive the first question; a processor module configured to; automatically label said unlabeled audio or digital text data to produce second labeled audio or digital text data associating a first answer with the unlabeled audio or digital text data using a first artificial neural network, said first artificial neural network comprising a first set of weights and producing the second labeled audio or digital text data by performing one or more auxiliary tasks analyzing characteristics of said unlabeled audio or digital text data, transfer said first set of weights to a second artificial neural network, train said second artificial neural network using said first audio or digital text labeled data by modifying a second set of weights associated with the second artificial neural network responsive to the second audio or digital text labeled data and freezing the first set of weights, and associate a question category with the first question using said second artificial neural network, said question category identifying a source for retrieving text data describing an answer corresponding to the first question. - View Dependent Claims (16, 22)
-
-
17. An apparatus for automatically classifying a first question, the apparatus comprising:
-
means for receiving unlabeled audio or digital text data comprising data that is not previously associated with an expected answer; means for automatically labeling said unlabeled audio or digital text data to produce first labeled audio or digital text data associating a first answer with the unlabeled audio or digital text data using a first artificial neural network, said first artificial neural network comprising a first set of weights, said first artificial neural network producing the first labeled audio or digital text data by performing one or more auxiliary tasks analyzing characteristics of said unlabeled audio or digital text data; means for transferring said first set of weights to a second artificial neural network; means for receiving second labeled data comprising a second question and a corresponding answer; means for training said second artificial neural network using said second labeled audio or digital text data by modifying a second set of weights associated with the second artificial neural network responsive to the second labeled audio or digital text data and freezing the first set of weights; means for receiving the first question; and means for associating a question category with the first question using said second artificial neural network, said question category identifying a source for retrieving text data or audio data describing an answer corresponding to the first question. - View Dependent Claims (18, 23)
-
-
19. A computer program product, comprising a computer-readable medium having computer program instructions embodied thereon to cause a computer processor to implement a method for automatically classifying a first question, the method comprising:
-
automatically labeling unlabeled audio or digital text data comprising data that is not previously associated with an expected answer to produce first labeled audio or digital text data associating a first answer with the unlabeled audio or digital text data using a first artificial neural network, said first artificial neural network comprising a first set of weights and producing the first labeled text data by performing one or more auxiliary tasks analyzing characteristics of said unlabeled audio or digital text data; transferring said first set of weights to a second artificial neural network; receiving second labeled audio or digital text data comprising a second question and a corresponding answer; training said second artificial neural network using said second labeled audio or digital text data by modifying a second set of weights associated with the second artificial neural network responsive to the second labeled audio or digital text data and freezing the first set of weights; receiving the first question; and associating a question category with the first question using said second artificial neural network, said question category identifying a source for retrieving audio data or text data describing an answer corresponding to the first question. - View Dependent Claims (24)
-
-
25. A method of using a computer to perform the steps of:
-
receiving unlabeled audio or digital text data from an input module, said unlabeled audio or digital text data comprising data that is not previously associated with an expected answer; automatically labeling said unlabeled audio or digital text data using a processor to produce first labeled audio or digital text data associating a first answer with the unlabeled audio or digital text data using a first artificial neural network, said first artificial neural network comprising a first set of weights, said first artificial neural network producing the first labeled audio or digital text data by performing one or more auxiliary tasks analyzing characteristics of said unlabeled audio or digital text data; transferring said first set of weights to a second artificial neural network; receiving second labeled audio or digital text data comprising a second question and a corresponding answer; training said second artificial neural network with the processor using said second labeled audio or digital text data by modifying a second set of weights associated with the second artificial neural network responsive to the second labeled audio or digital text data and freezing the first set of weights; and storing said second set of weights associated with the second artificial neural network and said first set of weights.
-
Specification