Question-answering system and method based on semantic labeling of text documents and user questions
First Claim
1. A method for question-answering based on automatic semantic labeling of text documents and user questions, the method comprising:
- electronically receiving natural language text documents using at least one computer processor coupled to at least one non-transitory storage medium;
electronically receiving a user question formulated in a natural language;
performing a basic linguistic analysis of the text documents and the user question;
performing semantic labeling of the text documents through semantic analysis, including identifying target words in the text documents using linguistic patterns stored in a database and assigning question types to the target words in the text documents;
storing the semantically labeled text documents in a labeled text documents database;
performing semantic labeling of the user question through semantic analysis, including identifying one or more target words in the user question using linguistic patterns stored in a database and assigning a question type to each of the one or more target words in the user question;
searching the labeled text documents database for text fragments relevant to the semantically labeled user question at least in part by matching a question type assigned to a target word in the labeled text documents to a question type assigned to a target word in the user question, wherein relevance is based on a ranking of the text fragments relative to the semantically labeled user question; and
synthesizing answers to the user question from the relevant text fragments, and electronically presenting the synthesized answer to the user.
4 Assignments
0 Petitions
Accused Products
Abstract
A question-answering system for searching exact answers in text documents provided in the electronic or digital form to questions formulated by user in the natural language is based on automatic semantic labeling of text documents and user questions. The system performs semantic labeling with the help of markers in terms of basic knowledge types, their components and attributes, in terms of question types from the predefined classifier for target words, and in terms of components of possible answers. A matching procedure makes use of mentioned types of semantic labels to determine exact answers to questions and present them to the user in the form of fragments of sentences or a newly synthesized phrase in the natural language. Users can independently add new types of questions to the system classifier and develop required linguistic patterns for the system linguistic knowledge base.
163 Citations
28 Claims
-
1. A method for question-answering based on automatic semantic labeling of text documents and user questions, the method comprising:
-
electronically receiving natural language text documents using at least one computer processor coupled to at least one non-transitory storage medium; electronically receiving a user question formulated in a natural language; performing a basic linguistic analysis of the text documents and the user question; performing semantic labeling of the text documents through semantic analysis, including identifying target words in the text documents using linguistic patterns stored in a database and assigning question types to the target words in the text documents; storing the semantically labeled text documents in a labeled text documents database; performing semantic labeling of the user question through semantic analysis, including identifying one or more target words in the user question using linguistic patterns stored in a database and assigning a question type to each of the one or more target words in the user question; searching the labeled text documents database for text fragments relevant to the semantically labeled user question at least in part by matching a question type assigned to a target word in the labeled text documents to a question type assigned to a target word in the user question, wherein relevance is based on a ranking of the text fragments relative to the semantically labeled user question; and synthesizing answers to the user question from the relevant text fragments, and electronically presenting the synthesized answer to the user. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
-
15. A computer program product comprising a non-transitory computer-readable medium having stored therein computer-executable instructions for performing a method for question-answering based on automatic semantic labeling of text documents and user questions, the method comprising:
-
electronically receiving natural language text documents; electronically receiving a user question formulated in a natural language; performing a basic linguistic analysis of the text documents and the user question; performing semantic labeling of the text documents through semantic analysis, including identifying target words in the text documents using linguistic templates and assigning question types to the target words, storing the semantically labeled text documents in a labeled text documents database; performing semantic labeling of the user question through semantic analysis, including identifying one or more target words in the user question using linguistic templates and assigning question types to the one or more target words; searching the labeled text documents database for text fragments relevant to the semantically labeled user question, wherein relevance is based on a ranking of the text fragments relative to the semantically labeled user question; and synthesizing answers to the user question from the relevant text fragments. - View Dependent Claims (16)
-
-
17. A question-answering system that uses automatic semantic labeling of text documents and user questions in electronic or digital form formulated in natural language, the system comprising:
-
one or more processors; a linguistic knowledge base that includes a plurality of question-classification based linguistic patterns; and a linguistic analyzer configured to produce linguistically analyzed text documents and user questions using the one or more processors, the linguistic analyzer comprising a semantic analyzer that includes; an expanded Subject-Action-Object (eSAO) recognizer and a Cause-Effect recognizer configured to produce semantically analyzed text documents and user questions, including recognizing one or more facts in the form of one or more eSAO sets based on the text documents and on the user questions, wherein eSAO and Cause-Effect recognition is based on patterns stored in the linguistic knowledge base. - View Dependent Claims (18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28)
-
Specification