Relationship analysis system and method for semantic disambiguation of natural language
First Claim
1. A computer storage medium comprising computer executable instructions for performing a computer-implemented method for domain-independent natural language understanding, comprising the steps of:
- forming pairs of words and/or phrases in independent phrases in a sentence, wherein each word and/or phrase is paired with every other word and/or phrase;
determining meanings for the words and/or phrases;
assigning a numeric code from the pre-existing class and category structure that uniquely identifies semantic concepts to each meaning;
adjusting the numeric codes using a weighting component;
storing the adjusted numeric codes that uniquely identify semantic concepts in a computer memory;
comparing the adjusted numeric codes that uniquely identify semantic concepts of each word and/or phrase with each adjusted numeric code that uniquely identifies the semantic concept of the other word and/or phrase in the pair;
comparing syntactically identical sentences with a potential idiom that changes adjusted numeric codes that uniquely identify semantic concepts depending on the context using an ontology;
determining whether a potential idiom is identified and employing appropriate class and category structure to calculate a first adjusted numeric code total, wherein said first adjusted numeric code total corresponds to the best word and/or phrase pairings;
determining an overall-adjusted numeric code total based on said first adjusted numeric code total by comparing remaining words in a sentence and before selecting the pairs of adjusted numeric codes that uniquely identify semantic concepts with the best relationships based on numerical comparisons;
selecting the pairs of adjusted numeric codes that uniquely indentify semantic concepts with the best relationships based on numerical comparisons;
storing the pairs in the computer memory;
combining highly-related pairs with other highly-related pairs to form longer groups of adjusted numeric codes that uniquely identify semantic concepts;
exchanging adjusted numeric codes that uniquely identify semantic concepts for the longer groups with adjusted numeric codes that uniquely identify semantic concepts having a weaker relationship to determine if the exchanged adjusted numeric codes that uniquely identify semantic concepts provide an overall stronger relationship based on numerical comparisons;
forming longer and longer groups until the adjusted numeric codes that uniquely identify semantic concepts are selected in context for all words and phrases in the sentence; and
relating the adjusted numeric codes that uniquely identify semantic concepts to words and phrases.
0 Assignments
0 Petitions
Accused Products
Abstract
A system, method, and computer program product for domain-independent natural language understanding, including at least one of forming pairs of words and/or phrases in a sentence, wherein each word and/or phrase is paired with every other word and/or phrase; determining meanings for the words and/or phrases; assigning numeric codes that uniquely identify semantic concepts to those word and/or phrase meanings; comparing the numeric code of each word and/or phrase with each numeric code of the other word and/or phrase in the pair; selecting the pairs with the best relationships; combining highly-related pairs with other highly-related pairs to form longer groups of words; exchanging numeric codes for the longer groups with numeric codes having a weaker relationship to determine if the exchanged numeric codes provide an overall stronger relationship; and forming longer and longer groups until the sentence is understood.
120 Citations
27 Claims
-
1. A computer storage medium comprising computer executable instructions for performing a computer-implemented method for domain-independent natural language understanding, comprising the steps of:
-
forming pairs of words and/or phrases in independent phrases in a sentence, wherein each word and/or phrase is paired with every other word and/or phrase; determining meanings for the words and/or phrases; assigning a numeric code from the pre-existing class and category structure that uniquely identifies semantic concepts to each meaning; adjusting the numeric codes using a weighting component; storing the adjusted numeric codes that uniquely identify semantic concepts in a computer memory; comparing the adjusted numeric codes that uniquely identify semantic concepts of each word and/or phrase with each adjusted numeric code that uniquely identifies the semantic concept of the other word and/or phrase in the pair; comparing syntactically identical sentences with a potential idiom that changes adjusted numeric codes that uniquely identify semantic concepts depending on the context using an ontology; determining whether a potential idiom is identified and employing appropriate class and category structure to calculate a first adjusted numeric code total, wherein said first adjusted numeric code total corresponds to the best word and/or phrase pairings; determining an overall-adjusted numeric code total based on said first adjusted numeric code total by comparing remaining words in a sentence and before selecting the pairs of adjusted numeric codes that uniquely identify semantic concepts with the best relationships based on numerical comparisons; selecting the pairs of adjusted numeric codes that uniquely indentify semantic concepts with the best relationships based on numerical comparisons; storing the pairs in the computer memory; combining highly-related pairs with other highly-related pairs to form longer groups of adjusted numeric codes that uniquely identify semantic concepts; exchanging adjusted numeric codes that uniquely identify semantic concepts for the longer groups with adjusted numeric codes that uniquely identify semantic concepts having a weaker relationship to determine if the exchanged adjusted numeric codes that uniquely identify semantic concepts provide an overall stronger relationship based on numerical comparisons; forming longer and longer groups until the adjusted numeric codes that uniquely identify semantic concepts are selected in context for all words and phrases in the sentence; and relating the adjusted numeric codes that uniquely identify semantic concepts to words and phrases. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13)
-
-
14. A system for domain-independent natural language understanding, comprising:
-
means for forming pairs of words and/or phrases in independent phrases in a sentence, wherein each word and/or phrase is paired with every other word and/or phrase; means for determining meanings for the words and/or phrases by assigning numeric codes from the pre-existing class and category structure that uniquely identify semantic concepts; means for adjusting the numeric codes using a weighting component; means for storing the adjusted numeric codes that uniquely identify semantic concepts; means for comparing the adjusted numeric codes that uniquely identify semantic concepts of each word and/or phrase with each adjusted numeric code that uniquely identifies the semantic concept of the other word and/or phrase in the pair; comparing syntactically identical sentences with a potential idiom that changes adjusted numeric codes that uniquely identify semantic concepts depending on the context using an ontology; means for determining whether a potential idiom is identified and employing appropriate class and category structure to calculate a first adjusted numeric code total, wherein said first adjusted numeric code total corresponds to the best word and/or phrase pairings; means for determining an overall-adjusted numeric code total based on said first adjusted numeric code total by comparing remaining words in a sentence and selecting the pairs adjusted numeric codes that uniquely identify semantics concepts with the best relationships based on numerical comparisons; means for selecting the pairs with the best relationships based on numerical comparisons; means for storing the pairs; means for combining highly-related pairs with other highly-related pairs to form longer groups of adjusted numeric codes that uniquely identify semantic concepts; means for exchanging adjusted numeric codes that uniquely identify semantic concepts for the longer groups with adjusted numeric codes that uniquely identify semantic concepts having a weaker relationship to determine if the exchanged adjusted numeric codes that uniquely identify semantic concepts provide an overall stronger relationship based on numerical comparisons; means for forming longer and longer groups until the adjusted numeric codes that uniquely identify semantic concepts are selected in context for all words and phrases in the sentence; and means for relating the adjusted numeric codes that uniquely identify semantic concepts to words and phrases. - View Dependent Claims (15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26)
-
-
27. A computer program product for domain-independent natural language understanding, including one or more computer readable instructions embedded on a computer readable medium and configured to cause one or more computer processors to perform the steps of:
-
forming pairs of words and/or phrases in independent phrases in a sentence, wherein each word and/or phrase is paired with every other word and/or phrase; determining meanings for the words and/or phrases; assigning a numeric code from the pre-existing class and category structure that uniquely identifies the semantic concept to each meaning; adjusting the numeric codes using a weighting component; storing the adjusted numeric codes that uniquely identify semantic concepts in a computer memory; comparing the adjusted numeric codes that uniquely identify semantic concepts of each word and/or phrase with each adjusted numeric code that uniquely identifies the semantic concept of the other word and/or phrase in the pair; comparing syntactically identical sentences with a potential idiom that changes adjusted numeric codes that uniquely identify semantic concepts depending on the context using an ontology; determining whether a potential idiom is identified and employing appropriate class and category structure to calculate a first adjusted numeric code total, wherein said first adjusted numeric code total corresponds to the best word and/or phrase pairings; determining an overall-adjusted numeric code total based on said first adjusted numeric code total by comparing remaining words in a sentence and selecting the pairs of adjusted numeric codes that uniquely identify semantic concepts with the best relationships based on numerical comparisons; selecting the pairs of adjusted numeric codes that uniquely indentify semantic concepts with the best relationships based on numerical comparisons would be considered a better relationship than codes 4 and 10; storing the pairs in the computer memory; combining highly-related pairs with other highly-related pairs to form longer groups of adjusted numeric codes that uniquely identify semantic concepts; exchanging adjusted numeric codes that uniquely identify semantic concepts for the longer groups with adjusted numeric codes that uniquely identify semantic concepts having a weaker relationship to determine if the exchanged adjusted numeric codes that uniquely identify semantic concepts provide an overall stronger relationship based on numerical comparisons; forming longer and longer groups until the adjusted numeric codes that uniquely identify semantic concepts are selected in context for all words and phrases in the sentence; and relating the adjusted numeric codes that uniquely identify semantic concepts to words and phrases.
-
Specification