Using wildcards in semantic parsing
First Claim
1. A method of generating a semantic representation of a string of words from a language, the method comprising:
- identifying at least one semantic token activated by a word;
placing the identified semantic token in a list of potential semantic tokens for the semantic representation of the string of words;
placing a wildcard semantic token in the list of potential semantic tokens for the word, the wildcard semantic token being capable of being activated by every word in the language; and
building the semantic representation in part by utilizing one of the semantic tokens in the list of potential semantic tokens through steps comprising;
generating a semantic representation that spans the entire string of words;
identifying a skipped word in the string of words that is not associated with a semantic token in the semantic representation;
determining that a word to the left of the skipped word is associated with a wildcard semantic token; and
associating the skipped word with the wildcard semantic token.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and apparatus perform semantic parsing by designating one or more words in an input text stream as wildcards. Under some embodiments, partially constructed parses formed from other words in the text stream are used to control when a later word will be identified as a wildcard. In particular, if a partial parse is expecting a semantic token that begins with a wildcard, the next word in the input text segment is designated as a wildcard term. If none of the existing partial parses are expecting a semantic token that begins with a wildcard, the next word is not designated as a wildcard
6 Citations
23 Claims
-
1. A method of generating a semantic representation of a string of words from a language, the method comprising:
-
identifying at least one semantic token activated by a word;
placing the identified semantic token in a list of potential semantic tokens for the semantic representation of the string of words;
placing a wildcard semantic token in the list of potential semantic tokens for the word, the wildcard semantic token being capable of being activated by every word in the language; and
building the semantic representation in part by utilizing one of the semantic tokens in the list of potential semantic tokens through steps comprising;
generating a semantic representation that spans the entire string of words;
identifying a skipped word in the string of words that is not associated with a semantic token in the semantic representation;
determining that a word to the left of the skipped word is associated with a wildcard semantic token; and
associating the skipped word with the wildcard semantic token. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. (canceled)
-
9. A method of parsing text to form a semantic structure, the semantic structure formed of terminal nodes and non-terminal nodes, each non-terminal node being represented by a semantic token that is defined by child nodes beneath the semantic token, the method comprising:
-
generating a semantic token that has a word in the text as a child node;
generating a wildcard semantic token for a word in the text, the wildcard semantic token being capable of having any word as a child node, generating a wildcard semantic token comprising;
determining if any generated semantic tokens have an unfilled child node that is defined in part by a left-most child node that is expecting a wildcard semantic token; and
generating the wildcard semantic token based on the fact that an unfilled child node has a left-most child node that is expecting a wildcard semantic token;
generating a semantic token that has a wildcard semantic token as a child node; and
utilizing at least one of the generated semantic tokens in the semantic structure. - View Dependent Claims (10, 12, 13)
-
-
11. (canceled)
-
14. A computer-readable medium having computer-executable instructions for generating a semantic representation of an input text string by performing steps comprising:
-
constructing potential portions of the semantic representation based on the input text string;
determining that one of the potential portions of the semantic representation can be extended by a wildcard;
designating a word in the text string as a wildcard to extend the potential portion of the semantic representation;
identifying a skipped word in the input text string that is not directly linked to the semantic representation;
determining that a word adjacent to the skipped word was designated as a wildcard; and
designating the skipped word as part of the wildcard.
-
-
15. (canceled)
-
16. A method of generating a semantic representation of a string of words from a language, the method comprising:
-
identifying at least one semantic token activated by a word;
placing the identified semantic token in a list of potential semantic tokens for the semantic representation of the string of words;
placing a wildcard semantic token in the list of potential semantic tokens for the word, the wildcard semantic token being capable of being activated by every word in the language, wherein placing the wildcard semantic token in the list of potential semantic tokens comprises only placing the wildcard semantic token in the list under two alternative conditions comprising;
A) if the word is the first word in the string of words and the entire sentence can begin with a wildcard according to the grammar; and
B) if a partial parse in the chart needs a semantic token that can begin with a wildcard semantic token; and
building the semantic representation in part by utilizing one of the semantic tokens in the list of potential semantic tokens. - View Dependent Claims (17, 18, 19, 20)
-
-
21. A method of parsing text to form a semantic structure, the semantic structure formed of terminal nodes and non-terminal nodes, each non-terminal node being represented by a semantic token that is defined by child nodes beneath the semantic token, the method comprising:
-
generating a semantic token that has a word in the text as a child node;
generating a wildcard semantic token for a word in the text, the wildcard semantic token being capable of having any word as a child node;
generating a semantic token that has a wildcard semantic token as a child node;
utilizing at least one of the generated semantic tokens in the semantic structure;
identifying a skipped word in the text that has not been associated with a semantic token in the semantic structure;
determining that a word to the left of the skipped word is associated with a wildcard semantic token; and
adding the skipped word as a child node of the wildcard semantic token. - View Dependent Claims (22, 23)
-
Specification