Method and apparatus for parsing text using mutual information
First Claim
Patent Images
1. A method of generating a score for a node identified during a parse of a text segment, the method comprising:
- identifying a phrase level for the node;
identifying a word class for at least one word that neighbors a text spanned by the node; and
generating a score based on the phrase level and the word class.
2 Assignments
0 Petitions
Accused Products
Abstract
A method and parser are provided that generate a score for a node identified during a parse of a text segment. The score is based on a mutual information score that measures the mutual information between a phrase level for the node and a word class of at least one word in the text segment.
24 Citations
22 Claims
-
1. A method of generating a score for a node identified during a parse of a text segment, the method comprising:
-
identifying a phrase level for the node;
identifying a word class for at least one word that neighbors a text spanned by the node; and
generating a score based on the phrase level and the word class. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A parser for generating a syntax structure from a text segment, the parser comprising:
-
a seeding unit for inserting words from the text segment into a candidate list as nodes;
a node selector for promoting nodes from the candidate list to a node chart;
a rule engine for combining nodes in the node chart to form a larger node; and
a metric calculator for generating a score for a node formed by the rule engine, the score being based in part on mutual information. - View Dependent Claims (11, 12, 13, 14, 15, 16, 17, 18)
-
-
19. A computer-readable medium having computer-executable instructions for performing steps comprising:
-
dividing a text segment into words;
forming syntax nodes that each represent a syntax structure for one or more words;
scoring a syntax node to indicate its likelihood of appearing in a full parse structure for the text segment, the score being a mutual information score; and
using the score for the syntax node when forming the full parse structure. - View Dependent Claims (20, 21, 22)
-
Specification