Method and apparatus for parsing in a spoken language translation system
First Claim
1. A method for performing spoken language translation, comprising:
- receiving at least one input expression;
determining at least one next action using at least one parsing table;
shifting at least one component of the at least one input expression into a data structure when the at least one next action is determined to be a shift action;
generating at least one new parse node;
associating at least one feature structure of the at least one component with the at least one new parse node;
placing the at least one new parse node on the data structure;
generating at least one packed shared parse forest comprising a plurality of parse trees and a plurality of nodes;
rebuilding at least one node of the at least one packed shared parse forest; and
outputting at least one structural analysis of the at least one input expression.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and apparatus for parsing in a spoken language translation system are provided, wherein an input is received comprising at least one input sentence or expression. A parsing table is accessed and consulted for a next action, wherein the parser looks up in the next action in the parsing table. During parsing operations, the parser may perform shift actions and reduce actions. In performing a shift action, a next item of the input string is shifted onto a stack or intermediate data structure of the parser. A new parse node is generated, and a feature structure or lexical feature structure of the shifted input item is obtained from a morphological analyzer and associated with the new parse node. The new node is placed on the stack or intermediate data structure. In performing a reduce action, a grammar rule and an associated compiled feature structure manipulation are applied. When the manipulations succeed, a new parse node is generated comprising the new feature structures resulting from the successful feature structure manipulations. When the entire input is successfully parsed then an accept action is performed, a rebuilding procedure is performed, and a structural analysis of the input is provided comprising a number of parse trees and sentential feature structures.
261 Citations
36 Claims
-
1. A method for performing spoken language translation, comprising:
-
receiving at least one input expression;
determining at least one next action using at least one parsing table;
shifting at least one component of the at least one input expression into a data structure when the at least one next action is determined to be a shift action;
generating at least one new parse node;
associating at least one feature structure of the at least one component with the at least one new parse node;
placing the at least one new parse node on the data structure;
generating at least one packed shared parse forest comprising a plurality of parse trees and a plurality of nodes;
rebuilding at least one node of the at least one packed shared parse forest; and
outputting at least one structural analysis of the at least one input expression. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12)
applying at least one grammar rule to the at least one component when the at least one next action is determined to be a reduce action;
executing at least one feature structure manipulation corresponding to the at least one grammar rule; and
determining whether the at least one feature structure manipulation succeeds.
-
-
3. The method of claim 1, wherein rebuilding comprises:
-
performing local ambiguity packing; and
recursively rebuilding the at least one node.
-
-
4. The method of claim 3, wherein recursively rebuilding comprises:
-
marking each of a plurality of nodes for which at least one feature structure is to be rebuilt;
maintaining at least one log comprising each of the plurality of nodes for which the at least one feature structure is to be rebuilt;
locating at least one farthermost marked node from a root node when traversing at least one branch path of the at least one packed shared parse forest;
rebuilding the at least one feature structure of the at least one farthermost marked node;
rebuilding the at least one feature structure of each marked node in succession along the at least one branch path between the at least one farthermost marked and the root node; and
rebuilding the root node.
-
-
5. The method of claim 1, further comprising:
-
generating the at least one parsing table from at least one set of grammar rules;
compiling at least one set of feature structure manipulations corresponding to the at least one set of grammar rules;
maintaining at least one graph-structured stack comprising at least one parsing state.
-
-
6. The method of claim 5, wherein the at least one set of grammar rules comprise context-free grammar rules, wherein the context-free grammar rules use a plurality of non-terminal symbols.
-
7. The method of claim 5, wherein the at least one set of grammar rules comprise English parsing grammar rules and Japanese parsing grammar rules.
-
8. The method of claim 1, wherein shifting comprises:
-
creating at least one parse node for a shifted lexicon; and
generating a feature structure for the at least one parse node by coping a feature structure of the lexicon.
-
-
9. The method of claim 1, wherein the at least one input expression comprises a grammatical sentence, wherein the grammatical sentence comprises at least one spoken word, wherein at least one feature structure is available for each lexicon of the at least one spoken word.
-
10. The method of claim 1, wherein the at least one input expression comprises at least one lexical feature structure.
-
11. The method of claim 1, wherein the at least one structural analysis comprises a plurality of parse trees and sentential feature structures.
-
12. The method of claim 1, wherein the at least one feature structure comprises information from at least one lexical level of at least one morphological analysis.
-
13. An apparatus for spoken language translation comprising:
-
at least one processor;
an input coupled to the at least one processor, the input capable of receiving at least one input expression, the at least one processor configured to translate the received at least one input expression by, determining at least one next action using at least one parsing table, shifting at least one component of the at least one input expression into a data structure when the at least one next action is determined to be a shift action, generating at least one new parse node, associating at least one feature structure of the at least one component with the at least one new parse node, placing the at least one new parse node on the data structure, generating at least one packed shared parse forest comprising a plurality of parse trees and a plurality of nodes, rebuilding at least one node of the at least one packed shared parse forest; and
an output coupled to the at least one processor, the output capable of providing at least one structural analysis of the at least one input. - View Dependent Claims (14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24)
applying at least one grammar rule to the at least one component when the at least one next action is determined to be a reduce action;
executing at least one feature structure manipulation corresponding to the at least one grammar rule; and
determining whether the at least one feature structure manipulation succeeds.
-
-
15. The apparatus of claim 13, wherein rebuilding comprises:
-
performing local ambiguity packing; and
recursively rebuilding the at least one node.
-
-
16. The apparatus of claim 15, wherein recursively rebuilding comprises:
-
marking each of a plurality of nodes for which at least one feature structure is to be rebuilt;
maintaining at least one log comprising each of the plurality of nodes for which the at least one feature structure is to be rebuilt;
locating at least one farthermost marked node from a root node when traversing at least one branch path of the at least one packed shared parse forest;
rebuilding the at least one feature structure of the at least one farthermost marked node;
rebuilding the at least one feature structure of each marked node in succession along the at least one branch path between the at least one farthermost marked and the root node; and
rebuilding the root node.
-
-
17. The apparatus of claim 13, wherein the processor is further configured to translate by:
-
generating the at least one parsing table from at least one set of grammar rules;
compiling at least one set of feature structure manipulations corresponding to the at least one set of grammar rules;
maintaining at least one graph-structured stack comprising at least one parsing state.
-
-
18. The apparatus of claim 17, wherein the at least one set of grammar rules comprise context-free grammar rules, wherein the context-free grammar rules use approximately 78 non-terminal symbols.
-
19. The apparatus of claim 17, wherein the at least one set of grammar rules comprise English parsing grammar rules and Japanese parsing grammar rules.
-
20. The apparatus of claim 13, wherein shifting comprises:
-
creating at least one parse node for a shifted lexicon; and
generating a feature structure for the at least one parse node by coping a feature structure of the lexicon.
-
-
21. The apparatus of claim 13, wherein the at least one input expression comprises a grammatical sentence, wherein the grammatical sentence comprises at least one spoken word, wherein at least one feature structure is available for each lexicon of the at least one spoken word.
-
22. The apparatus of claim 13, wherein the at least one input expression comprises at least one lexical feature structure.
-
23. The apparatus of claim 13, wherein the at least one structural analysis comprises a plurality of parse trees and sentential feature structures.
-
24. The apparatus of claim 13, wherein the at least one feature structure comprises information from at least one lexical level of at least one morphological analysis.
-
25. A computer readable medium containing executable instructions which, when executed in a processing system, causes the system to perform a method for spoken language translation, the method comprising:
-
receiving at least one input expression;
determining at least one next action using at least one parsing table;
shifting at least one component of the at least one input expression into a data structure when the at least one next action is determined to be a shift action;
generating at least one new parse node;
associating at least one feature structure of the at least one component with the at least one new parse node;
placing the at least one new parse node on the data structure;
generating at least one packed shared parse forest comprising a plurality of parse trees and a plurality of nodes;
rebuilding at least one node of the at least one packed shared parse forest; and
outputting at least one structural analysis of the at least one input expression. - View Dependent Claims (26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36)
applying at least one grammar rule to the at least one component when the at least one next action is determined to be a reduce action;
executing at least one feature structure manipulation corresponding to the at least one grammar rule; and
determining whether the at least one feature structure manipulation succeeds.
-
-
27. The computer readable medium of claim 25, wherein rebuilding comprises:
-
performing local ambiguity packing; and
recursively rebuilding the at least one node.
-
-
28. The computer readable medium of claim 27, wherein recursively rebuilding comprises:
-
marking each of a plurality of nodes for which at least one feature structure is to be rebuilt;
maintaining at least one log comprising each of the plurality of nodes for which the at least one feature structure is to be rebuilt;
locating at least one farthermost marked node from a root node when traversing at least one branch path of the at least one packed shared parse forest;
rebuilding the at least one feature structure of the at least one farthermost marked node;
rebuilding the at least one feature structure of each marked node in succession along the at least one branch path between the at least one farthermost marked and the root node; and
rebuilding the root node.
-
-
29. The computer readable medium of claim 25, wherein the method further comprises:
-
generating the at least one parsing table from at least one set of grammar rules;
compiling at least one set of feature structure manipulations corresponding to the at least one set of grammar rules;
maintaining at least one graph-structured stack comprising at least one parsing state.
-
-
30. The computer readable medium of claim 29, wherein the at least one set of grammar rules comprise context-free grammar rules, wherein the context-free grammar rules use approximately 78 non-terminal symbols.
-
31. The computer readable medium of claim 29, wherein the at least one set of grammar rules comprise English parsing grammar rules and Japanese parsing grammar rules.
-
32. The computer readable medium of claim 25, wherein shifting comprises:
-
creating at least one parse node for a shifted lexicon; and
generating a feature structure for the at least one parse node by coping a feature structure of the lexicon.
-
-
33. The computer readable medium of claim 25, wherein the at least one input expression comprises a grammatical sentence, wherein the grammatical sentence comprises at least one spoken word, wherein at least one feature structure is available for each lexicon of the at least one spoken word.
-
34. The computer readable medium of claim 25, wherein the at least one input expression comprises at least one lexical feature structure.
-
35. The computer readable medium of claim 25, wherein the at least one structural analysis comprises a plurality of parse trees and sentential feature structures.
-
36. The computer readable medium of claim 25, wherein the at least one feature structure comprises information from at least one lexical level of at least one morphological analysis.
Specification