PRUNING AND LABEL SELECTION IN HIDDEN MARKOV MODEL-BASED OCR
First Claim
1. A computer-implemented method performed by a data processing apparatus, the method comprising:
- receiving a node of a Hidden Markov Model, wherein the node is a label transition node;
receiving a frame, wherein the frame is a predicted segmentation point; and
pruning the node from a possible nodes list with label transition node pruning,wherein label transition node pruning comprises;
scoring the node at the frame to obtain a score, andpruning the node when the score is greater than the sum of a best score at the frame and a beam threshold minus a penalty term.
2 Assignments
0 Petitions
Accused Products
Abstract
Systems and techniques are provided for pruning a node from a possible nodes list for Hidden Markov Model with label transition node pruning. The node may be a label transition node. A frame may be at a predicted segmentation point in decoding input with the Hidden Markov Model. The node may be scored at the frame. The node may be pruned from the possible nodes list for the frame when score for the node is greater than the sum of a best score among nodes on the possible nodes list for the frame and a beam threshold minus a penalty term. A possible nodes list may be generated for a subsequent frame using label selection. A second node may be pruned from the possible nodes list for the subsequent frame with early pruning.
22 Citations
37 Claims
-
1. A computer-implemented method performed by a data processing apparatus, the method comprising:
-
receiving a node of a Hidden Markov Model, wherein the node is a label transition node; receiving a frame, wherein the frame is a predicted segmentation point; and pruning the node from a possible nodes list with label transition node pruning, wherein label transition node pruning comprises; scoring the node at the frame to obtain a score, and pruning the node when the score is greater than the sum of a best score at the frame and a beam threshold minus a penalty term. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17)
-
-
18. A computer-implemented method performed by a data processing apparatus, the method comprising:
-
receiving an input comprising frames and for each frame of the input that is at a predicted segmentation point; receiving a possible nodes list comprising nodes, wherein the nodes are label transition nodes; pruning the nodes having a score greater than a sum of a score for a best scoring node at the frame and a beam threshold minus a penalty term; generating a possible nodes list for a subsequent frame of the input comprising nodes that are connected to the nodes not pruned from the possible nodes and are first nodes for labels having a label score less than or equal to the sum of a score for a best scoring label at the subsequent frame and a first parameter or a label rank less than or equal to a second parameter; and pruning nodes from the possible nodes list for the subsequent frame having a sum of an observation score for the node and a score for a previous node to the node at a previous frame to the subsequent frame greater than the sum of the score for the best scoring node at the subsequent frame and a beam threshold. - View Dependent Claims (19, 20, 21, 22, 23, 24, 25, 26, 27)
-
-
28. A computer-implemented system for stitching tables comprising:
-
a Hidden Markov Model comprising nodes and transitions, where the nodes comprise emission probabilities and are associated with transition probabilities; a decoder adapted to receive the Hidden Markov Model and frames, decode the frames, and output predictions; a label transition node pruner adapted to receive a frame from the frames and a possible nodes list for the frame and prune one of the nodes from the possible nodes list based on a score for the one of nodes, a best scoring node from the possible nodes list, and a penalty term; a label selector adapted to receive the frame and the possible nodes list for the frame and generate a possible nodes list for a subsequent frame to the frame comprising at least one node connected to one of the nodes from the possible nodes list based on a score for a label associated with the at least one node, a highest scoring label at the subsequent frame, a first parameter, a label rank for the label, and a second parameter; and an early pruner adapted receive the subsequent frame and the possible nodes list for the subsequent frame and prune one of the nodes from the possible nodes list for the subsequent frame based on an observation score for the one of the nodes, a score for a previous node from a previous frame, and the best scoring node from the possible nodes list for the subsequent frame. - View Dependent Claims (29, 30, 31, 32, 33, 34)
-
-
35. A system comprising:
- one or more computers and one or more storage devices storing instructions which are operable, when executed by the one or more computers, to cause the one or more computers to perform operations comprising;
receiving a node of a Hidden Markov Model, wherein the node is a label transition node; receiving a frame, wherein the frame is a predicted segmentation point; and pruning the node from a possible nodes list with label transition node, wherein label transition node pruning comprises; scoring the node at the frame to obtain a score, and pruning the node when the score is greater than the sum of a best score at the frame and a beam threshold minus a penalty term. - View Dependent Claims (36, 37)
- one or more computers and one or more storage devices storing instructions which are operable, when executed by the one or more computers, to cause the one or more computers to perform operations comprising;
Specification