Character segmentation and recognition method
First Claim
1. A character segmentation and recognition method, comprising:
- collecting image data to obtain a to-be-recognized image;
positioning a character line candidate region on the to-be-recognized image, wherein the character line candidate region comprises a plurality of characters which do not overlap with each other;
obtaining pre-set character line prior information, wherein the character line prior information comprises a quantity of the plurality of characters, a spacing among the plurality of characters, and a size of the plurality of characters;
obtaining a segmentation point template based on the character line prior information, wherein the segmentation point template comprises boundary frames of each of the plurality of characters;
traversing the segmentation point template within the character line candidate region, to obtain credible degrees of different positions in the character line candidate region;
determining an optimal segmentation position from the different positions, wherein the credible degree of the optimal segmentation position is largest among the credible degrees of the different positions;
segmenting the character line candidate region based on the segmentation point template and the optimal segmentation position to obtain a plurality of single-character regions which do not overlap with each other, wherein each of the plurality of single-character regions comprises one of the plurality of characters; and
performing character recognition on each of the plurality of single-character regions to obtain a corresponding recognition result;
wherein traversing the segmentation point template within the character line candidate region, to obtain credible degrees of different positions in the character line candidate region comprises;
recording position information of the segmentation point template, in response to the segmentation point being traversed to each of the different positions; and
obtaining the credible degrees corresponding to the position information; and
wherein the obtaining the credible degrees corresponding to the position information comprises;
obtaining the first number of first effective pixel points of the character line candidate region on a left boundary of a character segmentation point, wherein the segmentation point template comprises a plurality of character segmentation points, each of the character segmentation points corresponds to one character on a character line, each of the character segmentation points comprises the left boundary and a right boundary, and the first effective pixel points are pixel points with gray values in a pre-set first threshold range;
obtaining a first evaluation value based on the first number and a pre-set first weight;
obtaining the second number of second effective pixel points of the character line candidate region on the right boundary of the character segmentation point, wherein the second effective pixel points are pixel points with gray values in a pre-set second threshold range;
obtaining a second evaluation value based on the second number and a pre-set second weight;
obtaining the third number of third effective pixel points of the character line candidate region within the segmentation point template, wherein the third effective pixel points are pixel points with gray values in a pre-set third threshold range;
obtaining a third evaluation value based on the third number and a pre-set third weight; and
obtaining the credible degrees corresponding to the position information based on the first evaluation value, the second evaluation value and the third evaluation value.
1 Assignment
0 Petitions
Accused Products
Abstract
Provided is a character segmentation and recognition method. The method includes: collecting image data to obtain a to-be-recognized image; positioning a character line candidate region on the to-be-recognized image; obtaining pre-set character line prior information, where the character line prior information includes the number of characters, character spacing and a character size; obtaining a corresponding segmentation point template based on the character line prior information; obtaining credible degrees of different positions on the character line candidate region traversed by the segmentation point template; determining a position with the highest credible degree as an optimal segmentation position; segmenting the character line candidate region based on the segmentation point template and the optimal segmentation position to obtain multiple single character regions; and performing character recognition on each of the single character regions to obtain a corresponding recognition result.
-
Citations
14 Claims
-
1. A character segmentation and recognition method, comprising:
-
collecting image data to obtain a to-be-recognized image; positioning a character line candidate region on the to-be-recognized image, wherein the character line candidate region comprises a plurality of characters which do not overlap with each other; obtaining pre-set character line prior information, wherein the character line prior information comprises a quantity of the plurality of characters, a spacing among the plurality of characters, and a size of the plurality of characters; obtaining a segmentation point template based on the character line prior information, wherein the segmentation point template comprises boundary frames of each of the plurality of characters; traversing the segmentation point template within the character line candidate region, to obtain credible degrees of different positions in the character line candidate region; determining an optimal segmentation position from the different positions, wherein the credible degree of the optimal segmentation position is largest among the credible degrees of the different positions; segmenting the character line candidate region based on the segmentation point template and the optimal segmentation position to obtain a plurality of single-character regions which do not overlap with each other, wherein each of the plurality of single-character regions comprises one of the plurality of characters; and performing character recognition on each of the plurality of single-character regions to obtain a corresponding recognition result; wherein traversing the segmentation point template within the character line candidate region, to obtain credible degrees of different positions in the character line candidate region comprises; recording position information of the segmentation point template, in response to the segmentation point being traversed to each of the different positions; and obtaining the credible degrees corresponding to the position information; and wherein the obtaining the credible degrees corresponding to the position information comprises; obtaining the first number of first effective pixel points of the character line candidate region on a left boundary of a character segmentation point, wherein the segmentation point template comprises a plurality of character segmentation points, each of the character segmentation points corresponds to one character on a character line, each of the character segmentation points comprises the left boundary and a right boundary, and the first effective pixel points are pixel points with gray values in a pre-set first threshold range; obtaining a first evaluation value based on the first number and a pre-set first weight; obtaining the second number of second effective pixel points of the character line candidate region on the right boundary of the character segmentation point, wherein the second effective pixel points are pixel points with gray values in a pre-set second threshold range; obtaining a second evaluation value based on the second number and a pre-set second weight; obtaining the third number of third effective pixel points of the character line candidate region within the segmentation point template, wherein the third effective pixel points are pixel points with gray values in a pre-set third threshold range; obtaining a third evaluation value based on the third number and a pre-set third weight; and obtaining the credible degrees corresponding to the position information based on the first evaluation value, the second evaluation value and the third evaluation value. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14)
-
Specification