Correcting text with voice processing
First Claim
Patent Images
1. A system for correcting text, comprising:
- a processor in communication with one or more types of memory, a display and external devices, embodied in a computing device, the processor configured to;
determine a target text unit to be corrected in the text by selecting the target text unit from a plurality of recognized text units, wherein the target text unit is selected based on a lowest confidence value corresponding to the target text unit;
display the target text unit to a user on the display of the computing device and indicate the target text unit to be corrected on the display;
receive, via a graphical user interface of the computing device, a reference voice segment of the user, wherein the reference voice segment comprises a sentence of phrase;
performing automatic voice recognition on the reference voice segment to obtain a reference text segment, wherein the reference text segment comprises a plurality of reference text units;
determine that one of a plurality of reference text units has a pronunciation similar to a word in the target text unit, wherein the reference voice segment is in Chinese and the text is in Chinese characters that are divided into a first phoneme and a second phoneme and the pronunciation similarity between two Chinese characters is measured according to an average similarity per phoneme, wherein the average similarity per phoneme is obtained from a sum of the phoneme similarities of the two Chinese characters, that are being compared, divided by two; and
correct the word in the target unit in the text using the reference text unit with the similar pronunciation.
1 Assignment
0 Petitions
Accused Products
Abstract
The present invention relates to voice processing and provides a method and system for correcting a text. The method comprising: determining a target text unit to be corrected in a text; receiving a reference voice segment input by the user for the target text unit; determining a reference text unit whose pronunciation is similar to a word in the target text unit based on the reference voice segment; and correcting the word in the target text unit in the text by the reference text unit. The present invention enables the user to easily correct errors in the text vocally.
35 Citations
9 Claims
-
1. A system for correcting text, comprising:
-
a processor in communication with one or more types of memory, a display and external devices, embodied in a computing device, the processor configured to; determine a target text unit to be corrected in the text by selecting the target text unit from a plurality of recognized text units, wherein the target text unit is selected based on a lowest confidence value corresponding to the target text unit; display the target text unit to a user on the display of the computing device and indicate the target text unit to be corrected on the display; receive, via a graphical user interface of the computing device, a reference voice segment of the user, wherein the reference voice segment comprises a sentence of phrase; performing automatic voice recognition on the reference voice segment to obtain a reference text segment, wherein the reference text segment comprises a plurality of reference text units; determine that one of a plurality of reference text units has a pronunciation similar to a word in the target text unit, wherein the reference voice segment is in Chinese and the text is in Chinese characters that are divided into a first phoneme and a second phoneme and the pronunciation similarity between two Chinese characters is measured according to an average similarity per phoneme, wherein the average similarity per phoneme is obtained from a sum of the phoneme similarities of the two Chinese characters, that are being compared, divided by two; and correct the word in the target unit in the text using the reference text unit with the similar pronunciation. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
Specification