Correcting text with voice processing

US 9,502,036 B2
Filed: 01/16/2014
Issued: 11/22/2016
Est. Priority Date: 09/29/2012
Status: Expired due to Fees

First Claim

Patent Images

1. A system for correcting text, comprising:

a processor in communication with one or more types of memory, a display and external devices, embodied in a computing device, the processor configured to;

determine a target text unit to be corrected in the text by selecting the target text unit from a plurality of recognized text units, wherein the target text unit is selected based on a lowest confidence value corresponding to the target text unit;

display the target text unit to a user on the display of the computing device and indicate the target text unit to be corrected on the display;

receive, via a graphical user interface of the computing device, a reference voice segment of the user, wherein the reference voice segment comprises a sentence of phrase;

performing automatic voice recognition on the reference voice segment to obtain a reference text segment, wherein the reference text segment comprises a plurality of reference text units;

determine that one of a plurality of reference text units has a pronunciation similar to a word in the target text unit, wherein the reference voice segment is in Chinese and the text is in Chinese characters that are divided into a first phoneme and a second phoneme and the pronunciation similarity between two Chinese characters is measured according to an average similarity per phoneme, wherein the average similarity per phoneme is obtained from a sum of the phoneme similarities of the two Chinese characters, that are being compared, divided by two; and

correct the word in the target unit in the text using the reference text unit with the similar pronunciation.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

The present invention relates to voice processing and provides a method and system for correcting a text. The method comprising: determining a target text unit to be corrected in a text; receiving a reference voice segment input by the user for the target text unit; determining a reference text unit whose pronunciation is similar to a word in the target text unit based on the reference voice segment; and correcting the word in the target text unit in the text by the reference text unit. The present invention enables the user to easily correct errors in the text vocally.

35 Citations

View as Search Results

9 Claims

1. A system for correcting text, comprising:
- a processor in communication with one or more types of memory, a display and external devices, embodied in a computing device, the processor configured to;
  
  determine a target text unit to be corrected in the text by selecting the target text unit from a plurality of recognized text units, wherein the target text unit is selected based on a lowest confidence value corresponding to the target text unit;
  
  display the target text unit to a user on the display of the computing device and indicate the target text unit to be corrected on the display;
  
  receive, via a graphical user interface of the computing device, a reference voice segment of the user, wherein the reference voice segment comprises a sentence of phrase;
  
  performing automatic voice recognition on the reference voice segment to obtain a reference text segment, wherein the reference text segment comprises a plurality of reference text units;
  
  determine that one of a plurality of reference text units has a pronunciation similar to a word in the target text unit, wherein the reference voice segment is in Chinese and the text is in Chinese characters that are divided into a first phoneme and a second phoneme and the pronunciation similarity between two Chinese characters is measured according to an average similarity per phoneme, wherein the average similarity per phoneme is obtained from a sum of the phoneme similarities of the two Chinese characters, that are being compared, divided by two; and
  
  correct the word in the target unit in the text using the reference text unit with the similar pronunciation.
- View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
- - 2. The system according to claim 1, wherein the processer is further configured to:
    - obtain the text by automatic voice recognition of voice data.
  - 3. The system according to claim 1, wherein the processor if further configured to:
    - parse the reference text segment and the target text unitdetermine that the one of the reference text units has the pronunciation similar to the word in the target text unit based on similarity of the phonemes.
  - 4. The system according to claim 1, wherein, to determine that the one of the plurality of reference text units has the pronunciation similar to the word in the target text unit, the processor is further configured to, after the performing of voice recognition on the reference segment to obtain the reference text segment:
    - determine a voice sub-segment whose pronunciation is similar to the word in the target text unit from reference voice segment based on pronunciation similarity; and
      
      obtain the reference text unit corresponding to the voice sub-segment from the reference text segment.
  - 5. The system according to claim 1, wherein the determined reference text unit is multiple reference text units, wherein correcting the word in the target text unit in the text by the reference text unit further comprises:
    - receiving a selection by the user, using a mouse or selecting directly on a touch screen, for one of the multiple reference text units to correct at least one word in the target text unit.
  - 6. The system according to claim 1,wherein the determined reference text unit comprises multiple reference text units, andwherein the target text unit determining section module selects the reference text unit for correcting the word in the target text unit based on the confidences of the multiple reference text units.
  - 7. The system according to claim 1, wherein a boundary recognition section module recognizes unit boundaries of text units in the text.
  - 8. The system according to claim 1, wherein the target text unit determining section module receives a selection by the user for text units in the text to determine the target text unit to be corrected.
  - 9. The system according to claim 2, wherein the target text unit determining section module obtains the confidences of text units in the recognized text of the voice data, and determine the target text unit to be corrected based on the confidences.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
International Business Machines Corporation
Original Assignee
International Business Machines Corporation
Inventors
Bao, Sheng Hua, Chen, Jian, Liu, Wen, Qin, Yong, Shi, Qin, Su, Zhong, Zhang, Shi Lei
Primary Examiner(s)
Sirjani, Fariba

Application Number

US14/156,976
Publication Number

US 20140136198A1
Time in Patent Office

1,041 Days
Field of Search
US Class Current

1/1
CPC Class Codes

G06F 40/166   Editing, e.g. inserting or ...

G06F 40/232   Orthographic correction, e....

G06F 40/53   Processing of non-Latin tex...

G10L 15/22   Procedures used during a sp...

G10L 15/26   Speech to text systems G10L...

G10L 2015/025   Phonemes, fenemes or fenone...

Correcting text with voice processing

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

35 Citations

9 Claims

Specification

Solutions

Use Cases

Quick Links

Correcting text with voice processing

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

35 Citations

9 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links