Method and apparatus for phonetically annotating text

US 10,114,809 B2
Filed: 06/23/2016
Issued: 10/30/2018
Est. Priority Date: 05/07/2014
Status: Active Grant

First Claim

Patent Images

1. A method, comprising:

at a computing device having one or more processors and memory;

receiving a text input from a user, including receiving copied or scanned text for which context-appropriate phonetic annotation is to be performed at the computing device;

identifying a first polyphonic word segment and a first monophonic word segment in the text input, the first polyphonic word segment having at least a first pronunciation and a second pronunciation that is distinct from the first pronunciation, and the first monophonic word segment having a single pronunciation;

determining at least a first probability corresponding to the first pronunciation being a correct pronunciation for the first polyphonic word segment and a second probability corresponding to the second pronunciation being the correct pronunciation for the first polyphonic word segment, wherein the first probability is greater than the second probability;

determining a predetermined threshold difference based on;

(1) a comparison of the first probability and the second probability with a preset threshold probability value, respectively, and (2) a magnitude of a difference between the first probability and the second probability;

comparing the difference between the first probability and the second probability with the predetermined threshold difference; and

selecting the first pronunciation as a current pronunciation for the first polyphonic word segment in accordance with a determination that the difference between the first probability and the second probability exceeds the predetermined threshold difference; and

in a text presentation user interface, displaying the input text concurrently with context-appropriate pronunciation annotations to facilitate a user'"'"'s reading the input text aloud, including;

phonetically annotating the first monophonic word segment in the displayed input text with the single pronunciation of the first monophonic word segment;

phonetically annotating the first polyphonic word segment in the displayed input text with the first pronunciation of the first polyphonic word segment; and

forgoing phonetically annotating the first polyphonic word segment in the displayed input text with the second pronunciation of the first polyphonic word segment.

View all claims

1 Assignment

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

Method for phonetically annotating text is performed at a computing device. The method includes: identifying a first polyphonic word segment in a text input, the first polyphonic word segment having at least a first pronunciation and a second pronunciation; determining at least a first probability for the first pronunciation and a second probability for the second pronunciation; determining a predetermined threshold difference based on: a comparison of the first and second probabilities with a preset threshold probability value, respectively, and a magnitude of a difference between the first and second probabilities; comparing the difference between the first probability and the second probability with the predetermined threshold difference; and selecting the first pronunciation as a current pronunciation for the first polyphonic word segment in accordance with a determination that the difference between the first probability and the second probability exceeds the predetermined threshold difference.

Citations

20 Claims

1. A method, comprising:
- at a computing device having one or more processors and memory;
  
  receiving a text input from a user, including receiving copied or scanned text for which context-appropriate phonetic annotation is to be performed at the computing device;
  
  identifying a first polyphonic word segment and a first monophonic word segment in the text input, the first polyphonic word segment having at least a first pronunciation and a second pronunciation that is distinct from the first pronunciation, and the first monophonic word segment having a single pronunciation;
  
  determining at least a first probability corresponding to the first pronunciation being a correct pronunciation for the first polyphonic word segment and a second probability corresponding to the second pronunciation being the correct pronunciation for the first polyphonic word segment, wherein the first probability is greater than the second probability;
  
  determining a predetermined threshold difference based on;
  
  (1) a comparison of the first probability and the second probability with a preset threshold probability value, respectively, and (2) a magnitude of a difference between the first probability and the second probability;
  
  comparing the difference between the first probability and the second probability with the predetermined threshold difference; and
  
  selecting the first pronunciation as a current pronunciation for the first polyphonic word segment in accordance with a determination that the difference between the first probability and the second probability exceeds the predetermined threshold difference; and
  
  in a text presentation user interface, displaying the input text concurrently with context-appropriate pronunciation annotations to facilitate a user'"'"'s reading the input text aloud, including;
  
  phonetically annotating the first monophonic word segment in the displayed input text with the single pronunciation of the first monophonic word segment;
  
  phonetically annotating the first polyphonic word segment in the displayed input text with the first pronunciation of the first polyphonic word segment; and
  
  forgoing phonetically annotating the first polyphonic word segment in the displayed input text with the second pronunciation of the first polyphonic word segment.
- View Dependent Claims (2, 3, 4, 5, 6, 7)
- - 2. The method of claim 1, further comprising:
    - obtaining the text input for phonetic annotation;
      
      determining whether the text input contains at least one polyphonic character; and
      
      in accordance with a determination that the text input includes at least one polyphonic character, segmenting the text input into a plurality of word segments including the first polyphonic word segment.
  - 3. The method of claim 2, further comprising:
    - determining whether the plurality of word segments contain at least one polyphonic word segment, wherein the identification of the first polyphonic word segment in the text input is performed in accordance with a determination that the plurality of word segments contains at least one polyphonic word segment.
  - 4. The method of claim 1, further comprising:
    - selecting a current value for the predetermined threshold difference based on the first probability and the second probability.
  - 5. The method of claim 4, wherein selecting the current value for the predetermined threshold difference based on the first probability and the second probability further comprises:
    - selecting a first difference value for the predetermined threshold difference when both the first and the second probabilities are above a predetermined threshold probability value; and
      
      selecting a second difference value for the predetermined threshold difference when the first probability is above the predetermined threshold probability value and the second probability is below the predetermined threshold probability value.
  - 6. The method of claim 5, wherein the first difference value is smaller than the second difference value.
  - 7. The method of claim 6, wherein the first pronunciation and the second pronunciation have the two largest probabilities among all pronunciations of the first polyphonic word segment.

8. A non-transitory computer-readable medium having instructions stored thereon, the instructions, when executed by one or more processors cause the processors to perform operations comprising:
- receiving a text input from a user, including receiving copied or scanned text for which context-appropriate phonetic annotation is to be performed at the computing device;
  
  identifying a first polyphonic word segment and a first monophonic word segment in the text input, the first polyphonic word segment having at least a first pronunciation and a second pronunciation that is distinct from the first pronunciation, and the first monophonic word segment having a single pronunciation;
  
  determining at least a first probability corresponding to the first pronunciation being a correct pronunciation for the first polyphonic word segment and a second probability corresponding to the second pronunciation being the correct pronunciation for the first polyphonic word segment, wherein the first probability is greater than the second probability;
  
  determining a predetermined threshold difference based on;
  
  (1) a comparison of the first probability and the second probability with a preset threshold probability value, respectively, and (2) a magnitude of a difference between the first probability and the second probability;
  
  comparing the difference between the first probability and the second probability with the predetermined threshold difference; and
  
  selecting the first pronunciation as a current pronunciation for the first polyphonic word segment in accordance with a determination that the difference between the first probability and the second probability exceeds the predetermined threshold difference; and
  
  in a text presentation user interface, displaying the input text concurrently with context-appropriate pronunciation annotations to facilitate a user'"'"'s reading the input text aloud, including;
  
  phonetically annotating the first monophonic word segment in the displayed input text with the single pronunciation of the first monophonic word segment;
  
  phonetically annotating the first polyphonic word segment in the displayed input text with the first pronunciation of the first polyphonic word segment; and
  
  forgoing phonetically annotating the first polyphonic word segment in the displayed input text with the second pronunciation of the first polyphonic word segment.
- View Dependent Claims (9, 10, 11, 12, 13, 14)
- - 9. The computer-readable medium of claim 8, wherein the operations further comprise:
    - obtaining the text input for phonetic annotation;
      
      determining whether the text input contains at least one polyphonic character; and
      
      in accordance with a determination that the text input includes at least one polyphonic character, segmenting the text input into a plurality of word segments including the first polyphonic word segment.
  - 10. The computer-readable medium of claim 9, wherein the operations further comprise:
    - determining whether the plurality of word segments contain at least one polyphonic word segment, wherein the identification of the first polyphonic word segment in the text input is performed in accordance with a determination that the plurality of word segments contains at least one polyphonic word segment.
  - 11. The computer-readable medium of claim 8, wherein the operations further comprise:
    - selecting a current value for the predetermined threshold difference based on the first probability and the second probability.
  - 12. The computer-readable medium of claim 11, wherein selecting the current value for the predetermined threshold difference based on the first probability and the second probability further comprises:
    - selecting a first difference value for the predetermined threshold difference when both the first and the second probabilities are above a predetermined threshold probability value; and
      
      selecting a second difference value for the predetermined threshold difference when the first probability is above the predetermined threshold probability value and the second probability is below the predetermined threshold probability value.
  - 13. The computer-readable medium of claim 12, wherein the first difference value is smaller than the second difference value.
  - 14. The computer-readable medium of claim 13, wherein the first pronunciation and the second pronunciation have the two largest probabilities among all pronunciations of the first polyphonic word segment.

15. A system, comprising:
- one or more processors; and
  
  memory having instructions stored thereon, the instructions, when executed by the one or more processors, cause the processors to perform operations comprising;
  
  receiving a text input from a user, including receiving copied or scanned text for which context-appropriate phonetic annotation is to be performed at the computing device;
  
  identifying a first polyphonic word segment and a first monophonic word segment in the text input, the first polyphonic word segment having at least a first pronunciation and a second pronunciation that is distinct from the first pronunciation, and the first monophonic word segment having a single pronunciation;
  
  determining at least a first probability corresponding to the first pronunciation being a correct pronunciation for the first polyphonic word segment and a second probability corresponding to the second pronunciation being the correct pronunciation for the first polyphonic word segment, wherein the first probability is greater than the second probability;
  
  determining a predetermined threshold difference based on;
  
  (1) a comparison of the first probability and the second probability with a preset threshold probability value, respectively, and (2) a magnitude of a difference between the first probability and the second probability;
  
  comparing the difference between the first probability and the second probability with the predetermined threshold difference; and
  
  selecting the first pronunciation as a current pronunciation for the first polyphonic word segment in accordance with a determination that the difference between the first probability and the second probability exceeds the predetermined threshold difference; and
  
  in a text presentation user interface, displaying the input text concurrently with context-appropriate pronunciation annotations to facilitate a user'"'"'s reading the input text aloud, including;
  
  phonetically annotating the first monophonic word segment in the displayed input text with the single pronunciation of the first monophonic word segment;
  
  phonetically annotating the first polyphonic word segment in the displayed input text with the first pronunciation of the first polyphonic word segment; and
  
  forgoing phonetically annotating the first polyphonic word segment in the displayed input text with the second pronunciation of the first polyphonic word segment.
- View Dependent Claims (16, 17, 18, 19, 20)
- - 16. The system of claim 15, wherein the operations further comprise:
    - obtaining the text input for phonetic annotation;
      
      determining whether the text input contains at least one polyphonic character; and
      
      in accordance with a determination that the text input includes at least one polyphonic character, segmenting the text input into a plurality of word segments including the first polyphonic word segment.
  - 17. The system of claim 16, wherein the operations further comprise:
    - determining whether the plurality of word segments contain at least one polyphonic word segment, wherein the identification of the first polyphonic word segment in the text input is performed in accordance with a determination that the plurality of word segments contains at least one polyphonic word segment.
  - 18. The system of claim 15, wherein the operations further comprise:
    - selecting a current value for the predetermined threshold difference based on the first probability and the second probability.
  - 19. The system of claim 18, wherein selecting the current value for the predetermined threshold difference based on the first probability and the second probability further comprises:
    - selecting a first difference value for the predetermined threshold difference when both the first and the second probabilities are above a predetermined threshold probability value; and
      
      selecting a second difference value for the predetermined threshold difference when the first probability is above the predetermined threshold probability value and the second probability is below the predetermined threshold probability value.
  - 20. The system of claim 19, wherein the first difference value is smaller than the second difference value.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
Tencent Technology Shenzhen Company Limited (Tencent Holdings Limited)
Original Assignee
Tencent Technology Shenzhen Company Limited (Tencent Holdings Limited)
Inventors
Wu, Xiaoping, Dai, Qiang
Primary Examiner(s)
Godbold, Douglas
Assistant Examiner(s)
Villena, Mark

Application Number

US15/191,309
Publication Number

US 20160306783A1
Time in Patent Office

859 Days
Field of Search
US Class Current
CPC Class Codes

G06F 40/129   Handling non-Latin characte...

G06F 40/169   Annotation, e.g. comment da...

G06F 40/253   Grammatical analysis; Style...

G06F 40/284   Lexical analysis, e.g. toke...

G10L 13/08   Text analysis or generation...

Method and apparatus for phonetically annotating text

First Claim

1 Assignment

0 Petitions

Accused Products

Abstract

Citations

20 Claims

Specification

Solutions

Use Cases

Quick Links

Method and apparatus for phonetically annotating text

First Claim

1 Assignment

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

20 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links